Abstract: The saying goes, “All roads lead to Rome.” However, in programming, particularly in constraint programming, the approach taken to reach a solution is crucial. In an ideal declarative world, ...
SlopCodeBench evaluates coding agents under iterative specification refinement: the agent implements a spec, then extends its own code as the spec changes. This exposes behaviors that single-shot ...