SlopCodeBench: Measuring Code Erosion Under Iterative Specification Refinement

55 stars
9 forks
Python
161 views