SlopCodeBench: Measuring Code Erosion Under Iterative Specification Refinement

71 stars
17 forks
Python
243 views