SWE-bench & SWE-bench Verified Benchmarks

SWE-bench
In their 2023 paper "SWE-bench: Can Language Models Resolve Real-World GitHub Issues?", researchers from Princeton University, Princeton Language and Intelligence, and University of...