SWE-bench SWE-bench (Lite, Verified, Multimodal, Multilingual) all in one place! SWE-bench/SWE-bench_Verified Viewer • Updated Apr 29 • 500 • 147k • 8 SWE-bench/SWE-bench_Multilingual Viewer • Updated Aug 26 • 300 • 11.2k • 6 SWE-bench/SWE-bench_Multimodal Viewer • Updated Apr 29 • 612 • 1.06k • 7 SWE-bench/SWE-bench_Lite Viewer • Updated Apr 29 • 323 • 8.04k • 9
SWE-agent-LM A collection of language models trained on SWE-smith + (mini-)SWE-agent for SWE-bench tasks SWE-bench/SWE-agent-LM-32B Text Generation • 33B • Updated May 12 • 642 • • 68 SWE-bench/SWE-agent-LM-7B Text Generation • 8B • Updated Jul 13 • 178 • 4 SWE-bench/SWE-Rater-32B 33B • Updated Jun 1 • 10 • 3
SWE-smith SWE-smith datasets of task instances for different programming languages SWE-bench/SWE-smith-py Viewer • Updated 2 days ago • 50.9k • 26 SWE-bench/SWE-smith-go Viewer • Updated 2 days ago • 8.21k • 12 SWE-bench/SWE-smith-java Viewer • Updated 2 days ago • 11 • 5 SWE-bench/SWE-smith-rs Viewer • Updated 2 days ago • 1 • 5
SWE-bench SWE-bench (Lite, Verified, Multimodal, Multilingual) all in one place! SWE-bench/SWE-bench_Verified Viewer • Updated Apr 29 • 500 • 147k • 8 SWE-bench/SWE-bench_Multilingual Viewer • Updated Aug 26 • 300 • 11.2k • 6 SWE-bench/SWE-bench_Multimodal Viewer • Updated Apr 29 • 612 • 1.06k • 7 SWE-bench/SWE-bench_Lite Viewer • Updated Apr 29 • 323 • 8.04k • 9
SWE-smith SWE-smith datasets of task instances for different programming languages SWE-bench/SWE-smith-py Viewer • Updated 2 days ago • 50.9k • 26 SWE-bench/SWE-smith-go Viewer • Updated 2 days ago • 8.21k • 12 SWE-bench/SWE-smith-java Viewer • Updated 2 days ago • 11 • 5 SWE-bench/SWE-smith-rs Viewer • Updated 2 days ago • 1 • 5
SWE-agent-LM A collection of language models trained on SWE-smith + (mini-)SWE-agent for SWE-bench tasks SWE-bench/SWE-agent-LM-32B Text Generation • 33B • Updated May 12 • 642 • • 68 SWE-bench/SWE-agent-LM-7B Text Generation • 8B • Updated Jul 13 • 178 • 4 SWE-bench/SWE-Rater-32B 33B • Updated Jun 1 • 10 • 3