DCAgent2/swebench-verified-sample-100_Qwen3-Coder-30B-A3B-Instruct-FP8_20251126 Viewer • Updated 24 days ago • 99 • 14
DCAgent2/swebench-verified-sample-100_Qwen3-Coder-30B-A3B-Instruct-FP8_20251126 Viewer • Updated 24 days ago • 99 • 14
BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution Paper • 2510.08697 • Published Oct 9 • 36
BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution Paper • 2510.08697 • Published Oct 9 • 36