Scale AI
company
Verified
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
Agentic Rubrics as Contextual Verifiers for SWE Agents
ResearchRubrics: A Benchmark of Prompts and Rubrics For Evaluating Deep Research Agents
datasets
20
ScaleAI/audiomc
Viewer
•
Updated
•
452
•
398
•
4
ScaleAI/SciPredict
Viewer
•
Updated
•
405
•
97
•
1
ScaleAI/PRBench
Viewer
•
Updated
•
1.65k
•
615
•
6
ScaleAI/MCP-Atlas
Viewer
•
Updated
•
500
•
615
•
6
ScaleAI/VisualToolBench
Viewer
•
Updated
•
1.2k
•
70
•
2
ScaleAI/dummy_mcp
Viewer
•
Updated
•
16
•
12
ScaleAI/researchrubrics
Viewer
•
Updated
•
101
•
154
•
17
ScaleAI/swe-oec-claude-expert
Viewer
•
Updated
•
1.27k
•
61
•
1
ScaleAI/TutorBench
Viewer
•
Updated
•
1.47k
•
215
•
3
ScaleAI/SWE-bench_Pro
Viewer
•
Updated
•
731
•
16.2k
•
47