Berkeley Function Calling Leaderboard
View the Berkeley Function-Calling Leaderboard
Gotta rank 'em all!
View the Berkeley Function-Calling Leaderboard
More advanced and challenging multi-task evaluation
Compact LLM Battle Arena: Frugal AI Face-Off!
Text to Video and Image to Video Arena & Leaderboard
AI Music Arena & Leaderboard (Suno, Udio, Google, Meta, +)
Ranking of LLMs for agentic tasks
Uncensored General Intelligence Leaderboard
Compare two AI models' answers to document questions
Explore code-generation model leaderboards and task details
Can AI Code? An LLM leaderboard inclquantized models.
Duplicate this leaderboard to initialize your own!
Embedding Leaderboard
Explore LLM performance across hardware configurations
View and filter LLM hallucination leaderboard
VLMEvalKit Evaluation Results Collection
Explore LLM performance with a leaderboard
Display hardware performance leaderboard
A Comprehensive Benchmark for Generative Writing
Evaluating LLMs by their role-playing capabilities.
Explore and compare speechβrecognition model benchmarks