Leaderboards - a Felladrin Collection

Felladrin 's Collections

Trained Models 🏋️

Frequently Used Spaces

Foundation Text-Generation Models Below 360M Parameters

Leaderboards

updated Dec 13, 2025

Gotta rank 'em all!

Running

123

Berkeley Function Calling Leaderboard

🏃

123

View the Berkeley Function-Calling Leaderboard
Running on CPU Upgrade

241

MMLU-Pro Leaderboard

🥇

241

More advanced and challenging multi-task evaluation
Running

352

GPU Poor LLM Arena

🏆

352

Compact LLM Battle Arena: Frugal AI Face-Off!
Running

192

Video Generation Leaderboard

📊

192

Text to Video and Image to Video Arena & Leaderboard
Running

Featured

87

Music Arena Leaderboard

🎵

87

AI Music Arena & Leaderboard (Suno, Udio, Google, Meta, +)
Running on CPU Upgrade

445

Agent Leaderboard

💬

445

Ranking of LLMs for agentic tasks
Running

1.49k

UGI Leaderboard

📢

1.49k

Uncensored General Intelligence Leaderboard
Running on Zero

31

SLM RAG Arena

🤼

31

Compare two AI models' answers to document questions
Running

230

BigCodeBench Leaderboard

🥇

230

Explore code-generation model leaderboards and task details
Running

450

Can Ai Code Results

🏆

450

Can AI Code? An LLM leaderboard inclquantized models.
Running

10

Web Bench Leaderboard

🥇

10

Duplicate this leaderboard to initialize your own!
Running on CPU Upgrade

7.03k

MTEB Leaderboard

🥇

7.03k

Embedding Leaderboard
Running

Featured

583

LLM-Perf Leaderboard

🏆

583

Explore LLM performance across hardware configurations
Running on CPU Upgrade

190

LLM Hallucination Leaderboard

🚀

190

View and filter LLM hallucination leaderboard
Running on CPU Upgrade

990

Open VLM Leaderboard

🌎

990

VLMEvalKit Evaluation Results Collection
Running

16

LLM Inference Benchmark

🥇

16

Explore LLM performance with a leaderboard
Running

18

Edge LLM Leaderboard

🌖

18

Display hardware performance leaderboard
Running

19

WritingBench

🏆

19

A Comprehensive Benchmark for Generative Writing
Running

2

RPEval

🏆

2

Evaluating LLMs by their role-playing capabilities.
Running on CPU Upgrade

Featured

1.22k

Open ASR Leaderboard

🏆

1.22k

Explore and compare speech‑recognition model benchmarks