HyperCLOVA X SEED Collection HyperCLOVA X SEED is NAVER's lightweight open-source lineup with a strong focus on Korean language performance • 6 items • Updated 25 days ago • 41
SciCode: A Research Coding Benchmark Curated by Scientists Paper • 2407.13168 • Published Jul 18, 2024 • 17
Falcon-H1 Collection Falcon-H1 Family of Hybrid-Head Language Models (Transformer-SSM), including 0.5B, 1.5B, 1.5B-Deep, 3B, 7B, and 34B (pretrained & instruction-tuned). • 39 items • Updated 9 days ago • 59
Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance Paper • 2507.22448 • Published Jul 30, 2025 • 69
Falcon-H1R: Pushing the Reasoning Frontiers with a Hybrid Model for Efficient Test-Time Scaling Paper • 2601.02346 • Published 13 days ago • 25
3LM Arabic Benchmark Collection Arabic benchmark datasets https://arxiv.org/pdf/2507.15850 • 6 items • Updated Nov 6, 2025 • 3
3LM: Bridging Arabic, STEM, and Code through Benchmarking Paper • 2507.15850 • Published Jul 21, 2025 • 5
Arabic LLM Models Collection A collection of general purpose Arabic LLM models from SILMA AI • 2 items • Updated Jul 7, 2025 • 2
view article Article ABBL: NextGen LLM Benchmark & Leaderboard for evaluating Arabic models May 18, 2025 • 3
Olmo 3.1 Collection The latest members of the Olmo 3 family: another 3 weeks of RL for 32B Think, the 32B Instruct model, large post-training research datasets... • 9 items • Updated 26 days ago • 44
Bolmo Collection Artifacts for the Bolmo release: https://allenai.org/papers/bolmo. • 4 items • Updated 26 days ago • 12
NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models • 7 items • Updated 2 days ago • 121
Pearl Collection PEARL: A Multimodal Culturally-Aware Arabic Instruction Dataset • 4 items • Updated Oct 27, 2025 • 6
Mistral Large 3 Collection A state-of-the-art, open-weight, general-purpose multimodal model with a granular Mixture-of-Experts architecture. • 4 items • Updated Dec 2, 2025 • 85
Devstral 2 Collection A couple of agentic LLMs for software engineering tasks, excelling at using tools to explore codebases, edit multiple files, and power SWE Agents. • 3 items • Updated Dec 9, 2025 • 39