8 30

Arslan

cowgoesmoo

volf52

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

Zyphra/ZAYA1-8B

liked a model 6 days ago

ibm-granite/granite-4.1-8b

updated a collection 17 days ago

reasoning-gym

View all activity

Organizations

liked a model 2 days ago

Zyphra/ZAYA1-8B

9B • Updated 2 days ago • 110k • 467

liked a model 6 days ago

ibm-granite/granite-4.1-8b

Text Generation • 9B • Updated 9 days ago • 39.4k • 170

updated a collection 17 days ago

reasoning-gym

Collection

Datasets generated using https://github.com/open-thought/reasoning-gym (with Qwen3-instruct templates) • 15 items • Updated 17 days ago

liked a model 17 days ago

moonshotai/MoonViT-SO-400M

Image Feature Extraction • 0.4B • Updated Apr 17, 2025 • 4.97k • 42

liked a model about 1 month ago

llava-hf/llava-v1.6-mistral-7b-hf

Image-Text-to-Text • 8B • Updated Dec 22, 2025 • 567k • 308

liked 2 Spaces about 2 months ago

Evaluation Guidebook

📝

317

Explore LLM benchmark trends over time

The Smol Training Playbook

📚

3.17k

The secrets to building world-class LLMs

liked 2 models 4 months ago

LiquidAI/LFM2-2.6B

Text Generation • 3B • Updated Mar 30 • 6.38k • 188

IQuestLab/IQuest-Coder-V1-40B-Instruct

Text Generation • 40B • Updated Mar 4 • 27.4k • 291

liked 2 models 5 months ago

nvidia/NitroGen

Reinforcement Learning • Updated Feb 5 • 532

NousResearch/nomos-1

Text Generation • Updated Jan 10 • 485 • 148

upvoted a collection 5 months ago

GLM-4.6

Collection

2 items • Updated Mar 2 • 53

upvoted 5 papers 5 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22, 2025 • 449

liked a model 7 months ago

deepseek-ai/DeepSeek-OCR

Image-Text-to-Text • 3B • Updated Nov 4, 2025 • 2.94M • 3.23k

liked a model 9 months ago

google-bert/bert-large-uncased

Fill-Mask • 0.3B • Updated Feb 19, 2024 • 1.1M • • 147

upvoted a paper 9 months ago

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Paper • 2508.13167 • Published Aug 6, 2025 • 129