Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2602.10560

BitNet: Scaling 1-bit Transformers for Large Language Models

Paper • 2310.11453 • Published Oct 17, 2023 • 106
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection

Paper • 2310.11511 • Published Oct 17, 2023 • 78
In-Context Learning Creates Task Vectors

Paper • 2310.15916 • Published Oct 24, 2023 • 44
Matryoshka Diffusion Models

Paper • 2310.15111 • Published Oct 23, 2023 • 45

When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning

Paper • 2602.10560 • Published 28 days ago • 29

Agentic / LLm stuff

Agentic Uncertainty Quantification

Paper • 2601.15703 • Published Jan 22 • 9
From Passive Metric to Active Signal: The Evolving Role of Uncertainty Quantification in Large Language Models

Paper • 2601.15690 • Published Jan 22 • 4
Agentic Confidence Calibration

Paper • 2601.15778 • Published Jan 22 • 6
When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning

Paper • 2602.10560 • Published 28 days ago • 29

Stuff I'm going to read

about 5 hours ago

LTX-2: Efficient Joint Audio-Visual Foundation Model

Paper • 2601.03233 • Published Jan 6 • 160
MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head

Paper • 2601.07832 • Published Jan 12 • 52
Motion Attribution for Video Generation

Paper • 2601.08828 • Published Jan 13 • 71
Post-LayerNorm Is Back: Stable, ExpressivE, and Deep

Paper • 2601.19895 • Published Jan 27 • 25

Agent Knowledge

SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

Paper • 2602.08234 • Published 30 days ago • 69
When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning

Paper • 2602.10560 • Published 28 days ago • 29
SimpleMem: Efficient Lifelong Memory for LLM Agents

Paper • 2601.02553 • Published Jan 5 • 37
Beyond RAG for Agent Memory: Retrieval by Decoupling and Aggregation

Paper • 2602.02007 • Published Feb 2 • 16

Good agents related space, model, dataset

Good agents related space, model, dataset collection

zai-org/GLM-4.5

Text Generation • 358B • Updated Aug 11, 2025 • 44.5k • • 1.4k
Running

30

GLM 4.5V Demo App

🏃

30

Demo App of dmg file
nvidia/Cosmos-Reason1-7B

Image-Text-to-Text • Updated Dec 10, 2025 • 71.4k • 235
Running

MCP

Featured

159

Web Search MCP

🔎

159

Search and extract web content for LLM ingestion

BitNet: Scaling 1-bit Transformers for Large Language Models

Paper • 2310.11453 • Published Oct 17, 2023 • 106
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection

Paper • 2310.11511 • Published Oct 17, 2023 • 78
In-Context Learning Creates Task Vectors

Paper • 2310.15916 • Published Oct 24, 2023 • 44
Matryoshka Diffusion Models

Paper • 2310.15111 • Published Oct 23, 2023 • 45

Stuff I'm going to read

about 5 hours ago

LTX-2: Efficient Joint Audio-Visual Foundation Model

Paper • 2601.03233 • Published Jan 6 • 160
MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head

Paper • 2601.07832 • Published Jan 12 • 52
Motion Attribution for Video Generation

Paper • 2601.08828 • Published Jan 13 • 71
Post-LayerNorm Is Back: Stable, ExpressivE, and Deep

Paper • 2601.19895 • Published Jan 27 • 25

When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning

Paper • 2602.10560 • Published 28 days ago • 29

Agent Knowledge

SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

Paper • 2602.08234 • Published 30 days ago • 69
When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning

Paper • 2602.10560 • Published 28 days ago • 29
SimpleMem: Efficient Lifelong Memory for LLM Agents

Paper • 2601.02553 • Published Jan 5 • 37
Beyond RAG for Agent Memory: Retrieval by Decoupling and Aggregation

Paper • 2602.02007 • Published Feb 2 • 16

Agentic / LLm stuff

Agentic Uncertainty Quantification

Paper • 2601.15703 • Published Jan 22 • 9
From Passive Metric to Active Signal: The Evolving Role of Uncertainty Quantification in Large Language Models

Paper • 2601.15690 • Published Jan 22 • 4
Agentic Confidence Calibration

Paper • 2601.15778 • Published Jan 22 • 6
When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning

Paper • 2602.10560 • Published 28 days ago • 29

Good agents related space, model, dataset

Good agents related space, model, dataset collection

zai-org/GLM-4.5

Text Generation • 358B • Updated Aug 11, 2025 • 44.5k • • 1.4k
Running

30

GLM 4.5V Demo App

🏃

30

Demo App of dmg file
nvidia/Cosmos-Reason1-7B

Image-Text-to-Text • Updated Dec 10, 2025 • 71.4k • 235
Running

MCP

Featured

159

Web Search MCP

🔎

159

Search and extract web content for LLM ingestion

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs