-
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Paper • 2411.03562 • Published • 70 -
Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning
Paper • 2502.06060 • Published • 38 -
MLGym: A New Framework and Benchmark for Advancing AI Research Agents
Paper • 2502.14499 • Published • 195 -
SurveyX: Academic Survey Automation via Large Language Models
Paper • 2502.14776 • Published • 100
Collections
Discover the best community collections!
Collections including paper arxiv:2605.06614
-
ClawEnvKit: Automatic Environment Generation for Claw-Like Agents
Paper • 2604.18543 • Published • 30 -
Near-Future Policy Optimization
Paper • 2604.20733 • Published • 76 -
Co-Evolving LLM Decision and Skill Bank Agents for Long-Horizon Tasks
Paper • 2604.20987 • Published • 21 -
PATRA: Pattern-Aware Alignment and Balanced Reasoning for Time Series Question Answering
Paper • 2602.23161 • Published
-
Agent Learning via Early Experience
Paper • 2510.08558 • Published • 276 -
Learning on the Job: An Experience-Driven Self-Evolving Agent for Long-Horizon Tasks
Paper • 2510.08002 • Published • 24 -
Self-Improving LLM Agents at Test-Time
Paper • 2510.07841 • Published • 10 -
The Denario project: Deep knowledge AI agents for scientific discovery
Paper • 2510.26887 • Published • 8
-
SkillX: Automatically Constructing Skill Knowledge Bases for Agents
Paper • 2604.04804 • Published • 35 -
XSkill: Continual Learning from Experience and Skills in Multimodal Agents
Paper • 2603.12056 • Published • 34 -
AutoAgent: Evolving Cognition and Elastic Memory Orchestration for Adaptive Agents
Paper • 2603.09716 • Published -
SkillOS: Learning Skill Curation for Self-Evolving Agents
Paper • 2605.06614 • Published • 45
-
ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling
Paper • 2603.25746 • Published • 155 -
TAPS: Task Aware Proposal Distributions for Speculative Sampling
Paper • 2603.27027 • Published • 144 -
Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models
Paper • 2603.25716 • Published • 156 -
LongCat-Next: Lexicalizing Modalities as Discrete Tokens
Paper • 2603.27538 • Published • 147
-
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Paper • 2411.03562 • Published • 70 -
Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning
Paper • 2502.06060 • Published • 38 -
MLGym: A New Framework and Benchmark for Advancing AI Research Agents
Paper • 2502.14499 • Published • 195 -
SurveyX: Academic Survey Automation via Large Language Models
Paper • 2502.14776 • Published • 100
-
SkillX: Automatically Constructing Skill Knowledge Bases for Agents
Paper • 2604.04804 • Published • 35 -
XSkill: Continual Learning from Experience and Skills in Multimodal Agents
Paper • 2603.12056 • Published • 34 -
AutoAgent: Evolving Cognition and Elastic Memory Orchestration for Adaptive Agents
Paper • 2603.09716 • Published -
SkillOS: Learning Skill Curation for Self-Evolving Agents
Paper • 2605.06614 • Published • 45
-
ClawEnvKit: Automatic Environment Generation for Claw-Like Agents
Paper • 2604.18543 • Published • 30 -
Near-Future Policy Optimization
Paper • 2604.20733 • Published • 76 -
Co-Evolving LLM Decision and Skill Bank Agents for Long-Horizon Tasks
Paper • 2604.20987 • Published • 21 -
PATRA: Pattern-Aware Alignment and Balanced Reasoning for Time Series Question Answering
Paper • 2602.23161 • Published
-
ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling
Paper • 2603.25746 • Published • 155 -
TAPS: Task Aware Proposal Distributions for Speculative Sampling
Paper • 2603.27027 • Published • 144 -
Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models
Paper • 2603.25716 • Published • 156 -
LongCat-Next: Lexicalizing Modalities as Discrete Tokens
Paper • 2603.27538 • Published • 147
-
Agent Learning via Early Experience
Paper • 2510.08558 • Published • 276 -
Learning on the Job: An Experience-Driven Self-Evolving Agent for Long-Horizon Tasks
Paper • 2510.08002 • Published • 24 -
Self-Improving LLM Agents at Test-Time
Paper • 2510.07841 • Published • 10 -
The Denario project: Deep knowledge AI agents for scientific discovery
Paper • 2510.26887 • Published • 8