Collections
Discover the best community collections!
Collections including paper arxiv:2510.13998
-
TradingAgents: Multi-Agents LLM Financial Trading Framework
Paper • 2412.20138 • Published • 16 -
MinerU: An Open-Source Solution for Precise Document Content Extraction
Paper • 2409.18839 • Published • 40 -
MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing
Paper • 2509.22186 • Published • 144 -
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
Paper • 2508.03680 • Published • 134
-
Paper2Web: Let's Make Your Paper Alive!
Paper • 2510.15842 • Published • 27 -
Paper2Video: Automatic Video Generation from Scientific Papers
Paper • 2510.05096 • Published • 119 -
BitNet Distillation
Paper • 2510.13998 • Published • 59 -
Visual Backdoor Attacks on MLLM Embodied Decision Making via Contrastive Trigger Learning
Paper • 2510.27623 • Published • 13
-
SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights
Paper • 2509.22944 • Published • 80 -
Robot Learning: A Tutorial
Paper • 2510.12403 • Published • 122 -
UniMoE-Audio: Unified Speech and Music Generation with Dynamic-Capacity MoE
Paper • 2510.13344 • Published • 63 -
Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding
Paper • 2510.06308 • Published • 55
-
Your Group-Relative Advantage Is Biased
Paper • 2601.08521 • Published • 151 -
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
Paper • 2508.03680 • Published • 134 -
mHC: Manifold-Constrained Hyper-Connections
Paper • 2512.24880 • Published • 305 -
BitNet Distillation
Paper • 2510.13998 • Published • 59
-
The Art of Scaling Reinforcement Learning Compute for LLMs
Paper • 2510.13786 • Published • 32 -
Attention Is All You Need for KV Cache in Diffusion LLMs
Paper • 2510.14973 • Published • 42 -
BitNet Distillation
Paper • 2510.13998 • Published • 59 -
GigaBrain-0: A World Model-Powered Vision-Language-Action Model
Paper • 2510.19430 • Published • 52
-
Demystifying Reinforcement Learning in Agentic Reasoning
Paper • 2510.11701 • Published • 32 -
Self-Improving LLM Agents at Test-Time
Paper • 2510.07841 • Published • 10 -
Making Mathematical Reasoning Adaptive
Paper • 2510.04617 • Published • 23 -
DocReward: A Document Reward Model for Structuring and Stylizing
Paper • 2510.11391 • Published • 27
-
Your Group-Relative Advantage Is Biased
Paper • 2601.08521 • Published • 151 -
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
Paper • 2508.03680 • Published • 134 -
mHC: Manifold-Constrained Hyper-Connections
Paper • 2512.24880 • Published • 305 -
BitNet Distillation
Paper • 2510.13998 • Published • 59
-
TradingAgents: Multi-Agents LLM Financial Trading Framework
Paper • 2412.20138 • Published • 16 -
MinerU: An Open-Source Solution for Precise Document Content Extraction
Paper • 2409.18839 • Published • 40 -
MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing
Paper • 2509.22186 • Published • 144 -
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
Paper • 2508.03680 • Published • 134
-
Paper2Web: Let's Make Your Paper Alive!
Paper • 2510.15842 • Published • 27 -
Paper2Video: Automatic Video Generation from Scientific Papers
Paper • 2510.05096 • Published • 119 -
BitNet Distillation
Paper • 2510.13998 • Published • 59 -
Visual Backdoor Attacks on MLLM Embodied Decision Making via Contrastive Trigger Learning
Paper • 2510.27623 • Published • 13
-
The Art of Scaling Reinforcement Learning Compute for LLMs
Paper • 2510.13786 • Published • 32 -
Attention Is All You Need for KV Cache in Diffusion LLMs
Paper • 2510.14973 • Published • 42 -
BitNet Distillation
Paper • 2510.13998 • Published • 59 -
GigaBrain-0: A World Model-Powered Vision-Language-Action Model
Paper • 2510.19430 • Published • 52
-
SINQ: Sinkhorn-Normalized Quantization for Calibration-Free Low-Precision LLM Weights
Paper • 2509.22944 • Published • 80 -
Robot Learning: A Tutorial
Paper • 2510.12403 • Published • 122 -
UniMoE-Audio: Unified Speech and Music Generation with Dynamic-Capacity MoE
Paper • 2510.13344 • Published • 63 -
Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding
Paper • 2510.06308 • Published • 55
-
Demystifying Reinforcement Learning in Agentic Reasoning
Paper • 2510.11701 • Published • 32 -
Self-Improving LLM Agents at Test-Time
Paper • 2510.07841 • Published • 10 -
Making Mathematical Reasoning Adaptive
Paper • 2510.04617 • Published • 23 -
DocReward: A Document Reward Model for Structuring and Stylizing
Paper • 2510.11391 • Published • 27