-
Cut Your Losses! Learning to Prune Paths Early for Efficient Parallel Reasoning
Paper • 2604.16029 • Published • 23 -
Qwen3.5-Omni Technical Report
Paper • 2604.15804 • Published • 57 -
REFRAG: Rethinking RAG based Decoding
Paper • 2509.01092 • Published • 9 -
OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation
Paper • 2604.18486 • Published • 90
Collections
Discover the best community collections!
Collections including paper arxiv:2604.15804
-
MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild
Paper • 2603.17187 • Published • 139 -
Attention Residuals
Paper • 2603.15031 • Published • 184 -
MOSS-TTS Technical Report
Paper • 2603.18090 • Published • 12 -
MSA: Memory Sparse Attention for Efficient End-to-End Memory Model Scaling to 100M Tokens
Paper • 2603.23516 • Published • 49
-
Qwen2.5-Omni Technical Report
Paper • 2503.20215 • Published • 172 -
Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO
Paper • 2505.22453 • Published • 46 -
UniRL: Self-Improving Unified Multimodal Models via Supervised and Reinforcement Learning
Paper • 2505.23380 • Published • 22 -
More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models
Paper • 2505.21523 • Published • 13
-
PersonaVLM: Long-Term Personalized Multimodal LLMs
Paper • 2604.13074 • Published • 45 -
Elucidating the SNR-t Bias of Diffusion Probabilistic Models
Paper • 2604.16044 • Published • 74 -
Web Retrieval-Aware Chunking (W-RAC) for Efficient and Cost-Effective Retrieval-Augmented Generation Systems
Paper • 2604.04936 • Published • 26 -
Qwen3.5-Omni Technical Report
Paper • 2604.15804 • Published • 57
-
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
Paper • 2508.06471 • Published • 211 -
Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent Foundation Models Training
Paper • 2508.00414 • Published • 96 -
Continuous Autoregressive Language Models
Paper • 2510.27688 • Published • 74 -
MiMo-Embodied: X-Embodied Foundation Model Technical Report
Paper • 2511.16518 • Published • 26
-
Can Large Language Models Understand Context?
Paper • 2402.00858 • Published • 24 -
OLMo: Accelerating the Science of Language Models
Paper • 2402.00838 • Published • 85 -
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 153 -
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity
Paper • 2401.17072 • Published • 25
-
Cut Your Losses! Learning to Prune Paths Early for Efficient Parallel Reasoning
Paper • 2604.16029 • Published • 23 -
Qwen3.5-Omni Technical Report
Paper • 2604.15804 • Published • 57 -
REFRAG: Rethinking RAG based Decoding
Paper • 2509.01092 • Published • 9 -
OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation
Paper • 2604.18486 • Published • 90
-
PersonaVLM: Long-Term Personalized Multimodal LLMs
Paper • 2604.13074 • Published • 45 -
Elucidating the SNR-t Bias of Diffusion Probabilistic Models
Paper • 2604.16044 • Published • 74 -
Web Retrieval-Aware Chunking (W-RAC) for Efficient and Cost-Effective Retrieval-Augmented Generation Systems
Paper • 2604.04936 • Published • 26 -
Qwen3.5-Omni Technical Report
Paper • 2604.15804 • Published • 57
-
MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild
Paper • 2603.17187 • Published • 139 -
Attention Residuals
Paper • 2603.15031 • Published • 184 -
MOSS-TTS Technical Report
Paper • 2603.18090 • Published • 12 -
MSA: Memory Sparse Attention for Efficient End-to-End Memory Model Scaling to 100M Tokens
Paper • 2603.23516 • Published • 49
-
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
Paper • 2508.06471 • Published • 211 -
Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent Foundation Models Training
Paper • 2508.00414 • Published • 96 -
Continuous Autoregressive Language Models
Paper • 2510.27688 • Published • 74 -
MiMo-Embodied: X-Embodied Foundation Model Technical Report
Paper • 2511.16518 • Published • 26
-
Qwen2.5-Omni Technical Report
Paper • 2503.20215 • Published • 172 -
Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO
Paper • 2505.22453 • Published • 46 -
UniRL: Self-Improving Unified Multimodal Models via Supervised and Reinforcement Learning
Paper • 2505.23380 • Published • 22 -
More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models
Paper • 2505.21523 • Published • 13
-
Can Large Language Models Understand Context?
Paper • 2402.00858 • Published • 24 -
OLMo: Accelerating the Science of Language Models
Paper • 2402.00838 • Published • 85 -
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 153 -
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity
Paper • 2401.17072 • Published • 25