Scaling Open-Ended Reasoning to Predict the Future Paper • 2512.25070 • Published about 23 hours ago • 11
IQuestLab/IQuest-Coder-V1-40B-Loop-Instruct Text Generation • 40B • Updated about 24 hours ago • 27 • 64
Rethinking Sample Polarity in Reinforcement Learning with Verifiable Rewards Paper • 2512.21625 • Published 7 days ago • 3
TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times Paper • 2512.16093 • Published 15 days ago • 88
Openhands Trajectories Collection Dataset of 67,074 OpenHands trajectories collected with Qwen3-Coder-480B-A35B-Instruct and two RFT checkpoints trained on the data • 3 items • Updated 9 days ago • 6
Next-Embedding Prediction Makes Strong Vision Learners Paper • 2512.16922 • Published 14 days ago • 82
VersatileFFN: Achieving Parameter Efficiency in LLMs via Adaptive Wide-and-Deep Reuse Paper • 2512.14531 • Published 16 days ago • 11
Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics Paper • 2512.12602 • Published 18 days ago • 40
QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management Paper • 2512.12967 • Published 18 days ago • 103
Bolmo Collection Artifacts for the Bolmo release: https://allenai.org/papers/bolmo. • 4 items • Updated 9 days ago • 12
Olmo 3.1 Collection The latest members of the Olmo 3 family: another 3 weeks of RL for 32B Think, the 32B Instruct model, large post-training research datasets... • 9 items • Updated 9 days ago • 40