A Survey of Reinforcement Learning for Large Reasoning Models Paper • 2509.08827 • Published Sep 10 • 190
ByteDance Papers Collection ByteDance papers collection • 131 items • Updated about 13 hours ago • 22
barc0/200k_HEAVY_gpt4o-description-gpt4omini-code_generated_problems Viewer • Updated Nov 2, 2024 • 139k • 124 • 11
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models Paper • 2503.09573 • Published Mar 12 • 74
You Do Not Fully Utilize Transformer's Representation Capacity Paper • 2502.09245 • Published Feb 13 • 37