Rui-Jie Zhu's picture

Rui-Jie Zhu

ridger

·

AI & ML interests

None yet

Recent Activity

upvoted a collection 2 days ago

Nemotron-Cascade 2

liked a dataset 12 days ago

stepfun-ai/Step-3.5-Flash-SFT

upvoted a collection 21 days ago

View all activity

Organizations

upvoted a collection 2 days ago

Nemotron-Cascade 2

Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation • 4 items • Updated 3 days ago • 38

upvoted a collection 21 days ago

Qwen3.5

21 items • Updated 18 days ago • 1.32k

upvoted a paper 21 days ago

Helios: Real Real-Time Long Video Generation Model

Paper • 2603.04379 • Published 23 days ago • 177

upvoted 5 papers about 2 months ago

Learning Query-Specific Rubrics from Human Preferences for DeepResearch Report Generation

Paper • 2602.03619 • Published Feb 3 • 27

LoopViT: Scaling Visual ARC with Looped Transformers

Paper • 2602.02156 • Published Feb 2 • 12

Kimi K2.5: Visual Agentic Intelligence

Paper • 2602.02276 • Published Feb 2 • 259

ConceptMoE: Adaptive Token-to-Concept Compression for Implicit Compute Allocation

Paper • 2601.21420 • Published Jan 29 • 42

DeepSeek-OCR 2: Visual Causal Flow

Paper • 2601.20552 • Published Jan 28 • 66

upvoted a collection 2 months ago

OpenThinker-Agent

5 items • Updated Dec 6, 2025 • 8

upvoted 3 papers 3 months ago

Dynamic Large Concept Models: Latent Reasoning in an Adaptive Semantic Space

Paper • 2512.24617 • Published Dec 31, 2025 • 65

Universal Reasoning Model

Paper • 2512.14693 • Published Dec 16, 2025 • 43

MMGR: Multi-Modal Generative Reasoning

Paper • 2512.14691 • Published Dec 16, 2025 • 121

upvoted 2 papers 4 months ago

Virtual Width Networks

Paper • 2511.11238 • Published Nov 14, 2025 • 38

Motif 2 12.7B technical report

Paper • 2511.07464 • Published Nov 7, 2025 • 39

upvoted 3 papers 5 months ago

Kimi Linear: An Expressive, Efficient Attention Architecture

Paper • 2510.26692 • Published Oct 30, 2025 • 133

Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published Oct 29, 2025 • 229

Parallel Loop Transformer for Efficient Test-Time Computation Scaling

Paper • 2510.24824 • Published Oct 28, 2025 • 17

upvoted a collection 5 months ago

Ouro

a family of pre-trained Looped Language Models. • 4 items • Updated Oct 29, 2025 • 26

upvoted a paper 5 months ago

Seed3D 1.0: From Images to High-Fidelity Simulation-Ready 3D Assets

Paper • 2510.19944 • Published Oct 22, 2025 • 22

upvoted a paper 6 months ago

Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget Allocation

Paper • 2509.25849 • Published Sep 30, 2025 • 48