2 18 24

Zhen Dong

zhendongucb

https://dong-zhen.com/

AI & ML interests

None yet

Recent Activity

authored a paper 12 days ago

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

authored a paper 12 days ago

Can World Models Benefit VLMs for World Dynamics?

authored a paper 12 days ago

NVIDIA Nemotron 3: Efficient and Open Intelligence

View all activity

Organizations

authored 6 papers 12 days ago

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

Paper • 2508.14444 • Published Aug 20, 2025 • 51

Can World Models Benefit VLMs for World Dynamics?

Paper • 2510.00855 • Published Oct 1, 2025 • 3

NVIDIA Nemotron 3: Efficient and Open Intelligence

Paper • 2512.20856 • Published Dec 24, 2025 • 44

Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Paper • 2512.20848 • Published Dec 23, 2025 • 43

TS-Attn: Temporal-wise Separable Attention for Multi-Event Video Generation

Paper • 2604.19473 • Published Apr 21

Agents' Last Exam

Paper • 2606.05405 • Published 20 days ago • 358

upvoted a paper 12 days ago

Agents' Last Exam

Paper • 2606.05405 • Published 20 days ago • 358

liked 2 datasets 6 months ago

nvidia/Nemotron-Math-Proofs-v1

Viewer • Updated Jan 5 • 925k • 731 • 122

nvidia/Nemotron-CC-v2.1

Viewer • Updated Dec 22, 2025 • 3.8B • 5.12k • 129

upvoted a collection 6 months ago

Nemotron-Pre-Training-Datasets

Collection

Large scale pre-training datasets used in the Nemotron family of models. • 15 items • Updated 11 days ago • 168

liked a dataset 6 months ago

nvidia/Nemotron-Agentic-v1

Preview • Updated Dec 15, 2025 • 5k • 168

liked 2 models 6 months ago

nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8

Text Generation • 32B • Updated Mar 15 • 364k • • 350

nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16

Text Generation • 32B • Updated Mar 15 • 1.26M • • 773

liked a dataset 10 months ago

nvidia/Llama-Nemotron-VLM-Dataset-v1

Viewer • Updated Oct 22, 2025 • 2.86M • 3.63k • 166

liked a dataset 11 months ago

nvidia/Llama-Nemotron-Post-Training-Dataset

Viewer • Updated May 8, 2025 • 3.91M • 4.4k • 677

liked a model 11 months ago

nvidia/Llama-3_3-Nemotron-Super-49B-v1_5-FP8

Text Generation • 50B • Updated Oct 15, 2025 • 243k • 28

upvoted 2 collections 11 months ago

NexusRaven V2

Collection

9 items • Updated Mar 2 • 3

Llama Nemotron

Collection

Open, Production-ready Enterprise Models • 12 items • Updated 11 days ago • 78

liked a model 11 months ago

nvidia/Llama-3_3-Nemotron-Super-49B-v1_5

Text Generation • 50B • Updated Oct 15, 2025 • 799k • • 234

authored a paper 11 months ago

R-KV: Redundancy-aware KV Cache Compression for Reasoning Models

Paper • 2505.24133 • Published May 30, 2025 • 2

Zhen Dong

AI & ML interests

Recent Activity

Organizations

zhendongucb's activity