1 26 10

Tianchen Zhao

A-suozhang

https://a-suozhang.xyz

A-suozhang

AI & ML interests

efficient deep learning

Recent Activity

liked a model 10 days ago

deepseek-ai/DeepSeek-V3.1-Terminus

upvoted a paper about 2 months ago

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models

liked a model about 2 months ago

miromind-ai/MiroThinker-v1.0-8B

View all activity

Organizations

upvoted a paper about 2 months ago

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models

Paper • 2511.08577 • Published Nov 11, 2025 • 105

upvoted a collection about 2 months ago

MiroThinker-v0.2

Collection

Better performance in multi-hop search and multilingual tasks. • 8 items • Updated Nov 9, 2025 • 7

upvoted 4 papers 3 months ago

RLinf-VLA: A Unified and Efficient Framework for VLA+RL Training

Paper • 2510.06710 • Published Oct 8, 2025 • 39

Cache-to-Cache: Direct Semantic Communication Between Large Language Models

Paper • 2510.03215 • Published Oct 3, 2025 • 97

Bridging the Gap Between Promise and Performance for Microscaling FP4 Quantization

Paper • 2509.23202 • Published Sep 27, 2025 • 27

SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear Attention

Paper • 2509.24006 • Published Sep 28, 2025 • 118

upvoted an article 6 months ago

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

Feb 11, 2025

•

upvoted 4 papers 7 months ago

PAROAttention: Pattern-Aware ReOrdering for Efficient Sparse and Quantized Attention in Visual Generation Models

Paper • 2506.16054 • Published Jun 19, 2025 • 60

VS-Bench: Evaluating VLMs for Strategic Reasoning and Decision-Making in Multi-Agent Environments

Paper • 2506.02387 • Published Jun 3, 2025 • 58

SageAttention2++: A More Efficient Implementation of SageAttention2

Paper • 2505.21136 • Published May 27, 2025 • 45

R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing

Paper • 2505.21600 • Published May 27, 2025 • 71

upvoted 3 papers 9 months ago

A Unified Agentic Framework for Evaluating Conditional Image Generation

Paper • 2504.07046 • Published Apr 9, 2025 • 30

DDT: Decoupled Diffusion Transformer

Paper • 2504.05741 • Published Apr 8, 2025 • 77

AdaptiVocab: Enhancing LLM Efficiency in Focused Domains through Lightweight Vocabulary Adaptation

Paper • 2503.19693 • Published Mar 25, 2025 • 76

upvoted 2 papers 10 months ago

TPDiff: Temporal Pyramid Video Diffusion Model

Paper • 2503.09566 • Published Mar 12, 2025 • 45

Chain of Draft: Thinking Faster by Writing Less

Paper • 2502.18600 • Published Feb 25, 2025 • 50

upvoted 2 papers about 1 year ago

MarDini: Masked Autoregressive Diffusion for Video Generation at Scale

Paper • 2410.20280 • Published Oct 26, 2024 • 23

ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation

Paper • 2406.02540 • Published Jun 4, 2024 • 3

upvoted 2 papers over 1 year ago

LLM Pruning and Distillation in Practice: The Minitron Approach

Paper • 2408.11796 • Published Aug 21, 2024 • 58

Factorized-Dreamer: Training A High-Quality Video Generator with Limited and Low-Quality Data

Paper • 2408.10119 • Published Aug 19, 2024 • 17

Tianchen Zhao

AI & ML interests

Recent Activity

Organizations

A-suozhang's activity

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment