1 22 4

Lancer

ruixiangma

AI & ML interests

None yet

Recent Activity

new activity 21 days ago

NucleusAI/Nucleus-Image:Diffusers example for low vram?

updated a model about 1 month ago

ruixiangma/LongCat-AudioDiT-1B-Diffusers

published a model about 1 month ago

ruixiangma/LongCat-AudioDiT-1B-Diffusers

View all activity

Organizations

None yet

upvoted 4 papers 3 months ago

upvoted a paper 4 months ago

Can LLMs Clean Up Your Mess? A Survey of Application-Ready Data Preparation with LLMs

Paper • 2601.17058 • Published Jan 22 • 190

upvoted a collection 4 months ago

Qwen3-ASR

Collection

4 items • Updated Jan 29 • 66

upvoted 5 papers 4 months ago

Inference-Time Scaling of Verification: Self-Evolving Deep Research Agents via Test-Time Rubric-Guided Verification

Paper • 2601.15808 • Published Jan 22 • 20

EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience

Paper • 2601.15876 • Published Jan 22 • 92

Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization

Paper • 2601.05432 • Published Jan 8 • 170

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published Jan 8 • 231

Atlas: Orchestrating Heterogeneous Models and Tools for Multi-Domain Complex Reasoning

Paper • 2601.03872 • Published Jan 7 • 45

upvoted a paper 5 months ago

Memory in the Age of AI Agents

Paper • 2512.13564 • Published Dec 15, 2025 • 156

upvoted an article 6 months ago

Article

Continuous batching from first principles

ror, ArthurZ, mcpotato

•

Nov 25, 2025

• 385

upvoted 2 articles 7 months ago

Article

Supercharge your OCR Pipelines with Open Models

merve, ariG23498, davanstrien, hynky, andito, reach-vb, pcuenq

•

Oct 21, 2025

• 312

Article

mem-agent: Equipping LLM Agents with Memory Using RL

driaforall

•

Oct 9, 2025

• 33

upvoted an article 8 months ago

Article

From GRPO to DAPO and GSPO: What, Why, and How

NormalUhr

•

Aug 9, 2025

• 118

upvoted a paper 8 months ago

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2, 2025 • 238

upvoted a paper 10 months ago

MemOS: A Memory OS for AI System

Paper • 2507.03724 • Published Jul 4, 2025 • 167

upvoted an article 12 months ago

Article

Vision Language Models (Better, faster, stronger)

merve, sergiopaniego, ariG23498, pcuenq, andito

•

May 12, 2025

• 611

upvoted an article about 1 year ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

NormalUhr

•

Feb 7, 2025

• 293

Lancer

AI & ML interests

Recent Activity

Organizations

ruixiangma's activity

Continuous batching from first principles

Supercharge your OCR Pipelines with Open Models

mem-agent: Equipping LLM Agents with Memory Using RL

From GRPO to DAPO and GSPO: What, Why, and How

Vision Language Models (Better, faster, stronger)

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge