1 17 88

Maojia Song

OrangeEye

AI & ML interests

None yet

Recent Activity

upvoted a paper 14 days ago

Evaluating Gemini Robotics Policies in a Veo World Simulator

upvoted a paper 22 days ago

Dyna-Mind: Learning to Simulate from Experience for Better AI Agents

upvoted a paper 22 days ago

SIMA 2: A Generalist Embodied Agent for Virtual Worlds

View all activity

Organizations

upvoted a paper 14 days ago

Evaluating Gemini Robotics Policies in a Veo World Simulator

Paper • 2512.10675 • Published 19 days ago • 16

upvoted 3 papers 22 days ago

upvoted a paper about 1 month ago

Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

Paper • 2511.16043 • Published Nov 20 • 108

upvoted a paper about 2 months ago

LLMs Can't Handle Peer Pressure: Crumbling under Multi-Agent Social Interactions

Paper • 2508.18321 • Published Aug 24 • 2

liked a Space about 2 months ago

The Smol Training Playbook

📚

2.74k

The secrets to building world-class LLMs

upvoted a paper 3 months ago

Demystifying deep search: a holistic evaluation with hint-free multi-hop questions and factorised metrics

Paper • 2510.05137 • Published Oct 1 • 5

authored a paper 3 months ago

Scaling Agents via Continual Pre-training

Paper • 2509.13310 • Published Sep 16 • 117

upvoted a paper 3 months ago

Scaling Agents via Continual Pre-training

Paper • 2509.13310 • Published Sep 16 • 117

updated a dataset 4 months ago

declare-lab/KAIROS_EVAL

Viewer • Updated Aug 31 • 3k • 225 • 2

published a dataset 4 months ago

declare-lab/KAIROS_EVAL

Viewer • Updated Aug 31 • 3k • 225 • 2

liked a dataset 4 months ago

macabdul9/hle_text_only

Viewer • Updated Feb 18 • 2.37k • 9.18k • 1

liked a dataset 5 months ago

malaysia-ai/GH200-ARM64-vLLM-wheel

Updated Sep 20 • 87 • 1

upvoted a paper 5 months ago

WebWatcher: Breaking New Frontier of Vision-Language Deep Research Agent

Paper • 2508.05748 • Published Aug 7 • 141

upvoted an article 7 months ago

Article

🦸🏻#1: Open-endedness and AI Agents – A Path from Generative to Creative AI?

Dec 25, 2024

•

liked 2 datasets 8 months ago

nvidia/Nemotron-CrossThink

Preview • Updated May 1 • 246 • 112

JoeYing/ReTool-SFT

Viewer • Updated Apr 29 • 2k • 572 • 49

upvoted a paper 8 months ago

Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling

Paper • 2504.13169 • Published Apr 17 • 39

liked a dataset 9 months ago

cais/hle

Viewer • Updated Sep 10 • 2.5k • 22.8k • 629