Huan Sun's picture

1 8 6

Huan Sun

huansun

·

http://web.cse.ohio-state.edu/~sun.397/

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 months ago

Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents

liked a dataset 6 months ago

osunlp/Mind2Web-2

upvoted a paper 6 months ago

Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge

View all activity

Organizations

upvoted a paper 2 months ago

Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents

Paper • 2510.24702 • Published Oct 28 • 28

upvoted a paper 6 months ago

Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge

Paper • 2506.21506 • Published Jun 26 • 51

upvoted a paper 7 months ago

RedTeamCUA: Realistic Adversarial Testing of Computer-Use Agents in Hybrid Web-OS Environments

Paper • 2505.21936 • Published May 28 • 1

upvoted a paper 10 months ago

On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective

Paper • 2502.14296 • Published Feb 20 • 45

upvoted a paper over 1 year ago

Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization

Paper • 2405.15071 • Published May 23, 2024 • 41

upvoted 2 papers about 2 years ago

Mind2Web: Towards a Generalist Agent for the Web

Paper • 2306.06070 • Published Jun 9, 2023 • 19

MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI

Paper • 2311.16502 • Published Nov 27, 2023 • 37

upvoted a paper over 2 years ago

MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning

Paper • 2309.05653 • Published Sep 11, 2023 • 10