Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Alan Wang's picture
1 7 1

Alan Wang

kaiw7
21world's profile picture
·

AI & ML interests

None yet

Organizations

ab's profile picture

upvoted a paper 3 months ago

Self-Improvement in Multimodal Large Language Models: A Survey

Paper • 2510.02665 • Published Oct 3, 2025 • 20
upvoted 2 papers 4 months ago

Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play

Paper • 2509.25541 • Published Sep 29, 2025 • 140

Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning

Paper • 2509.03646 • Published Sep 3, 2025 • 33
upvoted a paper 10 months ago

JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical Spatio-Temporal Prior Synchronization

Paper • 2503.23377 • Published Mar 30, 2025 • 57
upvoted a paper 11 months ago

Lumina-Video: Efficient and Flexible Video Generation with Multi-scale Next-DiT

Paper • 2502.06782 • Published Feb 10, 2025 • 15
upvoted a paper about 1 year ago

MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks

Paper • 2410.10563 • Published Oct 14, 2024 • 37
upvoted a paper over 1 year ago

AV-DiT: Efficient Audio-Visual Diffusion Transformer for Joint Audio and Video Generation

Paper • 2406.07686 • Published Jun 11, 2024 • 17
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs