Faves - a fysp Collection

fysp 's Collections

Faves

Faves

updated Dec 19, 2025

MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer

Paper • 2509.16197 • Published Sep 19, 2025 • 56
InternRobotics/VLAC

Robotics • 2B • Updated 3 days ago • 43 • 39
LazyDrag: Enabling Stable Drag-Based Editing on Multi-Modal Diffusion Transformers via Explicit Correspondence

Paper • 2509.12203 • Published Sep 15, 2025 • 20
A Vision-Language-Action-Critic Model for Robotic Real-World Reinforcement Learning

Paper • 2509.15937 • Published Sep 19, 2025 • 20
PersonaX: Multimodal Datasets with LLM-Inferred Behavior Traits

Paper • 2509.11362 • Published Sep 14, 2025 • 5
google/gemma-3n-E2B-it-litert-lm

Text Generation • Updated Dec 8, 2025 • 19.4k • 300