Tong Wu's picture

15 5

Tong Wu

wutong16

wutong16

AI & ML interests

None yet

Recent Activity

upvoted a paper 18 days ago

V-RGBX: Video Editing with Accurate Controls over Intrinsic Properties

updated a model about 2 months ago

wutong16/upload_1108

published a model about 2 months ago

wutong16/upload_1108

View all activity

Organizations

None yet

authored 19 papers about 1 year ago

V3Det: Vast Vocabulary Visual Detection Dataset

Paper • 2304.03752 • Published Apr 7, 2023 • 1

GPT4Point: A Unified Framework for Point-Language Understanding and Generation

Paper • 2312.02980 • Published Dec 5, 2023 • 9

Alpha-CLIP: A CLIP Model Focusing on Wherever You Want

Paper • 2312.03818 • Published Dec 6, 2023 • 34

HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image

Paper • 2312.04543 • Published Dec 7, 2023 • 22

Gemini vs GPT-4V: A Preliminary Comparison and Combination of Vision-Language Models Through Qualitative Cases

Paper • 2312.15011 • Published Dec 22, 2023 • 18

GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation

Paper • 2401.04092 • Published Jan 8, 2024 • 21

Large-Vocabulary 3D Diffusion Model with Transformer

Paper • 2309.07920 • Published Sep 14, 2023

ComboVerse: Compositional 3D Assets Creation Using Spatially-Aware Diffusion Guidance

Paper • 2403.12409 • Published Mar 19, 2024 • 10

Make-it-Real: Unleashing Large Multimodal Model's Ability for Painting 3D Objects with Realistic Materials

Paper • 2404.16829 • Published Apr 25, 2024 • 4

Bootstrap3D: Improving 3D Content Creation with Synthetic Data

Paper • 2406.00093 • Published May 31, 2024 • 1

MotionClone: Training-Free Motion Cloning for Controllable Video Generation

Paper • 2406.05338 • Published Jun 8, 2024 • 41

OmniObject3D: Large-Vocabulary 3D Object Dataset for Realistic Perception, Reconstruction and Generation

Paper • 2301.07525 • Published Jan 18, 2023

V3Det Challenge 2024 on Vast Vocabulary and Open Vocabulary Object Detection: Methods and Results

Paper • 2406.11739 • Published Jun 17, 2024

Tailor3D: Customized 3D Assets Editing and Generation with Dual-Side Images

Paper • 2407.06191 • Published Jul 8, 2024 • 13

LayerPano3D: Layered 3D Panorama for Hyper-Immersive Scene Generation

Paper • 2408.13252 • Published Aug 23, 2024 • 26

3DTopia: Large Text-to-3D Generation Model with Hybrid Diffusion Priors

Paper • 2403.02234 • Published Mar 4, 2024

3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive Diffusion

Paper • 2409.12957 • Published Sep 19, 2024 • 21

BroadWay: Boost Your Text-to-Video Generation Model in a Training-free Way

Paper • 2410.06241 • Published Oct 8, 2024 • 10

LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models

Paper • 2410.09732 • Published Oct 13, 2024 • 54