MoCA3D Checkpoint

This release provides the pretrained MoCA3D checkpoint used in the MoCA3D paper.

Paper

This repository release corresponds to:

MoCA3D: Monocular 3D Bounding Box Prediction in the Image Plane
Changwoo Jeon, Rishi Upadhyay, Achuta Kadambi (arXiv:2603.19538v1, 2026).

Abstract:

Monocular 3D object understanding is reformulated from a 2D RoI-to-3D lifting task into a pixel-space geometry recovery task. MoCA3D predicts projected 3D box corners and per-corner depths without requiring camera intrinsics at inference time, using dense corner heatmaps and depth maps from a tight 2D box input. It is class-agnostic, optimized for image-plane geometry fidelity, and evaluated with Pixel-Aligned Geometry (PAG) metrics (projected-corner and depth consistency).

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Paper for jeoncwcw/MoCA3D