CLIP Behaves like a Bag-of-Words Model Cross-modally but not Uni-modally Paper • 2502.03566 • Published Feb 5, 2025 • 4
YOLOv1 to YOLOv10: The fastest and most accurate real-time object detection systems Paper • 2408.09332 • Published Aug 18, 2024 • 2