WAInjectBench: Benchmarking Prompt Injection Detections for Web Agents Paper • 2510.01354 • Published Oct 1, 2025 • 3
MLLM-as-a-Judge: Assessing Multimodal LLM-as-a-Judge with Vision-Language Benchmark Paper • 2402.04788 • Published Feb 7, 2024
Agentic Robot: A Brain-Inspired Framework for Vision-Language-Action Models in Embodied Agents Paper • 2505.23450 • Published May 29, 2025 • 9