AINeutralarXiv – CS AI · 7h ago6/10
🧠
RoboBenchMart: Benchmarking Robots in Retail Environment
Researchers introduced RoboBenchMart, an open-source simulated benchmark for evaluating robotic systems in retail dark-store environments. The study reveals that current state-of-the-art vision-language-action (VLA) models struggle with complex grocery manipulation tasks, indicating limitations in their generalization across diverse domains beyond tabletop scenarios.