AINeutralarXiv – CS AI · 10h ago6/10
🧠
Robusto-2: Benchmarking Humans & VLMs for Autonomous Driving in Lima & New York City
Researchers benchmark Vision Language Models (VLMs) and human drivers from Lima and New York City on autonomous driving comprehension tasks using dashcam footage, finding that VLMs and humans diverge in responses but geography has minimal impact due to the extreme out-of-distribution nature of challenging driving scenarios in these underserved markets.
🏢 Hugging Face