AINeutralarXiv โ CS AI ยท 3d ago5/10
๐ง
MA-EgoQA: Question Answering over Egocentric Videos from Multiple Embodied Agents
Researchers introduce MA-EgoQA, a benchmark for evaluating AI models' ability to understand multiple egocentric video streams from embodied agents simultaneously. The benchmark includes 1.7k questions across five categories and reveals current approaches struggle with multi-agent system-level understanding.