AINeutralarXiv – CS AI · 8h ago6/10
🧠
NoRA: Evaluating Grounded Reasonableness in Visual First-person Normative Action Reasoning
Researchers introduce NoRA, a visual reasoning benchmark that evaluates whether AI models can generate and justify appropriate actions in first-person video scenarios through explicit reasoning graphs. The benchmark reveals that current multimodal language models struggle to construct complete action spaces and properly ground decisions in visible evidence, highlighting a critical gap between selecting plausible actions and explaining them through verifiable reasoning.