AINeutralarXiv โ CS AI ยท 6h ago0
๐ง
How effective are VLMs in assisting humans in inferring the quality of mental models from Multimodal short answers?
Researchers developed MMGrader, an AI system to assess student mental models from multimodal responses using concept graphs. Testing 9 open AI models showed they achieved only 40% accuracy compared to human evaluators, indicating current limitations in educational AI assessment tools.