#educational-assessment News & Analysis

2 articles tagged with #educational-assessment. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

2 articles

AIBullisharXiv – CS AI · Mar 274/10

🧠

Measuring What Matters -- or What's Convenient?: Robustness of LLM-Based Scoring Systems to Construct-Irrelevant Factors

Researchers tested a dual-architecture LLM-based automated scoring system for educational assessments and found it generally robust to construct-irrelevant factors like meaningless text padding and spelling errors. The study shows promise for LLM-based scoring systems' reliability when properly designed, though off-topic responses were heavily penalized.

AINeutralarXiv – CS AI · Mar 34/105

🧠

How effective are VLMs in assisting humans in inferring the quality of mental models from Multimodal short answers?

Researchers developed MMGrader, an AI system to assess student mental models from multimodal responses using concept graphs. Testing 9 open AI models showed they achieved only 40% accuracy compared to human evaluators, indicating current limitations in educational AI assessment tools.