AINeutralarXiv โ CS AI ยท 14h ago6/10
๐ง
Assessing the Pedagogical Readiness of Large Language Models as AI Tutors in Low-Resource Contexts: A Case Study of Nepal's K-10 Curriculum
A comprehensive study evaluates four state-of-the-art LLMs (GPT-4o, Claude Sonnet 4, Qwen3-235B, Kimi K2) for use as AI tutors in Nepal's K-10 curriculum, revealing significant pedagogical gaps despite high technical accuracy. The research identifies critical failure modes including inability to simplify complex concepts for young learners and poor cultural contextualization, concluding that current LLMs require human oversight and curriculum-specific fine-tuning before classroom deployment in low-resource regions.
๐ง GPT-4๐ง Claude๐ง Sonnet