y0news
AnalyticsDigestsRSSAICrypto
#cognitive-assessment1 article
1 articles
AIBearisharXiv โ€“ CS AI ยท 5h ago
๐Ÿง 

Baseline Performance of AI Tools in Classifying Cognitive Demand of Mathematical Tasks

A research study tested 11 AI tools on their ability to classify the cognitive demand of mathematical tasks, finding they achieved only 63% accuracy on average with no tool exceeding 83%. The tools showed systematic bias toward middle-category classifications and struggled with reasoning about underlying cognitive processes versus surface textual features.

๐Ÿข Perplexity๐Ÿง  ChatGPT๐Ÿง  Claude