AINeutralarXiv – CS AI · 5h ago6/10
🧠
CrowdMath: A Dataset of Crowdsourced Mathematical Research Discussions
Researchers introduce CrowdMath, a dataset of 164 expert-annotated collaborative mathematical problem-solving discussions from MIT PRIMES and Art of Problem Solving (2016-2025). While frontier AI models achieve 83-88% accuracy in predicting next posts, they struggle significantly with understanding the functional roles of contributions in mathematical reasoning, revealing a gap between solving isolated problems and comprehending collaborative research progress.