AINeutralarXiv โ CS AI ยท 4d ago6/103
๐ง
Understanding the Role of Training Data in Test-Time Scaling
Research paper analyzes test-time scaling in large language models, revealing that longer reasoning chains (CoTs) can reduce training data requirements but may harm performance if relevant skills aren't present in training data. The study provides theoretical framework showing that diverse, relevant, and challenging training tasks optimize test-time scaling performance.