AINeutralarXiv โ CS AI ยท 10h ago6/10
๐ง
Temperature-Dependent Performance of Prompting Strategies in Extended Reasoning Large Language Models
Researchers systematically evaluated how sampling temperature and prompting strategies affect extended reasoning performance in large language models, finding that zero-shot prompting peaks at moderate temperatures (T=0.4-0.7) while chain-of-thought performs better at extremes. The study reveals that extended reasoning benefits grow substantially with higher temperatures, suggesting that T=0 is suboptimal for reasoning tasks.
๐ง Grok