AIBullisharXiv – CS AI · 14h ago7/10
🧠
Reasoning with Sampling: Cutting at Decision Points
Researchers introduce Entropy-Cut Metropolis-Hastings, an algorithm that improves sampling from power distributions in language models by identifying key decision points using entropy analysis rather than random sampling positions. The method achieves stronger reasoning performance across multiple benchmarks without requiring additional training or reinforcement learning.