AIBullisharXiv – CS AI · 9h ago6/10
🧠
InfoDensity: Rewarding Information-Dense Traces for Efficient Reasoning
Researchers propose InfoDensity, a reinforcement learning reward framework that optimizes Large Language Models for efficient reasoning by measuring information density rather than just output length. The method tracks entropy trajectories to identify high-quality intermediate reasoning steps, achieving better accuracy-efficiency trade-offs on mathematical and general reasoning benchmarks.