AINeutralarXiv – CS AI · 8h ago6/10
🧠
Rethinking Entropy Minimization in Test-Time Adaptation for Autoregressive Models
Researchers present a unified mathematical framework for Test-Time Adaptation (TTA) in autoregressive generative models, decomposing entropy minimization into token-level policy gradient and entropy losses. Validated on Whisper ASR across 20+ domains, the approach demonstrates consistent performance improvements and reconciles previously disparate adaptation methods under a single theoretical foundation.