AIBullisharXiv – CS AI · 15h ago7/10
🧠
Learning When to Think While Listening in Large Audio-Language Models
Researchers introduce a learnable control system for Large Audio-Language Models that dynamically decides when to process reasoning during real-time speech interactions. The approach balances responsiveness with accuracy by optimizing intermediate reasoning transparency, achieving 2.7% accuracy improvement while reducing latency on benchmark tasks.