#speech-language-models News & Analysis

2 articles tagged with #speech-language-models. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

2 articles

AIBullisharXiv – CS AI · Apr 107/10

🧠

Do We Need Distinct Representations for Every Speech Token? Unveiling and Exploiting Redundancy in Large Speech Language Models

Researchers demonstrate that large speech language models contain significant redundancy in their token representations, particularly in deeper layers. By introducing Affinity Pooling, a training-free token merging technique, they achieve 27.48% reduction in prefilling FLOPs and up to 1.7× memory savings while maintaining semantic accuracy, challenging the necessity of fully distinct tokens for acoustic processing.

AINeutralarXiv – CS AI · Apr 146/10

🧠

ASPIRin: Action Space Projection for Interactivity-Optimized Reinforcement Learning in Full-Duplex Speech Language Models

Researchers propose ASPIRin, a reinforcement learning framework that improves full-duplex speech language models by separating turn-taking decisions from semantic generation. The method reduces repetitive output by over 50% compared to standard approaches while maintaining natural conversational dynamics.