AIBullisharXiv – CS AI · 10h ago7/10
🧠
CORTIS: Text-Only Adaptation of Spoken Language Models for Task-Oriented Voice Agents
Researchers introduce CORTIS, a framework that enables spoken language models (SLMs) to handle task-oriented voice agent functions using only text-based training data, eliminating the need for expensive paired speech-target annotations. The approach matches or outperforms traditional ASR-LLM cascades while demonstrating superior robustness under acoustic degradation.