y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#speech-llm News & Analysis

3 articles tagged with #speech-llm. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

3 articles
AIBullisharXiv โ€“ CS AI ยท Mar 276/10
๐Ÿง 

X-OPD: Cross-Modal On-Policy Distillation for Capability Alignment in Speech LLMs

Researchers propose X-OPD, a Cross-Modal On-Policy Distillation framework to improve Speech Large Language Models by aligning them with text-based counterparts. The method uses token-level feedback from teacher models to bridge performance gaps in end-to-end speech systems while preserving inherent capabilities.

AINeutralarXiv โ€“ CS AI ยท Mar 114/10
๐Ÿง 

VoxEmo: Benchmarking Speech Emotion Recognition with Speech LLMs

Researchers introduce VoxEmo, a comprehensive benchmark for evaluating Speech Large Language Models on emotion recognition tasks across 35 emotion corpora and 15 languages. The benchmark addresses evaluation challenges in open text generation and introduces novel protocols that better align with human subjective emotion perception.