y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#audio-language-models News & Analysis

3 articles tagged with #audio-language-models. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

3 articles
AINeutralarXiv โ€“ CS AI ยท Mar 117/10
๐Ÿง 

MUGEN: Evaluating and Improving Multi-audio Understanding of Large Audio-Language Models

Researchers introduce MUGEN, a comprehensive benchmark revealing significant weaknesses in large audio-language models when processing multiple concurrent audio inputs. The study shows performance degrades sharply with more audio inputs and proposes Audio-Permutational Self-Consistency as a training-free solution, achieving up to 6.74% accuracy improvements.

AINeutralarXiv โ€“ CS AI ยท Mar 175/10
๐Ÿง 

SAKE: Towards Editing Auditory Attribute Knowledge of Large Audio-Language Models

Researchers introduce SAKE, the first benchmark for editing auditory attribute knowledge in large audio-language models without requiring full retraining. The study reveals significant limitations in current editing methods, particularly with auditory generalization and sequential editing, while finding that fine-tuning modality connectors offers better performance than editing LLM backbones directly.