y0news
#audio-ai2 articles
2 articles
AIBullisharXiv โ€“ CS AI ยท 4h ago4
๐Ÿง 

Hello-Chat: Towards Realistic Social Audio Interactions

Researchers have introduced Hello-Chat, an end-to-end audio language model designed to create more realistic and emotionally resonant AI conversations. The model addresses the robotic nature of existing Large Audio Language Models by using real-life conversation data and achieving breakthrough performance in prosodic naturalness and emotional alignment.

AINeutralarXiv โ€“ CS AI ยท 4h ago0
๐Ÿง 

AudioCapBench: Quick Evaluation on Audio Captioning across Sound, Music, and Speech

Researchers introduce AudioCapBench, a new benchmark for evaluating how well large multimodal AI models can generate captions for audio content across sound, music, and speech domains. The study tested 13 models from OpenAI and Google Gemini, finding that Gemini models generally outperformed OpenAI in overall captioning quality, though all models struggled most with music captioning.