#voice-assistants News & Analysis

6 articles tagged with #voice-assistants. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

6 articles

AIBullisharXiv – CS AI · May 47/10

🧠

Putting HUMANS first: Efficient LAM Evaluation with Human Preference Alignment

Researchers demonstrate that minimal subsets of just 50 examples (0.3% of data) can reliably evaluate large audio models with 93%+ correlation to full benchmarks. By training regression models on human-preference-aligned subsets, they achieve 98% correlation with user satisfaction—outperforming full benchmark evaluations—and release the HUMANS benchmark as an efficient LAM evaluation tool.

AINeutralarXiv – CS AI · Jun 236/10

🧠

WASIL: In-the-Wild Arabic Spoken Interactions with LLMs

Researchers released WASIL, a dataset of 8,529 Arabic spoken interactions with LLMs including audio, transcriptions, and user feedback, to address how speech recognition errors degrade voice assistant performance. The dataset includes a 2,000-turn test set covering Modern Standard Arabic and four dialects, with annotations distinguishing between genuine unanswerability and ASR-induced failures, enabling more accurate evaluation of voice AI systems.

AIBullishTechCrunch – AI · May 126/10

🧠

Thinking Machines wants to build an AI that actually listens while it talks

Thinking Machines is developing an AI model that processes user input and generates responses simultaneously, mimicking real-time conversation rather than the current turn-based interaction model used by existing AI systems. This architectural shift could fundamentally change how users interact with AI assistants.

AINeutralarXiv – CS AI · May 116/10

🧠

MIST: Multimodal Interactive Speech-based Tool-calling Conversational Assistants for Smart Homes

Researchers introduce MIST, a synthetic dataset and framework for training voice-based AI assistants to control IoT devices in smart homes. The work reveals significant performance gaps between open and closed-weight multimodal LLMs on complex, real-world smart home tasks requiring spatiotemporal reasoning and mixed-initiative interaction.

AIBullisharXiv – CS AI · Mar 36/104

🧠

A Contemporary Overview: Trends and Applications of Large Language Models on Mobile Devices

Large language models (LLMs) are increasingly being deployed on mobile devices, enabling applications like voice assistants, real-time translation, and intelligent recommendations. Advancements in hardware and 5G infrastructure allow for efficient local inference while improving data privacy and reducing cloud dependency.

AINeutralarXiv – CS AI · Mar 35/104

🧠

Convenience vs. Control: A Qualitative Study of Youth Privacy with Smart Voice Assistants

A study of 26 young Canadians reveals that smart voice assistants' complex privacy controls and lack of transparency discourage privacy-protective behaviors among youth. Researchers propose design improvements including unified privacy hubs, plain-language data labels, and clearer retention policies to empower young users while maintaining convenience.