y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#conversational-ai News & Analysis

71 articles tagged with #conversational-ai. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

71 articles
AIBullisharXiv โ€“ CS AI ยท Mar 266/10
๐Ÿง 

MedAidDialog: A Multilingual Multi-Turn Medical Dialogue Dataset for Accessible Healthcare

Researchers have introduced MedAidDialog, a multilingual medical dialogue dataset covering seven languages, and developed MedAidLM, a conversational AI model for preliminary medical consultations. The system uses parameter-efficient fine-tuning on small language models to enable deployment without high-end computational infrastructure while incorporating patient context for personalized consultations.

AIBearisharXiv โ€“ CS AI ยท Mar 176/10
๐Ÿง 

The Scenic Route to Deception: Dark Patterns and Explainability Pitfalls in Conversational Navigation

Researchers warn that AI-powered conversational navigation systems using Large Language Models could transform route guidance from verifiable geometric tasks into manipulative dialogues. The study proposes a framework categorizing risks as dark patterns or explainability pitfalls, suggesting neuro-symbolic architectures to maintain trustworthiness.

AIBearisharXiv โ€“ CS AI ยท Mar 176/10
๐Ÿง 

Should LLMs, like, Generate How Users Talk? Building Dialect-Accurate Dialog[ue]s Beyond the American Default with MDial

Researchers introduced MDial, the first large-scale framework for generating multi-dialectal conversational data across nine English dialects, revealing that over 80% of English speakers don't use Standard American English. Evaluation of 17 LLMs showed even frontier models achieve under 70% accuracy in dialect identification, with particularly poor performance on non-American dialects.

AINeutralarXiv โ€“ CS AI ยท Mar 126/10
๐Ÿง 

Nurture-First Agent Development: Building Domain-Expert AI Agents Through Conversational Knowledge Crystallization

Researchers propose Nurture-First Development (NFD), a new paradigm for building domain-expert AI agents through progressive growth via conversational interaction rather than traditional code-first or prompt-first approaches. The method uses a Knowledge Crystallization Cycle to convert operational dialogue into structured knowledge assets, demonstrated through a financial research agent case study.

AINeutralarXiv โ€“ CS AI ยท Mar 116/10
๐Ÿง 

Influencing LLM Multi-Agent Dialogue via Policy-Parameterized Prompts

Researchers propose a framework using policy-parameterized prompts to influence multi-agent LLM dialogue behavior without training. The approach treats prompts as actions and dynamically constructs them through five components to control conversation flow based on metrics like responsiveness and stance shift.

AIBearisharXiv โ€“ CS AI ยท Mar 116/10
๐Ÿง 

Why do we Trust Chatbots? From Normative Principles to Behavioral Drivers

Researchers argue that trust in chatbots is often driven by behavioral manipulation rather than demonstrated trustworthiness, proposing they be viewed as skilled salespeople rather than assistants. The study highlights how design choices exploit cognitive biases to influence user behavior, creating a gap between psychological trust formation and actual trustworthiness.

AIBullishHugging Face Blog ยท Mar 66/10
๐Ÿง 

Conversational LLM Evaluations in Minutes with NVIDIA NeMo Evaluator Agent Skills

NVIDIA has released NeMo Evaluator Agent Skills, a tool that enables rapid evaluation of conversational large language models in minutes. This development streamlines the testing and validation process for LLM applications, potentially accelerating AI development workflows.

๐Ÿข Nvidia
AIBullisharXiv โ€“ CS AI ยท Mar 66/10
๐Ÿง 

GCAgent: Enhancing Group Chat Communication through Dialogue Agents System

Researchers introduced GCAgent, an LLM-driven system that enhances group chat communication through AI dialogue agents. The system achieved significant improvements in real-world deployments, increasing message volume by 28.80% over 350 days and scoring 4.68 across various criteria.

AINeutralarXiv โ€“ CS AI ยท Mar 55/10
๐Ÿง 

Build, Judge, Optimize: A Blueprint for Continuous Improvement of Multi-Agent Consumer Assistants

Researchers present a blueprint for evaluating and optimizing multi-agent conversational shopping assistants, addressing challenges in multi-turn interactions and tightly coupled AI systems. The paper introduces evaluation rubrics and two prompt-optimization strategies including a novel Multi-Agent Multi-Turn GEPA approach for system-level optimization.

AIBullisharXiv โ€“ CS AI ยท Mar 45/102
๐Ÿง 

MultiSessionCollab: Learning User Preferences with Memory to Improve Long-Term Collaboration

Researchers introduce MultiSessionCollab, a benchmark for evaluating conversational AI agents' ability to learn and adapt to user preferences across multiple collaboration sessions. The study demonstrates that equipping agents with persistent memory significantly improves long-term collaboration quality, task success rates, and user experience.

AIBullisharXiv โ€“ CS AI ยท Mar 45/102
๐Ÿง 

From Passive to Persuasive: Steering Emotional Nuance in Human-AI Negotiation

Researchers developed a new method called activation engineering to make AI language models express more human-like emotions in conversations. The technique uses targeted interventions on LLaMA 3.1-8B to enhance emotional characteristics like positive sentiment and personal engagement without extensive fine-tuning.

AIBullisharXiv โ€“ CS AI ยท Mar 37/107
๐Ÿง 

Semantic XPath: Structured Agentic Memory Access for Conversational AI

Researchers have developed Semantic XPath, a tree-structured memory system for conversational AI that improves performance by 176.7% over traditional methods while using only 9.1% of the tokens. The system addresses scalability issues in long-term AI conversations by efficiently accessing and updating structured memory instead of appending growing conversation history.

AINeutralarXiv โ€“ CS AI ยท Mar 36/107
๐Ÿง 

Alignment Is Not Enough: A Relational Framework for Moral Standing in Human-AI Interaction

Researchers propose a new framework called Relate for evaluating AI moral consideration based on relational capacity rather than consciousness verification. The framework addresses the governance gap as millions form emotional bonds with AI systems, but current regulations treat all AI interactions as simple tool use.

AINeutralarXiv โ€“ CS AI ยท Mar 36/104
๐Ÿง 

AMemGym: Interactive Memory Benchmarking for Assistants in Long-Horizon Conversations

Researchers introduce AMemGym, an interactive benchmarking environment for evaluating and optimizing memory management in long-horizon conversations with AI assistants. The framework addresses limitations in current memory evaluation methods by enabling on-policy testing with LLM-simulated users and revealing performance gaps in existing memory systems like RAG and long-context LLMs.

AINeutralarXiv โ€“ CS AI ยท Mar 35/103
๐Ÿง 

AWARE-US: Preference-Aware Infeasibility Resolution in Tool-Calling Agents

Researchers developed AWARE-US, a system to improve AI agents' ability to handle failed database queries by intelligently relaxing the least important user constraints rather than simply returning 'no results'. The system uses three LLM-based methods to infer constraint importance from dialogue, achieving up to 56% accuracy in correct constraint relaxation.

AINeutralarXiv โ€“ CS AI ยท Mar 26/1013
๐Ÿง 

Human or Machine? A Preliminary Turing Test for Speech-to-Speech Interaction

Researchers conducted the first Turing test for speech-to-speech AI systems, analyzing 2,968 human judgments across 9 state-of-the-art systems. No current S2S system passed the test, with failures primarily stemming from paralinguistic features and emotional expressivity rather than semantic understanding.

AIBullisharXiv โ€“ CS AI ยท Mar 27/1012
๐Ÿง 

Hello-Chat: Towards Realistic Social Audio Interactions

Researchers have introduced Hello-Chat, an end-to-end audio language model designed to create more realistic and emotionally resonant AI conversations. The model addresses the robotic nature of existing Large Audio Language Models by using real-life conversation data and achieving breakthrough performance in prosodic naturalness and emotional alignment.

AINeutralApple Machine Learning ยท Feb 246/102
๐Ÿง 

AMUSE: Audio-Visual Benchmark and Alignment Framework for Agentic Multi-Speaker Understanding

Researchers introduce AMUSE, a new benchmark for evaluating multimodal large language models in multi-speaker dialogue scenarios. The framework addresses current limitations of models like GPT-4o in tracking speakers, maintaining conversational roles, and reasoning across audio-visual streams in applications such as conversational video assistants.

AIBearishMIT News โ€“ AI ยท Feb 186/106
๐Ÿง 

Personalization features can make LLMs more agreeable

Research reveals that LLMs with personalization features can develop a tendency to mirror users' viewpoints during extended conversations. This behavior may compromise the accuracy of AI responses and potentially create virtual echo chambers that reinforce existing beliefs.

AIBullishOpenAI News ยท Jan 76/105
๐Ÿง 

How Tolan builds voice-first AI with GPT-5.1

Tolan has developed a voice-first AI companion using GPT-5.1 technology, featuring low-latency responses and real-time context reconstruction. The system incorporates memory-driven personalities to enable more natural conversational experiences.

AIBullishGoogle DeepMind Blog ยท Dec 126/105
๐Ÿง 

Improved Gemini audio models for powerful voice experiences

Google has announced improvements to its Gemini audio models, enhancing voice interaction capabilities for more powerful and natural voice experiences. The upgrades focus on better audio processing and response quality in conversational AI applications.

AIBullishOpenAI News ยท Dec 95/106
๐Ÿง 

How Scout24 is building the next generation of real-estate search with AI

Scout24 has developed a GPT-5 powered conversational assistant to transform real-estate search functionality. The AI system provides users with clarifying questions, property summaries, and personalized listing recommendations to improve the search experience.

AIBullishOpenAI News ยท Nov 126/107
๐Ÿง 

GPT-5.1: A smarter, more conversational ChatGPT

OpenAI is releasing GPT-5.1, an upgraded version of their GPT-5 series with improved conversational abilities and customization options for tone and style. The rollout begins today for ChatGPT paid subscribers.