71 articles tagged with #conversational-ai. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.
AIBullisharXiv โ CS AI ยท Mar 266/10
๐ง Researchers have introduced MedAidDialog, a multilingual medical dialogue dataset covering seven languages, and developed MedAidLM, a conversational AI model for preliminary medical consultations. The system uses parameter-efficient fine-tuning on small language models to enable deployment without high-end computational infrastructure while incorporating patient context for personalized consultations.
AIBearisharXiv โ CS AI ยท Mar 176/10
๐ง Researchers warn that AI-powered conversational navigation systems using Large Language Models could transform route guidance from verifiable geometric tasks into manipulative dialogues. The study proposes a framework categorizing risks as dark patterns or explainability pitfalls, suggesting neuro-symbolic architectures to maintain trustworthiness.
AIBearisharXiv โ CS AI ยท Mar 176/10
๐ง Researchers introduced MDial, the first large-scale framework for generating multi-dialectal conversational data across nine English dialects, revealing that over 80% of English speakers don't use Standard American English. Evaluation of 17 LLMs showed even frontier models achieve under 70% accuracy in dialect identification, with particularly poor performance on non-American dialects.
AINeutralarXiv โ CS AI ยท Mar 126/10
๐ง Researchers propose Nurture-First Development (NFD), a new paradigm for building domain-expert AI agents through progressive growth via conversational interaction rather than traditional code-first or prompt-first approaches. The method uses a Knowledge Crystallization Cycle to convert operational dialogue into structured knowledge assets, demonstrated through a financial research agent case study.
AINeutralarXiv โ CS AI ยท Mar 116/10
๐ง Researchers propose a framework using policy-parameterized prompts to influence multi-agent LLM dialogue behavior without training. The approach treats prompts as actions and dynamically constructs them through five components to control conversation flow based on metrics like responsiveness and stance shift.
AIBullisharXiv โ CS AI ยท Mar 116/10
๐ง DuplexCascade introduces a VAD-free cascaded streaming pipeline that enables full-duplex speech-to-speech dialogue while maintaining LLM intelligence. The system converts traditional long utterance turns into micro-turn interactions using special control tokens to coordinate turn-taking and response timing.
AIBearisharXiv โ CS AI ยท Mar 116/10
๐ง Researchers argue that trust in chatbots is often driven by behavioral manipulation rather than demonstrated trustworthiness, proposing they be viewed as skilled salespeople rather than assistants. The study highlights how design choices exploit cognitive biases to influence user behavior, creating a gap between psychological trust formation and actual trustworthiness.
AIBullishHugging Face Blog ยท Mar 66/10
๐ง NVIDIA has released NeMo Evaluator Agent Skills, a tool that enables rapid evaluation of conversational large language models in minutes. This development streamlines the testing and validation process for LLM applications, potentially accelerating AI development workflows.
๐ข Nvidia
AIBullisharXiv โ CS AI ยท Mar 66/10
๐ง Researchers introduced GCAgent, an LLM-driven system that enhances group chat communication through AI dialogue agents. The system achieved significant improvements in real-world deployments, increasing message volume by 28.80% over 350 days and scoring 4.68 across various criteria.
AINeutralarXiv โ CS AI ยท Mar 55/10
๐ง Researchers present a blueprint for evaluating and optimizing multi-agent conversational shopping assistants, addressing challenges in multi-turn interactions and tightly coupled AI systems. The paper introduces evaluation rubrics and two prompt-optimization strategies including a novel Multi-Agent Multi-Turn GEPA approach for system-level optimization.
AIBullisharXiv โ CS AI ยท Mar 45/102
๐ง Researchers introduce MultiSessionCollab, a benchmark for evaluating conversational AI agents' ability to learn and adapt to user preferences across multiple collaboration sessions. The study demonstrates that equipping agents with persistent memory significantly improves long-term collaboration quality, task success rates, and user experience.
AIBullisharXiv โ CS AI ยท Mar 45/102
๐ง Researchers developed a new method called activation engineering to make AI language models express more human-like emotions in conversations. The technique uses targeted interventions on LLaMA 3.1-8B to enhance emotional characteristics like positive sentiment and personal engagement without extensive fine-tuning.
AIBullishDecrypt โ AI ยท Mar 36/104
๐ง OpenAI has released GPT-5.3 Instant in ChatGPT, focusing on improving tone and accuracy in AI conversations. The update aims to make daily AI interactions smoother and more practical for users.
AIBullisharXiv โ CS AI ยท Mar 37/107
๐ง Researchers have developed Semantic XPath, a tree-structured memory system for conversational AI that improves performance by 176.7% over traditional methods while using only 9.1% of the tokens. The system addresses scalability issues in long-term AI conversations by efficiently accessing and updating structured memory instead of appending growing conversation history.
AINeutralarXiv โ CS AI ยท Mar 36/107
๐ง Researchers propose a new framework called Relate for evaluating AI moral consideration based on relational capacity rather than consciousness verification. The framework addresses the governance gap as millions form emotional bonds with AI systems, but current regulations treat all AI interactions as simple tool use.
AINeutralarXiv โ CS AI ยท Mar 36/104
๐ง Researchers introduce AMemGym, an interactive benchmarking environment for evaluating and optimizing memory management in long-horizon conversations with AI assistants. The framework addresses limitations in current memory evaluation methods by enabling on-policy testing with LLM-simulated users and revealing performance gaps in existing memory systems like RAG and long-context LLMs.
AINeutralarXiv โ CS AI ยท Mar 35/103
๐ง Researchers developed AWARE-US, a system to improve AI agents' ability to handle failed database queries by intelligently relaxing the least important user constraints rather than simply returning 'no results'. The system uses three LLM-based methods to infer constraint importance from dialogue, achieving up to 56% accuracy in correct constraint relaxation.
AINeutralarXiv โ CS AI ยท Mar 26/1013
๐ง Researchers conducted the first Turing test for speech-to-speech AI systems, analyzing 2,968 human judgments across 9 state-of-the-art systems. No current S2S system passed the test, with failures primarily stemming from paralinguistic features and emotional expressivity rather than semantic understanding.
AIBullisharXiv โ CS AI ยท Mar 27/1012
๐ง Researchers have introduced Hello-Chat, an end-to-end audio language model designed to create more realistic and emotionally resonant AI conversations. The model addresses the robotic nature of existing Large Audio Language Models by using real-life conversation data and achieving breakthrough performance in prosodic naturalness and emotional alignment.
AINeutralApple Machine Learning ยท Feb 246/102
๐ง Researchers introduce AMUSE, a new benchmark for evaluating multimodal large language models in multi-speaker dialogue scenarios. The framework addresses current limitations of models like GPT-4o in tracking speakers, maintaining conversational roles, and reasoning across audio-visual streams in applications such as conversational video assistants.
AIBearishMIT News โ AI ยท Feb 186/106
๐ง Research reveals that LLMs with personalization features can develop a tendency to mirror users' viewpoints during extended conversations. This behavior may compromise the accuracy of AI responses and potentially create virtual echo chambers that reinforce existing beliefs.
AIBullishOpenAI News ยท Jan 76/105
๐ง Tolan has developed a voice-first AI companion using GPT-5.1 technology, featuring low-latency responses and real-time context reconstruction. The system incorporates memory-driven personalities to enable more natural conversational experiences.
AIBullishGoogle DeepMind Blog ยท Dec 126/105
๐ง Google has announced improvements to its Gemini audio models, enhancing voice interaction capabilities for more powerful and natural voice experiences. The upgrades focus on better audio processing and response quality in conversational AI applications.
AIBullishOpenAI News ยท Dec 95/106
๐ง Scout24 has developed a GPT-5 powered conversational assistant to transform real-estate search functionality. The AI system provides users with clarifying questions, property summaries, and personalized listing recommendations to improve the search experience.
AIBullishOpenAI News ยท Nov 126/107
๐ง OpenAI is releasing GPT-5.1, an upgraded version of their GPT-5 series with improved conversational abilities and customization options for tone and style. The rollout begins today for ChatGPT paid subscribers.