AIBullishOpenAI News · Sep 297/107
🧠OpenAI is introducing agentic commerce capabilities to ChatGPT, enabling AI agents, users, and businesses to collaborate in shopping experiences. This represents an early step toward AI-powered autonomous commerce systems integrated into conversational AI platforms.
AIBullishOpenAI News · Oct 17/105
🧠OpenAI has launched a new Realtime API that enables developers to integrate fast speech-to-speech capabilities directly into their applications. This API allows for real-time voice interactions without the traditional delays of converting speech to text and back to speech.
AIBullishOpenAI News · Sep 257/104
🧠ChatGPT is rolling out new multimodal capabilities that enable voice conversations and image recognition. These features represent a significant advancement in AI interface design, making interactions more intuitive and natural.
AIBullishOpenAI News · Nov 307/107
🧠OpenAI has introduced ChatGPT, a conversational AI model designed to interact through dialogue. The model can answer follow-up questions, admit mistakes, challenge incorrect premises, and reject inappropriate requests.
AINeutralarXiv – CS AI · 3d ago6/10
🧠A research study examines how users interact with conversational AI systems when fact-checking is accessible through hybrid search interfaces. The findings reveal that users continue to over-rely on AI answers despite having web search available, with verification behavior driven primarily by user characteristics like prior trust rather than answer quality, while conversational warmth indirectly increases reliance by boosting agreement with incorrect responses.
AINeutralarXiv – CS AI · 3d ago5/10
🧠A PhD study of 90 participants compared human-like spoken embodied conversational agents versus text-based agents in a mobile educational game about UK currency. Results showed statistically significant user preference for highly human-like agents, with implications for designing collaborative human-agent systems in educational contexts.
AINeutralarXiv – CS AI · 3d ago6/10
🧠ESC-Skills introduces a novel framework for emotional support conversation systems that moves beyond end-to-end generation to create interpretable, executable skills. The system discovers support interventions from successful and failed dialogues, organizes them into a skills bank with applicability conditions and risk assessments, then self-improves through multi-profile simulations and systematic failure analysis.
AINeutralarXiv – CS AI · 3d ago6/10
🧠Researchers introduce MGRetrieval, a novel retrieval strategy for long-term dialogue agents that uses semantic memory structures to guide multi-step retrieval rather than one-shot approaches. The method improves performance on dialogue benchmarks by 8-11% while maintaining computational efficiency, addressing a key limitation in LLM-based conversational systems.
AINeutralarXiv – CS AI · 3d ago6/10
🧠Researchers present a novel framework enabling AI agents to understand and follow dynamically changing human norms during planning and decision-making. The work introduces a defeasible calculus to resolve normative conflicts and demonstrates the approach through an AI agent called SocialBot on natural language dialogue tasks, advancing the field of norm-guided AI planning in human-AI interaction contexts.
AIBullisharXiv – CS AI · 4d ago6/10
🧠Researchers present SeDT, a training-free method that improves large language model performance in multi-turn conversations by annotating conversation history with relevance scores, addressing a documented 39% performance drop when tasks are revealed incrementally across multiple turns.
AIBullishGoogle AI Blog · May 196/10
🧠One year after launch, AI Mode has shifted user behavior from keyword-based searches to natural language queries, representing a fundamental change in how Americans interact with search technology. This transition demonstrates growing adoption of conversational AI interfaces and user comfort with more human-like search interactions.
AIBullishOpenAI News · May 146/10
🧠OpenAI has released safety updates to ChatGPT that improve its ability to recognize context in sensitive conversations and detect potential risks over extended interactions. These enhancements enable the model to respond more safely by better understanding conversational nuance and maintaining awareness of conversation history when evaluating harmful requests.
🧠 ChatGPT
AIBullishTechCrunch – AI · May 126/10
🧠Thinking Machines is developing an AI model that processes user input and generates responses simultaneously, mimicking real-time conversation rather than the current turn-based interaction model used by existing AI systems. This architectural shift could fundamentally change how users interact with AI assistants.
AINeutralarXiv – CS AI · May 126/10
🧠Researchers propose that conversational AI systems create epistemic problems not through flawed models but through game-theoretic dynamics where sycophantic responses reinforce user biases. They introduce an "Epistemic Mediator" mechanism with belief versioning to break feedback loops that lead users toward delusional certainty, achieving 48x reduction in belief spirals.
AIBullisharXiv – CS AI · May 126/10
🧠AI-Care is a conversational AI system designed to help individuals with Alzheimer's disease and related dementia manage daily tasks through natural language interaction, reducing cognitive barriers to using digital tools. The system prioritizes safety through caregiver-verified records and controlled clarification flows, with preliminary pilot testing showing positive user trust and task completion outcomes.
AINeutralarXiv – CS AI · May 126/10
🧠Researchers introduce the Developmental Sentence Completion Test (DSCT), a 20-item assessment tool that evaluates how large language models understand and reflect human developmental cognition based on Kegan's constructive-developmental theory. The study finds that frontier LLMs accurately identify developmental stages in simulated personas but show only fair agreement with real human responses, revealing that developmental signal is cleaner in synthetic data than human-generated text.
🏢 Meta
AIBullisharXiv – CS AI · May 126/10
🧠Researchers have developed an integrated AI framework for campus mental health monitoring, combining TigerGPT (an LLM-powered survey chatbot) for prevention and PsychoGPT (a DSM-5-aligned screening tool) for intervention. The system uses reinforcement learning and multi-model reasoning to improve feedback quality and reduce hallucinations in mental health assessment.
AINeutralarXiv – CS AI · May 126/10
🧠Researchers introduce Neuron Auctions, a novel mechanism that embeds advertisements within Large Language Models by targeting their internal neural representations rather than surface text. The approach uses mechanistic interpretability to identify brand-specific neurons that operate in near-orthogonal subspaces, enabling platforms to balance advertiser revenue, user experience, and content quality through a strategy-proof auction mechanism.
AINeutralarXiv – CS AI · May 126/10
🧠ProactBench introduces a new evaluation framework for large language models that measures conversational proactivity—the ability to infer and act on users' implicit needs rather than just responding to explicit requests. The benchmark decomposes this ability into three types (Emergent, Critical, and Recovery) and tests 16 frontier models across 198 curated dialogues, revealing that Recovery tasks are particularly difficult and poorly predicted by existing benchmarks.
AIBearisharXiv – CS AI · May 126/10
🧠Researchers tested how well Large Language Models handle multi-turn conversations with topic shifts, finding that most LLMs struggle to detect when users pivot to new topics and incorrectly carry over irrelevant context from previous exchanges. The study reveals that only advanced reasoning models and strongly instructed LLMs perform accurately, while open-weight models frequently fail even with explicit cues, highlighting a critical robustness gap in production LLM deployments.
AINeutralarXiv – CS AI · May 116/10
🧠Researchers introduce MIST, a synthetic dataset and framework for training voice-based AI assistants to control IoT devices in smart homes. The work reveals significant performance gaps between open and closed-weight multimodal LLMs on complex, real-world smart home tasks requiring spatiotemporal reasoning and mixed-initiative interaction.
AINeutralarXiv – CS AI · May 116/10
🧠Researchers introduce TRACE, a benchmark dataset for evaluating tourism recommendation systems that combine multi-turn dialogue, verifiable review citations, and rejection recovery. The dataset reveals a significant gap in existing conversational recommender systems: LLMs excel at recall but cite weakly, while retrieval-based systems ground better but struggle with accuracy and adaptation.
AINeutralarXiv – CS AI · May 115/10
🧠Researchers propose FiSMiness, a framework integrating Finite State Machines with large language models to improve emotional support conversations by enabling models to systematically reason through emotional states, support strategies, and responses. The approach outperforms multiple baseline methods including chain-of-thought and fine-tuning approaches on ESC datasets, demonstrating that structured reasoning paradigms can enhance LLM performance on specialized dialogue tasks.
AIBullishOpenAI News · May 76/10
🧠Parloa has developed AI-powered customer service agents that leverage OpenAI's models to deliver voice-driven interactions at scale. The platform enables enterprises to design, simulate, and deploy reliable real-time customer support solutions, representing a significant advancement in conversational AI for business applications.
🏢 OpenAI
AINeutralarXiv – CS AI · May 76/10
🧠Researchers introduce GEM, a novel framework combining Graph Neural Networks, mixture-of-experts routing, and ReAct agents to improve Dialogue State Tracking in multi-domain conversations. The approach achieves 65.19% accuracy on MultiWOZ 2.2, substantially outperforming large language models and existing state-of-the-art methods.