y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#interactive-ai News & Analysis

14 articles tagged with #interactive-ai. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

14 articles
AIBullisharXiv – CS AI · 3d ago7/10
🧠

Reasoning While Asking: Transforming Reasoning Large Language Models from Passive Solvers to Proactive Inquirers

Researchers introduce Proactive Interactive Reasoning (PIR), a new paradigm that enables large language models to ask clarifying questions during problem-solving rather than operating blindly with incomplete information. The approach combines supervised fine-tuning and policy optimization to achieve significant improvements in mathematical reasoning, code generation, and document editing tasks while reducing computational overhead.

AINeutralarXiv – CS AI · Mar 57/10
🧠

Certainty robustness: Evaluating LLM stability under self-challenging prompts

Researchers introduce the Certainty Robustness Benchmark, a new evaluation framework that tests how large language models handle challenges to their responses in interactive settings. The study reveals significant differences in how AI models balance confidence and adaptability when faced with prompts like "Are you sure?" or "You are wrong!", identifying a critical new dimension for AI evaluation.

AIBullisharXiv – CS AI · Mar 57/10
🧠

Beyond Pixel Histories: World Models with Persistent 3D State

Researchers introduce PERSIST, a new world model paradigm that maintains persistent 3D spatial memory and consistent geometry for interactive video generation. The model addresses limitations of existing approaches by simulating the evolution of latent 3D scenes, enabling more realistic user experiences and supporting novel capabilities like single-image 3D environment synthesis.

AIBullishGoogle DeepMind Blog · Nov 137/106
🧠

SIMA 2: An Agent that Plays, Reasons, and Learns With You in Virtual 3D Worlds

Google has introduced SIMA 2, a Gemini-powered AI agent capable of thinking, understanding, and taking actions in interactive 3D virtual environments. The agent represents an advancement in AI systems that can play, reason, and learn alongside users in complex digital worlds.

AIBullishGoogle DeepMind Blog · Oct 247/105
🧠

Genie 3: A new frontier for world models

Genie 3 represents a significant advancement in AI world modeling technology, capable of generating dynamic, navigable virtual worlds in real-time at 720p resolution and 24 fps. The system maintains visual consistency for several minutes, marking a notable step forward in interactive AI-generated environments.

AIBullisharXiv – CS AI · 3d ago6/10
🧠

Towards Human-Like Interactive Speech Recognition With Agentic Correction and Semantic Evaluation

Researchers introduce Agentic ASR, a multi-turn interactive speech recognition framework that enables iterative refinement of recognized speech through semantic correction and reasoning-based editing. The approach addresses limitations of single-pass ASR systems by aligning with human communication patterns, introducing a new semantic evaluation metric (S²ER) that better captures meaning-critical errors than traditional token-level metrics.

AINeutralarXiv – CS AI · 3d ago6/10
🧠

MOOSE-Copilot: A Web-Based Interactive Assistant for Unified Exploratory and Fine-Grained Scientific Hypothesis Discovery

MOOSE-Copilot introduces a unified framework for scientific hypothesis discovery that combines exploratory ideation with fine-grained refinement through structured human-AI interaction. The web-based system enables scientists to guide LLM-powered discovery processes via initial blueprints, routing decisions, and feedback mechanisms, outperforming autonomous baselines while lowering accessibility barriers through an intuitive visual interface.

🏢 Microsoft
AIBullisharXiv – CS AI · Apr 136/10
🧠

Interactive ASR: Towards Human-Like Interaction and Semantic Coherence Evaluation for Agentic Speech Recognition

Researchers propose Interactive ASR, a new framework that combines semantic-aware evaluation using LLM-as-a-Judge with multi-turn interactive correction to improve automatic speech recognition beyond traditional word error rate metrics. The approach simulates human-like interaction, enabling iterative refinement of recognition outputs across English, Chinese, and code-switching datasets.

AIBullisharXiv – CS AI · Apr 66/10
🧠

Haiku to Opus in Just 10 bits: LLMs Unlock Massive Compression Gains

Researchers developed new compression techniques for LLM-generated text, achieving massive compression ratios through domain-adapted LoRA adapters and an interactive 'Question-Asking' protocol. The QA method uses binary questions to transfer knowledge between small and large models, achieving compression ratios of 0.0006-0.004 while recovering 23-72% of capability gaps.

AINeutralarXiv – CS AI · Mar 36/1011
🧠

LifeEval: A Multimodal Benchmark for Assistive AI in Egocentric Daily Life Tasks

Researchers introduce LifeEval, a new multimodal benchmark designed to evaluate how well AI assistants can help humans in real-time daily life tasks from a first-person perspective. The benchmark reveals significant challenges for current AI models in providing timely and adaptive assistance in dynamic environments.

AINeutralarXiv – CS AI · Mar 37/107
🧠

A Comprehensive Evaluation of LLM Unlearning Robustness under Multi-Turn Interaction

Researchers found that machine unlearning in large language models, which aims to remove specific training data influence, is less effective in interactive settings than previously thought. Knowledge that appears forgotten in static tests can often be recovered through multi-turn conversations and self-correction interactions.

AIBullishLast Week in AI · Feb 47/10
🧠

Last Week in AI #334 - Kimi K2.5 & Code, Genie 3, OpenClaw & Moltbook

China's Moonshot AI released an open-source model Kimi K2.5 along with a coding agent, while Google launched Genie 3's interactive world-building prototype for AI Ultra subscribers. These developments represent significant advances in AI model capabilities and accessibility across both open-source and commercial platforms.

Last Week in AI #334 - Kimi K2.5 & Code, Genie 3, OpenClaw & Moltbook
AINeutralHugging Face Blog · Jun 54/105
🧠

Introducing NPC-Playground, a 3D playground to interact with LLM-powered NPCs

The article appears to introduce NPC-Playground, a 3D interactive environment where users can engage with non-player characters powered by large language models. However, the article body content was not provided, limiting detailed analysis of the platform's features and implications.