#hallucination-mitigation News & Analysis

24 articles tagged with #hallucination-mitigation. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

24 articles

AIBullisharXiv – CS AI · May 127/10

🧠

When Language Overwrites Vision: Over-Alignment and Geometric Debiasing in Vision-Language Models

Researchers identify a fundamental geometric flaw in decoder-based Vision-Language Models where visual embeddings become over-aligned with linguistic patterns, causing systematic hallucinations. The study introduces quantitative methods to characterize this bias and proposes training-free and fine-tuning solutions that reduce hallucinations across multiple benchmarks without computational overhead.

AIBullisharXiv – CS AI · May 17/10

🧠

CareGuardAI: Context-Aware Multi-Agent Guardrails for Clinical Safety & Hallucination Mitigation in Patient-Facing LLMs

CareGuardAI is a safety framework designed to mitigate clinical risks and hallucinations in patient-facing medical LLMs through dual risk assessment mechanisms. The system employs context-aware multi-agent guardrails that evaluate both clinical safety and factual reliability before releasing responses, outperforming GPT-4o-mini on specialized healthcare benchmarks.

🧠 GPT-4

AIBullisharXiv – CS AI · May 17/10

🧠

OmniDrive-R1: Reinforcement-driven Interleaved Multi-modal Chain-of-Thought for Trustworthy Vision-Language Autonomous Driving

OmniDrive-R1 is a new Vision-Language Model framework that addresses critical reliability failures in autonomous driving by combining perception and reasoning through an interleaved multi-modal chain-of-thought mechanism, achieving significant accuracy improvements (37.81% to 73.62%) without requiring dense localization labels.

AIBullisharXiv – CS AI · Apr 147/10

🧠

Variational Visual Question Answering for Uncertainty-Aware Selective Prediction

Researchers demonstrate that variational Bayesian methods significantly improve Vision Language Models' reliability for Visual Question Answering tasks by enabling selective prediction with reduced hallucinations and overconfidence. The proposed Variational VQA approach shows particular strength at low error tolerances and offers a practical path to making large multimodal models safer without proportional computational costs.

AIBullisharXiv – CS AI · Apr 107/10

🧠

Faithful-First Reasoning, Planning, and Acting for Multimodal LLMs

Researchers propose Faithful-First RPA, a framework that improves multimodal AI reasoning by prioritizing faithfulness to visual evidence. The method uses FaithEvi for supervision and FaithAct for execution, achieving up to 24% improvement in perceptual faithfulness without sacrificing task accuracy.

AIBullisharXiv – CS AI · Apr 67/10

🧠

OSCAR: Orchestrated Self-verification and Cross-path Refinement

Researchers introduce OSCAR, a training-free framework that reduces AI hallucinations in diffusion language models by using cross-chain entropy to detect uncertain token positions during generation. The system runs parallel denoising chains and performs targeted remasking with retrieved evidence to improve factual accuracy without requiring external hallucination classifiers.

AIBullisharXiv – CS AI · Mar 127/10

🧠

Adaptive Activation Cancellation for Hallucination Mitigation in Large Language Models

Researchers developed Adaptive Activation Cancellation (AAC), a real-time framework that reduces hallucinations in large language models by identifying and suppressing problematic neural activations during inference. The method requires no fine-tuning or external knowledge and preserves model capabilities while improving factual accuracy across multiple model scales including LLaMA 3-8B.

🏢 Perplexity

AIBullisharXiv – CS AI · Mar 46/103

🧠

Through the Lens of Contrast: Self-Improving Visual Reasoning in VLMs

Researchers introduce VC-STaR, a new framework that improves visual reasoning in vision-language models by using contrastive image pairs to reduce hallucinations. The approach creates VisCoR-55K, a new dataset that outperforms existing visual reasoning methods when used for model fine-tuning.

AIBullisharXiv – CS AI · Mar 46/103

🧠

Self-Aug: Query and Entropy Adaptive Decoding for Large Vision-Language Models

Researchers developed a new training-free decoding strategy for Large Vision-Language Models that reduces hallucinations by using query-adaptive visual augmentation and entropy-based token selection. The method showed significant improvements in factual consistency across four LVLMs and seven benchmarks compared to existing approaches.

AIBullisharXiv – CS AI · 6d ago6/10

🧠

Traceable Knowledge Graph Reasoning Enables LLM-Assisted Decision Support for Industrial VOCs in the Steel Industry

Researchers developed Chat-ISV, an LLM-enhanced knowledge graph system that organizes fragmented steel industry VOCs literature into a queryable database with 27,180 nodes and 81,779 semantic edges. The system achieved 96.93% precision in answering specialized industrial questions, demonstrating a scalable approach to deploying reliable LLMs in domain-specific applications where hallucination risks are high.

AINeutralarXiv – CS AI · 6d ago6/10

🧠

MiRD: Reliable Set-Valued Prediction for Open-Ended Question Answering via Miscoverage Risk Decomposition

Researchers introduce MiRD, a two-stage framework that improves reliable prediction for open-ended question answering by separately addressing sampling failures and selection errors. The approach maintains calibration-set integrity while controlling hallucinations in AI models, outperforming existing conformal prediction methods across multiple datasets and models.

AINeutralarXiv – CS AI · May 126/10

🧠

Separate First, Fuse Later: Mitigating Cross-Modal Interference in Audio-Visual LLMs Reasoning with Modality-Specific Chain-of-Thought

Researchers propose SFFL, a framework that mitigates cross-modal interference in audio-visual language models by enforcing separate reasoning chains for each modality before fusion. The approach uses modality-preference labels and reinforcement learning to reduce hallucinations and achieves 5-11% performance improvements on benchmarks.

AINeutralarXiv – CS AI · May 126/10

🧠

Probing Cross-modal Information Hubs in Audio-Visual LLMs

Researchers have analyzed how audio-visual large language models (AVLLMs) process cross-modal information, discovering that integrated audio-visual data concentrates in specialized 'sink tokens' rather than distributing uniformly. This finding enables a training-free method to reduce hallucinations by leveraging these cross-modal information hubs.

AIBullisharXiv – CS AI · May 116/10

🧠

From Surface Learning to Deep Understanding: A Grounded AI Tutoring System for Moodle

Researchers have developed an AI Teaching & Learning Assistant, a Moodle plugin using Retrieval-Augmented Generation (RAG) to provide students with Socratic tutoring while enabling educators to supervise content generation. The system grounds LLM responses in teacher-provided materials to minimize hallucinations and misinformation, achieving high faithfulness scores (0.97) and strong user satisfaction (4.00/5.00 rating).

AINeutralarXiv – CS AI · May 16/10

🧠

Theory Under Construction: Orchestrating Language Models for Research Software Where the Specification Evolves

Researchers propose Comet-H, an AI system that orchestrates language models to generate research software by keeping mathematical theory, code, benchmarks, and documentation synchronized. The framework addresses hallucination and desynchronization failures in LLM-driven development, demonstrating effectiveness through a portfolio of 46 research repositories, with a static-analysis tool reaching F1=0.768 performance.

AINeutralarXiv – CS AI · Apr 206/10

🧠

Mechanisms of Prompt-Induced Hallucination in Vision-Language Models

Researchers identify specific attention heads in vision-language models that cause prompt-induced hallucinations, where models favor textual instructions over visual evidence. By ablating these identified heads, they reduce hallucinations by 40% without retraining, revealing model-specific mechanisms underlying this failure mode.

AINeutralarXiv – CS AI · Apr 136/10

🧠

3D-VCD: Hallucination Mitigation in 3D-LLM Embodied Agents through Visual Contrastive Decoding

Researchers introduce 3D-VCD, an inference-time framework that reduces hallucinations in 3D-LLM embodied agents by contrasting predictions against distorted scene graphs. The method addresses failures specific to 3D spatial reasoning without requiring model retraining, advancing reliability in embodied AI systems.

AINeutralarXiv – CS AI · Apr 136/10

🧠

Noise-Aware In-Context Learning for Hallucination Mitigation in ALLMs

Researchers propose Noise-Aware In-Context Learning (NAICL), a plug-and-play method to reduce hallucinations in auditory large language models without expensive fine-tuning. The approach uses a noise prior library to guide models toward more conservative outputs, achieving a 37% reduction in hallucination rates while establishing a new benchmark for evaluating audio understanding systems.

AIBullisharXiv – CS AI · Apr 136/10

🧠

Constraining Sequential Model Editing with Editing Anchor Compression

Researchers propose Editing Anchor Compression (EAC), a framework that addresses degradation of large language models' general abilities during sequential knowledge editing. By constraining parameter matrix deviations through selective anchor compression, EAC preserves over 70% of model performance while maintaining edited knowledge, advancing the practical viability of model editing as an alternative to expensive retraining.

AIBullisharXiv – CS AI · Apr 76/10

🧠

Focus Matters: Phase-Aware Suppression for Hallucination in Vision-Language Models

Researchers developed a new method to reduce hallucinations in Large Vision-Language Models (LVLMs) by identifying a three-phase attention structure in vision processing and selectively suppressing low-attention tokens during the focus phase. The training-free approach significantly reduces object hallucinations while maintaining caption quality with minimal inference latency impact.

AIBullisharXiv – CS AI · Mar 176/10

🧠

Thinking in Uncertainty: Mitigating Hallucinations in MLRMs with Latent Entropy-Aware Decoding

Researchers propose Latent Entropy-Aware Decoding (LEAD), a new method to reduce hallucinations in multimodal large reasoning models by switching between continuous and discrete token embeddings based on entropy states. The technique addresses issues where transition words correlate with high-entropy states that lead to unreliable outputs in visual question answering tasks.

AIBullisharXiv – CS AI · Mar 96/10

🧠

HiPP-Prune: Hierarchical Preference-Conditioned Structured Pruning for Vision-Language Models

Researchers introduce HiPP-Prune, a new framework for efficiently compressing vision-language models while maintaining performance and reducing hallucinations. The hierarchical approach uses preference-based pruning that considers multiple objectives including task utility, visual grounding, and compression efficiency.

AIBullisharXiv – CS AI · Mar 96/10

🧠

MAP: Mitigating Hallucinations in Large Vision-Language Models with Map-Level Attention Processing

Researchers developed MAP (Map-Level Attention Processing), a training-free method to reduce hallucinations in Large Vision-Language Models by treating hidden states as 2D semantic maps. The approach uses attention-based operations to better leverage factual information and improve consistency between generated text and visual inputs.

AIBullisharXiv – CS AI · Mar 36/104

🧠

SpotAgent: Grounding Visual Geo-localization in Large Vision-Language Models through Agentic Reasoning

Researchers introduce SpotAgent, a new framework that improves AI geo-localization by combining visual interpretation with external tool verification through agentic reasoning. The system addresses limitations of current Large Vision-Language Models that often make confident but ungrounded predictions when visual cues are sparse or ambiguous.