AINeutralarXiv – CS AI · May 126/10
🧠Researchers developed a reflective storytelling agent that combines large language models with knowledge graphs and argumentation theory to generate personalized narratives for older adults. Testing with 55 participants showed the system successfully identified personally relevant purposes in two-thirds of narratives, with argument-based grounding and hallucination detection significantly improving perceived consistency and clarity.
AIBullisharXiv – CS AI · May 126/10
🧠A new study challenges whether standard LLM benchmarks accurately measure hallucination detection performance. By having human adjudicators re-evaluate conflicting cases between original annotations and model predictions, researchers found that LLMs frequently made correct judgments that human annotators initially missed, suggesting single-pass human annotation may be insufficient for complex, ambiguous tasks.
🧠 GPT-5🧠 Gemini
AINeutralarXiv – CS AI · May 116/10
🧠Researchers introduce a proxy-analyzer framework that detects hallucinations in large language models by analyzing internal activations of a small open-weight reader model rather than the generator itself. The system achieves competitive or superior performance compared to existing methods across multiple model architectures, with notably consistent results showing that model size has minimal impact on detection accuracy.
🧠 GPT-4
AINeutralarXiv – CS AI · May 116/10
🧠Researchers introduce CoCoReviewBench, a new benchmark dataset of 3,900 papers from ICLR and NeurIPS designed to reliably evaluate AI review systems. The benchmark addresses critical gaps in current evaluation methods by prioritizing correctness over mere overlap with human reviews, revealing that existing AI reviewers struggle with hallucinations and reasoning accuracy.
AIBullisharXiv – CS AI · May 96/10
🧠Researchers propose a two-stage approach to improve reliability in retrieval-augmented generation (RAG) systems by using conformal prediction to filter retrieved content and an attention-based classifier to detect factual inconsistencies. The framework achieves up to 6% answer quality improvement and 77% inconsistency detection, advancing toward certified RAG systems for production AI applications.
AINeutralarXiv – CS AI · May 76/10
🧠Researchers propose Adaptive Conformal Semantic Entropy (ACSE), a novel method for quantifying uncertainty in large language model outputs by measuring semantic diversity rather than relying solely on lexical or probabilistic measures. The approach uses conformal calibration to provide statistical guarantees on error rates, demonstrating significant performance improvements over existing uncertainty quantification baselines.
AINeutralarXiv – CS AI · Apr 206/10
🧠Researchers propose TPA (Token Probability Attribution), a new method for detecting hallucinations in Retrieval-Augmented Generation systems by attributing token generation to seven distinct sources rather than the traditional binary approach. The technique uses Part-of-Speech tagging to identify anomalies in how different linguistic categories are generated, achieving state-of-the-art detection performance.
AINeutralarXiv – CS AI · Apr 206/10
🧠Researchers propose VIB-Probe, a novel framework using Variational Information Bottleneck theory to detect and mitigate hallucinations in Vision-Language Models by analyzing internal attention mechanisms. The method identifies specific attention heads responsible for truthful generation and introduces an inference-time intervention strategy that outperforms existing detection baselines.
AINeutralarXiv – CS AI · Apr 106/10
🧠Researchers propose a composite architecture combining instruction-based refusal with a structural abstention gate to reduce hallucinations in large language models. The system uses a support deficit score derived from self-consistency, paraphrase stability, and citation coverage to block unreliable outputs, achieving better accuracy than either mechanism alone across multiple models.
AIBullisharXiv – CS AI · Mar 266/10
🧠Researchers developed HalluJudge, a reference-free system to detect hallucinations in AI-generated code review comments, addressing a key challenge in LLM adoption for software development. The system achieves 85% F1 score with 67% alignment to developer preferences at just $0.009 average cost, making it a practical safeguard for AI-assisted code reviews.
AINeutralarXiv – CS AI · Mar 126/10
🧠Researchers have developed the System Hallucination Scale (SHS), a human-centered tool for evaluating hallucination behavior in large language models. The instrument showed strong statistical validity in testing with 210 participants and provides a practical method for assessing AI model reliability from a user perspective.
AIBullisharXiv – CS AI · Mar 36/102
🧠Researchers developed a training-free method to detect AI hallucinations by reinterpreting LLM output as Energy-Based Models and tracking 'energy spills' during text generation. The approach successfully identifies factual errors and biases across multiple state-of-the-art models including LLaMA, Mistral, and Gemma without requiring additional training or probe classifiers.
AINeutralApple Machine Learning · Mar 35/103
🧠Researchers are developing new methods to detect hallucinations in large language models by identifying specific spans of unsupported content rather than making binary decisions. The study evaluates Chain-of-Thought reasoning approaches to improve the complex multi-step process of hallucination span detection in LLMs.