AINeutralarXiv – CS AI · 3h ago6/10
🧠
Entropy Distribution as a Fingerprint for Hallucinations in Generative Models
Researchers propose Calibrated Entropy Score (CES), a novel method for detecting hallucinations in large language models using entropy distribution patterns from a single forward pass. The technique achieves performance comparable to computationally expensive multi-sample methods while requiring only black-box access to token logits, with formal mathematical guarantees for detection accuracy.
🏢 Perplexity