Hallucination as an Anomaly: Dynamic Intervention via Probabilistic Circuits
Researchers introduce PCNET, a probabilistic circuit-based method that detects hallucinations in large language models as geometric anomalies in the factual manifold, achieving 99% detection accuracy. The approach uses PC-LDCD decoding to correct hallucinations selectively without corrupting originally correct outputs, demonstrating significant improvements across multiple benchmarks.