AIBullisharXiv – CS AI · Mar 277/10
🧠
Cross-Model Disagreement as a Label-Free Correctness Signal
Researchers introduce cross-model disagreement as a training-free method to detect when AI language models make confident errors without requiring ground truth labels. The approach uses Cross-Model Perplexity and Cross-Model Entropy to measure how surprised a second verifier model is when reading another model's answers, significantly outperforming existing uncertainty-based methods across multiple benchmarks.
🏢 Perplexity