AIBullisharXiv โ CS AI ยท 8h ago7/10
๐ง
Cross-Model Disagreement as a Label-Free Correctness Signal
Researchers introduce cross-model disagreement as a training-free method to detect when AI language models make confident errors without requiring ground truth labels. The approach uses Cross-Model Perplexity and Cross-Model Entropy to measure how surprised a second verifier model is when reading another model's answers, significantly outperforming existing uncertainty-based methods across multiple benchmarks.
๐ข Perplexity