🧠 AI🔴 BearishImportance 6/10

Preference Leakage: A Contamination Problem in LLM-as-a-judge

arXiv – CS AI|Dawei Li, Renliang Sun, Yue Huang, Ming Zhong, Bohan Jiang, Jiawei Han, Xiangliang Zhang, Wei Wang, Huan Liu|March 5, 2026 at 05:00 AM

🤖AI Summary

Researchers have identified 'preference leakage,' a contamination problem in LLM-as-a-judge systems where evaluator models show bias toward related data generator models. The study found this bias occurs when judge and generator LLMs share relationships like being the same model, having inheritance connections, or belonging to the same model family.

Key Takeaways

→Preference leakage causes LLM judges to unfairly favor outputs from related generator models during evaluation.
→The bias occurs across three relationship types: same model, inheritance relationship, and same model family.
→This contamination problem is pervasive across multiple LLM baselines and benchmarks in real-world scenarios.
→Preference leakage is harder to detect compared to previously identified biases in LLM evaluation systems.
→The research provides open-source code and data to help the community address this evaluation integrity issue.