AIBearisharXiv โ CS AI ยท 5h ago
๐ง
Preference Leakage: A Contamination Problem in LLM-as-a-judge
Researchers have identified 'preference leakage,' a contamination problem in LLM-as-a-judge systems where evaluator models show bias toward related data generator models. The study found this bias occurs when judge and generator LLMs share relationships like being the same model, having inheritance connections, or belonging to the same model family.