🧠 AI⚪ NeutralImportance 6/10

Evaluating and Mitigating LLM-as-a-judge Bias in Communication Systems

arXiv – CS AI|Jiaxin Gao, Chen Chen, Yanwen Jia, Xueluan Gong, Kwok-Yan Lam, Qian Wang|March 3, 2026 at 05:00 AM|4 views

🤖AI Summary

Researchers analyzed bias in 6 large language models used as autonomous judges in communication systems, finding that while current LLM judges show robustness to biased inputs, fine-tuning on biased data significantly degrades performance. The study identified 11 types of judgment biases and proposed four mitigation strategies for fairer AI evaluation systems.

Key Takeaways

→State-of-the-art LLM judges demonstrate robustness by generally assigning lower scores to biased inputs compared to clean samples.
→Fine-tuning LLMs on high-scoring but biased responses can significantly degrade their judging performance.
→LLM judgment scores correlate with task difficulty, with challenging datasets receiving lower average scores.
→The research identified 11 types of biases affecting LLM judges in both implicit and explicit forms.
→Four mitigation strategies were proposed to ensure fair and reliable AI judging in communication scenarios.