AIBullisharXiv – CS AI · Mar 266/10
🧠
HalluJudge: A Reference-Free Hallucination Detection for Context Misalignment in Code Review Automation
Researchers developed HalluJudge, a reference-free system to detect hallucinations in AI-generated code review comments, addressing a key challenge in LLM adoption for software development. The system achieves 85% F1 score with 67% alignment to developer preferences at just $0.009 average cost, making it a practical safeguard for AI-assisted code reviews.