AIBullisharXiv โ CS AI ยท 5h ago1
๐ง
CollabEval: Enhancing LLM-as-a-Judge via Multi-Agent Collaboration
Researchers propose CollabEval, a new multi-agent framework for evaluating AI-generated content that uses collaborative judgment instead of single LLM evaluation. The system implements a three-phase process with multiple AI agents working together to provide more consistent and less biased evaluations than current approaches.