AIBullisharXiv – CS AI · 8h ago6/10
🧠
Self-Evolving Deep Research via Joint Generation and Evaluation
Researchers introduce SCORE, a self-evolving co-evolutionary framework that jointly trains evaluation and generation models for deep research report generation. The approach addresses limitations in LLM-based research agents by enabling evaluators to dynamically adapt standards as solver performance improves, demonstrating consistent quality improvements over static evaluation methods.