AIBullisharXiv โ CS AI ยท 4d ago6/103
๐ง
ScholarEval: Research Idea Evaluation Grounded in Literature
Researchers introduce ScholarEval, a retrieval-augmented framework for evaluating AI-generated research ideas based on soundness and contribution metrics. The system outperformed OpenAI's o1-mini-deep-research baseline across multiple evaluation criteria in testing with 117 expert-annotated research ideas across four scientific disciplines.