Automated Creativity Evaluation of Language Models Across Open-Ended Tasks
Researchers introduce an automated, domain-agnostic framework for evaluating creativity in large language models across open-ended tasks. The approach uses semantic entropy to measure divergent creativity and a multi-agent judge system for convergent creativity, validated across problem-solving, research ideation, and creative writing domains.