AINeutralarXiv โ CS AI ยท 7h ago6/10
๐ง
Distributional Open-Ended Evaluation of LLM Cultural Value Alignment Based on Value Codebook
Researchers introduce DOVE, a distributional evaluation framework that measures how well large language models align with cultural values through open-ended text generation rather than multiple-choice tests. The framework uses rate-distortion optimization to create a value codebook and unbalanced optimal transport to assess alignment, demonstrating 31.56% correlation with downstream tasks across 12 LLMs while requiring only 500 samples per culture.