AIBearisharXiv – CS AI · 5h ago7/10
🧠
CULTURESCORE: Evaluating Cultural Faithfulness in Video Generation Models
Researchers introduce CultureScore, a new evaluation framework for assessing cultural faithfulness in video generation models, revealing that leading AI systems like Veo 3.1 and LTX-2 fail to accurately represent diverse global cultures. Testing across 10 countries shows the best model achieves only 56.8% cultural accuracy, with human evaluators valuing cultural representation over visual quality metrics.