AINeutralarXiv โ CS AI ยท 6h ago2
๐ง
MOSAIC: Unveiling the Moral, Social and Individual Dimensions of Large Language Models
Researchers introduce MOSAIC, the first comprehensive benchmark to evaluate moral, social, and individual characteristics of Large Language Models beyond traditional Moral Foundation Theory. The benchmark includes over 600 curated questions and scenarios from nine validated questionnaires and four platform-based games, providing empirical evidence that current evaluation methods are insufficient for assessing AI ethics comprehensively.