AINeutralarXiv โ CS AI ยท 17h ago7/10
๐ง
AdAEM: An Adaptively and Automated Extensible Measurement of LLMs' Value Difference
Researchers introduce AdAEM, a new evaluation algorithm that automatically generates test questions to better assess value differences and biases across Large Language Models. Unlike static benchmarks, AdAEM adaptively creates controversial topics that reveal more distinguishable insights about LLMs' underlying values and cultural alignment.