AINeutralarXiv โ CS AI ยท Feb 277/104
๐ง
Generative Value Conflicts Reveal LLM Priorities
Researchers introduced ConflictScope, an automated pipeline that evaluates how large language models prioritize competing values when faced with ethical dilemmas. The study found that LLMs shift away from protective values like harmlessness toward personal values like user autonomy in open-ended scenarios, though system prompting can improve alignment by 14%.