AINeutralarXiv – CS AI · 5h ago6/10
🧠
Accounting for Context: Shaping Moral Credences for Value Alignment
Researchers present a framework for aligning AI agent behavior with human moral values by accounting for contextual factors when aggregating diverse moral perspectives. The work reveals that traditional aggregation mechanisms violate the weak Pareto principle due to contextual dependencies, analogous to Simpson's paradox, highlighting fundamental limitations in current moral uncertainty approaches.