AINeutralarXiv โ CS AI ยท 6h ago6
๐ง
Moral Susceptibility and Robustness under Persona Role-Play in Large Language Models
Researchers analyzed how large language models express moral judgments when prompted to role-play different personas. The study found that Claude models are most morally robust, while larger models within families tend to be more susceptible to moral shifts through persona conditioning.