🧠 AI⚪ NeutralImportance 7/10

Moral Susceptibility and Robustness under Persona Role-Play in Large Language Models

arXiv – CS AI|Davi Bastos Costa, Felippe Alves, Renato Vicente|March 2, 2026 at 05:00 AM|18 views

🤖AI Summary

Researchers analyzed how large language models express moral judgments when prompted to role-play different personas. The study found that Claude models are most morally robust, while larger models within families tend to be more susceptible to moral shifts through persona conditioning.

Key Takeaways

→Claude AI models demonstrated the highest moral robustness among tested LLM families, followed by Gemini and GPT-4.
→Larger language models within the same family show greater moral susceptibility to persona role-play prompts.
→Model family accounts for most variance in moral robustness, while model size has no systematic effect on robustness.
→The research introduces new benchmarks for measuring moral susceptibility and robustness in AI systems.
→Moral robustness and susceptibility show positive correlation, particularly at the model family level.

#ai-ethics #large-language-models #moral-reasoning #ai-safety #claude #gemini #gpt-4 #persona-conditioning #ai-research #llm-behavior

Read Original →via arXiv – CS AI

Act on this with AI

Stay ahead of the market.

Connect your wallet to an AI agent. It reads balances, proposes swaps and bridges across 15 chains — you keep full control of your keys.

Connect Wallet to AI →How it works

AIMay 6

Your company’s AI could delete everything in 9 seconds. ServiceNow wants to be the kill switch

AIMay 6

Hut 8 (HUT) Stock Soars 37% on Massive $9.8 Billion AI Data Center Agreement

AIMay 6

Moral Susceptibility and Robustness under Persona Role-Play in Large Language Models

Your company’s AI could delete everything in 9 seconds. ServiceNow wants to be the kill switch

Hut 8 (HUT) Stock Soars 37% on Massive $9.8 Billion AI Data Center Agreement

S&P 500 and NASDAQ hit record highs as AI chip stocks surge