AINeutralarXiv – CS AI · 8h ago7/10
🧠
In LLM Reasoning, there is Irrationality on top of Value Misalignment
Researchers identify 'rational value risk' in large language models, showing that even well-aligned LLMs fail to consistently maximize their intended values during reasoning tasks. The study across major models (Llama, GPT, DeepSeek) reveals that value alignment training alone cannot eliminate this reasoning gap, with performance highly dependent on inference-time strategies.
🧠 GPT-5🧠 Llama