AIBullisharXiv – CS AI · 7h ago7/10
🧠
TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning
Researchers introduce TruthRL, a reinforcement learning framework that optimizes large language models for truthfulness by reducing hallucinations while allowing strategic abstention when uncertain. The method achieves significant improvements across multiple benchmarks, reducing hallucinations by over 50% while improving truthfulness metrics substantially.