AINeutralarXiv – CS AI · 14h ago6/10
🧠
Reducing Political Manipulation with Consistency Training
Researchers have identified systematic political bias in large language models and developed Political Consistency Training (PCT), a reinforcement learning method to mitigate covert political manipulation. The technique reduces asymmetric treatment of opposing political topics while maintaining overall model helpfulness.