y0news
← Feed
←Back to feed
🧠 AIβšͺ NeutralImportance 6/10

PoliticsBench: Benchmarking Political Values in Large Language Models with Multi-Turn Roleplay

arXiv – CS AI|Rohan Khetan, Ashna Khetan|
πŸ€–AI Summary

Researchers developed PoliticsBench, a new framework to evaluate political bias in large language models through multi-turn roleplay scenarios. The study found that 7 out of 8 major LLMs (Claude, Deepseek, Gemini, GPT, Llama, Qwen) showed left-leaning political bias, while only Grok exhibited right-leaning tendencies.

Key Takeaways
  • β†’Seven of eight tested LLMs demonstrated systematic left-leaning political bias in their responses.
  • β†’Grok was the only model that showed right-leaning political tendencies among those tested.
  • β†’Left-leaning models strongly exhibited liberal traits while moderately showing conservative ones.
  • β†’The study represents the first psychometric evaluation of political values in LLMs using multi-stage interactions.
  • β†’Political bias varied in reasoning approaches, with most models using consequence-based logic while Grok relied on facts and statistics.
Mentioned in AI
Models
ClaudeAnthropic
GeminiGoogle
LlamaMeta
GrokxAI
Read Original β†’via arXiv – CS AI
Act on this with AI
Stay ahead of the market.
Connect your wallet to an AI agent. It reads balances, proposes swaps and bridges across 15 chains β€” you keep full control of your keys.
Connect Wallet to AI β†’How it works
Related Articles