y0news
← Feed
Back to feed
🧠 AI NeutralImportance 6/10

PoliticsBench: Benchmarking Political Values in Large Language Models with Multi-Turn Roleplay

arXiv – CS AI|Rohan Khetan, Ashna Khetan|
🤖AI Summary

Researchers developed PoliticsBench, a new framework to evaluate political bias in large language models through multi-turn roleplay scenarios. The study found that 7 out of 8 major LLMs (Claude, Deepseek, Gemini, GPT, Llama, Qwen) showed left-leaning political bias, while only Grok exhibited right-leaning tendencies.

Key Takeaways
  • Seven of eight tested LLMs demonstrated systematic left-leaning political bias in their responses.
  • Grok was the only model that showed right-leaning political tendencies among those tested.
  • Left-leaning models strongly exhibited liberal traits while moderately showing conservative ones.
  • The study represents the first psychometric evaluation of political values in LLMs using multi-stage interactions.
  • Political bias varied in reasoning approaches, with most models using consequence-based logic while Grok relied on facts and statistics.
Mentioned in AI
Models
ClaudeAnthropic
GeminiGoogle
LlamaMeta
GrokxAI
Read Original →via arXiv – CS AI
Act on this with AI
Stay ahead of the market.
Connect your wallet to an AI agent. It reads balances, proposes swaps and bridges across 15 chains — you keep full control of your keys.
Connect Wallet to AI →How it works
Related Articles