🧠 AI🔴 BearishImportance 6/10

Which English Do LLMs Prefer? Triangulating Structural Bias Towards American English in Foundation Models

arXiv – CS AI|Mir Tafseer Nayeem, Davood Rafiei|April 7, 2026 at 04:00 AM

🤖AI Summary

A new research study reveals that major large language models exhibit systematic bias toward American English over British English across training data, tokenization, and outputs. The research introduces DiAlign, a method for measuring dialectal alignment, and finds evidence of linguistic homogenization that could impact global AI equity.

Key Takeaways

→Six major pretraining corpora show systematic skew toward American English over British English varieties.
→LLM tokenizers impose higher segmentation costs on British English forms compared to American English.
→Generative AI models consistently prefer American English in their outputs despite global English diversity.
→The study introduces DiAlign, a training-free method for measuring dialectal bias in language models.
→Researchers warn of linguistic homogenization and epistemic injustice in global AI deployment.

#llm-bias #english-variants #ai-ethics #linguistic-equity #model-training #dialectal-bias #postcolonial-ai #foundation-models

Read Original →via arXiv – CS AI

Act on this with AI

Stay ahead of the market.

Connect your wallet to an AI agent. It reads balances, proposes swaps and bridges across 15 chains — you keep full control of your keys.

Connect Wallet to AI →How it works

AIMay 6

Your company’s AI could delete everything in 9 seconds. ServiceNow wants to be the kill switch

AIMay 6

Hut 8 (HUT) Stock Soars 37% on Massive $9.8 Billion AI Data Center Agreement

AIMay 6

Which English Do LLMs Prefer? Triangulating Structural Bias Towards American English in Foundation Models

Your company’s AI could delete everything in 9 seconds. ServiceNow wants to be the kill switch

Hut 8 (HUT) Stock Soars 37% on Massive $9.8 Billion AI Data Center Agreement

S&P 500 and NASDAQ hit record highs as AI chip stocks surge