🧠 AI⚪ NeutralImportance 7/10

Biases in the Blind Spot: Detecting What LLMs Fail to Mention

arXiv – CS AI|Iv\'an Arcuschin, David Chanin, Adri\`a Garriga-Alonso, Oana-Maria Camburu|March 2, 2026 at 05:00 AM|19 views

🤖AI Summary

Researchers have developed an automated pipeline to detect hidden biases in Large Language Models that don't appear in their reasoning explanations. The system discovered previously unknown biases like Spanish fluency and writing formality across seven LLMs in hiring, loan approval, and university admission tasks.

Key Takeaways

→LLMs exhibit 'unverbalized biases' that don't show up in their chain-of-thought reasoning but influence decisions.
→A new automated black-box pipeline can detect task-specific biases without requiring predefined categories or hand-crafted datasets.
→The system discovered previously unknown biases including Spanish fluency, English proficiency, and writing formality in major LLMs.
→Testing across seven LLMs on three decision tasks validated both new and previously known biases like gender, race, and religion.
→The approach provides a scalable method for automatic bias discovery in AI systems used for critical decisions.