#llm-fairness News & Analysis

3 articles tagged with #llm-fairness. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

3 articles

AIBullisharXiv – CS AI · Jun 17/10

🧠

COFT: Counterfactual-Conformal Decoding for Fair Chain-of-Thought Reasoning in Large Language Models

Researchers introduce COFT, a training-free decoding method that reduces bias in large language models' chain-of-thought reasoning by 30-55% through counterfactual prompting and conformal calibration. The approach preserves task performance while adding minimal computational overhead, offering a practical solution for deploying fairer AI systems without model retraining.

🏢 Meta

AIBearisharXiv – CS AI · May 127/10

🧠

Explanation Fairness in Large Language Models: An Empirical Analysis of Disparities in How LLMs Justify Decisions Across Demographic Groups

Researchers have identified systematic fairness disparities in how large language models explain their decisions across demographic groups, introducing the Explanation Fairness Taxonomy (EFT) to measure five dimensions of explanation inequality. Testing five major LLMs across hiring, medical, credit, and legal domains reveals statistically significant disparities in explanation quality, with stylistic inequalities appearing resistant to prompt-based fixes and likely embedded in model pre-training.

🧠 GPT-4🧠 Claude

AIBearisharXiv – CS AI · Jun 96/10

🧠

Neutrality Bites: Gender Representation in AI-Generated Animal Stories

Researchers analyzed gender representation in AI-generated animal stories across six leading LLMs and found that while models avoid gendering characters 19% of the time and use neutral pronouns 38% of the time, assigned genders show stark masculine bias with feminine characters appearing in only 2.2% of stories versus 40.6% masculine. The study argues that neutrality-focused bias mitigation strategies may paradoxically erase marginalized identities rather than promote genuine fairness.