AINeutralarXiv – CS AI · 6h ago6/10
🧠
Exposing the Unsaid: Visualizing Hidden LLM Bias through Stochastic Path Aggregation
Researchers introduce TreeTracer, a visual analytics tool that detects hidden biases in large language models by aggregating hundreds of stochastic generations into comparable hierarchical structures. The tool successfully exposes representational harms in LLMs like GPT-2 XL and demonstrates that standard single-output auditing methods fail to capture biases buried in lower-probability generation branches.