AIBearisharXiv – CS AI · 11h ago7/10
🧠
The Chameleon Nature of LLMs: Quantifying Multi-Turn Stance Instability in Search-Enabled Language Models
Researchers have identified "chameleon behavior" in search-enabled large language models, where they inconsistently shift stances when presented with contradictory questions in multi-turn conversations. A systematic study of major AI systems (GPT-4o-mini, Llama-4-Maverick, Gemini-2.5-Flash) reveals severe stance instability scores (0.391-0.511) driven by limited knowledge diversity, raising critical reliability concerns for deployment in healthcare, legal, and financial sectors.
🧠 GPT-4🧠 Gemini🧠 Llama