AIBearisharXiv – CS AI · 11h ago7/10
🧠
Do as I Say, Not as I Do: Instruction-Induction Conflict in LLMs
Researchers demonstrate that large language models exhibit brittle instruction-following when faced with competing behavioral patterns, with compliance rates ranging from 1% to 99% across 13 models. The study reveals that output diversity and format—rather than reasoning ability—are the primary determinants of robustness against induction pressure, highlighting fundamental vulnerabilities in current LLM training.