Do LLMs Know Tool Irrelevance? Demystifying Structural Alignment Bias in Tool Invocations
Researchers identify structural alignment bias, a mechanistic flaw where large language models invoke tools even when irrelevant to user queries, simply because query attributes match tool parameters. The study introduces SABEval dataset and a rebalancing strategy that effectively mitigates this bias without degrading general tool-use capabilities.