AINeutralarXiv – CS AI · 9h ago6/10
🧠
Same Signal, Opposite Meaning: Direction-Informed Adaptive Learning for LLM Agents
Researchers demonstrate that adaptive compute gates for LLM agents produce unstable and reversible signals across different environments and models, where the same confidence metric predicts both beneficial and harmful outcomes. They propose DIAL, a learned gating mechanism trained through counterfactual exploration, which outperforms fixed-direction baselines by accounting for task-specific utility directions.