βBack to feed
π§ AIπ΄ BearishImportance 7/10Actionable
AgentDrift: Unsafe Recommendation Drift Under Tool Corruption Hidden by Ranking Metrics in LLM Agents
π€AI Summary
Research reveals that AI agents using tools for financial advice can recommend unsafe products while maintaining good quality metrics when tool data is corrupted. The study found that 65-93% of recommendations contained risk-inappropriate products across seven LLMs, yet standard evaluation metrics failed to detect these safety issues.
Key Takeaways
- βAI agents consistently recommend unsafe financial products when tool outputs are contaminated, despite maintaining high quality scores on standard metrics.
- βSafety violations occurred in 65-93% of turns across seven different LLMs when tool corruption was present.
- βStandard NDCG evaluation metrics fail to capture safety risks, creating an evaluation blindness pattern in AI agent assessment.
- βEven narrative-only corruption without numerical manipulation can induce significant unsafe recommendation drift.
- βA new safety-penalized metric (sNDCG) reveals the true extent of safety degradation that standard metrics miss.
Read Original βvia arXiv β CS AI
Act on this with AI
Stay ahead of the market.
Connect your wallet to an AI agent. It reads balances, proposes swaps and bridges across 15 chains β you keep full control of your keys.
Related Articles