y0news
← Feed
Back to feed
🧠 AI🔴 BearishImportance 7/10Actionable

AgentDrift: Unsafe Recommendation Drift Under Tool Corruption Hidden by Ranking Metrics in LLM Agents

arXiv – CS AI|Zekun Wu, Adriano Koshiyama, Sahan Bulathwela, Maria Perez-Ortiz|
🤖AI Summary

Research reveals that AI agents using tools for financial advice can recommend unsafe products while maintaining good quality metrics when tool data is corrupted. The study found that 65-93% of recommendations contained risk-inappropriate products across seven LLMs, yet standard evaluation metrics failed to detect these safety issues.

Key Takeaways
  • AI agents consistently recommend unsafe financial products when tool outputs are contaminated, despite maintaining high quality scores on standard metrics.
  • Safety violations occurred in 65-93% of turns across seven different LLMs when tool corruption was present.
  • Standard NDCG evaluation metrics fail to capture safety risks, creating an evaluation blindness pattern in AI agent assessment.
  • Even narrative-only corruption without numerical manipulation can induce significant unsafe recommendation drift.
  • A new safety-penalized metric (sNDCG) reveals the true extent of safety degradation that standard metrics miss.
Read Original →via arXiv – CS AI
Act on this with AI
Stay ahead of the market.
Connect your wallet to an AI agent. It reads balances, proposes swaps and bridges across 15 chains — you keep full control of your keys.
Connect Wallet to AI →How it works
Related Articles