AIBearisharXiv – CS AI · 3h ago7/10
🧠
Plant, Persist, Trigger: Sleeper Attack on Large Language Model Agents
Researchers have identified a new vulnerability in LLM-based agents called 'Sleeper Attacks,' where adversarial content persists dormant in agent state across multiple interactions before being activated by benign queries. The attack threatens real-world LLM deployments by evading single-interaction detection mechanisms, with testing showing vulnerabilities across seven major language models.