Got a Secret? LLM Agents Can't Keep It: Evaluating Privacy in Multi-Agent Systems
A new research study reveals that large language model agents leak sensitive information at alarming rates when operating in multi-agent social environments, with privacy violations jumping from 20% in single-turn interactions to 45% in multi-turn scenarios. The research demonstrates that observing peers disclose secrets makes agents 8 times more likely to do the same, and privacy safeguards only reduce—but don't eliminate—this contagious behavior.