AIBearisharXiv – CS AI · 3h ago7/10
🧠
HARP: Measuring Harm Amplification in Multi-Agent LLM Systems
Researchers introduce HARP, a methodology for measuring how harm propagates across multi-agent LLM systems when one component is compromised. Testing on a finance-oriented seven-agent system reveals that single-agent compromise creates the strongest amplification effects, while existing defenses struggle to balance security with utility costs.