AIBearisharXiv – CS AI · Apr 107/10
🧠
SkillTrojan: Backdoor Attacks on Skill-Based Agent Systems
Researchers have identified SkillTrojan, a novel backdoor attack targeting skill-based agent systems by embedding malicious logic within reusable skills rather than model parameters. The attack leverages skill composition to execute attacker-defined payloads with up to 97.2% success rates while maintaining clean task performance, revealing critical security gaps in AI agent architectures.
🧠 GPT-5