AIBearisharXiv โ CS AI ยท 7h ago7/10
๐ง
SkillTrojan: Backdoor Attacks on Skill-Based Agent Systems
Researchers have identified SkillTrojan, a novel backdoor attack targeting skill-based agent systems by embedding malicious logic within reusable skills rather than model parameters. The attack leverages skill composition to execute attacker-defined payloads with up to 97.2% success rates while maintaining clean task performance, revealing critical security gaps in AI agent architectures.
๐ง GPT-5