#black-box-evasion News & Analysis

2 articles tagged with #black-box-evasion. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

2 articles

AIBearisharXiv – CS AI · Jun 57/10

🧠

Adversarial Agents: Black-Box Evasion Attacks with Reinforcement Learning

Researchers demonstrate a reinforcement learning approach that enables AI agents to learn and execute adversarial attacks on machine learning models more efficiently than traditional methods. The RL-based system achieves 13.2% higher attack success rates and reduces queries needed per attack by 16.9%, while outperforming state-of-the-art adversarial methods by 17% on unseen inputs, revealing a significant new security vulnerability in deployed ML systems.

AIBearisharXiv – CS AI · May 287/10

🧠

LLM Watermark Evasion via Bias Inversion

Researchers demonstrate a practical attack called Bias-Inversion Rewriting Attack (BIRA) that defeats LLM watermarking schemes with over 99% success rate while maintaining semantic quality. The findings expose fundamental vulnerabilities in current watermarking detection methods, which are widely considered essential for identifying AI-generated content.