AINeutralarXiv – CS AI · Apr 146/10
🧠
Detecting RAG Extraction Attack via Dual-Path Runtime Integrity Game
Researchers propose CanaryRAG, a runtime defense mechanism that protects Retrieval-Augmented Generation systems from adversarial attacks that extract proprietary data from knowledge bases. The solution uses embedded canary tokens to detect leakage in real-time while maintaining normal system performance, offering a practical safeguard for organizations deploying RAG-based AI systems.