AINeutralarXiv – CS AI · Mar 37/104
🧠
Trojans in Artificial Intelligence (TrojAI) Final Report
IARPA's TrojAI program investigated AI Trojans - malicious backdoors hidden in AI models that can cause system failures or allow unauthorized control. The multi-year initiative developed detection methods through weight analysis and trigger inversion, while identifying ongoing challenges in AI security that require continued research.