AINeutralarXiv โ CS AI ยท 6d ago7/104
๐ง
Trojans in Artificial Intelligence (TrojAI) Final Report
IARPA's TrojAI program investigated AI Trojans - malicious backdoors hidden in AI models that can cause system failures or allow unauthorized control. The multi-year initiative developed detection methods through weight analysis and trigger inversion, while identifying ongoing challenges in AI security that require continued research.