AINeutralarXiv โ CS AI ยท 17h ago6/10
๐ง
BlackMirror: Black-Box Backdoor Detection for Text-to-Image Models via Instruction-Response Deviation
Researchers have developed BlackMirror, a new framework for detecting backdoored text-to-image AI models in black-box settings. The system identifies semantic deviations between visual patterns and instructions, offering a training-free solution that can be deployed in Model-as-a-Service applications.