AIBearisharXiv โ CS AI ยท 6h ago2
๐ง
MIDAS: Multi-Image Dispersion and Semantic Reconstruction for Jailbreaking MLLMs
Researchers have developed MIDAS, a new jailbreaking framework that successfully bypasses safety mechanisms in Multimodal Large Language Models by dispersing harmful content across multiple images. The technique achieved an 81.46% average attack success rate against four closed-source MLLMs by extending reasoning chains and reducing security attention.
$LINK