AIBearisharXiv – CS AI · 9h ago7/10
🧠
SlotGCG: Exploiting the Positional Vulnerability in LLMs for Jailbreak Attacks
Researchers introduce SlotGCG, a novel jailbreak attack method that exploits positional vulnerabilities in large language models by strategically inserting adversarial tokens at optimal positions within prompts rather than just at the end. The approach achieves 14% higher success rates than existing GCG-based attacks while identifying that LLM vulnerability is significantly dependent on token insertion location.