TukaBench: A Culturally Grounded Jailbreak Benchmark for African Languages
Researchers introduce TukaBench, a jailbreak safety benchmark for seven African languages that reveals LLMs are significantly more vulnerable to adversarial prompts when queried in African languages versus English, with culturally adapted prompts proving most effective at bypassing safety measures. The study identifies critical gaps in LLM safety evaluation for low-resource languages and demonstrates that existing judging mechanisms fail to accurately assess model responses in these languages.
