Culture In a Frame: C$^3$B as a Comic-Based Benchmark for Multimodal Culturally Awareness
Researchers introduce CยณB (Comics Cross-Cultural Benchmark), a new benchmark to test cultural awareness capabilities in Multimodal Large Language Models using over 2000 comic images and 18000 QA pairs. Testing revealed significant performance gaps between current MLLMs and human performance, highlighting the need for improved cultural understanding in AI systems.