AIBullisharXiv โ CS AI ยท 5h ago2
๐ง
MMCOMET: A Large-Scale Multimodal Commonsense Knowledge Graph for Contextual Reasoning
Researchers have released MMCOMET, the first large-scale multimodal commonsense knowledge graph that combines visual and textual information with over 900K multimodal triples. The system extends existing knowledge graphs to support complex AI reasoning tasks like image captioning and visual storytelling, demonstrating improved contextual understanding compared to text-only approaches.