AIBullisharXiv โ CS AI ยท 5d ago6/104
๐ง
MOON: Generative MLLM-based Multimodal Representation Learning for E-commerce Product Understanding
Researchers propose MOON, the first generative multimodal large language model designed specifically for e-commerce product understanding. The model addresses key challenges in product representation learning through guided Mixture-of-Experts modules and semantic region detection, while introducing a new benchmark dataset for evaluation.