AIBullisharXiv – CS AI · 3h ago6/10
🧠
Utility-Aware Multimodal Contrastive Learning for Product Image Generation
Researchers propose a utility-aware multimodal contrastive learning framework that optimizes AI-generated product images for consumer demand rather than just semantic accuracy. The method, tested on Amazon and Airbnb data, outperforms existing generative AI models by shifting the learned image-text representation space toward demand-driven visual cues while maintaining image quality and text alignment.