AINeutralarXiv – CS AI · 6h ago6/10
🧠
VCG: A Multimodal Retrieval Framework for E-Commerce Video Feeds under Extreme Cold-Start Conditions
Researchers present VCG, a multimodal retrieval system that addresses the cold-start problem in e-commerce video feeds by using vision-language models to match users and videos in a shared semantic space rather than relying on behavioral history. The system achieved a 50% uplift in video completion rates during A/B testing and demonstrates that CLIP-based discriminative embeddings outperform generative LLM approaches for retrieval tasks.