AINeutralarXiv – CS AI · 11h ago6/10
🧠
Hierarchical Concept-to-Appearance Guidance for Multi-Subject Image Generation
Researchers propose Hierarchical Concept-to-Appearance Guidance (CAG), a novel framework for multi-subject image generation that improves identity consistency and compositional control by providing explicit supervision from semantic concepts to fine-grained visual details. The method combines VAE dropout training with correspondence-aware masked attention to better preserve multiple subject identities while following text prompts.