AINeutralarXiv – CS AI · 7h ago6/10
🧠
DetailMaster: Can Your Text-to-Image Model Handle Long Prompts?
DetailMaster introduces a comprehensive benchmark for evaluating text-to-image models on long, complex prompts averaging 285 tokens, revealing significant performance limitations in current T2I systems. The research identifies critical weaknesses in prompt encoding and attribute preservation, while demonstrating that high-quality generation requires both expanded prompt capacity and specialized long-prompt training.