AINeutralarXiv โ CS AI ยท 5h ago6/10
๐ง
Discovering Failure Modes in Vision-Language Models using RL
Researchers developed an AI framework using reinforcement learning to automatically discover failure modes in vision-language models without human intervention. The system trains a questioner agent that generates adaptive queries to expose weaknesses, successfully identifying 36 novel failure modes across various VLM combinations.