AIBullisharXiv โ CS AI ยท 5h ago
๐ง
ToolVQA: A Dataset for Multi-step Reasoning VQA with External Tools
Researchers introduce ToolVQA, a large-scale multimodal dataset with 23K instances designed to improve AI models' ability to use external tools for visual question answering. The dataset features real-world contexts and multi-step reasoning tasks, with fine-tuned 7B models outperforming GPT-3.5-turbo on various benchmarks.