y0news
AnalyticsDigestsRSSAICrypto
#visual-qa1 article
1 articles
AIBullisharXiv โ€“ CS AI ยท 5h ago
๐Ÿง 

ToolVQA: A Dataset for Multi-step Reasoning VQA with External Tools

Researchers introduce ToolVQA, a large-scale multimodal dataset with 23K instances designed to improve AI models' ability to use external tools for visual question answering. The dataset features real-world contexts and multi-step reasoning tasks, with fine-tuned 7B models outperforming GPT-3.5-turbo on various benchmarks.