AIBullisharXiv – CS AI · 6h ago7/10
🧠
ChartAgent: A Multimodal Agent for Visually Grounded Reasoning in Complex Chart Question Answering
ChartAgent is a new multimodal AI framework that enhances chart question-answering by combining language models with visual reasoning tools. The system decomposes complex chart queries into visual subtasks, using specialized actions like annotation and cropping to interpret unannotated charts, achieving state-of-the-art performance with gains up to 16% on benchmark datasets.