y0news
AnalyticsDigestsSourcesRSSAICrypto
#table-text-qa1 article
1 articles
AINeutralarXiv โ€“ CS AI ยท Feb 276/107
๐Ÿง 

SPARTA: Scalable and Principled Benchmark of Tree-Structured Multi-hop QA over Text and Tables

Researchers introduce SPARTA, an automated framework for generating large-scale Table-Text question answering benchmarks that require complex multi-hop reasoning across structured and unstructured data. The benchmark exposes significant weaknesses in current AI models, with state-of-the-art systems experiencing over 30 F1 point performance drops compared to existing simpler datasets.