AINeutralarXiv – CS AI · 10h ago6/10
🧠
VT-Bench: A Unified Benchmark for Visual-Tabular Multi-Modal Learning
Researchers introduce VT-Bench, the first comprehensive benchmark for visual-tabular multi-modal learning, aggregating 14 datasets with 756K samples across 9 domains. The benchmark evaluates 23 models and reveals significant gaps in current approaches for combining image and tabular data, particularly in high-stakes sectors like healthcare.