AINeutralarXiv – CS AI · 18h ago6/10
🧠
ArtiFact: A Large-Scale Multi-Modal Cultural Heritage Dataset
Researchers introduce ArtiFact, a large-scale multi-modal dataset containing 651,045 museum records from three major art institutions combined with images, text, and structured data. The dataset benchmarks AI systems on cross-modal error detection and semantic query processing tasks, revealing significant challenges in detecting domain-specific errors and handling culturally-nuanced information retrieval.