AIBullisharXiv โ CS AI ยท 7h ago7/10
๐ง
VeriTaS: The First Dynamic Benchmark for Multimodal Automated Fact-Checking
Researchers have introduced VeriTaS, a dynamic benchmark for evaluating automated fact-checking systems across 25,000 real-world claims in 54 languages and multiple media formats. Unlike static benchmarks vulnerable to data leakage from LLM pretraining, VeriTaS updates quarterly with claims from 104 professional fact-checkers, maintaining relevance as foundation models evolve.