AINeutralarXiv – CS AI · 8h ago6/10
🧠
LCSHBench: A Multilingual, Consensus-Grounded Benchmark for Library of Congress Subject Heading Assignment
LCSHBench introduces the first large-scale public benchmark for Library of Congress Subject Heading assignment, comprising 22,346 multilingual books with consensus-validated labels from three major university libraries. The dataset reveals that while libraries agree on conceptual topics 93% of the time, they differ in exact heading assignments 39.4% of the time, enabling more nuanced evaluation of automated cataloging systems.