Dr. DocBench: A Comprehensive Benchmark for Expert-Level and Difficult Document Parsing
Researchers introduce Dr. DocBench, a new benchmark dataset for evaluating document parsing systems on expert-level and difficult content. The dataset contains 4,514 annotated pages spanning 52 subject domains with specialized structures like chemical formulas and complex tables, revealing that state-of-the-art systems struggle significantly with these challenging real-world scenarios.
