AINeutralarXiv – CS AI · 10h ago6/10
🧠
Key Coverage Matters: Semi-Structured Extraction of OCR Clinical Reports
Researchers developed a semi-structured extraction method for digitizing fragmented clinical reports using OCR and question-answering models, introducing 'key coverage' as a metric to measure data completeness. The approach achieves F1 scores above 0.83 on real-world hospital data from 20+ institutions using a lightweight BERT model, demonstrating that canonical key inventory completeness drives extraction performance.