AIBullisharXiv โ CS AI ยท 7h ago7/10
๐ง
Language Models as Semantic Teachers: Post-Training Alignment for Medical Audio Understanding
Researchers introduce AcuLa, a post-training framework that aligns audio encoders with medical language models to enhance clinical understanding of auscultation sounds. The method leverages LLMs to generate synthetic clinical reports from audio metadata and achieves significant performance improvements across 18 cardio-respiratory tasks, including boosting COVID-19 cough detection from 55% to 89% accuracy.