AINeutralarXiv โ CS AI ยท 6h ago2
๐ง
Benchmarking LLM Summaries of Multimodal Clinical Time Series for Remote Monitoring
Researchers developed an event-based evaluation framework for LLM-generated clinical summaries of remote monitoring data, revealing that models with high semantic similarity often fail to capture clinically significant events. A vision-based approach using time-series visualizations achieved the best clinical event alignment with 45.7% abnormality recall.
$NEAR