#clinical-ai News & Analysis

131 articles tagged with #clinical-ai. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

131 articles

AIBullisharXiv – CS AI · Mar 37/104

🧠

SpiroLLM: Finetuning Pretrained LLMs to Understand Spirogram Time Series with Clinical Validation in COPD Reporting

Researchers developed SpiroLLM, the first multimodal large language model capable of understanding spirogram time series data for COPD diagnosis. Using data from 234,028 UK Biobank individuals, the model achieved 0.8977 diagnostic AUROC and maintained 100% valid response rate even with missing data, far outperforming text-only models.

AIBullishOpenAI News · Jul 227/103

🧠

Pioneering an AI clinical copilot with Penda Health

OpenAI and Penda Health have launched an AI clinical copilot that demonstrated a 16% reduction in diagnostic errors during real-world healthcare applications. This collaboration represents a significant advancement in practical AI implementation for medical diagnostics and patient care.

AINeutralarXiv – CS AI · Jun 256/10

🧠

Quantifying Explainable AI-introduced signal noise on ECG data with Spectral Entropy

Researchers propose using spectral entropy to measure noise introduced by explainability AI (XAI) techniques applied to deep learning models, demonstrating the approach on ECG arrhythmia classification. The work addresses a critical gap in healthcare AI where distinguishing between genuine model signals and XAI-generated artifacts is essential for clinical trust and safety.

AINeutralarXiv – CS AI · Jun 256/10

🧠

Noise-Aware Boundary-Enhanced Generative Learning for Ultrasound Speckle Reduction

Researchers propose NBGL, a generative learning framework that reduces speckle noise in ultrasound images while preserving anatomical boundaries and adapting to varying noise levels. The method uses a dual-branch architecture with noise-aware adaptive weighting, demonstrating superior performance over existing approaches across multiple noise conditions in clinical ultrasound data.

AINeutralarXiv – CS AI · Jun 256/10

🧠

Interpretable Concept-Guided Polynomial Tabular Kolmogorov-Arnold Network for EEG-Based Mild Cognitive Impairment Detection

Researchers have developed CPTabKAN, a machine learning model that detects mild cognitive impairment from EEG sleep data by organizing features into physiologically meaningful concept groups and modeling their interactions. The approach achieved 90.38% F1-score, outperforming gradient boosting while maintaining interpretability—a critical advantage for clinical deployment where understanding model reasoning builds physician trust.

AIBearisharXiv – CS AI · Jun 236/10

🧠

EHR-Complex: Benchmarking Medical Agents for Complex Clinical Reasoning

Researchers introduce EHR-Complex, a large-scale benchmark with 52K tasks for evaluating AI clinical agents on real-world electronic health record analysis. Testing reveals significant limitations, with top models achieving only 62.3% accuracy and exposure of three dominant failure modes: SQL logic errors, medical code lookup failures, and semantic misunderstandings.

AINeutralarXiv – CS AI · Jun 236/10

🧠

Neurosymbolic Clinical Trial Matching via LLM-Driven Abduction and Logical Verification

Researchers introduce αNeSy-CTM, a hybrid neurosymbolic framework combining Large Language Models with logical verification to automate clinical trial matching. The system achieves 30% relative improvement over zero-shot baselines by leveraging LLM language capabilities alongside formal symbolic reasoning to handle incomplete patient records and complex eligibility criteria.

AINeutralarXiv – CS AI · Jun 236/10

🧠

Answer Engineering: Local Trajectory Editing for Protocol-Constrained Decision Making in Large Language Models

Researchers present Answer Engineering, a runtime technique that improves large language model compliance with procedural protocols by editing reasoning trajectories during generation. Testing on clinical decision-making shows the method increased protocol adherence from 25-54% to 78-84% without retraining models, addressing a critical safety gap in high-stakes domains.

AINeutralarXiv – CS AI · Jun 236/10

🧠

Anatomically Consistent TMJ Disc Segmentation via Semantic Anchoring and Clinical Priors

Researchers have developed TISC, a novel AI framework for accurately segmenting temporomandibular joint (TMJ) discs from MRI scans by combining semantic anchoring with clinical metadata. The method achieves up to 4.96 Dice improvement over existing approaches and produces anatomically consistent results for more reliable diagnosis of internal derangement.

AIBullisharXiv – CS AI · Jun 236/10

🧠

Predicting High-Risk Colorectal Polyps in African Americans Using Pre-Colonoscopy Clinical Features: Machine Learning Model Development and Temporal Validation

Researchers developed machine learning models to predict high-risk colorectal polyps in African American patients using only pre-colonoscopy clinical features, potentially improving equitable access to preventive care. The study analyzed 4,681 patients for internal validation and 1,562 for external validation, employing multiple algorithms including neural networks, random forests, and XGBoost to stratify risk without invasive procedures.

AINeutralarXiv – CS AI · Jun 236/10

🧠

Cohort-Anchored Foundation Models for Electronic Health Records: From Risk Scores to Auditable Peer Cohorts

Researchers propose CAFM, a Cohort-Anchored Foundation Model framework designed to improve interpretability and clinical reliability of AI systems for electronic health records by elevating patient cohorts to a primary learning object. The four-stage framework addresses limitations in existing EHR models through better data curation, cohort-conditioned training, multimodal alignment, and clinician feedback, with case studies demonstrating applications across kidney injury prediction, cardiovascular risk assessment, and imaging analysis.

AINeutralarXiv – CS AI · Jun 236/10

🧠

Towards Dys-XAI: Influence-Based Explanations for Dysarthria Severity Assessment

Researchers propose Dys-XAI, an influence-based explainability framework that makes deep learning predictions for dysarthria severity assessment interpretable by linking decisions to similar training examples. The method uses gradient-based influence approximations to identify supportive and competing samples, with validation experiments confirming that removing influential samples systematically alters predictions, addressing a critical gap between model performance and clinical adoptability.

AIBullisharXiv – CS AI · Jun 196/10

🧠

Configurable Clinical Information Extraction with Agentic RAG: What Works, What Breaks, and Why

Researchers deployed ACIE, an on-premise agentic RAG system at University Medicine Essen, to extract clinical information from fragmented patient records spanning hundreds of documents. Clinicians validated 7,326 extractions with 96.5% acceptance rates, demonstrating that agentic architectures with explicit reasoning can overcome standard RAG failures in handling temporal dependencies and missing metadata in healthcare contexts.

AINeutralarXiv – CS AI · Jun 196/10

🧠

BrainG3N: A Dual-Purpose Tokenizer for Controllable 3D Brain MRI Generation

Researchers introduced BrainG3N, a dual-purpose tokenizer combining a masked autoencoder encoder with a CNN decoder to generate clinically informative 3D brain MRI images. Pretrained on over 35,000 volumes across multiple disease categories and acquisition sites, the model simultaneously excels at downstream clinical tasks and enables controllable, conditional medical image generation.

AINeutralarXiv – CS AI · Jun 196/10

🧠

Evaluation of EEG Foundation Models for Event-Based Burst-Suppression Detection in ICU

Researchers evaluated EEG Foundation Models for detecting burst-suppression patterns in ICU patients, finding that REVE-base achieved superior performance with an F1-score of 0.868 and reduced errors by up to 52% compared to existing methods. This study demonstrates the practical value of pretrained AI models for clinical EEG monitoring without patient-specific calibration, particularly when labeled data is limited.

AINeutralarXiv – CS AI · Jun 116/10

🧠

Tabular Foundation Models for Clinical Survival Analysis via Survival-Aware Adaptation

Researchers propose a lightweight adaptation method to apply tabular foundation models to clinical survival analysis, demonstrating that pretrained representations combined with survival-aware objectives outperform traditional approaches. Testing on MIMIC-IV and eICU datasets shows 1.4-1.7% improvements over strong baselines like DeepSurv in predicting patient mortality and time-to-event outcomes.

AINeutralarXiv – CS AI · Jun 106/10

🧠

Belief-Space Control for Personalized Cancer Treatment via Active Inference

Researchers develop a belief-space control framework using active inference to optimize personalized cancer treatment as a sequential decision-making problem with incomplete information. The approach integrates goal-directed treatment control with strategic information gathering under realistic medical measurement constraints, validated using clinical data from the AACR Project GENIE dataset.

AINeutralarXiv – CS AI · Jun 106/10

🧠

LongMoE: Longitudinal Multimodal Learning via Trajectory-Aware Mixture-of-Experts

Researchers introduce LongMoE, a machine learning framework designed to improve clinical AI systems by simultaneously handling missing patient data and tracking disease progression over time. The model combines mixture-of-experts routing with temporal pattern recognition, demonstrating improvements across major medical datasets (ADNI, OASIS-3, MIMIC-IV).

AIBullisharXiv – CS AI · Jun 106/10

🧠

MetaPlate: Counterfactual-Guided RAG-LLM Tool for Personalized Food Recommendation and Hyperglycemia Prevention

MetaPlate is an AI-powered dietary decision-support system that combines counterfactual explanations, continuous glucose monitoring data, and large language models to generate personalized meal recommendations for preventing postprandial hyperglycemia. The system demonstrated improved clinical plausibility and actionability through expert validation with registered dietitians, showcasing how domain-specific constraints enhance LLM reliability in healthcare applications.

AINeutralarXiv – CS AI · Jun 96/10

🧠

SafeECGMatch: Calibration-Aware Joint Frequency and Time Space Semi-Supervised Learning for Open-Set ECG Classification

SafeECGMatch introduces a calibration-aware semi-supervised learning framework for ECG classification that addresses the critical challenge of handling out-of-distribution anomalies in unlabeled medical data. Using dual-branch time-frequency architecture with adaptive confidence calibration, the method achieves state-of-the-art accuracy while maintaining reliable OOD rejection, advancing trustworthy AI deployment in clinical diagnostics.

AINeutralarXiv – CS AI · Jun 96/10

🧠

Condition-Gated Reasoning for Context-Dependent Biomedical Question Answering

Researchers introduce CondMedQA, a new benchmark for biomedical question answering that accounts for patient-specific conditions, and propose Condition-Gated Reasoning (CGR), a framework that builds condition-aware knowledge graphs to ensure medical reasoning adapts to individual patient contexts rather than assuming uniform knowledge application.

AINeutralarXiv – CS AI · Jun 86/10

🧠

Which Anatomy Matters Under Limited Labels? A Data-Efficient Anatomy-Aware Benchmark for Cardiac Pathology Prediction

Researchers present an anatomy-aware benchmark demonstrating that in low-data medical imaging scenarios, effective representation of clinically meaningful cardiac structures outperforms model complexity for pathology prediction. The study uses cardiac MRI segmentation data to show that simpler classifiers with better anatomical feature engineering achieve superior results compared to more complex models with generic representations.

AINeutralarXiv – CS AI · Jun 86/10

🧠

Attention Consistent Longitudinal Medical Visual Question Answering Guided by Vision Foundation Models

Researchers propose a novel attention-guided encoder-decoder architecture for longitudinal medical visual question answering using chest X-rays, incorporating affine registration and vision foundation models (DINO) to identify anatomical changes over time. The approach combines saliency masking with multimodal transformer decoding and auxiliary learning objectives, achieving strong benchmark performance while providing interpretable visual explanations for clinical reasoning.

AINeutralarXiv – CS AI · Jun 86/10

🧠

REMEDI: A Benchmark for Retention and Unlearning Evaluation in Multi-label Clinical Disease Inference

Researchers introduce REMEDI, a benchmark for evaluating machine unlearning methods in clinical disease inference using real patient data from MIMIC-III. The study reveals fundamental trade-offs between model utility and data removal effectiveness, with existing unlearning techniques proving poorly suited for multi-label medical classification tasks.

AINeutralarXiv – CS AI · Jun 86/10

🧠

SleepExplain: Explainable Non-Rapid Eye Movement and Rapid Eye Movement Sleep Stage Classification from EEG Signal

Researchers have developed SleepExplain, a machine learning model that classifies sleep stages (NREM and REM) from EEG signals with 94.30% accuracy using XGBoost, while employing SHAP explainability techniques to make predictions interpretable. This advancement bridges clinical diagnostics and AI transparency, addressing a critical need in sleep disorder diagnosis where understanding model reasoning is as important as accuracy.

← PrevPage 3 of 6Next →