#medical-ai News & Analysis

The #medical-ai tag tracks 179 articles covering artificial intelligence applications in healthcare, with 23 pieces published in the last month. Recent coverage reflects mixed sentiment, with 39.1% of articles bullish, 26.1% neutral, and 34.8% bearish. Notably, bullish sentiment has softened by 27.6 percentage points compared to the previous quarter, signaling growing caution in how the field is being discussed. Most coverage comes from arXiv's computer science and AI sections, while discussions frequently center on major AI models including Gemini, GPT-5, and Claude. Related coverage often intersects with broader #healthcare, #healthcare-ai, #machine-learning, and #computer-vision conversations. Scan the articles below to explore current developments and perspectives on medical AI.

sentiment · last 30d (23 articles) · -27.6pp bullish vs prior 90d

Top sources:arXiv – CS AI · 158Crypto Briefing · 1MIT News – AI · 1Google DeepMind Blog · 1The Register – AI · 1

Often co-tagged with:#healthcare #healthcare-ai #machine-learning #computer-vision #llm #ai

Most-discussed entities:Gemini · 6GPT-5 · 4Claude · 3Meta · 3GPT-4 · 2

358 articles

AINeutralarXiv – CS AI · Mar 57/10

🧠

Bridging the Reproducibility Divide: Open Source Software's Role in Standardizing Healthcare AI

A study reveals that 74% of healthcare AI research papers still use private datasets or don't share code, creating reproducibility issues that undermine trust in medical AI applications. Papers that embrace open practices by sharing both public datasets and code receive 110% more citations on average, demonstrating clear benefits for scientific impact.

AIBullisharXiv – CS AI · Mar 56/10

🧠

IntroductionDMD-augmented Unpaired Neural Schr\"odinger Bridge for Ultra-Low Field MRI Enhancement

Researchers developed a new AI framework using Unpaired Neural Schrödinger Bridge to enhance ultra-low field MRI scans (64 mT) to match the quality of high-field 3T MRI scans. The method combines diffusion-guided distribution matching with anatomical structure preservation to improve medical imaging accessibility while maintaining diagnostic quality.

AIBullisharXiv – CS AI · Mar 57/10

🧠

Knowledge Graphs are Implicit Reward Models: Path-Derived Signals Enable Compositional Reasoning

Researchers developed a new AI training method using knowledge graphs as reward models to improve compositional reasoning in specialized domains. The approach enables smaller 14B parameter models to outperform much larger frontier systems like GPT-5.2 and Gemini 3 Pro on complex multi-hop reasoning tasks in medicine.

🧠 Gemini

AINeutralarXiv – CS AI · Mar 57/10

🧠

ERDES: A Benchmark Video Dataset for Retinal Detachment and Macular Status Classification in Ocular Ultrasound

Researchers have released ERDES, the first open-access dataset of ocular ultrasound videos for detecting retinal detachment and macular status using machine learning. The dataset addresses a critical gap in automated medical diagnosis by enabling AI models to classify retinal detachment severity, which is essential for determining surgical urgency.

AIBullisharXiv – CS AI · Mar 57/10

🧠

Merlin: A Computed Tomography Vision-Language Foundation Model and Dataset

Stanford researchers introduced Merlin, a 3D vision-language foundation model for analyzing abdominal CT scans that processes volumetric medical images alongside electronic health records and radiology reports. The model was trained on over 6 million images from 15,331 CT scans and demonstrated superior performance compared to existing 2D models across 752 individual medical tasks.

AIBullisharXiv – CS AI · Mar 56/10

🧠

RANGER: Sparsely-Gated Mixture-of-Experts with Adaptive Retrieval Re-ranking for Pathology Report Generation

Researchers introduce RANGER, a new AI framework using sparsely-gated Mixture-of-Experts architecture for generating pathology reports from medical images. The system achieves superior performance on standard benchmarks by enabling dynamic expert specialization and reducing noise through adaptive retrieval re-ranking.

AIBullisharXiv – CS AI · Mar 57/10

🧠

Volumetric Directional Diffusion: Anchoring Uncertainty Quantification in Anatomical Consensus for Ambiguous Medical Image Segmentation

Researchers propose Volumetric Directional Diffusion (VDD), a new AI method for medical image segmentation that addresses uncertainty in 3D lesion analysis. VDD anchors generative models to consensus priors to maintain anatomical accuracy while capturing expert disagreements, achieving state-of-the-art uncertainty quantification on multiple medical datasets.

AIBullisharXiv – CS AI · Mar 56/10

🧠

Non-Invasive Reconstruction of Intracranial EEG Across the Deep Temporal Lobe from Scalp EEG based on Conditional Normalizing Flow

Researchers developed NeuroFlowNet, a novel AI framework using Conditional Normalizing Flow to reconstruct deep brain EEG signals from non-invasive scalp measurements. This breakthrough enables analysis of deep temporal lobe brain activity without requiring invasive electrode implantation, potentially transforming neuroscience research and clinical diagnosis.

AIBullisharXiv – CS AI · Mar 57/10

🧠

3D Wavelet-Based Structural Priors for Controlled Diffusion in Whole-Body Low-Dose PET Denoising

Researchers developed WCC-Net, a 3D wavelet-based diffusion model that significantly improves low-dose PET imaging denoising while reducing patient radiation exposure. The AI framework uses frequency-domain structural priors to maintain anatomical accuracy and outperforms existing CNN, GAN, and diffusion baselines across multiple dose levels.

AIBullisharXiv – CS AI · Mar 46/103

🧠

PRISM: Exploring Heterogeneous Pretrained EEG Foundation Model Transfer to Clinical Differential Diagnosis

Researchers introduce PRISM, an EEG foundation model that demonstrates how diverse pretraining data leads to better clinical performance than narrow-source datasets. The study shows that geographically diverse EEG data outperforms larger but homogeneous datasets in medical diagnosis tasks, particularly achieving 12.3% better accuracy in distinguishing epilepsy from similar conditions.

$COMP

AIBearisharXiv – CS AI · Mar 47/102

🧠

Silent Sabotage During Fine-Tuning: Few-Shot Rationale Poisoning of Compact Medical LLMs

Researchers discovered a new stealth poisoning attack method targeting medical AI language models during fine-tuning that degrades performance on specific medical topics without detection. The attack injects poisoned rationales into training data, proving more effective than traditional backdoor attacks or catastrophic forgetting methods.

AINeutralarXiv – CS AI · Mar 47/102

🧠

MedCalc-Bench Doesn't Measure What You Think: A Benchmark Audit and the Case for Open-Book Evaluation

Researchers audited the MedCalc-Bench benchmark for evaluating AI models on clinical calculator tasks, finding over 20 errors in the dataset and showing that simple 'open-book' prompting achieves 81-85% accuracy versus previous best of 74%. The study suggests the benchmark measures formula memorization rather than clinical reasoning, challenging how AI medical capabilities are evaluated.

AIBullisharXiv – CS AI · Mar 46/103

🧠

Contextual Invertible World Models: A Neuro-Symbolic Agentic Framework for Colorectal Cancer Drug Response

Researchers developed a Neuro-Symbolic Agentic Framework combining machine learning with LLM-based reasoning to predict colorectal cancer drug responses. The system achieved significant predictive accuracy (r=0.504) and introduces 'Inverse Reasoning' for simulating genomic edits to predict drug sensitivity changes.

AIBullisharXiv – CS AI · Mar 46/103

🧠

MedFeat: Model-Aware and Explainability-Driven Feature Engineering with LLMs for Clinical Tabular Prediction

Researchers introduce MedFeat, a new AI framework that uses Large Language Models for healthcare feature engineering in clinical tabular predictions. The system incorporates model awareness and domain knowledge to discover clinically meaningful features that outperform traditional approaches and demonstrate robustness across different hospital settings.

AIBullisharXiv – CS AI · Mar 47/103

🧠

MIRAGE: Knowledge Graph-Guided Cross-Cohort MRI Synthesis for Alzheimer's Disease Prediction

Researchers introduce MIRAGE, a novel AI framework that uses knowledge graphs and electronic health records to predict Alzheimer's disease when MRI scans are unavailable. The system improves AD classification rates by 13% compared to single-modality approaches by creating synthetic representations without expensive 3D brain scan reconstruction.

AIBullisharXiv – CS AI · Mar 47/103

🧠

ATPO: Adaptive Tree Policy Optimization for Multi-Turn Medical Dialogue

Researchers developed ATPO (Adaptive Tree Policy Optimization), a new AI algorithm for multi-turn medical dialogues that outperforms existing methods by better handling uncertainty in patient-doctor interactions. The algorithm enabled a smaller Qwen3-8B model to surpass GPT-4o's accuracy by 0.92% on medical dialogue benchmarks through improved value estimation and exploration strategies.

AIBullisharXiv – CS AI · Mar 47/103

🧠

Guideline-Grounded Evidence Accumulation for High-Stakes Agent Verification

Researchers developed GLEAN, a new AI verification framework that improves reliability of LLM-powered agents in high-stakes decisions like clinical diagnosis. The system uses expert guidelines and Bayesian logistic regression to better verify AI agent decisions, showing 12% improvement in accuracy and 50% better calibration in medical diagnosis tests.

AIBullisharXiv – CS AI · Mar 47/102

🧠

MedXIAOHE: A Comprehensive Recipe for Building Medical MLLMs

Researchers have released MedXIAOHE, a new medical vision-language AI foundation model that achieves state-of-the-art performance across medical benchmarks and surpasses leading closed-source systems. The model incorporates advanced features like entity-aware pretraining, reinforcement learning for medical reasoning, and evidence-grounded report generation to improve reliability in clinical applications.

AIBullisharXiv – CS AI · Mar 46/102

🧠

Intelligent Pathological Diagnosis of Gestational Trophoblastic Diseases via Visual-Language Deep Learning Model

Researchers developed GTDoctor, an AI model for diagnosing gestational trophoblastic disease that achieves over 91% precision in lesion detection. The system reduces diagnostic time from 56 to 16 seconds per case while maintaining 95.59% positive predictive value in clinical trials.

AIBullisharXiv – CS AI · Mar 46/103

🧠

Detecting Structural Heart Disease from Electrocardiograms via a Generalized Additive Model of Interpretable Foundation-Model Predictors

Researchers developed an interpretable AI framework for detecting structural heart disease from electrocardiograms, achieving better performance than existing deep-learning methods while providing clinical transparency. The model demonstrated improvements of nearly 1% across key metrics using the EchoNext benchmark of over 80,000 ECG-ECHO pairs.

AIBullisharXiv – CS AI · Mar 47/103

🧠

MedLA: A Logic-Driven Multi-Agent Framework for Complex Medical Reasoning with Large Language Models

Researchers have developed MedLA, a new logic-driven multi-agent AI framework that uses large language models for complex medical reasoning. The system employs multiple AI agents that organize their reasoning into explicit logical trees and engage in structured discussions to resolve inconsistencies and reach consensus on medical questions.

AIBullisharXiv – CS AI · Mar 37/103

🧠

OmniCT: Towards a Unified Slice-Volume LVLM for Comprehensive CT Analysis

Researchers have developed OmniCT, a unified AI model that combines slice-level and volumetric analysis for CT scan interpretation, addressing a major limitation in medical imaging AI. The model introduces spatial consistency enhancement and organ-level semantic features, outperforming existing methods across clinical tasks.

AIBullisharXiv – CS AI · Mar 37/103

🧠

Brain-IT: Image Reconstruction from fMRI via Brain-Interaction Transformer

Researchers developed Brain-IT, a new AI system using Brain Interaction Transformer technology to reconstruct images from fMRI brain recordings with significantly improved accuracy. The method requires only 1 hour of data versus 40 hours needed by current approaches while surpassing state-of-the-art results.

AIBullisharXiv – CS AI · Mar 37/103

🧠

A cross-species neural foundation model for end-to-end speech decoding

Researchers developed a new Brain-to-Text (BIT) framework that uses cross-species neural foundation models to decode speech from brain activity with significantly improved accuracy. The system reduces word error rates from 24.69% to 10.22% compared to previous methods and enables seamless translation of both attempted and imagined speech into text.

AIBullisharXiv – CS AI · Mar 37/104

🧠

Doctor-R1: Mastering Clinical Inquiry with Experiential Agentic Reinforcement Learning

Doctor-R1 is a new AI agent that combines accurate medical decision-making with strategic, empathetic patient consultation skills through reinforcement learning. The system outperforms existing open-source medical LLMs and proprietary models on clinical benchmarks while demonstrating superior communication quality and patient-centric performance.

← PrevPage 5 of 15Next →