#medical-imaging News & Analysis

119 articles tagged with #medical-imaging. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

119 articles

AIBearisharXiv – CS AI · Jun 257/10

🧠

Beyond Visual Forensics: Auditing Multimodal Robustness for Synthetic Medical Image Detection

Researchers have identified a critical multimodal vulnerability in vision-language models (VLMs) used for detecting synthetic medical images: when given both image and text data, these models can overweight textual context, causing identical images to receive different authenticity predictions based solely on accompanying metadata changes. The study introduces a benchmark to systematically audit this robustness gap, revealing risks for clinical deployment.

AIBullisharXiv – CS AI · Jun 237/10

🧠

Retrieval-Augmented Anatomical Guidance for Text-to-CT Generation

Researchers propose a retrieval-augmented approach for generating CT scans from radiology reports that combines semantic control with anatomical consistency by retrieving structurally similar clinical cases and using their annotations as guidance. The method improves image fidelity and clinical consistency compared to text-only baselines while enabling spatial controllability without requiring ground-truth annotations at inference time.

AIBullisharXiv – CS AI · Jun 237/10

🧠

Foundation Models for Epileptogenic Zone Identification in Drug-Resistant Epilepsy

Researchers developed EpiiSLM, a dual foundation model system that significantly improves identification of epileptogenic zones in drug-resistant epilepsy patients using stereo-electroencephalography data. The system achieved 97.8% contact-level accuracy and requires only one night of monitoring, potentially reducing invasive procedures and improving surgical outcomes where current seizure freedom rates remain below 50%.

AIBullisharXiv – CS AI · Jun 237/10

🧠

Render-FM: Feedforward Model for Real-time Photorealistic Volumetric Rendering

Render-FM is a feedforward neural model that generates photorealistic 3D renderings of CT scans in 2.8 seconds, achieving a 500x speedup over traditional optimization methods. By directly predicting Gaussian Splatting parameters with anatomy-guided priors, the model enables real-time clinical visualization without per-scan training, making advanced volumetric rendering practical for hospital workflows.

AIBullisharXiv – CS AI · Jun 237/10

🧠

EnTrust: Modeling Inter-Modal Conflict for Trustworthy Multimodal Medical Image Analysis

EnTrust is a new framework for multimodal medical image analysis that treats disagreement between imaging modalities as a direct source of predictive uncertainty rather than averaging it away. The approach combines feature decomposition, diffusion-based segmentation, and calibrated uncertainty estimation to help clinicians understand not just where predictions are uncertain, but why, achieving state-of-the-art accuracy across multiple medical imaging domains.

AIBullisharXiv – CS AI · Jun 197/10

🧠

Scaling Generative Foundation Models for Chest Radiography with Rectified Flow Transformers

Researchers have developed the first billion-parameter generative foundation model specifically designed for chest radiograph synthesis, trained on 1.2M radiographs. The model can generate synthetic chest X-rays with clinical-expert-level fidelity while supporting controllable generation across demographics, imaging views, and pathologies, addressing a critical need for diverse medical imaging datasets.

AIBearisharXiv – CS AI · Jun 197/10

🧠

A Controlled Benchmark of Quantum-Latent GAN Augmentation for Brain MRI

Researchers conducted a rigorous controlled benchmark comparing quantum and classical generative models for augmenting brain MRI datasets. The study found no statistically significant performance difference between quantum and classical generators, and neither provided meaningful benefits over real-data-only training across various data scarcity scenarios.

AIBullishDecrypt – AI · Jun 187/10

🧠

Midjourney Pivots From AI Images to Medical Imaging, Aiming to Build a Better MRI Alternative

Midjourney, known for AI-generated imagery, is pivoting into medical imaging by developing a full-body ultrasound system enhanced with artificial intelligence. This strategic shift represents a major diversification from generative AI into healthcare technology, potentially positioning the company to compete with established MRI alternatives.

🧠 Midjourney

AIBullisharXiv – CS AI · Jun 97/10

🧠

A Multi-modal Agentic Co-pilot for Evidence Grounded Computational Pathology

PathPocket is a multimodal AI co-pilot system designed to assist pathologists by grounding diagnostic recommendations in verifiable medical evidence. Built on a comprehensive pathology knowledge base of 110,472 documents and 4.55 million entities, the system demonstrates significant improvements in diagnostic accuracy and pathologist confidence across 200,000+ real-world cases.

AIBullisharXiv – CS AI · Jun 97/10

🧠

MedVision: Benchmarking Quantitative Medical Image Analysis

Researchers introduce MedVision, a large-scale benchmark dataset with 30.8 million image-annotation pairs designed to evaluate and improve vision-language models (VLMs) on quantitative medical image analysis tasks. The work demonstrates that current VLMs perform poorly on clinical quantitative reasoning—such as tumor measurement and joint angle assessment—but can be significantly improved through supervised and reinforcement fine-tuning.

AIBullisharXiv – CS AI · Jun 87/10

🧠

ReclAIm: A Multi-Agent Framework for Monitoring and Correcting Performance Decline in Medical Imaging AI

Researchers introduced ReclAIm, a multi-agent AI framework using large language models to automatically detect and correct performance degradation in medical imaging classification models. The system successfully restored models experiencing up to 40.6% performance decline to within 2% of baseline values through automated fine-tuning, demonstrating practical viability for maintaining AI reliability in clinical settings.

AIBullisharXiv – CS AI · Jun 87/10

🧠

STREAM: Stochastic Riemannian Flow Matching with Anisotropic Decoder for Digital Histopathology Image Generation

Researchers introduce STREAM, a novel framework applying Riemannian flow matching to synthetic histopathology image generation. The approach leverages pretrained Vision Foundation Models as latent space rather than conditioning signals, addressing the "conditioning collapse" problem and achieving state-of-the-art results for medical image synthesis.

AINeutralarXiv – CS AI · Jun 87/10

🧠

MMBU: A Massive Multi-modal Biomedical Understanding Benchmark to Probe the Perception Capabilities of Vision-Language Models

Researchers introduced MMBU, the largest biomedical vision-language benchmark covering 35 medical imaging modalities with structured metadata. Testing 15 open-weight and 2 frontier VLMs revealed that while medical adaptation helps some models, high reported accuracy on existing benchmarks masks significant deficiencies in visual perception and domain generalization.

AIBullisharXiv – CS AI · Jun 87/10

🧠

DaX: Learning General Pathology Representations Across Scales

Researchers present DaX, a pathology vision foundation model that adapts self-supervised learning to whole-slide histopathology imaging. The model demonstrates strong performance across a standardized benchmark of 161 clinical tasks, establishing a reproducible evaluation framework for computational pathology applications.

AIBullisharXiv – CS AI · Jun 27/10

🧠

CoilDrop-MRI: Self-supervised physics-guided MRI reconstruction with coil dropout

Researchers introduce CoilDrop-MRI, a self-supervised deep learning method that improves accelerated MRI reconstruction by strategically dropping data across receiver coils rather than only in k-space. Validated across multiple hospital sites and field strengths, the approach matches supervised methods' quality without requiring fully sampled training data, offering practical efficiency gains for medical imaging.

AIBullisharXiv – CS AI · Jun 27/10

🧠

CRISP -- Clustering-Based Redundancy-Reduced Instance Sampling for Pathology Case Representation and Retrieval

CRISP is an unsupervised machine learning framework that automates the analysis of multiple whole-slide images (WSIs) in digital pathology by selectively sampling informative patches across all slides in a case rather than relying on a single pathologist-selected slide. The approach matches or exceeds current clinical practice for breast cancer diagnosis and retrieval while eliminating subjective slide selection and reducing computational burden.

AIBullisharXiv – CS AI · Jun 27/10

🧠

SDR: Set-Distance Rewards for Radiology Report Generation

Researchers introduce Set-Distance Rewards (SDR), a novel reinforcement learning approach for chest X-ray report generation that treats medical reports as unordered sets rather than causal chains. The method achieves 4-8% improvements over supervised fine-tuning across multiple vision-language models and enables efficient test-time scaling by pruning low-quality candidates mid-generation.

🧠 GPT-4🧠 Gemini

AIBullisharXiv – CS AI · May 297/10

🧠

Pocket-Dentist: On-Device Dental Image Understanding via Efficient Multimodal Large Language Models

Pocket-Dentist presents an efficiency-aware benchmark for dental image analysis using compact multimodal vision-language models, demonstrating that smaller 2B-parameter models outperform larger counterparts while consuming significantly fewer computational resources. Successfully deployed on iPhone hardware, the approach enables privacy-preserving dental prescreening outside specialist centers with practical latency and memory constraints.

AIBullisharXiv – CS AI · May 287/10

🧠

VITAL: Visual-Semantic Dual Supervision for Enhanced and Interpretable Latent Reasoning in Medical MLLMs

Researchers introduce VITAL, a latent-space reasoning framework for medical AI models that uses dual visual-semantic supervision to improve medical visual question answering while maintaining interpretability. The method addresses modality collapse and inference efficiency issues in existing approaches, achieving state-of-the-art results on 7 benchmarks using a newly constructed 61K medical imaging dataset.

AIBullisharXiv – CS AI · May 287/10

🧠

Deep Learning Strain Estimation: Is Physics-Based Simulation the Solution?

Researchers propose a novel physics-based simulation strategy for training deep learning models to estimate myocardial strain from echocardiography videos, achieving superior accuracy to clinical standards. The method incorporates real speckle decorrelation patterns and iterative refinement, resulting in a publicly available dataset of 1,478 synthetic videos that enables more reliable regional strain detection for cardiac diagnosis.

AIBullisharXiv – CS AI · May 277/10

🧠

MedVol-R1: Reward-Driven Evidence Grounding for Volumetric Reasoning Segmentation

MedVol-R1 introduces a reinforcement learning framework for volumetric reasoning segmentation in 3D medical imaging, decoupling evidence grounding from mask generation to improve interpretability and accuracy. The system uses an LVLM to identify key 2D evidence anchors before propagating them into coherent 3D segmentations, achieving state-of-the-art results on multiple medical imaging benchmarks without requiring expensive annotations.

AIBullisharXiv – CS AI · May 117/10

🧠

Pan-FM: A Pan-Organ Foundation Model with Saliency-Guided Masking for Missing Robustness

Researchers introduce Pan-FM, a foundation model trained on multimodal medical imaging from seven organs that addresses the critical problem of missing data in real-world biomedical datasets. The model uses Saliency-Guided Masking to prevent bias toward dominant organs and demonstrates superior performance on disease prediction tasks across the UK Biobank.

AIBullisharXiv – CS AI · May 77/10

🧠

Local Intrinsic Dimension Unveils Hallucinations in Diffusion Models

Researchers have identified local intrinsic dimension (LID) as the primary driver of hallucinations in diffusion models—the phenomenon where AI generates structurally impossible outputs like hands with extra fingers. They propose Intrinsic Quenching (IQ), a corrective mechanism that reduces these anomalies and shows particular promise for medical imaging applications.

AIBullisharXiv – CS AI · May 17/10

🧠

RIHA: Report-Image Hierarchical Alignment for Radiology Report Generation

Researchers propose RIHA, a novel transformer-based framework that generates radiology reports from medical images by performing hierarchical alignment between visual and textual features across multiple levels. The method outperforms existing approaches on benchmark chest X-ray datasets by treating reports as structured documents rather than flat sequences, improving both clinical accuracy and natural language quality.

AIBullisharXiv – CS AI · Apr 107/10

🧠

DosimeTron: Automating Personalized Monte Carlo Radiation Dosimetry in PET/CT with Agentic AI

DosimeTron, an agentic AI system powered by GPT-5.2, automates personalized Monte Carlo radiation dosimetry calculations for PET/CT medical imaging. Validated on 597 studies across 378 patients, the system achieved 99.6% correlation with reference dosimetry calculations while processing each case in approximately 32 minutes with zero execution failures.

🧠 GPT-5

Page 1 of 5Next →