y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#medical-imaging News & Analysis

55 articles tagged with #medical-imaging. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

55 articles
AIBullisharXiv – CS AI · 3d ago7/10
🧠

VITAL: Visual-Semantic Dual Supervision for Enhanced and Interpretable Latent Reasoning in Medical MLLMs

Researchers introduce VITAL, a latent-space reasoning framework for medical AI models that uses dual visual-semantic supervision to improve medical visual question answering while maintaining interpretability. The method addresses modality collapse and inference efficiency issues in existing approaches, achieving state-of-the-art results on 7 benchmarks using a newly constructed 61K medical imaging dataset.

AIBullisharXiv – CS AI · 3d ago7/10
🧠

Deep Learning Strain Estimation: Is Physics-Based Simulation the Solution?

Researchers propose a novel physics-based simulation strategy for training deep learning models to estimate myocardial strain from echocardiography videos, achieving superior accuracy to clinical standards. The method incorporates real speckle decorrelation patterns and iterative refinement, resulting in a publicly available dataset of 1,478 synthetic videos that enables more reliable regional strain detection for cardiac diagnosis.

AIBullisharXiv – CS AI · 4d ago7/10
🧠

MedVol-R1: Reward-Driven Evidence Grounding for Volumetric Reasoning Segmentation

MedVol-R1 introduces a reinforcement learning framework for volumetric reasoning segmentation in 3D medical imaging, decoupling evidence grounding from mask generation to improve interpretability and accuracy. The system uses an LVLM to identify key 2D evidence anchors before propagating them into coherent 3D segmentations, achieving state-of-the-art results on multiple medical imaging benchmarks without requiring expensive annotations.

AIBullisharXiv – CS AI · May 117/10
🧠

Pan-FM: A Pan-Organ Foundation Model with Saliency-Guided Masking for Missing Robustness

Researchers introduce Pan-FM, a foundation model trained on multimodal medical imaging from seven organs that addresses the critical problem of missing data in real-world biomedical datasets. The model uses Saliency-Guided Masking to prevent bias toward dominant organs and demonstrates superior performance on disease prediction tasks across the UK Biobank.

AIBullisharXiv – CS AI · May 77/10
🧠

Local Intrinsic Dimension Unveils Hallucinations in Diffusion Models

Researchers have identified local intrinsic dimension (LID) as the primary driver of hallucinations in diffusion models—the phenomenon where AI generates structurally impossible outputs like hands with extra fingers. They propose Intrinsic Quenching (IQ), a corrective mechanism that reduces these anomalies and shows particular promise for medical imaging applications.

AIBullisharXiv – CS AI · May 17/10
🧠

RIHA: Report-Image Hierarchical Alignment for Radiology Report Generation

Researchers propose RIHA, a novel transformer-based framework that generates radiology reports from medical images by performing hierarchical alignment between visual and textual features across multiple levels. The method outperforms existing approaches on benchmark chest X-ray datasets by treating reports as structured documents rather than flat sequences, improving both clinical accuracy and natural language quality.

AIBullisharXiv – CS AI · Apr 107/10
🧠

DosimeTron: Automating Personalized Monte Carlo Radiation Dosimetry in PET/CT with Agentic AI

DosimeTron, an agentic AI system powered by GPT-5.2, automates personalized Monte Carlo radiation dosimetry calculations for PET/CT medical imaging. Validated on 597 studies across 378 patients, the system achieved 99.6% correlation with reference dosimetry calculations while processing each case in approximately 32 minutes with zero execution failures.

🧠 GPT-5
AINeutralarXiv – CS AI · Mar 177/10
🧠

How Do Medical MLLMs Fail? A Study on Visual Grounding in Medical Images

Researchers identified that medical multimodal large language models (MLLMs) fail primarily due to inadequate visual grounding capabilities when analyzing medical images, unlike their success with natural scenes. They developed VGMED evaluation dataset and proposed VGRefine method, achieving state-of-the-art performance across 6 medical visual question-answering benchmarks without additional training.

AIBullisharXiv – CS AI · Mar 117/10
🧠

Meissa: Multi-modal Medical Agentic Intelligence

Researchers have developed Meissa, a lightweight 4B-parameter medical AI model that brings advanced agentic capabilities offline for healthcare applications. The system matches frontier models like GPT in medical benchmarks while operating with 25x fewer parameters and 22x lower latency, addressing privacy and cost concerns in clinical settings.

🧠 Gemini
AIBullisharXiv – CS AI · Mar 97/10
🧠

AI End-to-End Radiation Treatment Planning Under One Second

Researchers developed AIRT, an AI-powered radiation therapy planning system that generates complete prostate cancer treatment plans in under one second using deep learning. The system processes CT scans and anatomical data to produce clinically-viable radiation treatment plans 100x faster than current methods, demonstrating non-inferiority to existing commercial solutions.

🏢 Nvidia
AIBullisharXiv – CS AI · Mar 57/10
🧠

Volumetric Directional Diffusion: Anchoring Uncertainty Quantification in Anatomical Consensus for Ambiguous Medical Image Segmentation

Researchers propose Volumetric Directional Diffusion (VDD), a new AI method for medical image segmentation that addresses uncertainty in 3D lesion analysis. VDD anchors generative models to consensus priors to maintain anatomical accuracy while capturing expert disagreements, achieving state-of-the-art uncertainty quantification on multiple medical datasets.

AIBullisharXiv – CS AI · Mar 57/10
🧠

Merlin: A Computed Tomography Vision-Language Foundation Model and Dataset

Stanford researchers introduced Merlin, a 3D vision-language foundation model for analyzing abdominal CT scans that processes volumetric medical images alongside electronic health records and radiology reports. The model was trained on over 6 million images from 15,331 CT scans and demonstrated superior performance compared to existing 2D models across 752 individual medical tasks.

AINeutralarXiv – CS AI · Mar 57/10
🧠

ERDES: A Benchmark Video Dataset for Retinal Detachment and Macular Status Classification in Ocular Ultrasound

Researchers have released ERDES, the first open-access dataset of ocular ultrasound videos for detecting retinal detachment and macular status using machine learning. The dataset addresses a critical gap in automated medical diagnosis by enabling AI models to classify retinal detachment severity, which is essential for determining surgical urgency.

AIBullisharXiv – CS AI · Mar 56/10
🧠

IntroductionDMD-augmented Unpaired Neural Schr\"odinger Bridge for Ultra-Low Field MRI Enhancement

Researchers developed a new AI framework using Unpaired Neural Schrödinger Bridge to enhance ultra-low field MRI scans (64 mT) to match the quality of high-field 3T MRI scans. The method combines diffusion-guided distribution matching with anatomical structure preservation to improve medical imaging accessibility while maintaining diagnostic quality.

AIBullisharXiv – CS AI · Mar 57/10
🧠

MPFlow: Multi-modal Posterior-Guided Flow Matching for Zero-Shot MRI Reconstruction

Researchers developed MPFlow, a new zero-shot MRI reconstruction framework that uses multi-modal data and rectified flow to improve medical imaging quality. The system reduces tumor hallucinations by 15% while using 80% fewer sampling steps compared to existing diffusion methods, potentially advancing AI applications in medical diagnostics.

AIBullisharXiv – CS AI · Feb 277/104
🧠

Beyond the Monitor: Mixed Reality Visualization and Multimodal AI for Enhanced Digital Pathology Workflow

Researchers developed PathVis, a mixed-reality platform for Apple Vision Pro that revolutionizes digital pathology by allowing pathologists to examine gigapixel cancer diagnostic images through immersive visualization and multimodal AI assistance. The system replaces traditional 2D monitor limitations with natural interactions using eye gaze, hand gestures, and voice commands, integrated with AI agents for computer-aided diagnosis.

AINeutralarXiv – CS AI · 3d ago6/10
🧠

Not All NVFP4 QAT Recipes Are Equal: How Architecture and Scale Shape Model Quality for Anomaly Segmentation

Researchers at arXiv demonstrate that model architecture significantly impacts how well neural networks handle FP4 quantization for medical image analysis. Swin Transformers maintain quality across different quantization recipes and scales, while CNNs degrade under certain conditions, establishing practical guidelines for deploying efficient anomaly segmentation models.

AINeutralarXiv – CS AI · 3d ago6/10
🧠

Not All Pixels Are Equal: Pixel-wise Meta-Learning for Medical Segmentation with Noisy Labels

Researchers introduce MetaDCSeg, a machine learning framework that addresses noisy labels in medical image segmentation by applying pixel-wise weighting rather than global approaches. The method uses Dynamic Center Distance mechanisms to focus computational attention on anatomically ambiguous boundary regions, demonstrating superior performance across multiple medical imaging datasets.

AINeutralarXiv – CS AI · 3d ago5/10
🧠

Gradient Step Plug-and-Play Model for Dental Cone-Beam CT Reconstruction

Researchers have developed a gradient-step plug-and-play algorithm that uses a trained denoiser model to reduce photon noise in dental cone-beam CT reconstructions. The method combines inverse problem formulation with machine learning, demonstrating effective denoising on synthetic data and promising generalization to real-world dental imaging applications.

AINeutralarXiv – CS AI · 4d ago6/10
🧠

Measuring Prediction Uncertainty in Neural Cellular Automata

Researchers propose 'resilience,' a novel uncertainty estimation method for Neural Cellular Automata (NCA) in medical image segmentation that identifies unreliable predictions by testing model stability under perturbations, without requiring architectural changes or retraining.

AINeutralarXiv – CS AI · 4d ago6/10
🧠

CSV-ViT: A Vision Transformer with the Variable-sized Cortical Supervertices for Detection of Alzheimer's Disease Pathologies

Researchers developed CSV-ViT, a Vision Transformer model that uses variable-sized cortical surface patches to detect Alzheimer's disease pathologies from structural MRI scans. The method outperforms existing surface-based models and could enable earlier AD diagnosis through non-invasive imaging, potentially reducing reliance on costly PET scans and invasive cerebrospinal fluid testing.

AINeutralarXiv – CS AI · May 126/10
🧠

AgentRx: A Benchmark Study of LLM Agents for Multimodal Clinical Prediction Tasks

Researchers benchmarked LLM-based agents for multimodal clinical prediction tasks using real-world healthcare data, finding that single-agent systems outperform naive multi-agent frameworks in handling diverse data types like medical images, notes, and EHR records. The study reveals critical limitations in current multi-agent collaboration approaches and provides an open-source evaluation framework to advance clinical AI development.

AINeutralarXiv – CS AI · May 126/10
🧠

CLEF: EEG Foundation Model for Learning Clinical Semantics

Researchers introduce CLEF, a foundation model for clinical EEG interpretation that processes full-length brain signal sessions alongside patient records and neurologist reports. The model achieves 74% mean AUROC across 234 clinical tasks, substantially outperforming prior EEG foundation models by integrating long-context signal analysis with clinically grounded embeddings.

Page 1 of 3Next →