133 articles tagged with #healthcare. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.
AINeutralarXiv โ CS AI ยท Mar 37/108
๐ง The MAMA-MIA Challenge introduced a large-scale benchmark for AI-powered breast cancer tumor segmentation and treatment response prediction using MRI data from 1,506 US patients for training and 574 European patients for testing. Results from 26 international teams revealed significant performance variability and trade-offs between accuracy and fairness across demographic subgroups when AI models were tested across different institutions and continents.
AIBullisharXiv โ CS AI ยท Mar 36/103
๐ง Researchers developed a detection-gated AI pipeline combining YOLOv8 and U-Net for accurate glottal segmentation in medical videoendoscopy. The system achieved state-of-the-art performance with zero-shot transfer capabilities across different clinical datasets, enabling real-time extraction of vocal function biomarkers at 35 frames per second.
AIBullisharXiv โ CS AI ยท Mar 36/104
๐ง Researchers introduce BoxMed-RL, a new AI framework that uses chain-of-thought reasoning and reinforcement learning to generate spatially verifiable radiology reports. The system mimics radiologist workflows by linking visual findings to precise anatomical locations, achieving 7% improvement over existing methods in key performance metrics.
$LINK
AINeutralarXiv โ CS AI ยท Mar 36/104
๐ง Researchers introduced EHR-ChatQA, a new benchmark for testing AI agents that interact with Electronic Health Record databases through natural language queries. The benchmark reveals significant reliability gaps in current state-of-the-art LLMs, with success rates dropping substantially when consistency across multiple trials is required.
AIBullisharXiv โ CS AI ยท Mar 36/108
๐ง Researchers have developed MED-COPILOT, an AI-powered clinical decision-support system that combines GraphRAG retrieval with similar patient case analysis to assist healthcare professionals. The system uses structured knowledge graphs from WHO and NICE guidelines along with a 36,000-case patient database to outperform standard AI models in clinical reasoning accuracy.
AIBullisharXiv โ CS AI ยท Mar 36/106
๐ง Researchers created OpenRad, a curated repository containing approximately 1,700 open-access AI models for radiology. The platform aggregates scattered radiology AI research into a standardized, searchable database that includes model weights, interactive applications, and spans all imaging modalities and radiology subspecialties.
AIBullisharXiv โ CS AI ยท Mar 36/107
๐ง Researchers propose REMIND, a framework for medical multi-modal AI learning that addresses the challenge of missing data across multiple modalities. The solution uses a Mixture-of-Experts architecture to handle long-tail distributions of modality combinations and shows superior performance on real-world medical datasets.
AINeutralarXiv โ CS AI ยท Mar 36/107
๐ง A research study evaluated how four major large language models (GPT-5.2, Claude 4.5 Sonnet, Gemini 3 Pro, and DeepSeek-R1) respond to patient preferences in clinical decision-making scenarios. While all models acknowledged patient values, they showed modest actual recommendation shifting with value sensitivity indices ranging from 0.13 to 0.27, revealing gaps in how AI systems incorporate patient preferences into medical recommendations.
AIBullisharXiv โ CS AI ยท Mar 36/108
๐ง Researchers developed SurgFusion-Net, a multimodal AI system for assessing surgical skills in robotic-assisted surgery. The system introduces new clinical datasets and fusion techniques that outperform existing baselines, addressing the domain gap between simulation and real clinical environments.
AIBullisharXiv โ CS AI ยท Mar 37/107
๐ง Researchers have developed CT-Flow, an AI framework that mimics how radiologists actually work by using tools interactively to analyze 3D CT scans. The system achieved 41% better diagnostic accuracy than existing models and 95% success in autonomous tool use, potentially revolutionizing clinical radiology workflows.
AIBullisharXiv โ CS AI ยท Mar 36/107
๐ง Researchers developed a spatiotemporal diffusion autoencoder using CT brain images to predict stroke outcomes and evolution. The AI model achieved best-in-class performance for predicting next-day severity and functional outcomes using a dataset of 5,824 CT images from 3,573 patients across two medical centers.
AIBullisharXiv โ CS AI ยท Mar 36/107
๐ง Researchers propose TC-SSA, a token compression framework that enables large vision-language models to process gigapixel pathology images by reducing visual tokens to 1.7% of original size while maintaining diagnostic accuracy. The method achieves 78.34% overall accuracy on SlideBench and demonstrates strong performance across multiple cancer classification tasks.
AIBullisharXiv โ CS AI ยท Mar 26/1013
๐ง Researchers developed MedMAP, a Medical Modality-Aware Pretraining framework that enhances vision-language models for 3D MRI multi-organ abnormality detection. The framework addresses challenges in modality-specific alignment and cross-modal feature fusion, demonstrating superior performance on a curated dataset of 7,392 3D MRI volume-report pairs.
AIBullisharXiv โ CS AI ยท Mar 27/1012
๐ง Researchers developed a new framework for selecting optimal medical AI foundation models without costly fine-tuning, achieving 31% better performance than existing methods. The topology-driven approach evaluates manifold tractability rather than statistical overlap to better assess model transferability for medical image segmentation tasks.
AIBullisharXiv โ CS AI ยท Mar 27/1016
๐ง Researchers developed a neurosymbolic verification framework to audit logical consistency in AI-generated radiology reports, addressing issues where vision-language models produce diagnostic conclusions unsupported by their findings. The system uses formal verification methods to identify hallucinations and missing logical conclusions in medical AI outputs, improving diagnostic accuracy.
AIBullisharXiv โ CS AI ยท Mar 26/1011
๐ง Researchers developed TASOT, an unsupervised AI method for surgical phase recognition that combines visual and textual information without requiring expensive large-scale pre-training. The approach showed significant improvements over existing zero-shot methods across multiple surgical datasets, demonstrating that effective surgical AI can be achieved with more efficient training methods.
AIBullisharXiv โ CS AI ยท Mar 27/1015
๐ง Researchers developed MACD, a Multi-Agent Clinical Diagnosis framework that enables large language models to self-learn clinical knowledge and improve medical diagnosis accuracy. The system achieved up to 22.3% improvement over clinical guidelines and 16% improvement over physician-only diagnosis when tested on 4,390 real-world patient cases.
AIBullisharXiv โ CS AI ยท Mar 26/1012
๐ง Researchers have developed Radiologist Copilot, an AI agentic framework that orchestrates specialized tools to complete the entire radiology reporting workflow beyond simple report generation. The system integrates image localization, interpretation, template selection, report composition, and quality control to support radiologists throughout the comprehensive reporting process.
AIBullisharXiv โ CS AI ยท Mar 26/1014
๐ง Researchers have developed SleepLM, a family of AI foundation models that combine natural language processing with sleep analysis using polysomnography data. The system can interpret and describe sleep patterns in natural language, trained on over 100K hours of sleep data from 10,000+ individuals, enabling new capabilities like language-guided sleep event detection and zero-shot generalization to novel sleep analysis tasks.
AIBearisharXiv โ CS AI ยท Mar 27/1019
๐ง Researchers propose a new risk-sensitive framework for evaluating AI hallucinations in medical advice that considers potential harm rather than just factual accuracy. The study reveals that AI models with similar performance show vastly different risk profiles when generating medical recommendations, highlighting critical safety gaps in current evaluation methods.
AIBullisharXiv โ CS AI ยท Feb 276/105
๐ง Researchers developed MedSegLatDiff, a new AI framework combining variational autoencoders with diffusion models for medical image segmentation. The system operates in compressed latent space to reduce computational costs while generating multiple plausible segmentation masks, achieving state-of-the-art performance on skin lesion, polyp, and lung nodule datasets.
AIBearisharXiv โ CS AI ยท Feb 276/107
๐ง Researchers developed ClinDet-Bench, a new benchmark that reveals large language models fail to properly identify when they have sufficient information to make clinical decisions. The study shows LLMs make both premature judgments and excessive abstentions in medical scenarios, highlighting safety concerns for AI deployment in healthcare settings.
AIBullisharXiv โ CS AI ยท Feb 276/105
๐ง Researchers demonstrated that prompt optimization using Genetic-Pareto (GEPA) significantly improves language models' ability to detect errors in medical notes. The technique boosted accuracy from 0.669 to 0.785 with GPT-5 and from 0.578 to 0.690 with Qwen3-32B, achieving state-of-the-art performance on medical error detection benchmarks.
AIBullisharXiv โ CS AI ยท Feb 276/106
๐ง ColoDiff is a new AI framework that uses diffusion models to generate high-quality colonoscopy videos for medical training and diagnosis. The system addresses data scarcity in medical imaging by creating synthetic videos with temporal consistency and precise clinical attribute control, achieving 90% faster generation through optimized sampling.
AIBullisharXiv โ CS AI ยท Feb 276/107
๐ง Researchers developed a deep learning framework using Organ Focused Attention (OFA) to predict renal tumor malignancy from 3D CT scans without requiring manual segmentation. The system achieved AUC scores of 0.685-0.760 across datasets, outperforming traditional segmentation-based approaches while reducing labor and costs.