140 articles tagged with #healthcare-ai. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.
AIBullisharXiv – CS AI · 1d ago7/10
🧠Researchers propose Schema-Adaptive Tabular Representation Learning, which uses LLMs to convert structured clinical data into semantic embeddings that transfer across different electronic health record schemas without retraining. When combined with imaging data for dementia diagnosis, the method achieves state-of-the-art results and outperforms board-certified neurologists on retrospective diagnostic tasks.
AIBearisharXiv – CS AI · 2d ago7/10
🧠Researchers introduce VeriSim, an open-source framework that tests medical AI systems by injecting realistic patient communication barriers—such as memory gaps and health literacy limitations—into clinical simulations. Testing across seven LLMs reveals significant performance degradation (15-25% accuracy drop), with smaller models suffering 40% greater decline than larger ones, exposing a critical gap between standardized benchmarks and real-world clinical robustness.
AIBearisharXiv – CS AI · 2d ago7/10
🧠Researchers evaluated domain-specific fine-tuning of vision-language models (VLMs) on medical imaging tasks and found that performance degrades significantly with task complexity, with medical fine-tuning providing no consistent advantage. The study reveals that these models exhibit fragility and high sensitivity to prompt variations, questioning the reliability of VLMs for high-stakes medical applications.
🧠 GPT-5
AINeutralarXiv – CS AI · 3d ago7/10
🧠Researchers present a comprehensive survey of medical reasoning in large language models, introducing MR-Bench, a clinical benchmark derived from real hospital data. The study reveals a significant performance gap between exam-style tasks and authentic clinical decision-making, highlighting that robust medical reasoning requires more than factual recall in safety-critical healthcare applications.
AIBullishBlockonomi · 6d ago7/10
🧠Tempus AI's stock surged following an expanded partnership with Gilead Sciences for oncology data analytics. The company reported a 111% jump in diagnostics revenue with a contract backlog exceeding $1.1 billion, signaling strong commercial traction in the healthcare AI sector.
AIBearishWired – AI · 6d ago7/10
🧠Meta's Muse Spark AI model requests access to users' raw health data including lab results, raising significant privacy concerns while demonstrating poor medical judgment. The system exemplifies how large language models lack the expertise to provide reliable healthcare guidance despite their persuasive presentation.
AINeutralarXiv – CS AI · 6d ago7/10
🧠Researchers demonstrate that standard LLM-as-a-judge methods achieve only 52% accuracy in detecting hallucinations and omissions in mental health chatbots, failing in high-risk healthcare contexts. A hybrid framework combining human domain expertise with machine learning features achieves significantly higher performance (0.717-0.849 F1 scores), suggesting that transparent, interpretable approaches outperform black-box LLM evaluation in safety-critical applications.
AIBullisharXiv – CS AI · Apr 77/10
🧠A comprehensive research review examines the current applications of Large Language Models (LLMs) across various healthcare specialties including cancer care, dermatology, dental care, neurodegenerative disorders, and mental health. The study highlights LLMs' transformative impact on medical diagnostics and patient care while acknowledging existing challenges and limitations in healthcare integration.
AIBullisharXiv – CS AI · Apr 67/10
🧠Researchers have developed ClinicalReTrial, a multi-agent AI system that can redesign clinical trial protocols to improve success rates. The system demonstrated an 83.3% improvement rate in trial protocols with a mean 5.7% increase in success probability at minimal cost of $0.12 per trial.
AIBearisharXiv – CS AI · Apr 67/10
🧠A research paper examines reliability issues in AI-assisted medication decision systems, finding that even systems with good aggregate performance can produce dangerous errors in real-world healthcare scenarios. The study emphasizes that single incorrect AI recommendations in medication management can cause severe patient harm, highlighting the need for human oversight and risk-aware evaluation approaches.
AIBearisharXiv – CS AI · Mar 277/10
🧠Researchers introduced CPGBench, a benchmark evaluating how well Large Language Models detect and follow clinical practice guidelines in healthcare conversations. The study found that while LLMs can detect 71-90% of clinical recommendations, they only adhere to guidelines 22-63% of the time, revealing significant gaps for safe medical deployment.
AIBullisharXiv – CS AI · Mar 267/10
🧠Alberta Health Services deployed Berta, an open-source AI scribe platform that reduces clinical documentation costs by 70-95% compared to commercial alternatives. The system was used by 198 emergency physicians across 105 facilities, generating over 22,000 clinical sessions while keeping all data within secure health system infrastructure.
AIBullisharXiv – CS AI · Mar 177/10
🧠Researchers developed FairMed-XGB, a machine learning framework that reduces gender bias in healthcare AI models by 40-72% while maintaining predictive accuracy. The system uses Bayesian optimization and explainable AI to ensure equitable treatment decisions in critical care settings.
AINeutralarXiv – CS AI · Mar 177/10
🧠Researchers convened a February 2025 workshop to explore how meta-research methodologies can enhance Trustworthy AI (TAI) implementation in healthcare. The study identifies key challenges including robustness, reproducibility, clinical integration, and transparency gaps, proposing a roadmap for interdisciplinary collaboration between TAI and meta-research fields.
AIBullisharXiv – CS AI · Mar 117/10
🧠Researchers have developed a new framework that enables dataset condensation for non-differentiable clinical AI models like decision trees and Cox regression, using differential privacy to create synthetic medical datasets. This breakthrough allows healthcare institutions to share condensed synthetic data while preserving patient privacy and maintaining model utility across classification and survival prediction tasks.
AIBullishTechCrunch – AI · Mar 107/10
🧠Amazon has launched a healthcare AI assistant on its website and mobile app that can answer health questions, explain medical records, manage prescription renewals, and book appointments. This represents Amazon's significant expansion into AI-powered healthcare services, potentially disrupting traditional healthcare delivery models.
AIBullisharXiv – CS AI · Mar 97/10
🧠Researchers developed AIRT, an AI-powered radiation therapy planning system that generates complete prostate cancer treatment plans in under one second using deep learning. The system processes CT scans and anatomical data to produce clinically-viable radiation treatment plans 100x faster than current methods, demonstrating non-inferiority to existing commercial solutions.
🏢 Nvidia
AINeutralarXiv – CS AI · Mar 56/10
🧠Researchers propose Trustworthy Federated Learning (TFL) framework that treats trust as a continuously maintained system condition rather than static property, addressing challenges in AI systems with autonomous decision-making. The framework introduces Trust Report 2.0 as a privacy-preserving coordination blueprint for multi-stakeholder governance in federated learning deployments.
AIBullisharXiv – CS AI · Mar 57/10
🧠Researchers developed MPFlow, a new zero-shot MRI reconstruction framework that uses multi-modal data and rectified flow to improve medical imaging quality. The system reduces tumor hallucinations by 15% while using 80% fewer sampling steps compared to existing diffusion methods, potentially advancing AI applications in medical diagnostics.
AIBullisharXiv – CS AI · Mar 57/10
🧠Researchers developed WCC-Net, a 3D wavelet-based diffusion model that significantly improves low-dose PET imaging denoising while reducing patient radiation exposure. The AI framework uses frequency-domain structural priors to maintain anatomical accuracy and outperforms existing CNN, GAN, and diffusion baselines across multiple dose levels.
AIBullisharXiv – CS AI · Mar 57/10
🧠Researchers propose Volumetric Directional Diffusion (VDD), a new AI method for medical image segmentation that addresses uncertainty in 3D lesion analysis. VDD anchors generative models to consensus priors to maintain anatomical accuracy while capturing expert disagreements, achieving state-of-the-art uncertainty quantification on multiple medical datasets.
AIBullisharXiv – CS AI · Mar 56/10
🧠Researchers introduced PulseLM, a large-scale dataset combining PPG cardiovascular sensor data with natural language processing for multimodal AI models. The dataset contains 1.31 million PPG segments with 3.15 million question-answer pairs, designed to enable language-based physiological reasoning in healthcare AI applications.
AIBullisharXiv – CS AI · Mar 56/10
🧠Researchers propose MIND, a reinforcement learning framework that improves AI-powered psychiatric consultation by addressing key challenges in diagnostic accuracy and clinical reasoning. The system uses a Criteria-Grounded Psychiatric Reasoning Bank to provide better clinical support and reduce inquiry drift during multi-turn patient interactions.
AINeutralarXiv – CS AI · Mar 57/10
🧠A study reveals that 74% of healthcare AI research papers still use private datasets or don't share code, creating reproducibility issues that undermine trust in medical AI applications. Papers that embrace open practices by sharing both public datasets and code receive 110% more citations on average, demonstrating clear benefits for scientific impact.
AINeutralarXiv – CS AI · Mar 57/10
🧠Researchers propose a new goal-driven risk assessment framework for LLM-powered systems, specifically targeting healthcare applications. The approach uses attack trees to identify detailed threat vectors combining adversarial AI attacks with conventional cyber threats, addressing security gaps in LLM system design.