159 articles tagged with #medical-ai. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.
AINeutralarXiv โ CS AI ยท Mar 36/107
๐ง Researchers developed an event-based evaluation framework for LLM-generated clinical summaries of remote monitoring data, revealing that models with high semantic similarity often fail to capture clinically significant events. A vision-based approach using time-series visualizations achieved the best clinical event alignment with 45.7% abnormality recall.
$NEAR
AIBullisharXiv โ CS AI ยท Mar 37/108
๐ง Researchers introduce CARE, an evidence-grounded agentic framework for medical AI that improves clinical accountability by decomposing tasks into specialized modules rather than using black-box models. The system achieves 10.9% better accuracy than state-of-the-art models by incorporating explicit visual evidence and coordinated reasoning that mimics clinical workflows.
AIBullisharXiv โ CS AI ยท Mar 36/106
๐ง Researchers created OpenRad, a curated repository containing approximately 1,700 open-access AI models for radiology. The platform aggregates scattered radiology AI research into a standardized, searchable database that includes model weights, interactive applications, and spans all imaging modalities and radiology subspecialties.
AIBullisharXiv โ CS AI ยท Mar 36/107
๐ง Researchers propose REMIND, a framework for medical multi-modal AI learning that addresses the challenge of missing data across multiple modalities. The solution uses a Mixture-of-Experts architecture to handle long-tail distributions of modality combinations and shows superior performance on real-world medical datasets.
AIBullisharXiv โ CS AI ยท Mar 36/108
๐ง Researchers developed SurgFusion-Net, a multimodal AI system for assessing surgical skills in robotic-assisted surgery. The system introduces new clinical datasets and fusion techniques that outperform existing baselines, addressing the domain gap between simulation and real clinical environments.
AIBullisharXiv โ CS AI ยท Mar 37/107
๐ง Researchers have developed CT-Flow, an AI framework that mimics how radiologists actually work by using tools interactively to analyze 3D CT scans. The system achieved 41% better diagnostic accuracy than existing models and 95% success in autonomous tool use, potentially revolutionizing clinical radiology workflows.
AIBullisharXiv โ CS AI ยท Mar 37/106
๐ง Researchers developed M-Gaussian, a new AI framework that adapts 3D Gaussian Splatting for efficient multi-stack MRI reconstruction. The method achieves 40.31 dB PSNR while being 14 times faster than existing implicit neural representation methods, offering improved balance between quality and computational efficiency.
AINeutralarXiv โ CS AI ยท Mar 36/107
๐ง Researchers fine-tuned the Llama 2 7B model using real patient-doctor interaction transcripts to improve medical query responses, but found significant discrepancies between automatic similarity metrics and GPT-4 evaluations. The study highlights the challenges in evaluating AI medical models and recommends human medical expert review for proper validation.
AIBullisharXiv โ CS AI ยท Mar 37/107
๐ง Researchers developed InstructX2X, a new AI model for generating counterfactual medical images that provides interpretable explanations and prevents unintended modifications. The model achieves state-of-the-art performance in creating high-quality chest X-ray images with visual guidance maps for medical applications.
AIBullisharXiv โ CS AI ยท Mar 36/1010
๐ง Researchers propose ClinCoT, a new framework for medical AI that improves Visual Language Models by grounding reasoning in specific visual regions rather than just text. The approach reduces factual hallucinations in medical AI systems by using visual chain-of-thought reasoning with clinically relevant image regions.
AIBullisharXiv โ CS AI ยท Mar 36/107
๐ง Researchers propose TC-SSA, a token compression framework that enables large vision-language models to process gigapixel pathology images by reducing visual tokens to 1.7% of original size while maintaining diagnostic accuracy. The method achieves 78.34% overall accuracy on SlideBench and demonstrates strong performance across multiple cancer classification tasks.
AIBullisharXiv โ CS AI ยท Mar 36/106
๐ง Researchers developed TARSE, a new AI system for clinical decision-making that retrieves relevant medical skills and experiences from curated libraries to improve reasoning accuracy. The system performs test-time adaptation to align language models with clinically valid logic, showing improvements over existing medical AI baselines in question-answering benchmarks.
AINeutralarXiv โ CS AI ยท Mar 37/108
๐ง The MAMA-MIA Challenge introduced a large-scale benchmark for AI-powered breast cancer tumor segmentation and treatment response prediction using MRI data from 1,506 US patients for training and 574 European patients for testing. Results from 26 international teams revealed significant performance variability and trade-offs between accuracy and fairness across demographic subgroups when AI models were tested across different institutions and continents.
AIBullisharXiv โ CS AI ยท Mar 36/107
๐ง Researchers introduce AG-VAS, a new AI framework that uses large multimodal models for zero-shot visual anomaly segmentation. The system employs learnable semantic anchor tokens and achieves state-of-the-art performance on industrial and medical benchmarks without requiring training data for specific anomaly types.
AINeutralarXiv โ CS AI ยท Mar 37/107
๐ง Researchers identify a critical flaw in Vision-Language Model evaluation for radiology, where high benchmark scores mask models' failure to generate clinically specific terminology. They propose new metrics including Clinical Association Displacement (CAD) to measure bias and clinical signal loss across demographic groups.
AIBullisharXiv โ CS AI ยท Mar 36/107
๐ง Researchers have developed QIME, a new framework for creating interpretable medical text embeddings that uses ontology-grounded questions to represent biomedical text. Unlike black-box AI models, QIME provides clinically meaningful explanations while achieving performance close to traditional dense embeddings in medical text analysis tasks.
AIBullisharXiv โ CS AI ยท Mar 36/104
๐ง Researchers developed MAP-Diff, a multi-anchor guided diffusion framework that improves 3D whole-body PET scan denoising by using intermediate-dose scans as trajectory anchors. The method achieves significant improvements in image quality metrics, increasing PSNR from 42.48 dB to 43.71 dB while reducing radiation exposure for patients.
AIBullisharXiv โ CS AI ยท Mar 36/103
๐ง Researchers developed a detection-gated AI pipeline combining YOLOv8 and U-Net for accurate glottal segmentation in medical videoendoscopy. The system achieved state-of-the-art performance with zero-shot transfer capabilities across different clinical datasets, enabling real-time extraction of vocal function biomarkers at 35 frames per second.
AIBullisharXiv โ CS AI ยท Mar 36/103
๐ง Researchers propose Adaptive Confidence Regularization (ACR), a new framework for detecting failures in multimodal AI systems used in critical applications like autonomous vehicles and medical diagnostics. The approach uses confidence degradation detection and synthetic failure generation to improve reliability of AI predictions in high-stakes scenarios.
AIBullisharXiv โ CS AI ยท Mar 36/104
๐ง Researchers introduce BoxMed-RL, a new AI framework that uses chain-of-thought reasoning and reinforcement learning to generate spatially verifiable radiology reports. The system mimics radiologist workflows by linking visual findings to precise anatomical locations, achieving 7% improvement over existing methods in key performance metrics.
$LINK
AINeutralarXiv โ CS AI ยท Mar 36/104
๐ง Researchers introduced EHR-ChatQA, a new benchmark for testing AI agents that interact with Electronic Health Record databases through natural language queries. The benchmark reveals significant reliability gaps in current state-of-the-art LLMs, with success rates dropping substantially when consistency across multiple trials is required.
AIBullisharXiv โ CS AI ยท Mar 36/103
๐ง Researchers propose a new medical alignment paradigm for large language models that addresses the shortcomings of current reinforcement learning approaches in high-stakes medical question answering. The framework introduces a multi-dimensional alignment matrix and unified optimization mechanism to simultaneously optimize correctness, safety, and compliance in medical AI applications.
AIBullisharXiv โ CS AI ยท Mar 36/103
๐ง Research shows that predictive AI deployment during medical training significantly improves diagnostic accuracy for novices, with the greatest benefits occurring when AI is used in both training and practice phases. The study found that AI integration not only enhances individual performance but also affects error diversity across groups, impacting collective decision-making quality.
AIBullisharXiv โ CS AI ยท Mar 26/1015
๐ง Researchers developed HMKGN, a hierarchical multi-scale graph network for cancer survival prediction using whole-slide images. The AI model outperformed existing methods by 10.85% in concordance indices across four cancer datasets, demonstrating improved accuracy in predicting patient survival outcomes.
AIBearisharXiv โ CS AI ยท Mar 27/1019
๐ง Researchers propose a new risk-sensitive framework for evaluating AI hallucinations in medical advice that considers potential harm rather than just factual accuracy. The study reveals that AI models with similar performance show vastly different risk profiles when generating medical recommendations, highlighting critical safety gaps in current evaluation methods.