AIBullisharXiv – CS AI · May 117/10
🧠Researchers introduce MedExAgent, an AI system trained to perform clinical diagnosis through a POMDP framework that simulates real-world complexity including patient interaction, medical exams, and noisy data. The model uses supervised finetuning and reinforcement learning to balance diagnostic accuracy with cost-efficiency, achieving performance comparable to larger models while maintaining practical clinical constraints.
AIBullisharXiv – CS AI · May 117/10
🧠Researchers introduce MedAction, a new framework and dataset designed to improve how large language models perform clinical diagnosis by simulating real-world multi-turn diagnostic processes. The approach addresses fundamental limitations in current medical LLMs through a tree-structured distillation pipeline that generates high-quality diagnostic trajectories, achieving state-of-the-art performance among open-source models.
AIBullisharXiv – CS AI · Mar 46/103
🧠Researchers introduce PRISM, an EEG foundation model that demonstrates how diverse pretraining data leads to better clinical performance than narrow-source datasets. The study shows that geographically diverse EEG data outperforms larger but homogeneous datasets in medical diagnosis tasks, particularly achieving 12.3% better accuracy in distinguishing epilepsy from similar conditions.
$COMP
AIBullisharXiv – CS AI · Mar 37/105
🧠Researchers have developed DeepMedix-R1, a foundation model for chest X-ray interpretation that provides transparent, step-by-step reasoning alongside accurate diagnoses to address the black-box problem in medical AI. The model uses reinforcement learning to align diagnostic outputs with clinical plausibility and significantly outperforms existing models in report generation and visual question answering tasks.
AIBullisharXiv – CS AI · Mar 37/104
🧠Doctor-R1 is a new AI agent that combines accurate medical decision-making with strategic, empathetic patient consultation skills through reinforcement learning. The system outperforms existing open-source medical LLMs and proprietary models on clinical benchmarks while demonstrating superior communication quality and patient-centric performance.
AINeutralarXiv – CS AI · 3d ago6/10
🧠Researchers introduce C-MIG, a retrieval-augmented generation framework that improves clinical diagnosis reasoning by using multi-view information gain instead of binary reward signals. The method outperforms existing RAG-RL approaches on medical benchmarks by better capturing semantically relevant information and addressing credit assignment challenges in healthcare AI systems.
AIBullisharXiv – CS AI · 3d ago6/10
🧠Researchers propose a case-aware medical image classification framework that leverages multimodal knowledge graphs to retrieve similar historical cases and integrate external clinical knowledge, improving diagnostic accuracy through interpretable evidence-based reasoning rather than relying solely on isolated visual analysis.
AINeutralarXiv – CS AI · May 16/10
🧠Researchers propose using large language models as graph structure refiners to improve EEG-based seizure detection by identifying and removing redundant connections in noisy neural signal data. A two-stage framework combining Transformer-based edge prediction with LLM validation demonstrates improved accuracy and more interpretable graph representations on the TUSZ dataset.
AIBullisharXiv – CS AI · Mar 66/10
🧠Research shows that multi-agent LLM systems using models from different vendors (o4-mini, Gemini-2.5-Pro, Claude-4.5-Sonnet) significantly outperform single-vendor teams in clinical diagnosis tasks. Mixed-vendor configurations achieve superior recall and accuracy by combining complementary strengths and reducing shared biases that affect homogeneous model teams.
🧠 Claude🧠 Gemini
AIBullisharXiv – CS AI · Mar 27/1015
🧠Researchers developed MACD, a Multi-Agent Clinical Diagnosis framework that enables large language models to self-learn clinical knowledge and improve medical diagnosis accuracy. The system achieved up to 22.3% improvement over clinical guidelines and 16% improvement over physician-only diagnosis when tested on 4,390 real-world patient cases.