#biomedical-ai News & Analysis

30 articles tagged with #biomedical-ai. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

30 articles

AINeutralarXiv – CS AI · Jun 87/10

🧠

MMBU: A Massive Multi-modal Biomedical Understanding Benchmark to Probe the Perception Capabilities of Vision-Language Models

Researchers introduced MMBU, the largest biomedical vision-language benchmark covering 35 medical imaging modalities with structured metadata. Testing 15 open-weight and 2 frontier VLMs revealed that while medical adaptation helps some models, high reported accuracy on existing benchmarks masks significant deficiencies in visual perception and domain generalization.

AIBullisharXiv – CS AI · Jun 57/10

🧠

Towards World Models in Biomedical Research

Researchers propose biomedical world models as an AI paradigm that learns dynamic representations of biological systems to simulate future states and predict responses to interventions. These models could accelerate drug discovery, personalized medicine, and surgical planning by enabling simulation-based experimentation before real-world testing.

AIBullisharXiv – CS AI · Jun 27/10

🧠

Ryze: Evidence-Enriched Data Synthesis from Biomedical Papers

Researchers introduce Ryze, an automated system that converts biomedical papers into evidence-enriched training datasets for specialized vision-language models. The resulting BioVLM-8B model achieves 48.0% accuracy on LAB-Bench, outperforming GPT-4V by 3.8 percentage points while costing under $200 to develop.

🧠 GPT-5

AIBullisharXiv – CS AI · Jun 27/10

🧠

EvoPool: Evolutionary Programmatic Annotation for Label-Efficient Specialized Supervision

EvoPool is an evolutionary multi-agent framework that generates specialized annotation code to label training data more efficiently than LLMs for domain-specific tasks. The system operates 4,500-31,000x faster than LLM annotation while achieving superior performance across biomedical, legal, and reasoning tasks, with improvements up to +0.301 macro-F1 on specialized benchmarks.

AIBullisharXiv – CS AI · May 287/10

🧠

CaMBRAIN: Real-time, Continuous EEG Inference with Causal State Space Models

Researchers introduce CaMBRAIN, a causal state space model based on Mamba architecture that enables real-time, continuous EEG signal processing with linear-time complexity. The model achieves state-of-the-art results across multiple datasets while processing signals >10x faster than existing attention-based methods, overcoming critical limitations in handling variable-length brain activity recordings.

AIBullisharXiv – CS AI · Mar 56/10

🧠

Uni-NTFM: A Unified Foundation Model for EEG Signal Representation Learning

Researchers developed Uni-NTFM, a new foundation model for EEG signal analysis that incorporates biological neural mechanisms and achieved record-breaking 1.9 billion parameters. The model was pre-trained on 28,000 hours of EEG data and outperformed existing models across nine downstream tasks by aligning architecture with actual brain functionality.

AIBullisharXiv – CS AI · Jun 256/10

🧠

Privacy-preserving federated tensor decomposition of single-cell immune data: recovering multicellular programs across institutions

Researchers developed a federated tensor decomposition method that enables privacy-preserving analysis of single-cell immune data across multiple institutions without sharing raw patient data. The approach recovers multicellular immune programs—coordinated patterns of gene expression across cell types—while protecting patient privacy through secure aggregation, demonstrated on systemic lupus erythematosus and COVID-19 datasets.

AIBullisharXiv – CS AI · Jun 236/10

🧠

BioInsight: Multi-Agent Orchestration for Interactive Biomedical Knowledge Discovery

BioInsight is a multi-agent AI system that transforms static biomedical reports into interactive, evidence-centered interfaces for disease research. The system combines evidence retrieval, mechanistic reasoning, and citation normalization to help researchers inspect findings, assess uncertainty, and refine hypotheses more effectively than traditional text-based outputs.

AIBullisharXiv – CS AI · Jun 236/10

🧠

Contrastive and Adaptive Multi-modal Masked Autoencoder for Spatial Transcriptomics

Researchers propose CAMMST, a Masked Autoencoder framework that predicts gene expression from histology images by leveraging small amounts of spatial transcriptomics data as genetic anchors. The method combines visual and genetic modalities through contrastive learning, achieving superior performance with minimal transcriptomic coverage and addressing the cost limitations of spatial transcriptomics profiling.

AIBullisharXiv – CS AI · Jun 236/10

🧠

Rethinking the Adaptation of Vision Foundation Models for Efficient Cell Segmentation

Researchers introduce EffiCell-Seg, a framework that adapts Vision Foundation Models for cell segmentation without fine-tuning the visual encoder, achieving state-of-the-art performance with 130x fewer trainable parameters than conventional approaches. The method leverages pretrained model representations to extract structural priors for efficient cellular imaging analysis.

AINeutralarXiv – CS AI · Jun 235/10

🧠

Exploration of LLMs, EEG, and behavioral data to measure and support attention and sleep

Researchers explored using large language models to detect and improve attention and sleep by analyzing EEG and physical activity data. While LLMs successfully generated personalized sleep improvement suggestions based on behavioral text data, the study found that directly detecting attention states and sleep stages from EEG data requires additional training data and domain expertise.

AIBullisharXiv – CS AI · Jun 196/10

🧠

Ensembles of Large Language Models for Identifying EQ-5D Studies in PubMed Based on Their Abstracts

Researchers developed an ensemble machine learning approach using Google's Gemini and Gemma large language models to automatically identify EQ-5D health quality-of-life studies in PubMed abstracts. The combined model achieved 0.74 F1-score and accuracy, demonstrating that ensemble methods outperform individual LLMs for biomedical document classification tasks.

🧠 Gemini

AINeutralarXiv – CS AI · Jun 196/10

🧠

A Deep Generative Model for Resting-State EEG Synthesis and Transferable Representation Learning

REST-GAN introduces a generative adversarial network framework for synthesizing resting-state EEG signals while learning transferable representations without manual feature engineering. The model demonstrates strong performance in reproducing key EEG properties and outperforms direct raw-signal approaches on demographic classification tasks, offering a computationally efficient alternative to existing EEG analysis methods.

AINeutralarXiv – CS AI · Jun 115/10

🧠

Skill-Augmented AI Agents for Medical Research Analysis: An Exploratory Multi-Model Human Evaluation in an NSCLC Transcriptomic Biomarker Task

Researchers evaluated whether AI agents equipped with specialized medical research skills produce higher-quality outputs than native language models on transcriptomic biomarker analysis tasks. While skill-augmented AI showed directional improvements in expert-rated quality, the gains were modest and within the margin of expert-rating noise, suggesting larger, more rigorous studies are needed.

AINeutralarXiv – CS AI · Jun 96/10

🧠

Correlation Is Not Enough: Embedding Human Metadata for Individual Causal Discovery

Researchers demonstrate that pretrained biomedical language models fail catastrophically at cross-domain discrimination, assigning high similarity scores (0.76-0.92) to unrelated concepts. They propose BODHI, a contrastive learning approach that improves domain separation 2.3x while maintaining correlation accuracy, and show that optimized inference achieves 133x latency reduction on specialized hardware.

AIBullisharXiv – CS AI · Jun 46/10

🧠

Beyond Prompt-Based Planning: MCP-Native Graph Planning-based Biomedical Agent System

Researchers introduce BioManus, an AI agent system that uses graph-based planning and standardized Model Context Protocol (MCP) servers to automate biomedical workflows. The system addresses scalability challenges by organizing bioinformatics tools into structured capability graphs rather than relying on flat prompt-based retrieval, achieving significant improvements in execution accuracy and context efficiency.

AIBullisharXiv – CS AI · Jun 26/10

🧠

A Novel Data Augmentation Strategy for Robust Deep Learning Classification of Biomedical Time-Series Data: Application to ECG and EEG Analysis

Researchers propose a unified deep learning framework combining ResNet-based CNNs with attention mechanisms and novel data augmentation techniques for analyzing biomedical time-series signals like ECG and EEG. The approach achieves near-perfect accuracy (99.78-100%) on benchmark datasets while remaining lightweight enough for wearable deployment, addressing critical gaps in multi-signal analysis and class imbalance handling.

AINeutralarXiv – CS AI · Jun 26/10

🧠

UF-AMA: A unified framework for cross-domain emotion recognition via adaptive multimodal alignment

Researchers introduce UF-AMA, a unified framework for cross-domain emotion recognition using multimodal physiological signals like EEG and eye-tracking data. The model employs adaptive alignment mechanisms and multi-level domain adaptation to achieve state-of-the-art performance in cross-subject and cross-session emotion recognition tasks.

AINeutralarXiv – CS AI · Jun 26/10

🧠

Plausibility Is Not Prediction: Contrastive Evidence for LLM-Based Cellular Perturbation Reasoning

Researchers demonstrate that large language models fail to accurately predict gene expression changes in cellular perturbation experiments despite producing biologically plausible explanations. They introduce CORE, a contrastive learning method that significantly improves prediction accuracy by organizing evidence from related perturbations rather than evaluating them in isolation.

AINeutralarXiv – CS AI · Jun 16/10

🧠

HypoAgent: An Agentic Framework for Interactive Abductive Hypothesis Generation over Knowledge Graphs

HypoAgent is a new AI framework that uses multiple specialized agents to generate logical hypotheses from knowledge graphs through interactive dialogue. The system excels at understanding evolving user intent across multi-turn conversations and diagnosing why generated hypotheses fail, achieving state-of-the-art performance on both commonsense and biomedical knowledge graphs.

AINeutralarXiv – CS AI · May 286/10

🧠

Do Clinical Models Change Treatment Decisions?

Researchers introduce ClinPivot, a benchmark testing whether clinical AI models adjust treatment decisions when patient contexts change. The study reveals that strong medical QA performance does not correlate with sound clinical decision-making, with leading models often failing to modify treatment choices appropriately when clinical constraints shift.

AINeutralarXiv – CS AI · May 286/10

🧠

A Multi-dimensional Framework for Evaluating Generalization in EEG Foundation Models

Researchers propose a multi-dimensional evaluation framework for EEG foundation models that tests performance under realistic biomedical constraints like limited labeled data and reduced sensor coverage. Analysis of models including LaBraM, CSBrain, and CBraMod reveals foundation models excel at long-context tasks but struggle with short-window Brain-Computer Interface applications and channel constraints compared to supervised alternatives.

AINeutralarXiv – CS AI · May 286/10

🧠

BIRDNet: Mining and Encoding Boolean Implication Knowledge Graphs as Interpretable Deep Neural Networks

Researchers introduce BIRDNet, a neurosymbolic deep learning architecture that mines Boolean implication relationships from tabular data and encodes them as sparse, interpretable neural networks. The model achieves near-baseline performance on biomedical datasets while using 96× fewer active parameters and maintaining human-readable symbolic rules without external rule bases.

AINeutralarXiv – CS AI · May 276/10

🧠

Can Broad Biomedical Knowledge be Contextualized into Scenario-Grounded Propositions?

Researchers introduce SCENE, a multi-agent AI framework that transforms general biomedical knowledge into specific, evidence-supported hypotheses grounded in experimental data. The system successfully identifies patient subgroups with different treatment responses in clinical trials and context-specific biological responses in genomic studies, bridging the gap between broad theoretical knowledge and actionable dataset-specific insights.

AIBullisharXiv – CS AI · May 276/10

🧠

BioFormer: Rethinking Cross-Subject Generalization via Spectral Structural Alignment in Biomedical Time-Series

BioFormer, a new machine learning framework, addresses cross-subject generalization in biomedical time-series analysis by using spectral structural alignment to suppress individual variability. The model achieves 6% F1-score improvements over 12 baselines through frequency-band alignment and adaptive normalization techniques.

Page 1 of 2Next →