y0news
#multi-agent5 articles
5 articles
AIBullisharXiv โ€“ CS AI ยท 6h ago6
๐Ÿง 

MACD: Multi-Agent Clinical Diagnosis with Self-Learned Knowledge for LLM

Researchers developed MACD, a Multi-Agent Clinical Diagnosis framework that enables large language models to self-learn clinical knowledge and improve medical diagnosis accuracy. The system achieved up to 22.3% improvement over clinical guidelines and 16% improvement over physician-only diagnosis when tested on 4,390 real-world patient cases.

AIBullisharXiv โ€“ CS AI ยท 6h ago5
๐Ÿง 

OM2P: Offline Multi-Agent Mean-Flow Policy

Researchers propose OM2P, a new offline multi-agent reinforcement learning algorithm that achieves efficient one-step action sampling using mean-flow models. The approach delivers up to 3.8x reduction in GPU memory usage and 10.8x speed-up in training time compared to existing diffusion and flow-based models.

AIBullisharXiv โ€“ CS AI ยท 6h ago1
๐Ÿง 

ProductResearch: Training E-Commerce Deep Research Agents via Multi-Agent Synthetic Trajectory Distillation

Researchers developed ProductResearch, a multi-agent AI framework that creates synthetic training data to improve e-commerce shopping agents. The system uses multiple AI agents to generate comprehensive product research trajectories, with experiments showing a compact model fine-tuned on this synthetic data significantly outperforming base models in shopping assistance tasks.

AINeutralarXiv โ€“ CS AI ยท 6h ago1
๐Ÿง 

Offline-to-Online Multi-Agent Reinforcement Learning with Offline Value Function Memory and Sequential Exploration

Researchers propose OVMSE, a new framework for Offline-to-Online Multi-Agent Reinforcement Learning that addresses key challenges in transitioning from offline training to online fine-tuning. The framework introduces Offline Value Function Memory and Sequential Exploration strategies to improve sample efficiency and performance in multi-agent environments.

AINeutralarXiv โ€“ CS AI ยท 6h ago1
๐Ÿง 

Heterogeneous Multi-Agent Reinforcement Learning with Attention for Cooperative and Scalable Feature Transformation

Researchers propose a new multi-agent reinforcement learning framework that uses three cooperative agents with attention mechanisms to automate feature transformation for machine learning models. The approach addresses key limitations in existing automated feature engineering methods, including dynamic feature expansion instability and insufficient agent cooperation.