AIBearisharXiv – CS AI · 3h ago7/10
🧠
Auditing medical multi-agent AI reveals risks of false consensus
Researchers introduced MedAgentAudit, a framework that reveals critical safety failures in medical multi-agent AI systems, finding that collaborative AI architectures frequently exhibit unsupported observations, evidence avoidance, and decision-making biases rather than genuine reasoning. The study across 14,400 cases and six AI architectures demonstrates that consensus-based medical AI systems are unreliable for clinical use without fundamental process-level improvements.