y0news
#multi-agent-systems8 articles
8 articles
AIBullisharXiv โ€“ CS AI ยท 4h ago9
๐Ÿง 

From Flat Logs to Causal Graphs: Hierarchical Failure Attribution for LLM-based Multi-Agent Systems

Researchers introduce CHIEF, a new framework that improves failure analysis in LLM-powered multi-agent systems by transforming execution logs into hierarchical causal graphs. The system uses oracle-guided backtracking and counterfactual attribution to better identify root causes of failures, outperforming existing methods on benchmark tests.

AIBullisharXiv โ€“ CS AI ยท 4h ago3
๐Ÿง 

RUMAD: Reinforcement-Unifying Multi-Agent Debate

Researchers introduce RUMAD, a reinforcement learning framework that optimizes multi-agent AI debate systems by dynamically controlling communication topology. The system achieves over 80% reduction in computational costs while improving reasoning accuracy across benchmark tests, with strong generalization capabilities across different task domains.

AIBullisharXiv โ€“ CS AI ยท 4h ago6
๐Ÿง 

CoMind: Towards Community-Driven Agents for Machine Learning Engineering

Researchers introduce CoMind, a multi-agent AI system that leverages community knowledge to automate machine learning engineering tasks. The system achieved a 36% medal rate on 75 past Kaggle competitions and outperformed 92.6% of human competitors in eight live competitions, establishing new state-of-the-art performance.

AINeutralarXiv โ€“ CS AI ยท 4h ago5
๐Ÿง 

Demystifying the Lifecycle of Failures in Platform-Orchestrated Agentic Workflows

Researchers present AgentFail, a dataset of 307 real-world failure cases from agentic workflow platforms, analyzing how multi-agent AI systems fail and can be repaired. The study reveals that failures in these low-code orchestrated AI workflows propagate differently than traditional software, making them harder to diagnose and fix.

AIBullisharXiv โ€“ CS AI ยท 4h ago6
๐Ÿง 

IntentCUA: Learning Intent-level Representations for Skill Abstraction and Multi-Agent Planning in Computer-Use Agents

Researchers introduced IntentCUA, a multi-agent framework for computer automation that achieved 74.83% task success rate through intent-aligned planning and memory systems. The system uses coordinated agents (Planner, Plan-Optimizer, and Critic) to reduce error accumulation and improve efficiency in long-horizon desktop automation tasks.

AINeutralarXiv โ€“ CS AI ยท 4h ago6
๐Ÿง 

LumiMAS: A Comprehensive Framework for Real-Time Monitoring and Enhanced Observability in Multi-Agent Systems

Researchers have developed LumiMAS, a comprehensive framework for monitoring and detecting failures in multi-agent systems that incorporate large language models. The framework features three layers: monitoring and logging, anomaly detection, and anomaly explanation with root cause analysis, addressing the unique challenges of observing entire multi-agent systems rather than individual agents.

AINeutralarXiv โ€“ CS AI ยท 4h ago0
๐Ÿง 

Integrating LLM in Agent-Based Social Simulation: Opportunities and Challenges

A research position paper examines the integration of Large Language Models (LLMs) in agent-based social simulations, highlighting both opportunities and limitations. The study proposes Hybrid Constitutional Architectures that combine classical agent-based models with small language models and LLMs to balance expressive flexibility with analytical transparency.