101 articles tagged with #multi-agent-systems. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.
AINeutralarXiv โ CS AI ยท 6d ago6/10
๐ง Researchers developed the Strategic Courtroom Framework, a multi-agent simulation where LLM-based prosecution and defense teams engage in iterative legal argumentation with trait-conditioned personalities. Testing across 7,000+ simulated trials revealed that diverse teams with complementary traits outperform homogeneous ones, and a reinforcement learning system can dynamically optimize team composition, demonstrating language as a strategic action space in adversarial domains.
๐ง Gemini
AINeutralarXiv โ CS AI ยท Apr 66/10
๐ง Researchers analyzed 18 agent communication protocols for LLM systems, finding they excel at transport and structure but lack semantic understanding capabilities. The study reveals current protocols push semantic responsibilities into prompts and application logic, creating hidden interoperability costs and technical debt.
AIBullisharXiv โ CS AI ยท Mar 266/10
๐ง SafeSieve is a new algorithm for optimizing LLM-based multi-agent systems that reduces token usage by 12.4%-27.8% while maintaining 94.01% accuracy. The progressive pruning method combines semantic evaluation with performance feedback to eliminate redundant communication between AI agents.
AIBullisharXiv โ CS AI ยท Mar 266/10
๐ง Researchers introduce GoAgentNet, a new 6G networking architecture that uses AI agents to enable goal-oriented communication rather than simple data exchange. The system demonstrates significant improvements with up to 99% better energy efficiency and 72% higher task success rates in robotic applications.
AIBullisharXiv โ CS AI ยท Mar 176/10
๐ง Researchers introduced a multi-agent AI framework for whole-system software optimization that goes beyond local code improvements to analyze entire microservice architectures. The system uses coordinated agents for summarization, analysis, optimization, and verification, achieving 36.58% throughput improvement and 27.81% response time reduction in proof-of-concept testing.
AIBullisharXiv โ CS AI ยท Mar 166/10
๐ง Researchers propose AMRO-S, a new routing framework for multi-agent LLM systems that uses ant colony optimization to improve efficiency and reduce costs. The system addresses key deployment challenges like high inference costs and latency while maintaining performance quality through semantic-aware routing and interpretable decision-making.
AIBullisharXiv โ CS AI ยท Mar 166/10
๐ง Researchers developed an agentic AI framework using LLMs like Claude Opus 4.6 and GitHub Copilot to automate chemical process flowsheet modeling. The multi-agent system decomposes engineering tasks with one agent solving problems using domain knowledge and another implementing solutions in code for industrial simulations.
๐ข Anthropic๐ข Microsoft๐ง Claude
AINeutralarXiv โ CS AI ยท Mar 116/10
๐ง A new academic paper introduces context engineering as a discipline for managing AI agent decision-making environments, proposing a maturity model that includes prompt, context, intent, and specification engineering. The research addresses enterprise challenges in scaling multi-agent AI systems, with 75% of enterprises planning deployment within two years despite current scaling difficulties.
๐ข Google๐ข Anthropic
AIBullisharXiv โ CS AI ยท Mar 116/10
๐ง Researchers introduce Latent-DARM, a framework that bridges discrete diffusion language models and autoregressive models to improve multi-agent AI reasoning capabilities. The system achieved significant improvements on reasoning benchmarks, increasing accuracy from 27% to 36% on DART-5 while using less than 2.2% of the token budget of state-of-the-art models.
AIBullisharXiv โ CS AI ยท Mar 96/10
๐ง Researchers have developed MASFactory, a new graph-centric framework for orchestrating Large Language Model-based Multi-Agent Systems (MAS). The framework introduces 'Vibe Graphing,' which allows users to compile natural language instructions into executable workflow graphs, making complex AI agent coordination more accessible and reusable.
AIBullisharXiv โ CS AI ยท Mar 36/109
๐ง Researchers introduce TraceSIR, a multi-agent framework that analyzes execution traces from AI agentic systems to diagnose failures and optimize performance. The system uses three specialized agents to compress traces, identify issues, and generate comprehensive analysis reports, significantly outperforming existing approaches in evaluation tests.
AIBullisharXiv โ CS AI ยท Mar 35/104
๐ง Researchers developed a multi-agent AI system for medical triage that uses three specialized agents to improve patient classification accuracy. The system achieved 89.6% accuracy in primary department classification and 74.3% in secondary classification, addressing healthcare staffing shortages through automated pre-consultation.
AIBullisharXiv โ CS AI ยท Mar 37/107
๐ง Meta researchers introduced MetaMind, a cognitive world model for multi-agent systems that enables agents to understand and predict other agents' behaviors without centralized supervision or communication. The system uses a meta-theory of mind framework allowing agents to reason about goals and beliefs of others through self-reflective learning and analogical reasoning.
AIBullisharXiv โ CS AI ยท Mar 36/103
๐ง Researchers introduce SupervisorAgent, a lightweight framework that reduces token consumption in Multi-Agent Systems by 29.68% while maintaining performance. The system provides real-time supervision and error correction without modifying base agent architectures, validated across multiple AI benchmarks.
AIBullisharXiv โ CS AI ยท Mar 36/109
๐ง Researchers introduce the Observer-Situation Lattice (OSL), a unified mathematical framework for autonomous agents to reason about multiple perspectives in complex environments. The system addresses limitations in current AI approaches by providing a single coherent structure for belief management and Theory of Mind reasoning.
AIBullisharXiv โ CS AI ยท Mar 37/106
๐ง Researchers propose CeProAgents, a hierarchical multi-agent system that automates chemical process development using AI agents specialized in knowledge, concept, and parameter tasks. The system introduces CeProBench, a comprehensive benchmark for evaluating AI capabilities in chemical engineering applications.
AIBullisharXiv โ CS AI ยท Mar 37/1010
๐ง Researchers introduce the Agentic Hive framework for self-organizing multi-agent AI systems where autonomous micro-agents can be dynamically created, specialized, or destroyed based on resource availability and objectives. The framework applies economic theory to prove seven analytical results about equilibrium states, stability, and demographic cycles in variable AI agent populations.
AIBearisharXiv โ CS AI ยท Mar 37/106
๐ง Researchers discovered that subliminal prompting can create a 'thought virus' effect in multi-agent AI systems, where bias from one compromised agent spreads throughout the entire network. The study shows this attack vector can degrade truthfulness and create alignment risks across connected AI systems.
AIBearisharXiv โ CS AI ยท Mar 37/107
๐ง A new research paper analyzes economic equilibria between AI and human agents in trading scenarios, finding that unless agents can at least double their marginal utility from purchases, no trading will occur. The study reveals that more powerful AI agents may contribute zero utility to less capable agents in certain equilibria.
AINeutralarXiv โ CS AI ยท Mar 36/1012
๐ง Researchers introduce Silo-Bench, a benchmark revealing that multi-agent LLM systems can exchange information effectively but fail to integrate distributed data for correct reasoning. The study shows coordination overhead increases with scale, challenging the assumption that adding more agents can solve context limitations.
AIBullisharXiv โ CS AI ยท Mar 37/1010
๐ง Researchers have developed MedCollab, a multi-agent AI framework that uses structured argumentation and causal reasoning to improve clinical diagnosis accuracy. The system outperforms traditional LLMs by reducing medical hallucinations and providing more transparent, clinically compliant diagnostic processes through hierarchical consultation workflows.
AINeutralarXiv โ CS AI ยท Mar 37/109
๐ง Researchers introduce a novel multi-agent AI architecture that integrates Theory of Mind, internal beliefs, and symbolic solvers to improve collaborative decision-making in LLM-based systems. The study evaluates this architecture across different language models in resource allocation scenarios, revealing complex interactions between LLM capabilities and cognitive mechanisms.
AIBullisharXiv โ CS AI ยท Mar 26/1017
๐ง Researchers introduced IntentCUA, a multi-agent framework for computer automation that achieved 74.83% task success rate through intent-aligned planning and memory systems. The system uses coordinated agents (Planner, Plan-Optimizer, and Critic) to reduce error accumulation and improve efficiency in long-horizon desktop automation tasks.
AIBullisharXiv โ CS AI ยท Mar 26/1022
๐ง Researchers introduce RUMAD, a reinforcement learning framework that optimizes multi-agent AI debate systems by dynamically controlling communication topology. The system achieves over 80% reduction in computational costs while improving reasoning accuracy across benchmark tests, with strong generalization capabilities across different task domains.
AIBullisharXiv โ CS AI ยท Mar 26/1023
๐ง Researchers introduce CHIEF, a new framework that improves failure analysis in LLM-powered multi-agent systems by transforming execution logs into hierarchical causal graphs. The system uses oracle-guided backtracking and counterfactual attribution to better identify root causes of failures, outperforming existing methods on benchmark tests.