AIBullisharXiv – CS AI · Jun 57/10
🧠Researchers propose PACT, a new protocol for multi-agent AI systems that compresses inter-agent communication into compact action-state records, reducing token usage by up to 50% while maintaining or improving task performance. The approach addresses a critical efficiency bottleneck in large language model-based multi-agent systems, with demonstrated improvements in production coding applications.
AIBullisharXiv – CS AI · May 297/10
🧠Researchers introduce Meta-Team, an experience-driven framework that enables multi-agent LLM systems to collaboratively self-evolve by learning from their own execution failures. The system coordinates post-task communication among agents to identify and implement improvements across individual behaviors, inter-agent coordination, and team-level organization, demonstrating consistent performance gains across six benchmarks.
AIBullisharXiv – CS AI · May 287/10
🧠Researchers introduce AutoScientists, a decentralized multi-agent AI system that autonomously conducts long-running scientific experiments by self-organizing teams, critiquing proposals, and sharing failures. The system outperforms single-agent approaches across biomedical machine learning, language model optimization, and protein prediction tasks, achieving significant improvements in speed and accuracy.
AINeutralarXiv – CS AI · May 17/10
🧠Researchers from arXiv demonstrate that multi-agent AI systems built on large language models achieve dramatically different performance levels based on their organizational structure, with governance topology showing a 57+ percentage point performance gap. The study translates seven historical political institutions into executable multi-agent architectures, revealing that optimal organizational design shifts systematically with model capability and task requirements.
AIBullisharXiv – CS AI · Mar 177/10
🧠Researchers introduce REDEREF, a training-free controller that improves multi-agent LLM system efficiency by 28% token usage reduction and 17% fewer agent calls through probabilistic routing and belief-guided delegation. The system uses Thompson sampling and reflection-driven re-routing to optimize agent coordination without requiring model fine-tuning.
AIBullisharXiv – CS AI · Jun 116/10
🧠Researchers introduce OMAD, an online multi-agent reinforcement learning framework that integrates diffusion-based generative models for improved policy coordination. The method achieves 2.5-5x improvements in sample efficiency across benchmark tasks by using relaxed policy objectives and joint distributional value functions to enable effective exploration without requiring tractable likelihood calculations.
AI × CryptoBullishHugging Face Blog · Jun 56/10
🤖Thousand Token Wood announces the deployment of a multi-agent economy system operating on a 3-billion parameter language model, enabling autonomous agents to interact, trade, and coordinate within a tokenized ecosystem. This development represents a practical implementation of decentralized AI agents at scale, combining language models with blockchain incentive structures.
AINeutralarXiv – CS AI · Jun 46/10
🧠Researchers introduce SMAC-Talk, a benchmark environment that extends the StarCraft Multi-Agent Challenge to evaluate how large language models coordinate and communicate in cooperative multi-agent settings. The framework tests LLM agents under realistic constraints including partial observability, decentralized control, and adversarial deception, using Qwen models to examine how reasoning, memory, and scale impact agent coordination.
AINeutralarXiv – CS AI · Jun 26/10
🧠Researchers propose Post-Deterministic Distributed Systems (PDDS), a new framework for coordinating infrastructure where autonomous agents, stochastic models, and deterministic code coexist—challenging decades-old assumptions in distributed computing that relied on predictable, deterministic participant behavior.
AIBullisharXiv – CS AI · Jun 26/10
🧠Researchers propose DySCo, a dynamic sparse communication mechanism for LLM-based multi-agent systems that reduces computational overhead by selectively routing messages between agents rather than using full broadcast. The approach maintains consensus quality while cutting token costs and latency that scale quadratically with agent count, addressing a key efficiency bottleneck in collaborative AI reasoning systems.
AINeutralarXiv – CS AI · Jun 26/10
🧠Researchers present MARFT (Multi-Agent Reinforcement Fine-Tuning), a framework for optimizing LLM-based multi-agent systems using reinforcement learning. The work introduces Flex-MG, a new Markov Game formulation, and addresses key challenges in applying traditional MARL to collaborative AI systems, providing open-source implementation for advancing adaptive agentic systems.
AINeutralarXiv – CS AI · May 276/10
🧠UnityMAS-O is a new reinforcement learning optimization framework that enables LLM-based multi-agent systems to be trained end-to-end rather than manually orchestrated. The framework treats entire agent workflows as optimization units and demonstrates performance improvements across QA, search, and code generation tasks, particularly benefiting smaller models.
AINeutralarXiv – CS AI · May 126/10
🧠Researchers propose a critique-and-routing controller for multi-agent LLM systems that iteratively refines outputs through sequential decision-making rather than one-shot routing. The method uses reinforcement learning with agent-utilization constraints to achieve performance approaching the strongest agent while reducing computational calls by over 75%, advancing coordination efficiency in heterogeneous AI systems.
AINeutralarXiv – CS AI · May 126/10
🧠Researchers present STAR, a failure-aware routing framework for multi-agent AI systems that handles spatiotemporal reasoning tasks by intelligently routing between specialist agents based on typed failure states rather than generic success/failure signals. The system learns recovery transitions from execution traces and demonstrates improved performance across multiple benchmarks, suggesting that explicit failure-aware routing is more effective than implicit language-based decision-making in complex reasoning tasks.
AINeutralarXiv – CS AI · May 116/10
🧠TeamBench is a new benchmark evaluating multi-agent AI coordination under enforced role separation, revealing that prompt-only instructions fail to prevent role violations and that agent teams often underperform single agents on well-solved tasks. The study demonstrates that passing rates can mask coordination failures and misaligned team dynamics.