#multi-agent News & Analysis

97 articles tagged with #multi-agent. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

97 articles

AIBullishAI News · Jun 197/10

🧠

SAP and Google Cloud deploy agentic commerce architecture

SAP and Google Cloud have launched an agentic commerce architecture designed to automate multi-agent marketing and retail operations at enterprise scale. The partnership addresses a critical gap where 78% of businesses view AI as essential for customer retention by 2026, yet fewer than 40% of companies effectively share customer data across CRM and customer experience platforms.

AIBullisharXiv – CS AI · Jun 197/10

🧠

MEAL: A Benchmark for Continual Multi-Agent Reinforcement Learning

Researchers introduce MEAL, the first benchmark for continual multi-agent reinforcement learning, which uses JAX and GPU acceleration to enable training on sequences of 100 tasks in hours rather than days. The work reveals that longer task sequences expose failure modes invisible in traditional small-scale benchmarks, addressing a critical gap in RL research where computational constraints have limited study to only 3-10 sequential tasks.

AIBullisharXiv – CS AI · Jun 37/10

🧠

AUDITFLOW: Executable Symbolic Environments for Structured Financial Reporting Verification

Researchers introduced AuditFlow, a multi-agent AI framework that combines language models with symbolic environments to verify structured financial reporting. The system achieved 82% accuracy in audit verification by separating adaptive search from deterministic symbolic checks, demonstrating that deterministic verification—not language models alone—drives reliable audit outcomes.

🧠 GPT-5

AINeutralarXiv – CS AI · May 127/10

🧠

EnactToM: An Evolving Benchmark for Functional Theory of Mind in Embodied Agents

Researchers introduce EnactToM, a benchmark testing whether AI agents can understand and act on others' beliefs in multi-agent embodied environments. Current frontier models achieve 0% on functional theory of mind tasks, revealing a critical gap in AI reasoning capabilities despite performing better on direct belief questions.

AI × CryptoNeutralarXiv – CS AI · Apr 77/10

🤖

Governance-Constrained Agentic AI: Blockchain-Enforced Human Oversight for Safety-Critical Wildfire Monitoring

Researchers propose a blockchain-based AI system for wildfire monitoring that requires mandatory human authorization before issuing alerts. The system uses smart contracts to enforce governance constraints on autonomous AI agents, combining UAV monitoring with cryptographic verification to prevent false alarms and ensure accountability.

AIBullisharXiv – CS AI · Apr 77/10

🧠

ROSClaw: A Hierarchical Semantic-Physical Framework for Heterogeneous Multi-Agent Collaboration

Researchers introduce ROSClaw, a new AI framework that integrates large language models with robotic systems to improve multi-agent collaboration and long-horizon task execution. The framework addresses critical gaps between semantic understanding and physical execution by using unified vision-language models and enabling real-time coordination between simulated and real-world robots.

AI × CryptoNeutralarXiv – CS AI · Apr 77/10

🤖

PolySwarm: A Multi-Agent Large Language Model Framework for Prediction Market Trading and Latency Arbitrage

PolySwarm is a new multi-agent AI framework that uses 50 diverse large language models to trade on prediction markets like Polymarket, combining swarm intelligence with arbitrage strategies. The system outperformed single-model baselines in probability calibration and includes latency arbitrage capabilities to exploit pricing inefficiencies across markets.