y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#marl News & Analysis

9 articles tagged with #marl. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

9 articles
AIBullisharXiv – CS AI · May 277/10
🧠

Decoupled Delay Compensation: Enhancing Pre-trained MARL Policies via Learned Dynamics Filtering

Researchers propose a modular state-estimation layer that enhances pre-trained multi-agent reinforcement learning (MARL) policies by compensating for communication delays and packet loss through learned dynamics filtering. The plug-and-play approach combines gated transition models with Kalman filtering to estimate current states from delayed observations, demonstrating significant robustness improvements without requiring retraining of original policies.

AIBullisharXiv – CS AI · Mar 57/10
🧠

HALyPO: Heterogeneous-Agent Lyapunov Policy Optimization for Human-Robot Collaboration

Researchers developed HALyPO (Heterogeneous-Agent Lyapunov Policy Optimization), a new approach to improve stability in human-robot collaboration through multi-agent reinforcement learning. The method addresses the 'rationality gap' between human and robot learning by using Lyapunov stability conditions to prevent policy oscillations and divergence during training.

AINeutralarXiv – CS AI · 3d ago6/10
🧠

LLM-Guided Communication for Cooperative Multi-Agent Reinforcement Learning

Researchers propose LMAC, an LLM-driven communication protocol for multi-agent reinforcement learning that enables agents to reconstruct shared state information more accurately and uniformly. The approach iteratively refines communication strategies using explicit state-awareness criteria, demonstrating substantial performance improvements over existing communication baselines across multiple MARL benchmarks.

AINeutralarXiv – CS AI · 4d ago6/10
🧠

Interaction-Breaking Adversarial Learning Framework for Robust Multi-Agent Reinforcement Learning

Researchers propose IBAL, an adversarial learning framework that makes multi-agent reinforcement learning systems robust against attacks that disrupt agent coordination through observation and action perturbations. The method addresses a gap in existing defenses by focusing on interaction-breaking attacks rather than value-oriented ones, demonstrating improved resilience across multiple scenarios.

AINeutralarXiv – CS AI · May 276/10
🧠

TABX: A High-Throughput Sandbox Battle Simulator for Multi-Agent Reinforcement Learning

Researchers introduce TABX, a high-throughput multi-agent reinforcement learning simulator built on JAX that enables GPU-accelerated testing of cooperative AI algorithms. The framework prioritizes modularity and customization, allowing systematic investigation of emergent agent behaviors across varying task complexities with significantly reduced computational overhead.

AINeutralarXiv – CS AI · May 126/10
🧠

Beyond Self-Play: Hierarchical Reasoning for Continuous Motion in Closed-Loop Traffic Simulation

Researchers propose a hierarchical reinforcement learning framework that combines multi-agent interaction reasoning with continuous motion control to improve behavioral realism in traffic simulations. The approach outperforms self-play methods by better capturing socially aware driving behaviors while maintaining safety and efficiency in closed-loop SUMO simulations.

AINeutralarXiv – CS AI · May 116/10
🧠

Dynamic one-time delivery of critical data by small and sparse UAV swarms: a model problem for MARL scaling studies

Researchers introduce a family of deterministic games designed to test Multi-Agent Reinforcement Learning (MARL) scalability for decentralized UAV swarm control tasked with relaying critical data. While baseline policies using Dijkstra's algorithm perform comparably to standard MARL algorithms for small agent counts, existing MARL approaches demonstrate significant scalability limitations as swarm size increases.