AIBullisharXiv – CS AI · May 277/10
🧠Researchers propose a modular state-estimation layer that enhances pre-trained multi-agent reinforcement learning (MARL) policies by compensating for communication delays and packet loss through learned dynamics filtering. The plug-and-play approach combines gated transition models with Kalman filtering to estimate current states from delayed observations, demonstrating significant robustness improvements without requiring retraining of original policies.
AIBullisharXiv – CS AI · Mar 57/10
🧠Researchers developed HALyPO (Heterogeneous-Agent Lyapunov Policy Optimization), a new approach to improve stability in human-robot collaboration through multi-agent reinforcement learning. The method addresses the 'rationality gap' between human and robot learning by using Lyapunov stability conditions to prevent policy oscillations and divergence during training.
AINeutralarXiv – CS AI · 3d ago6/10
🧠Researchers developed a method combining multi-agent deep reinforcement learning with explainable AI techniques to optimize drag reduction in turbulent flows, achieving 34.44% drag reduction with only 0.43% energy input—significantly outperforming traditional opposition control methods.
AINeutralarXiv – CS AI · 3d ago6/10
🧠Researchers propose LMAC, an LLM-driven communication protocol for multi-agent reinforcement learning that enables agents to reconstruct shared state information more accurately and uniformly. The approach iteratively refines communication strategies using explicit state-awareness criteria, demonstrating substantial performance improvements over existing communication baselines across multiple MARL benchmarks.
AINeutralarXiv – CS AI · 4d ago6/10
🧠Researchers propose IBAL, an adversarial learning framework that makes multi-agent reinforcement learning systems robust against attacks that disrupt agent coordination through observation and action perturbations. The method addresses a gap in existing defenses by focusing on interaction-breaking attacks rather than value-oriented ones, demonstrating improved resilience across multiple scenarios.
AINeutralarXiv – CS AI · May 276/10
🧠Researchers introduce TABX, a high-throughput multi-agent reinforcement learning simulator built on JAX that enables GPU-accelerated testing of cooperative AI algorithms. The framework prioritizes modularity and customization, allowing systematic investigation of emergent agent behaviors across varying task complexities with significantly reduced computational overhead.
AINeutralarXiv – CS AI · May 126/10
🧠Researchers propose a hierarchical reinforcement learning framework that combines multi-agent interaction reasoning with continuous motion control to improve behavioral realism in traffic simulations. The approach outperforms self-play methods by better capturing socially aware driving behaviors while maintaining safety and efficiency in closed-loop SUMO simulations.
AINeutralarXiv – CS AI · May 116/10
🧠Researchers introduce a family of deterministic games designed to test Multi-Agent Reinforcement Learning (MARL) scalability for decentralized UAV swarm control tasked with relaying critical data. While baseline policies using Dijkstra's algorithm perform comparably to standard MARL algorithms for small agent counts, existing MARL approaches demonstrate significant scalability limitations as swarm size increases.
AIBullisharXiv – CS AI · Mar 176/10
🧠Researchers propose MA-VLCM, a framework that uses pretrained vision-language models as centralized critics in multi-agent reinforcement learning instead of learning critics from scratch. This approach significantly improves sample efficiency and enables zero-shot generalization while producing compact policies suitable for resource-constrained robots.