y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#multi-agent-learning News & Analysis

8 articles tagged with #multi-agent-learning. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

8 articles
AIBullisharXiv – CS AI · Mar 56/10
🧠

Cognition to Control - Multi-Agent Learning for Human-Humanoid Collaborative Transport

Researchers developed a new three-layer hierarchy called cognition-to-control (C2C) for human-robot collaboration that combines vision-language models with multi-agent reinforcement learning. The system enables sustained deliberation and planning while maintaining real-time control for collaborative manipulation tasks between humans and humanoid robots.

AINeutralarXiv – CS AI · 3d ago6/10
🧠

TriAlign: Towards Universal Truth Consistency in Personalized LLM Alignment

Researchers introduce TriAlign, a machine learning framework that addresses fairness issues in personalized large language models by ensuring universal truths remain consistent across different social groups. The method balances accuracy, fairness, and personalization through multi-agent reinforcement learning, reducing disparities in objective task performance while maintaining user preference adaptation.

AINeutralarXiv – CS AI · May 296/10
🧠

Recurrent Structural Policy Gradient for Partially Observable Mean Field Games

Researchers introduce Recurrent Structural Policy Gradient (RSPG), an algorithmic advancement for solving Mean Field Games with partial observability by combining policy gradient methods with structural knowledge of system dynamics. The method achieves significantly faster convergence than model-free approaches while enabling history-aware behavior, accompanied by MFAX, a new JAX-based research framework for MFG implementations.

AINeutralarXiv – CS AI · May 286/10
🧠

Personalized Observation Normalization for Federated Reinforcement Learning in Simulation Environments with Heterogeneity

Researchers propose a Personalized Observation Normalization (PON) method to address challenges in federated reinforcement learning across heterogeneous environments. The technique allows individual agents to maintain localized normalization statistics while collaborating on a shared policy, improving training efficiency and performance without compromising privacy.

AINeutralarXiv – CS AI · May 276/10
🧠

TABX: A High-Throughput Sandbox Battle Simulator for Multi-Agent Reinforcement Learning

Researchers introduce TABX, a high-throughput multi-agent reinforcement learning simulator built on JAX that enables GPU-accelerated testing of cooperative AI algorithms. The framework prioritizes modularity and customization, allowing systematic investigation of emergent agent behaviors across varying task complexities with significantly reduced computational overhead.

AINeutralarXiv – CS AI · May 116/10
🧠

Experience Sharing in Mutual Reinforcement Learning for Heterogeneous Language Models

Researchers introduce Mutual Reinforcement Learning, a framework enabling heterogeneous language models to share training experiences while maintaining separate parameters and tokenizers. The system uses three mechanisms—Shared Experience Exchange, Multi-Worker Resource Allocation, and a Tokenizer Heterogeneity Layer—to coordinate reinforcement learning across incompatible model architectures, with outcome-level success transfer showing the best stability-support trade-off.

AINeutralarXiv – CS AI · May 96/10
🧠

Conversation for Non-verifiable Learning: Self-Evolving LLMs through Meta-Evaluation

Researchers introduce CoNL, a framework that enables large language models to improve themselves through multi-agent self-play without requiring ground-truth labels or external judges. The system uses critiques that successfully improve solutions as training signals, allowing models to jointly optimize both generation and evaluation capabilities for non-verifiable tasks like creative writing and ethical reasoning.

AIBullisharXiv – CS AI · Apr 146/10
🧠

Interactive Learning for LLM Reasoning

Researchers introduce ILR, a novel multi-agent learning framework that enables Large Language Models to enhance their independent reasoning through interactive training with other LLMs, then solve problems autonomously without re-executing the multi-agent system. The approach combines dynamic interaction strategies and perception calibration, delivering up to 5% performance improvements across mathematical, coding, and reasoning benchmarks.