y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#nash-equilibrium News & Analysis

8 articles tagged with #nash-equilibrium. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

8 articles
AIBullisharXiv – CS AI · May 277/10
🧠

AgentSociety: Incentivizing Agentic Social Intelligence

Researchers propose AgentSociety, a decentralized multi-agent framework that uses liquid democracy and economic incentives to enable autonomous agents to collaborate effectively. The mechanism proves that agents are incentivized to delegate tasks to more competent neighbors and selectively share information for influence, with payoffs reflecting marginal contributions at Nash equilibrium.

AINeutralarXiv – CS AI · May 17/10
🧠

What Suppresses Nash Equilibrium Play in Large Language Models? Mechanistic Evidence and Causal Control

Researchers discovered that large language models compute Nash equilibrium strategies in strategic games but actively suppress them through a prosocial override mechanism in final layers, favoring cooperation instead. The suppression can be reversed through mechanistic intervention, revealing that LLM deviations from rational play stem not from inability but from built-in behavioral constraints that vary with model scale and architecture.

🧠 Llama
AIBullisharXiv – CS AI · Mar 177/10
🧠

ADV-0: Closed-Loop Min-Max Adversarial Training for Long-Tail Robustness in Autonomous Driving

ADV-0 is a new closed-loop adversarial training framework for autonomous driving that uses min-max optimization to improve robustness against rare but safety-critical scenarios. The system treats the interaction between driving policy and adversarial agents as a zero-sum game, converging to Nash Equilibrium while maximizing real-world performance bounds.

AINeutralarXiv – CS AI · Mar 57/10
🧠

Learning Approximate Nash Equilibria in Cooperative Multi-Agent Reinforcement Learning via Mean-Field Subsampling

Researchers propose ALTERNATING-MARL, a new framework for cooperative multi-agent reinforcement learning that enables a global agent to learn with massive populations under communication constraints. The method achieves approximate Nash equilibrium convergence while only observing a subset of local agent states, with applications in multi-robot control and federated optimization.

$MKR
AINeutralarXiv – CS AI · 4d ago6/10
🧠

Efficient Exploration for Iterative Nash Preference Optimization

Researchers propose an improved Nash Learning from Human Feedback (NLHF) algorithm that addresses exploration challenges in preference alignment for large language models. The new method achieves better regret bounds without exponential dependence on regularization parameters and demonstrates empirical improvements when fine-tuning Llama-3-8B.

🧠 Llama
AINeutralarXiv – CS AI · May 286/10
🧠

Global Policy-Space Response Oracles for Two-Player Zero-Sum Games

Researchers introduce Global PSRO, an improved algorithm for computing Nash equilibria in two-player zero-sum games by using Population Exploitability metrics to guide strategy expansion more efficiently than existing methods. The approach reduces computational requirements while achieving better approximations of equilibrium solutions, advancing game-theoretic AI applications.

AINeutralarXiv – CS AI · May 116/10
🧠

Curated Synthetic Data Doesn't Have to Collapse: A Theoretical Study of Generative Retraining with Pluralistic Preferences

Researchers demonstrate that model collapse during recursive synthetic data retraining can be prevented by curating outputs across multiple reward functions rather than a single objective. The study provides theoretical proof that diverse preference aggregation leads to stable distributions satisfying Nash bargaining solutions, offering a framework for maintaining output diversity in AI training loops.

AINeutralarXiv – CS AI · Mar 176/10
🧠

Contests with Spillovers: Incentivizing Content Creation with GenAI

Researchers propose the Content Creation with Spillovers (CCS) model to address how GenAI and LLMs create positive spillovers where creators' content can be reused by others, potentially undermining individual incentives. They introduce Provisional Allocation mechanisms to guarantee equilibrium existence and develop approximation algorithms to maximize social welfare in content creation ecosystems.