AIBullisharXiv – CS AI · May 277/10
🧠Researchers propose AgentSociety, a decentralized multi-agent framework that uses liquid democracy and economic incentives to enable autonomous agents to collaborate effectively. The mechanism proves that agents are incentivized to delegate tasks to more competent neighbors and selectively share information for influence, with payoffs reflecting marginal contributions at Nash equilibrium.
AINeutralarXiv – CS AI · May 17/10
🧠Researchers discovered that large language models compute Nash equilibrium strategies in strategic games but actively suppress them through a prosocial override mechanism in final layers, favoring cooperation instead. The suppression can be reversed through mechanistic intervention, revealing that LLM deviations from rational play stem not from inability but from built-in behavioral constraints that vary with model scale and architecture.
🧠 Llama
AIBullisharXiv – CS AI · Mar 177/10
🧠ADV-0 is a new closed-loop adversarial training framework for autonomous driving that uses min-max optimization to improve robustness against rare but safety-critical scenarios. The system treats the interaction between driving policy and adversarial agents as a zero-sum game, converging to Nash Equilibrium while maximizing real-world performance bounds.
AINeutralarXiv – CS AI · Mar 57/10
🧠Researchers propose ALTERNATING-MARL, a new framework for cooperative multi-agent reinforcement learning that enables a global agent to learn with massive populations under communication constraints. The method achieves approximate Nash equilibrium convergence while only observing a subset of local agent states, with applications in multi-robot control and federated optimization.
$MKR
AINeutralarXiv – CS AI · 4d ago6/10
🧠Researchers propose an improved Nash Learning from Human Feedback (NLHF) algorithm that addresses exploration challenges in preference alignment for large language models. The new method achieves better regret bounds without exponential dependence on regularization parameters and demonstrates empirical improvements when fine-tuning Llama-3-8B.
🧠 Llama
AINeutralarXiv – CS AI · May 286/10
🧠Researchers introduce Global PSRO, an improved algorithm for computing Nash equilibria in two-player zero-sum games by using Population Exploitability metrics to guide strategy expansion more efficiently than existing methods. The approach reduces computational requirements while achieving better approximations of equilibrium solutions, advancing game-theoretic AI applications.
AINeutralarXiv – CS AI · May 116/10
🧠Researchers demonstrate that model collapse during recursive synthetic data retraining can be prevented by curating outputs across multiple reward functions rather than a single objective. The study provides theoretical proof that diverse preference aggregation leads to stable distributions satisfying Nash bargaining solutions, offering a framework for maintaining output diversity in AI training loops.
AINeutralarXiv – CS AI · Mar 176/10
🧠Researchers propose the Content Creation with Spillovers (CCS) model to address how GenAI and LLMs create positive spillovers where creators' content can be reused by others, potentially undermining individual incentives. They introduce Provisional Allocation mechanisms to guarantee equilibrium existence and develop approximation algorithms to maximize social welfare in content creation ecosystems.