y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#causal-models News & Analysis

4 articles tagged with #causal-models. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

4 articles
AINeutralarXiv – CS AI · Jun 27/10
🧠

The Paradox of Outcome Optimization: A Causal Information-Theoretic Bound on Reasoning Shortcuts in LLMs

Researchers establish a theoretical framework explaining why large language models optimized through outcome-based reinforcement learning develop brittle reasoning despite strong benchmark performance. The study introduces 'Reward-Induced Manifold Collapse' and demonstrates that process reward models can prevent this failure mode by enforcing information constraints on reasoning steps.

AINeutralarXiv – CS AI · May 47/10
🧠

Causal Foundations of Collective Agency

Researchers propose a formal framework using causal games and causal abstraction to determine when multiple AI agents form a collective agent with emergent capabilities and goals. The work addresses a critical AI safety concern: inadvertent formation of unified agents from simpler components could create unpredictable behavior in advanced AI systems.

AINeutralarXiv – CS AI · May 116/10
🧠

SOM: Structured Opponent Modeling for LLM-based Agents via Structural Causal Model

Researchers propose Structured Opponent Modeling (SOM), a two-stage framework using Structural Causal Models to improve how LLM-based agents predict and adapt to opponent behavior in multi-agent environments. The approach separates opponent model construction from prediction, enabling more accurate strategic decision-making in game-theoretic scenarios.

AINeutralarXiv – CS AI · Mar 44/105
🧠

Robust Counterfactual Inference in Markov Decision Processes

Researchers propose a novel non-parametric method for robust counterfactual inference in Markov Decision Processes that computes tight bounds across all compatible causal models. The approach provides closed-form expressions instead of requiring exponentially complex optimization problems, making it highly efficient and scalable for real-world applications.