AIBullisharXiv – CS AI · 4d ago7/10
🧠Researchers introduce UserHarness, a framework that improves AI agents' Theory-of-Mind capabilities by explicitly reconstructing user mental states rather than modeling behavior indirectly. The approach achieves 95.94% accuracy across five benchmarks, demonstrating significant improvements over existing methods and offering a foundation for building more adaptive AI assistants.
AINeutralarXiv – CS AI · May 127/10
🧠Researchers introduce EnactToM, a benchmark testing whether AI agents can understand and act on others' beliefs in multi-agent embodied environments. Current frontier models achieve 0% on functional theory of mind tasks, revealing a critical gap in AI reasoning capabilities despite performing better on direct belief questions.
AIBullisharXiv – CS AI · Apr 137/10
🧠Researchers introduce a hybrid framework combining probabilistic models with large language models to improve social reasoning in AI agents, achieving a 67% win rate against human players in the game Avalon—a breakthrough in AI's ability to infer beliefs and intentions from incomplete information.
AIBullisharXiv – CS AI · Apr 77/10
🧠Research published on arXiv demonstrates that large language models playing poker can develop sophisticated Theory of Mind capabilities when equipped with persistent memory, progressing to advanced levels of opponent modeling and strategic deception. The study found memory is necessary and sufficient for this emergent behavior, while domain expertise enhances but doesn't gate ToM development.
🧠 GPT-4
AIBearisharXiv – CS AI · Feb 277/107
🧠New research reveals that GPT-4o and other large language models lack true Theory of Mind capabilities, despite appearing socially proficient. While LLMs can approximate human judgments in simple social tasks, they fail at logically equivalent challenges and show inconsistent mental state reasoning.
AINeutralarXiv – CS AI · 3d ago6/10
🧠Researchers introduced Mindgames, a multi-game arena platform for evaluating large language model agents' social and strategic reasoning across four game environments. A 2025 competition cycle tested 944 agents from 76 teams, revealing that top-performing LLMs rely heavily on explicit structural scaffolding and struggle with rule adherence, while some game environments conflate robustness to errors with genuine strategic ability.
AINeutralarXiv – CS AI · 5d ago6/10
🧠Researchers introduce OmniToM, a new benchmark for evaluating Theory of Mind capabilities in large language models by requiring explicit modeling of belief structures rather than just final answers. The benchmark reveals that current LLMs struggle with tracking actor-specific beliefs and understanding knowledge access, exposing fundamental limitations in social reasoning despite high performance on traditional end-point question answering tasks.
AIBullisharXiv – CS AI · Apr 146/10
🧠Researchers introduce CoSToM, a framework that uses causal tracing and activation steering to improve Theory of Mind alignment in large language models. The work addresses a critical gap between LLMs' internal knowledge and external behavior, demonstrating that targeted interventions in specific neural layers can enhance social reasoning capabilities and dialogue quality.
AINeutralarXiv – CS AI · Apr 146/10
🧠Researchers introduce ToM-SB, a novel challenge where AI defenders must use theory-of-mind reasoning to deceive attackers trying to extract sensitive information. Through reinforcement learning, trained models outperform frontier LLMs like GPT-4 and Gemini-Pro, revealing an emergent bidirectional relationship between belief modeling and deception capabilities.
🧠 GPT-5
AINeutralarXiv – CS AI · Mar 176/10
🧠Research reveals that Large Language Models struggle with dynamic Theory of Mind tasks, particularly tracking how others' beliefs change over time. While LLMs can infer current beliefs effectively, they fail to maintain and retrieve prior belief states after updates occur, showing patterns consistent with human cognitive biases.
AIBullisharXiv – CS AI · Mar 37/107
🧠Meta researchers introduced MetaMind, a cognitive world model for multi-agent systems that enables agents to understand and predict other agents' behaviors without centralized supervision or communication. The system uses a meta-theory of mind framework allowing agents to reason about goals and beliefs of others through self-reflective learning and analogical reasoning.
AIBullisharXiv – CS AI · Mar 36/109
🧠Researchers introduce the Observer-Situation Lattice (OSL), a unified mathematical framework for autonomous agents to reason about multiple perspectives in complex environments. The system addresses limitations in current AI approaches by providing a single coherent structure for belief management and Theory of Mind reasoning.
AINeutralarXiv – CS AI · Mar 37/109
🧠Researchers introduce a novel multi-agent AI architecture that integrates Theory of Mind, internal beliefs, and symbolic solvers to improve collaborative decision-making in LLM-based systems. The study evaluates this architecture across different language models in resource allocation scenarios, revealing complex interactions between LLM capabilities and cognitive mechanisms.
AINeutralarXiv – CS AI · Mar 36/104
🧠A research study of nine advanced Large Language Models reveals that Large Reasoning Models (LRMs) do not consistently outperform non-reasoning models on Theory of Mind tasks, which assess social cognition abilities. The study found that longer reasoning often hurts performance and models rely on shortcuts rather than genuine deduction, suggesting formal reasoning advances don't transfer to social reasoning tasks.
AIBearisharXiv – CS AI · Mar 36/104
🧠Researchers introduced SimpleToM, a benchmark revealing that state-of-the-art language models can infer mental states but struggle to apply that knowledge for behavior prediction and judgment. The study exposes a critical gap between explicit Theory of Mind inference and implicit application in real-world scenarios.