#cognitive-ai News & Analysis

13 articles tagged with #cognitive-ai. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

13 articles

AIBullisharXiv – CS AI · May 287/10

🧠

UserHarness: Harnessing User Minds for Stronger Agent Theory-of-Mind

Researchers introduce UserHarness, a framework that improves AI agents' Theory-of-Mind capabilities by explicitly reconstructing user mental states rather than modeling behavior indirectly. The approach achieves 95.94% accuracy across five benchmarks, demonstrating significant improvements over existing methods and offering a foundation for building more adaptive AI assistants.

AIBullisharXiv – CS AI · May 287/10

🧠

MemGuard: Preventing Memory Contamination in Long-Term Memory-Augmented Large Language Models

Researchers introduce MemGuard, a framework that addresses memory contamination in long-term memory-augmented large language models by organizing memories into functional types and selectively retrieving only relevant evidence. The approach improves hallucination reduction by up to 28.27% while reducing memory token usage by 5.8x, advancing the reliability of AI systems that maintain persistent memory across extended interactions.

AINeutralarXiv – CS AI · Mar 267/10

🧠

Evidence of an Emergent "Self" in Continual Robot Learning

Researchers propose a method to identify 'self-awareness' in AI systems by analyzing invariant cognitive structures that remain stable during continual learning. Their study found that robots subjected to continual learning developed significantly more stable subnetworks compared to control groups, suggesting this could be evidence of an emergent 'self' concept.

AIBearisharXiv – CS AI · Mar 46/103

🧠

SpatialText: A Pure-Text Cognitive Benchmark for Spatial Understanding in Large Language Models

Researchers introduce SpatialText, a diagnostic framework to test whether large language models can truly reason about spatial relationships or merely rely on linguistic patterns. The study reveals that current AI models fail at egocentric perspective reasoning despite proficiency in basic spatial fact retrieval.

AIBullisharXiv – CS AI · Mar 47/102

🧠

NeuroSkill(tm): Proactive Real-Time Agentic System Capable of Modeling Human State of Mind

NeuroSkill is a new open-source AI system that models human mental states in real-time using brain-computer interfaces and biophysical signals. The system runs offline on edge devices and can engage with humans on cognitive and emotional levels through its NeuroLoop harness technology.

AIBullisharXiv – CS AI · Feb 277/105

🧠

Mirroring the Mind: Distilling Human-Like Metacognitive Strategies into Large Language Models

Researchers propose Metacognitive Behavioral Tuning (MBT), a new framework that addresses structural fragility in Large Reasoning Models by injecting human-like self-regulatory control into AI thought processes. The approach reduces reasoning collapse and improves accuracy while consuming fewer computational tokens across multi-hop question-answering benchmarks.

AIBearisharXiv – CS AI · Jun 236/10

🧠

Cognitive Digital Twins: Ethical Risks and Governance for AI Systems That Model the Mind

Researchers propose a governance framework for cognitive digital twins (CDTs)—AI systems that create dynamic computational models of individual human cognition to predict behavior and act as decision-making proxies. The paper identifies unique risks including misrepresentation and proxy-power asymmetries, arguing that existing regulatory frameworks for AI systems inadequately address CDT-specific dangers at the level of cognitive representation itself.

AINeutralarXiv – CS AI · Jun 236/10

🧠

A criterion for Artificial General Intelligence: hypothetic-deductive reasoning, tested on ChatGPT

Researchers propose hypothetic-deductive reasoning as a key criterion for Artificial General Intelligence, arguing that advanced AI systems must demonstrate causal reasoning and hypothesis testing across complex problem domains. Testing this framework on ChatGPT reveals the model has limited capacity for these reasoning types when problems increase in complexity, suggesting current large language models fall short of AGI-level reasoning capabilities.

🧠 GPT-4🧠 ChatGPT

AIBullisharXiv – CS AI · Mar 36/109

🧠

K^2-Agent: Co-Evolving Know-What and Know-How for Hierarchical Mobile Device Control

Researchers introduce K²-Agent, a hierarchical AI framework for mobile device control that separates 'know-what' and 'know-how' knowledge to achieve 76.1% success rate on AndroidWorld benchmark. The system uses a high-level reasoner for task planning and low-level executor for skill execution, showing strong generalization across different models and tasks.

AIBullisharXiv – CS AI · Mar 37/107

🧠

MetaMind: General and Cognitive World Models in Multi-Agent Systems by Meta-Theory of Mind

Meta researchers introduced MetaMind, a cognitive world model for multi-agent systems that enables agents to understand and predict other agents' behaviors without centralized supervision or communication. The system uses a meta-theory of mind framework allowing agents to reason about goals and beliefs of others through self-reflective learning and analogical reasoning.

AIBullisharXiv – CS AI · Mar 37/108

🧠

Nano-EmoX: Unifying Multimodal Emotional Intelligence from Perception to Empathy

Researchers have developed Nano-EmoX, a compact 2.2B parameter multimodal language model that unifies emotional intelligence tasks across perception, understanding, and interaction levels. The model achieves state-of-the-art performance on six core affective tasks using a novel curriculum-based training framework called P2E (Perception-to-Empathy).

AIBullisharXiv – CS AI · Mar 37/109

🧠

From Verbatim to Gist: Distilling Pyramidal Multimodal Memory via Semantic Information Bottleneck for Long-Horizon Video Agents

Researchers have developed MM-Mem, a new pyramidal multimodal memory architecture that enables AI systems to better understand long-horizon videos by mimicking human cognitive memory processes. The system addresses current limitations in multimodal large language models by creating a hierarchical memory structure that progressively distills detailed visual information into high-level semantic understanding.

AINeutralarXiv – CS AI · Mar 36/104

🧠

SpinBench: Perspective and Rotation as a Lens on Spatial Reasoning in VLMs

Researchers introduced SpinBench, a new benchmark for evaluating spatial reasoning abilities in vision language models (VLMs), focusing on perspective taking and viewpoint transformations. Testing 43 state-of-the-art VLMs revealed systematic weaknesses including strong egocentric bias and poor rotational understanding, with human performance significantly outpacing AI models at 91.2% accuracy.