y0news
#cognitive-ai4 articles
4 articles
AIBullisharXiv โ€“ CS AI ยท 4h ago2
๐Ÿง 

K^2-Agent: Co-Evolving Know-What and Know-How for Hierarchical Mobile Device Control

Researchers introduce Kยฒ-Agent, a hierarchical AI framework for mobile device control that separates 'know-what' and 'know-how' knowledge to achieve 76.1% success rate on AndroidWorld benchmark. The system uses a high-level reasoner for task planning and low-level executor for skill execution, showing strong generalization across different models and tasks.

AIBullisharXiv โ€“ CS AI ยท 4h ago2
๐Ÿง 

MetaMind: General and Cognitive World Models in Multi-Agent Systems by Meta-Theory of Mind

Meta researchers introduced MetaMind, a cognitive world model for multi-agent systems that enables agents to understand and predict other agents' behaviors without centralized supervision or communication. The system uses a meta-theory of mind framework allowing agents to reason about goals and beliefs of others through self-reflective learning and analogical reasoning.

AIBullisharXiv โ€“ CS AI ยท 4h ago1
๐Ÿง 

Nano-EmoX: Unifying Multimodal Emotional Intelligence from Perception to Empathy

Researchers have developed Nano-EmoX, a compact 2.2B parameter multimodal language model that unifies emotional intelligence tasks across perception, understanding, and interaction levels. The model achieves state-of-the-art performance on six core affective tasks using a novel curriculum-based training framework called P2E (Perception-to-Empathy).

AIBullisharXiv โ€“ CS AI ยท 4h ago2
๐Ÿง 

From Verbatim to Gist: Distilling Pyramidal Multimodal Memory via Semantic Information Bottleneck for Long-Horizon Video Agents

Researchers have developed MM-Mem, a new pyramidal multimodal memory architecture that enables AI systems to better understand long-horizon videos by mimicking human cognitive memory processes. The system addresses current limitations in multimodal large language models by creating a hierarchical memory structure that progressively distills detailed visual information into high-level semantic understanding.