AIBullisharXiv – CS AI · Feb 277/106
🧠Researchers propose Decision MetaMamba (DMM), a new AI model architecture that improves offline reinforcement learning by addressing information loss issues in Mamba-based models. The solution uses a dense layer-based sequence mixer and modified positional structure to achieve state-of-the-art performance with fewer parameters.
AIBullisharXiv – CS AI · Feb 277/109
🧠ArchAgent, an AI-driven system built on AlphaEvolve, has achieved breakthrough results in automated computer architecture discovery by designing state-of-the-art cache replacement policies. The system achieved 5.3% performance improvements in just 2 days and 0.9% improvements in 18 days, working 3-5x faster than human-developed solutions.
AIBullishOpenAI News · Nov 77/107
🧠Notion has rebuilt its AI architecture using GPT-5 to create autonomous agents capable of reasoning, acting, and adapting across workflows. This architectural shift represents a major upgrade in Notion 3.0, enabling smarter and more flexible productivity tools through agentic AI capabilities.
AIBullishOpenAI News · Aug 77/107
🧠OpenAI has released a GPT-5 system card detailing a unified model routing system that uses multiple specialized versions including gpt-5-main, gpt-5-thinking, and lightweight variants like gpt-5-thinking-nano. The system is designed to optimize performance across different tasks and developer use cases by routing queries to the most appropriate model variant.
AIBullishSynced Review · May 157/109
🧠DeepSeek has released a 14-page technical paper on their V3 model, focusing on scaling challenges and hardware-aware co-design for low-cost large model training. The paper, co-authored by DeepSeek CEO Wenfeng Liang, reveals insights into cost-effective AI architecture development.
AIBullishHugging Face Blog · Aug 127/104
🧠Falcon Mamba represents a breakthrough as the first strong 7B parameter language model that operates without attention mechanisms. This development challenges the dominance of transformer architectures and could lead to more efficient AI models with reduced computational requirements.
AIBullishHugging Face Blog · Dec 117/105
🧠Hugging Face introduces Mixtral, a state-of-the-art Mixture of Experts (MoE) model that represents a significant advancement in AI architecture. The model demonstrates improved efficiency and performance compared to traditional dense models by selectively activating subsets of parameters.
AINeutralarXiv – CS AI · 2d ago6/10
🧠Researchers introduce S3MEM, a structured memory framework that improves how AI agents retrieve and answer questions about long trajectory histories. The system outperforms standard retrieval-augmented generation by organizing trajectories into scene-event units and using anchor-sensitive retrieval, achieving better accuracy with fewer tokens across multiple interactive environments.
AINeutralarXiv – CS AI · 3d ago6/10
🧠Researchers introduce SAME, a new approach for training Multimodal Large Language Models that can continuously learn new tasks without forgetting previous capabilities. The method addresses fundamental problems in continual learning by stabilizing how AI systems route tasks to specialized expert networks and preventing knowledge degradation over time.
AINeutralarXiv – CS AI · 3d ago6/10
🧠Apple has published research on foundation language models powering Apple Intelligence, including a 3 billion parameter on-device model and a larger server-based model for Private Cloud Compute. The announcement demonstrates Apple's commitment to developing efficient, responsible AI systems that balance performance with privacy.
AIBullisharXiv – CS AI · 3d ago6/10
🧠TCP-MCP introduces a co-evolution framework that simultaneously optimizes AI agent prompts and communication network topologies, achieving state-of-the-art accuracy on multiple benchmarks while reducing token consumption by up to 5.69x compared to existing multi-agent systems. The approach treats prompt design and communication structure as interdependent variables rather than independent parameters, offering a practical methodology for cost-efficient multi-agent AI system design.
AIBullisharXiv – CS AI · 3d ago6/10
🧠VidPrism introduces a heterogeneous Mixture-of-Experts framework that enhances Vision-Language Models for video understanding by deploying specialized experts rather than identical generalists. The approach uses dynamic multi-rate sampling and bidirectional fusion to achieve state-of-the-art performance on video recognition benchmarks.
AIBullisharXiv – CS AI · 4d ago6/10
🧠Researchers introduce CyberEvolver, an AI agent framework that autonomously improves its own architecture through iterative learning from failed cybersecurity tasks. The system demonstrates 13.6% average success rate improvements across CTF challenges and penetration testing, outperforming fixed human-designed alternatives and competing self-improvement methods.
AIBullishTechCrunch – AI · May 126/10
🧠Thinking Machines is developing an AI model that processes user input and generates responses simultaneously, mimicking real-time conversation rather than the current turn-based interaction model used by existing AI systems. This architectural shift could fundamentally change how users interact with AI assistants.
AINeutralarXiv – CS AI · May 126/10
🧠Researchers introduce TESSERA, a neuro-symbolic framework that combines Large Language Models with Monte Carlo Tree Search to extract multi-step explanations from knowledge graphs, specifically for drug-disease mechanism discovery. The system uses LLMs for local judgments rather than autonomous generation, enforcing structural constraints through knowledge graphs while employing MCTS for principled credit assignment across extended reasoning chains.
AINeutralarXiv – CS AI · May 126/10
🧠Researchers demonstrate that unpredictability in language agents does not equate to effective control, finding that structured decision-making mechanisms significantly outperform stochastic sampling across 74,352 test cases. The study challenges assumptions about randomness and control in AI systems, with implications for agent reliability and interpretability.
AINeutralarXiv – CS AI · May 116/10
🧠Researchers present a formal framework for recursive reasoning systems that addresses two critical design challenges: how to represent evolving reasoning states and when to terminate iteration. The paper introduces an epistemic state graph representation and proposes the 'order-gap' metric as a stopping criterion, with theoretical guarantees for when this criterion provides meaningful guidance.
AINeutralarXiv – CS AI · May 76/10
🧠Researchers introduce Memini, a system that applies biological multi-timescale memory dynamics to external memory in large language models. By organizing knowledge as a directed graph where edges follow coupled fast and slow variables inspired by synaptic consolidation, the system enables LLMs to continuously update their knowledge without explicit management, allowing new information to be immediately useful while less relevant associations gradually fade.
AINeutralarXiv – CS AI · May 46/10
🧠A research position paper argues that agentic AI systems should incorporate Bayesian decision theory at their orchestration layer to improve decision-making under uncertainty. Rather than making LLMs themselves Bayesian, the framework proposes applying Bayesian principles to the control systems that coordinate multiple LLMs and tools, enabling better belief maintenance and resource allocation.
AINeutralarXiv – CS AI · May 46/10
🧠MemRouter is a new memory management system for conversational AI agents that uses lightweight embedding-based routing instead of expensive LLM generation to decide which conversation turns to store. The approach achieves 52.0 F1 score versus 45.6 for LLM-based alternatives while reducing latency from 970ms to 58ms, suggesting memory admission can be effectively learned through supervised classification rather than generative models.
AINeutralarXiv – CS AI · May 16/10
🧠Researchers evaluated eight LLM agents across three interaction paradigms—domain-specific agents, computer-use agents, and general-purpose coding agents—on scientific visualization tasks. The study reveals fundamental tradeoffs: general-purpose agents excel at task completion but consume more computational resources, while domain-specific agents offer efficiency and stability at the cost of flexibility, with persistent memory improving performance across modalities.
AINeutralarXiv – CS AI · Apr 156/10
🧠A new research paper proposes a governance framework for personal AI memory systems designed to function as 'companion' knowledge wikis that mirror user knowledge while compensating for epistemic failures like entrenchment and evidence suppression. The work addresses an emerging 2026 landscape of memory architectures for large language models through five operational mechanisms (TRIAGE, DECAY, CONTEXTUALIZE, CONSOLIDATE, AUDIT) aimed at preventing user-coupled drift in single-user knowledge systems.
AIBullisharXiv – CS AI · Apr 156/10
🧠Researchers propose Sequential Navigation Guidance (SNG), a framework addressing a critical flaw in end-to-end autonomous driving systems that over-rely on local scene understanding while underutilizing global navigation information. The SNG framework combines navigation paths and turn-by-turn instructions with a new VQA dataset and efficient model to improve autonomous vehicle planning and navigation-following in complex scenarios.
AINeutralarXiv – CS AI · Apr 146/10
🧠Researchers identify a critical architectural gap in leading AI agent frameworks (CoALA and JEPA), which lack an explicit Knowledge layer with distinct persistence semantics. The paper proposes a four-layer decomposition model with fundamentally different update mechanics for knowledge, memory, wisdom, and intelligence, with working implementations demonstrating feasibility.
AINeutralarXiv – CS AI · Apr 136/10
🧠Researchers formalize how agents can use environmental artifacts as external memory to reduce computational requirements in reinforcement learning tasks. The study demonstrates that spatial observations can implicitly serve as memory substitutes, allowing agents to learn effective policies with less internal memory capacity than previously thought necessary.