y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#ai-architecture News & Analysis

72 articles tagged with #ai-architecture. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

72 articles
AIBullisharXiv – CS AI · Feb 277/106
🧠

Decision MetaMamba: Enhancing Selective SSM in Offline RL with Heterogeneous Sequence Mixing

Researchers propose Decision MetaMamba (DMM), a new AI model architecture that improves offline reinforcement learning by addressing information loss issues in Mamba-based models. The solution uses a dense layer-based sequence mixer and modified positional structure to achieve state-of-the-art performance with fewer parameters.

AIBullisharXiv – CS AI · Feb 277/109
🧠

ArchAgent: Agentic AI-driven Computer Architecture Discovery

ArchAgent, an AI-driven system built on AlphaEvolve, has achieved breakthrough results in automated computer architecture discovery by designing state-of-the-art cache replacement policies. The system achieved 5.3% performance improvements in just 2 days and 0.9% improvements in 18 days, working 3-5x faster than human-developed solutions.

AIBullishOpenAI News · Nov 77/107
🧠

Notion’s rebuild for agentic AI: How GPT‑5 helped unlock autonomous workflows

Notion has rebuilt its AI architecture using GPT-5 to create autonomous agents capable of reasoning, acting, and adapting across workflows. This architectural shift represents a major upgrade in Notion 3.0, enabling smarter and more flexible productivity tools through agentic AI capabilities.

AIBullishOpenAI News · Aug 77/107
🧠

GPT-5 System Card

OpenAI has released a GPT-5 system card detailing a unified model routing system that uses multiple specialized versions including gpt-5-main, gpt-5-thinking, and lightweight variants like gpt-5-thinking-nano. The system is designed to optimize performance across different tasks and developer use cases by routing queries to the most appropriate model variant.

AIBullishHugging Face Blog · Aug 127/104
🧠

Welcome Falcon Mamba: The first strong attention-free 7B model

Falcon Mamba represents a breakthrough as the first strong 7B parameter language model that operates without attention mechanisms. This development challenges the dominance of transformer architectures and could lead to more efficient AI models with reduced computational requirements.

AIBullishHugging Face Blog · Dec 117/105
🧠

Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face

Hugging Face introduces Mixtral, a state-of-the-art Mixture of Experts (MoE) model that represents a significant advancement in AI architecture. The model demonstrates improved efficiency and performance compared to traditional dense models by selectively activating subsets of parameters.

AINeutralarXiv – CS AI · 2d ago6/10
🧠

S3Mem: Structured Spatiotemporal Scene-Event Memory for Long-Horizon Interactive Question Answering

Researchers introduce S3MEM, a structured memory framework that improves how AI agents retrieve and answer questions about long trajectory histories. The system outperforms standard retrieval-augmented generation by organizing trajectories into scene-event units and using anchor-sensitive retrieval, achieving better accuracy with fewer tokens across multiple interactive environments.

AINeutralarXiv – CS AI · 3d ago6/10
🧠

SAME: Stabilized Mixture-of-Experts for Multimodal Continual Instruction Tuning

Researchers introduce SAME, a new approach for training Multimodal Large Language Models that can continuously learn new tasks without forgetting previous capabilities. The method addresses fundamental problems in continual learning by stabilizing how AI systems route tasks to specialized expert networks and preventing knowledge degradation over time.

AINeutralarXiv – CS AI · 3d ago6/10
🧠

Apple Intelligence Foundation Language Models

Apple has published research on foundation language models powering Apple Intelligence, including a 3 billion parameter on-device model and a larger server-based model for Private Cloud Compute. The announcement demonstrates Apple's commitment to developing efficient, responsible AI systems that balance performance with privacy.

AIBullisharXiv – CS AI · 3d ago6/10
🧠

TCP-MCP: Landscape-Guided Co-Evolution of Prompts and Communication Topologies for Multi-Agent Systems

TCP-MCP introduces a co-evolution framework that simultaneously optimizes AI agent prompts and communication network topologies, achieving state-of-the-art accuracy on multiple benchmarks while reducing token consumption by up to 5.69x compared to existing multi-agent systems. The approach treats prompt design and communication structure as interdependent variables rather than independent parameters, offering a practical methodology for cost-efficient multi-agent AI system design.

AIBullisharXiv – CS AI · 3d ago6/10
🧠

VidPrism: Heterogeneous Mixture of Experts for Image-to-Video Transfer

VidPrism introduces a heterogeneous Mixture-of-Experts framework that enhances Vision-Language Models for video understanding by deploying specialized experts rather than identical generalists. The approach uses dynamic multi-rate sampling and bidirectional fusion to achieve state-of-the-art performance on video recognition benchmarks.

AIBullisharXiv – CS AI · 4d ago6/10
🧠

CyberEvolver: Structured Self-Evolution for Cybersecurity Agents On the Fly

Researchers introduce CyberEvolver, an AI agent framework that autonomously improves its own architecture through iterative learning from failed cybersecurity tasks. The system demonstrates 13.6% average success rate improvements across CTF challenges and penetration testing, outperforming fixed human-designed alternatives and competing self-improvement methods.

AIBullishTechCrunch – AI · May 126/10
🧠

Thinking Machines wants to build an AI that actually listens while it talks

Thinking Machines is developing an AI model that processes user input and generates responses simultaneously, mimicking real-time conversation rather than the current turn-based interaction model used by existing AI systems. This architectural shift could fundamentally change how users interact with AI assistants.

AINeutralarXiv – CS AI · May 126/10
🧠

LLM-Guided Monte Carlo Tree Search over Knowledge Graphs: Composing Mechanistic Explanations for Drug-Disease Pairs

Researchers introduce TESSERA, a neuro-symbolic framework that combines Large Language Models with Monte Carlo Tree Search to extract multi-step explanations from knowledge graphs, specifically for drug-disease mechanism discovery. The system uses LLMs for local judgments rather than autonomous generation, enforcing structural constraints through knowledge graphs while employing MCTS for principled credit assignment across extended reasoning chains.

AINeutralarXiv – CS AI · May 126/10
🧠

Unpredictability dissociates from structured control in language agents

Researchers demonstrate that unpredictability in language agents does not equate to effective control, finding that structured decision-making mechanisms significantly outperform stochastic sampling across 74,352 test cases. The study challenges assumptions about randomness and control in AI systems, with implications for agent reliability and interpretability.

AINeutralarXiv – CS AI · May 116/10
🧠

State Representation and Termination for Recursive Reasoning Systems

Researchers present a formal framework for recursive reasoning systems that addresses two critical design challenges: how to represent evolving reasoning states and when to terminate iteration. The paper introduces an epistemic state graph representation and proposes the 'order-gap' metric as a stopping criterion, with theoretical guarantees for when this criterion provides meaningful guidance.

AINeutralarXiv – CS AI · May 76/10
🧠

Continual Knowledge Updating in LLM Systems: Learning Through Multi-Timescale Memory Dynamics

Researchers introduce Memini, a system that applies biological multi-timescale memory dynamics to external memory in large language models. By organizing knowledge as a directed graph where edges follow coupled fast and slow variables inspired by synaptic consolidation, the system enables LLMs to continuously update their knowledge without explicit management, allowing new information to be immediately useful while less relevant associations gradually fade.

AINeutralarXiv – CS AI · May 46/10
🧠

Position: agentic AI orchestration should be Bayes-consistent

A research position paper argues that agentic AI systems should incorporate Bayesian decision theory at their orchestration layer to improve decision-making under uncertainty. Rather than making LLMs themselves Bayesian, the framework proposes applying Bayesian principles to the control systems that coordinate multiple LLMs and tools, enabling better belief maintenance and resource allocation.

AINeutralarXiv – CS AI · May 46/10
🧠

MemRouter: Memory-as-Embedding Routing for Long-Term Conversational Agents

MemRouter is a new memory management system for conversational AI agents that uses lightweight embedding-based routing instead of expensive LLM generation to decide which conversation turns to store. The approach achieves 52.0 F1 score versus 45.6 for LLM-based alternatives while reducing latency from 970ms to 58ms, suggesting memory admission can be effectively learned through supervised classification rather than generative models.

AINeutralarXiv – CS AI · May 16/10
🧠

Exploring Interaction Paradigms for LLM Agents in Scientific Visualization

Researchers evaluated eight LLM agents across three interaction paradigms—domain-specific agents, computer-use agents, and general-purpose coding agents—on scientific visualization tasks. The study reveals fundamental tradeoffs: general-purpose agents excel at task completion but consume more computational resources, while domain-specific agents offer efficiency and stability at the cost of flexibility, with persistent memory improving performance across modalities.

AINeutralarXiv – CS AI · Apr 156/10
🧠

Memory as Metabolism: A Design for Companion Knowledge Systems

A new research paper proposes a governance framework for personal AI memory systems designed to function as 'companion' knowledge wikis that mirror user knowledge while compensating for epistemic failures like entrenchment and evidence suppression. The work addresses an emerging 2026 landscape of memory architectures for large language models through five operational mechanisms (TRIAGE, DECAY, CONTEXTUALIZE, CONSOLIDATE, AUDIT) aimed at preventing user-coupled drift in single-user knowledge systems.

AIBullisharXiv – CS AI · Apr 156/10
🧠

Unveiling the Surprising Efficacy of Navigation Understanding in End-to-End Autonomous Driving

Researchers propose Sequential Navigation Guidance (SNG), a framework addressing a critical flaw in end-to-end autonomous driving systems that over-rely on local scene understanding while underutilizing global navigation information. The SNG framework combines navigation paths and turn-by-turn instructions with a new VQA dataset and efficient model to improve autonomous vehicle planning and navigation-following in complex scenarios.

AINeutralarXiv – CS AI · Apr 146/10
🧠

The Missing Knowledge Layer in Cognitive Architectures for AI Agents

Researchers identify a critical architectural gap in leading AI agent frameworks (CoALA and JEPA), which lack an explicit Knowledge layer with distinct persistence semantics. The paper proposes a four-layer decomposition model with fundamentally different update mechanics for knowledge, memory, wisdom, and intelligence, with working implementations demonstrating feasibility.

AINeutralarXiv – CS AI · Apr 136/10
🧠

Artifacts as Memory Beyond the Agent Boundary

Researchers formalize how agents can use environmental artifacts as external memory to reduce computational requirements in reinforcement learning tasks. The study demonstrates that spatial observations can implicitly serve as memory substitutes, allowing agents to learn effective policies with less internal memory capacity than previously thought necessary.

← PrevPage 2 of 3Next →