#ai-architecture News & Analysis

104 articles tagged with #ai-architecture. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

104 articles

AIBullisharXiv – CS AI · May 117/10

🧠

From Storage to Experience: A Survey on the Evolution of LLM Agent Memory Mechanisms

Researchers propose a unified evolutionary framework for LLM agent memory systems, categorizing development into three stages: Storage, Reflection, and Experience. The framework addresses fragmented research by synthesizing engineering and cognitive science perspectives, offering design principles for building more capable autonomous AI agents.

AIBullisharXiv – CS AI · May 77/10

🧠

A large language model-type architecture for high-dimensional molecular potential energy surfaces

Researchers have developed a neural network architecture inspired by large language models to predict high-dimensional molecular potential energy surfaces, successfully computing accurate predictions for a 186-dimensional system representing a protonated 21-water cluster—a significant advance in computational chemistry that could accelerate reaction rate predictions.

AINeutralarXiv – CS AI · May 17/10

🧠

When Agents Evolve, Institutions Follow

Researchers from arXiv demonstrate that multi-agent AI systems built on large language models achieve dramatically different performance levels based on their organizational structure, with governance topology showing a 57+ percentage point performance gap. The study translates seven historical political institutions into executable multi-agent architectures, revealing that optimal organizational design shifts systematically with model capability and task requirements.

AIBullisharXiv – CS AI · Apr 147/10

🧠

Escaping the Context Bottleneck: Active Context Curation for LLM Agents via Reinforcement Learning

Researchers introduce ContextCurator, a reinforcement learning-based framework that decouples context management from task execution in LLM agents, addressing the context bottleneck problem. The approach pairs a lightweight specialized policy model with a frozen foundation model, achieving significant improvements in success rates and token efficiency across benchmark tasks.

🧠 GPT-4🧠 Gemini

AIBearisharXiv – CS AI · Apr 137/10

🧠

Robust Reasoning Benchmark

Researchers have developed a 14-technique perturbation pipeline to test the robustness of large language models' reasoning capabilities on mathematical problems. Testing reveals that while frontier models maintain resilience, open-weight models experience catastrophic accuracy collapses up to 55%, and all tested models degrade when solving sequential problems in a single context window, suggesting fundamental architectural limitations in current reasoning systems.

🧠 Claude🧠 Opus

AIBullisharXiv – CS AI · Apr 107/10

🧠

Computer Environments Elicit General Agentic Intelligence in LLMs

Researchers introduce LLM-in-Sandbox, a minimal computer environment that significantly enhances large language models' capabilities across diverse tasks without additional training. The approach enables weaker models to internalize agent-like behaviors through specialized training, demonstrating that environmental interaction—not just model parameters—drives general intelligence in LLMs.

AIBullisharXiv – CS AI · Apr 77/10

🧠

Springdrift: An Auditable Persistent Runtime for LLM Agents with Case-Based Memory, Normative Safety, and Ambient Self-Perception

Researchers have developed Springdrift, a persistent runtime system for long-lived AI agents that maintains memory across sessions and provides auditable decision-making capabilities. The system was successfully deployed for 23 days, during which the AI agent autonomously diagnosed infrastructure problems and maintained context across multiple communication channels without explicit instructions.

AINeutralarXiv – CS AI · Apr 77/10

🧠

The Topology of Multimodal Fusion: Why Current Architectures Fail at Creative Cognition

Researchers identify a fundamental topological limitation in current multimodal AI architectures like CLIP and GPT-4V, proposing that their 'contact topology' structure prevents creative cognition. The paper introduces a philosophical framework combining Chinese epistemology with neuroscience to propose new architectures using Neural ODEs and topological regularization.

🧠 Gemini

AINeutralarXiv – CS AI · Mar 267/10

🧠

A Theory of LLM Information Susceptibility

Researchers propose a theory of LLM information susceptibility that identifies fundamental limits to how large language models can improve optimization in AI agent systems. The study shows that nested, co-scaling architectures may be necessary for open-ended AI self-improvement, providing predictive constraints for AI system design.

AIBullisharXiv – CS AI · Mar 267/10

🧠

Bottlenecked Transformers: Periodic KV Cache Consolidation for Generalised Reasoning

Researchers introduce Bottlenecked Transformers, a new architecture that improves AI reasoning by up to 6.6 percentage points through periodic memory consolidation inspired by brain processes. The system uses a Cache Processor to rewrite key-value cache entries at reasoning step boundaries, achieving better performance on math reasoning benchmarks compared to standard Transformers.

AIBullisharXiv – CS AI · Mar 177/10

🧠

EARCP: Self-Regulating Coherence-Aware Ensemble Architecture for Sequential Decision Making -- Ensemble Auto-Regule par Coherence et Performance

Researchers introduce EARCP, a new ensemble architecture for AI that dynamically weights different expert models based on performance and coherence. The system provides theoretical guarantees with sublinear regret bounds and has been tested on time series forecasting, activity recognition, and financial prediction tasks.

AIBullisharXiv – CS AI · Mar 177/10

🧠

Learning to Forget: Sleep-Inspired Memory Consolidation for Resolving Proactive Interference in Large Language Models

Researchers developed SleepGate, a biologically-inspired framework that significantly improves large language model memory by mimicking sleep-based consolidation to resolve proactive interference. The system achieved 99.5% retrieval accuracy compared to less than 18% for existing methods in experimental testing.

AIBullisharXiv – CS AI · Mar 167/10

🧠

Revisiting Model Stitching In the Foundation Model Era

Researchers introduce improved methods for stitching Vision Foundation Models (VFMs) like CLIP and DINOv2, enabling integration of different models' strengths. The study proposes VFM Stitch Tree (VST) technique that allows controllable accuracy-latency trade-offs for multimodal applications.

AINeutralarXiv – CS AI · Mar 127/10

🧠

Multi-Agent Memory from a Computer Architecture Perspective: Visions and Challenges Ahead

Researchers propose treating multi-agent AI memory as a computer architecture problem, introducing a three-layer memory hierarchy and identifying critical protocol gaps. The paper highlights multi-agent memory consistency as the most pressing challenge for building scalable collaborative AI systems.

AINeutralarXiv – CS AI · Mar 57/10

🧠

Synthetic emotions and consciousness: exploring architectural boundaries

Researchers propose an architectural framework for implementing emotion-like AI systems while deliberately avoiding features associated with consciousness. The study introduces risk-reduction constraints and engineering principles to create sophisticated emotional AI without triggering consciousness-related safety concerns.

AIBullisharXiv – CS AI · Mar 46/104

🧠

REGAL: A Registry-Driven Architecture for Deterministic Grounding of Agentic AI in Enterprise Telemetry

Researchers present REGAL, a registry-driven architecture that enables AI agents to work deterministically with enterprise telemetry data from systems like CI/CD pipelines and observability platforms. The system addresses key challenges of grounding Large Language Models on private enterprise data through structured data processing and version-controlled action spaces.

AIBullisharXiv – CS AI · Mar 37/104

🧠

Relational Transformer: Toward Zero-Shot Foundation Models for Relational Data

Researchers from Stanford introduce the Relational Transformer (RT), a new AI architecture that can work with relational databases without task-specific fine-tuning. The 22M parameter model achieves 93% performance of fully supervised models on binary classification tasks, significantly outperforming a 27B parameter LLM at 84%.

AIBullisharXiv – CS AI · Feb 277/106

🧠

Decision MetaMamba: Enhancing Selective SSM in Offline RL with Heterogeneous Sequence Mixing

Researchers propose Decision MetaMamba (DMM), a new AI model architecture that improves offline reinforcement learning by addressing information loss issues in Mamba-based models. The solution uses a dense layer-based sequence mixer and modified positional structure to achieve state-of-the-art performance with fewer parameters.

AIBullisharXiv – CS AI · Feb 277/109

🧠

ArchAgent: Agentic AI-driven Computer Architecture Discovery

ArchAgent, an AI-driven system built on AlphaEvolve, has achieved breakthrough results in automated computer architecture discovery by designing state-of-the-art cache replacement policies. The system achieved 5.3% performance improvements in just 2 days and 0.9% improvements in 18 days, working 3-5x faster than human-developed solutions.

AIBullishOpenAI News · Nov 77/107

🧠

Notion’s rebuild for agentic AI: How GPT‑5 helped unlock autonomous workflows

Notion has rebuilt its AI architecture using GPT-5 to create autonomous agents capable of reasoning, acting, and adapting across workflows. This architectural shift represents a major upgrade in Notion 3.0, enabling smarter and more flexible productivity tools through agentic AI capabilities.

AIBullishOpenAI News · Aug 77/107

🧠

GPT-5 System Card

OpenAI has released a GPT-5 system card detailing a unified model routing system that uses multiple specialized versions including gpt-5-main, gpt-5-thinking, and lightweight variants like gpt-5-thinking-nano. The system is designed to optimize performance across different tasks and developer use cases by routing queries to the most appropriate model variant.

AIBullishSynced Review · May 157/109

🧠

DeepSeek-V3 New Paper is coming! Unveiling the Secrets of Low-Cost Large Model Training through Hardware-Aware Co-design

DeepSeek has released a 14-page technical paper on their V3 model, focusing on scaling challenges and hardware-aware co-design for low-cost large model training. The paper, co-authored by DeepSeek CEO Wenfeng Liang, reveals insights into cost-effective AI architecture development.

AIBullishHugging Face Blog · Aug 127/104

🧠

Welcome Falcon Mamba: The first strong attention-free 7B model

Falcon Mamba represents a breakthrough as the first strong 7B parameter language model that operates without attention mechanisms. This development challenges the dominance of transformer architectures and could lead to more efficient AI models with reduced computational requirements.

AIBullishHugging Face Blog · Dec 117/105

🧠

Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face

Hugging Face introduces Mixtral, a state-of-the-art Mixture of Experts (MoE) model that represents a significant advancement in AI architecture. The model demonstrates improved efficiency and performance compared to traditional dense models by selectively activating subsets of parameters.

AINeutralarXiv – CS AI · Jun 256/10

🧠

Is GraphRAG Needed? From Basic RAG to Graph-/Agentic Solutions with Context Optimization

Researchers present a comprehensive framework comparing RAG (Retrieval-Augmented Generation) variants—including GraphRAG, Modular RAG, and Agentic RAG—across 9 standardized scenarios. They introduce a novel context optimization method that reduces token usage by 19-53% while identifying a retrieval-generation gap suggesting advanced retrieval methods may not proportionally improve output quality.

← PrevPage 2 of 5Next →