AIBullisharXiv – CS AI · 4d ago7/10
🧠LedgerAgent is a new inference-time method that improves how AI agents handle customer-service tasks by maintaining explicit task states in a separate ledger rather than reconstructing context from prompts. The approach reduces policy violations and improves decision consistency across multiple trials by validating state-dependent constraints before executing tool calls.
AIBearisharXiv – CS AI · Jun 107/10
🧠A comprehensive review of 247 research papers reveals that LLM agents face escalating security threats beyond text generation, including prompt injection, tool hijacking, and state corruption. The study proposes a framework emphasizing trust boundaries, privilege control, and stateful risk evaluation to address fragmented defenses and inadequate benchmarking standards.
AIBearisharXiv – CS AI · Jun 17/10
🧠Researchers introduce LongDS, a benchmark revealing significant limitations in AI agents performing long-horizon data analysis tasks. Testing five state-of-the-art models shows best performance of only 48.45% accuracy with performance degrading by 47 points across task progression, indicating that maintaining analytical context over extended interactions remains a critical unsolved problem.
AIBullisharXiv – CS AI · May 97/10
🧠Researchers introduce execution lineage, a DAG-based execution model that makes AI-native workflows reproducible and maintainable by explicitly tracking dependencies and enabling identity-based replay. Tested against traditional loop-based approaches, the system demonstrated superior performance in preserving work integrity during updates while preventing unrelated context contamination.
AIBullisharXiv – CS AI · Apr 147/10
🧠Researchers propose MGA (Memory-Driven GUI Agent), a minimalist AI framework that improves GUI automation by decoupling long-horizon tasks into independent steps linked through structured state memory. The approach addresses critical limitations in current multimodal AI agents—context overload and architectural redundancy—while maintaining competitive performance with reduced complexity.
AIBullisharXiv – CS AI · Mar 177/10
🧠Researchers introduce StatePlane, a model-agnostic cognitive state management system that enables AI systems to maintain coherent reasoning over long interaction horizons without expanding context windows or retraining models. The system uses episodic, semantic, and procedural memory mechanisms inspired by cognitive psychology to overcome current limitations in large language models.
CryptoBullishBankless · Feb 57/105
⛓️The article discusses Vitalik Buterin's proposed tiered state architecture for Ethereum and its potential implications for token management and scalability. This vision represents a significant technical evolution that could reshape how tokens operate within Ethereum's ecosystem.
$ETH
AINeutralarXiv – CS AI · Jun 26/10
🧠Researchers propose Post-Deterministic Distributed Systems (PDDS), a new framework for coordinating infrastructure where autonomous agents, stochastic models, and deterministic code coexist—challenging decades-old assumptions in distributed computing that relied on predictable, deterministic participant behavior.
AINeutralarXiv – CS AI · Jun 26/10
🧠Atomix is a new runtime system that enables LLM agents to execute multi-step workflows with transactional guarantees, preventing partial effects and state corruption from faults or concurrent execution. By explicitly tracking which effects must settle together and when conflicting work is exhausted, Atomix provides reliable settlement semantics for agentic systems with minimal overhead.
AINeutralarXiv – CS AI · May 276/10
🧠Researchers propose Governed Evolving Memory (GEM), a new paradigm for long-term AI agent memory that treats memory as a state-management workload rather than traditional database storage. The framework addresses four critical failure modes in current agent systems—unregulated growth, missing semantic revision, capacity-driven forgetting, and read-only retrieval—through four state-level operators and six correctness conditions that operate at the trajectory level rather than individual records.
AINeutralarXiv – CS AI · Apr 146/10
🧠ClawVM is a virtual memory management system designed for stateful LLM agents that addresses critical failures in current context window management. The system implements typed pages, multi-resolution representations, and validated writeback protocols to ensure deterministic state residency and durability, adding minimal computational overhead.
CryptoNeutralEthereum Foundation Blog · Dec 164/102
⛓️The Stateless Consensus team has published a proposal regarding the future of Ethereum's state management. This appears to be an internal Ethereum Foundation discussion document that may not represent consensus views across the organization.
$ETH