y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#autonomous-agents News & Analysis

83 articles tagged with #autonomous-agents. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

83 articles
AINeutralarXiv โ€“ CS AI ยท Mar 27/1010
๐Ÿง 

From Static Benchmarks to Dynamic Protocol: Agent-Centric Text Anomaly Detection for Evaluating LLM Reasoning

Researchers propose a dynamic agent-centric benchmarking system for evaluating large language models that replaces static datasets with autonomous agents that generate, validate, and solve problems iteratively. The protocol uses teacher, orchestrator, and student agents to create progressively challenging text anomaly detection tasks that expose reasoning errors missed by conventional benchmarks.

AIBullisharXiv โ€“ CS AI ยท Mar 27/1011
๐Ÿง 

Foundation World Models for Agents that Learn, Verify, and Adapt Reliably Beyond Static Environments

Researchers propose a new framework for foundation world models that enables autonomous agents to learn, verify, and adapt reliably in dynamic environments. The approach combines reinforcement learning with formal verification and adaptive abstraction to create agents that can synthesize verifiable programs and maintain correctness while adapting to novel conditions.

AIBullisharXiv โ€“ CS AI ยท Feb 276/107
๐Ÿง 

AMA-Bench: Evaluating Long-Horizon Memory for Agentic Applications

Researchers introduce AMA-Bench, a new benchmark for evaluating long-horizon memory in AI agents deployed in real-world applications. The study reveals existing memory systems underperform due to lack of causality and objective information, while their proposed AMA-Agent system achieves 57.22% accuracy, surpassing baselines by 11.16%.

AIBullisharXiv โ€“ CS AI ยท Feb 276/106
๐Ÿง 

ESAA: Event Sourcing for Autonomous Agents in LLM-Based Software Engineering

Researchers have introduced ESAA (Event Sourcing for Autonomous Agents), a new architecture that improves LLM-based autonomous agents by separating cognitive intention from state mutation using structured JSON events and deterministic orchestration. The system addresses key limitations like context degradation and execution reliability, with successful validation through multi-agent case studies using various LLMs including Claude Sonnet and GPT-5.

AINeutralHugging Face Blog ยท Jan 276/106
๐Ÿง 

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

The article discusses practical approaches to implementing Agentic Reinforcement Learning (RL) training for GPT-OSS, an open-source AI model. It provides a retrospective analysis of challenges and solutions encountered during the training process, focusing on technical implementation details and lessons learned.

AIBullishOpenAI News ยท Jul 235/106
๐Ÿง 

Model ML is helping financial firms rebuild with AI from the ground up

Model ML CEO Chaz Englander discusses how AI-native infrastructure and autonomous agents are transforming financial services workflows as part of an Executive Function series. The company is helping financial firms rebuild their operations using artificial intelligence from the ground up.

AIBullishLil'Log (Lilian Weng) ยท Jun 236/10
๐Ÿง 

LLM Powered Autonomous Agents

The article explores LLM-powered autonomous agents that use large language models as core controllers, going beyond text generation to serve as general problem solvers. Key systems like AutoGPT, GPT-Engineer, and BabyAGI demonstrate the potential of agents with planning, memory, and tool-use capabilities.

โ† PrevPage 4 of 4