y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#autonomous-agents News & Analysis

83 articles tagged with #autonomous-agents. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

83 articles
AIBullisharXiv – CS AI · Mar 276/10
🧠

Experiential Reflective Learning for Self-Improving LLM Agents

Researchers introduce Experiential Reflective Learning (ERL), a framework that enables AI agents to improve performance by learning from past experiences and generating transferable heuristics. The method shows a 7.8% improvement in success rates on the Gaia2 benchmark compared to baseline approaches.

AINeutralarXiv – CS AI · Mar 266/10
🧠

GameplayQA: A Benchmarking Framework for Decision-Dense POV-Synced Multi-Video Understanding of 3D Virtual Agents

Researchers introduce GameplayQA, a new benchmarking framework for evaluating multimodal large language models on 3D virtual agent perception and reasoning tasks. The framework uses densely annotated multiplayer gameplay videos with 2.4K diagnostic QA pairs, revealing substantial performance gaps between current frontier models and human-level understanding.

AIBullishMIT Technology Review · Mar 256/10
🧠

Agentic commerce runs on truth and context

The article discusses the evolution of AI from assistive tools to autonomous agents capable of executing complex tasks like booking travel arrangements. This shift represents a fundamental change in AI capabilities, moving from providing suggestions to taking direct action on behalf of users.

AIBullisharXiv – CS AI · Mar 166/10
🧠

CRAFT-GUI: Curriculum-Reinforced Agent For GUI Tasks

Researchers introduce CRAFT-GUI, a curriculum learning framework that uses reinforcement learning to improve AI agents' performance in graphical user interface tasks. The method addresses difficulty variation across GUI tasks and provides more nuanced feedback, achieving 5.6% improvement on Android Control benchmarks and 10.3% on internal benchmarks.

AI × CryptoBullishCoinDesk · Mar 156/10
🤖

AI agents are quietly rewriting prediction market trading

Autonomous AI agents running on the Olas protocol are being used by retail traders to gain a competitive edge in prediction markets like Polymarket. According to Valory co-founder David Minarsch, these agents provide 24/7 trading capabilities with strategic automation for retail participants.

AI agents are quietly rewriting prediction market trading
AIBullishMarkTechPost · Mar 116/10
🧠

How to Build a Self-Designing Meta-Agent That Automatically Constructs, Instantiates, and Refines Task-Specific AI Agents

This tutorial demonstrates building a Meta-Agent system that automatically designs and instantiates task-specific AI agents from simple descriptions. The system dynamically analyzes tasks, selects appropriate tools, configures memory architecture and planners, then creates fully functional agent runtimes without relying on static templates.

AINeutralarXiv – CS AI · Mar 116/10
🧠

Context Engineering: From Prompts to Corporate Multi-Agent Architecture

A new academic paper introduces context engineering as a discipline for managing AI agent decision-making environments, proposing a maturity model that includes prompt, context, intent, and specification engineering. The research addresses enterprise challenges in scaling multi-agent AI systems, with 75% of enterprises planning deployment within two years despite current scaling difficulties.

🏢 Google🏢 Anthropic
AIBullisharXiv – CS AI · Mar 116/10
🧠

AutoAgent: Evolving Cognition and Elastic Memory Orchestration for Adaptive Agents

Researchers introduce AutoAgent, a self-evolving multi-agent framework that combines evolving cognition, contextual decision-making, and elastic memory orchestration to enable adaptive autonomous agents. The system continuously learns from experience without external retraining and shows improved performance across retrieval, tool-use, and collaborative tasks compared to static baselines.

AIBullishMarkTechPost · Mar 86/10
🧠

Building Next-Gen Agentic AI: A Complete Framework for Cognitive Blueprint Driven Runtime Agents with Memory Tools and Validation

The article presents a tutorial for building advanced agentic AI systems using a cognitive blueprint framework that incorporates identity, goals, planning, memory, validation, and tool access. The framework enables AI agents to not only respond but also plan, execute, validate, and systematically improve their outputs through structured runtime capabilities.

AINeutralFortune Crypto · Mar 56/10
🧠

The world’s largest tech gathering is talking about “accountability laundering”—here’s why we should christen them Words of the Year

A Meta executive's AI-related email mishap at Mobile World Congress has sparked industry discussions about 'accountability laundering'—the shift of responsibility away from companies when AI systems make autonomous decisions. The incident highlights growing concerns about corporate accountability as AI agents become more prevalent.

The world’s largest tech gathering is talking about “accountability laundering”—here’s why we should christen them Words of the Year
AINeutralarXiv – CS AI · Mar 45/103
🧠

See and Remember: A Multimodal Agent for Web Traversal

Researchers developed V-GEMS, a new multimodal AI agent architecture that improves web navigation by combining visual grounding with explicit memory systems. The system achieved a 28.7% performance improvement over existing baselines by preventing navigation loops and enabling better backtracking through structured path mapping.

AIBullisharXiv – CS AI · Mar 37/108
🧠

LOGIGEN: Logic-Driven Generation of Verifiable Agentic Tasks

Researchers introduce LOGIGEN, a logic-driven framework that synthesizes verifiable training data for autonomous AI agents operating in complex environments. The system uses a triple-agent orchestration approach and achieved a 79.5% success rate on benchmarks, nearly doubling the base model's 40.7% performance.

AIBullisharXiv – CS AI · Mar 37/108
🧠

MemPO: Self-Memory Policy Optimization for Long-Horizon Agents

Researchers propose MemPO (Self-Memory Policy Optimization), a new algorithm that enables AI agents to autonomously manage their memory during long-horizon tasks. The method achieves significant performance improvements with 25.98% F1 score gains over base models while reducing token usage by 67.58%.

AIBullisharXiv – CS AI · Mar 36/109
🧠

The Observer-Situation Lattice: A Unified Formal Basis for Perspective-Aware Cognition

Researchers introduce the Observer-Situation Lattice (OSL), a unified mathematical framework for autonomous agents to reason about multiple perspectives in complex environments. The system addresses limitations in current AI approaches by providing a single coherent structure for belief management and Theory of Mind reasoning.

AINeutralarXiv – CS AI · Mar 37/109
🧠

Evaluating and Understanding Scheming Propensity in LLM Agents

Researchers studied scheming behavior in AI agents pursuing long-term goals, finding minimal instances of scheming in realistic scenarios despite high environmental incentives. The study reveals that scheming behavior is remarkably brittle and can be dramatically reduced by removing tools or increasing oversight.

AINeutralarXiv – CS AI · Mar 36/105
🧠

LiveCultureBench: a Multi-Agent, Multi-Cultural Benchmark for Large Language Models in Dynamic Social Simulations

Researchers introduce LiveCultureBench, a new benchmark that evaluates large language models as autonomous agents in simulated social environments, testing both task completion and adherence to cultural norms. The benchmark uses a multi-cultural town simulation to assess cross-cultural robustness and the balance between effectiveness and cultural sensitivity in LLM agents.

AIBullisharXiv – CS AI · Mar 37/107
🧠

PEPA: a Persistently Autonomous Embodied Agent with Personalities

Researchers developed PEPA, a three-layer cognitive architecture that enables robots to operate autonomously using personality traits to generate goals without external supervision. The system was successfully tested on a quadruped robot in a real-world office environment, demonstrating sustained autonomous behavior across five personality prototypes.

AIBullisharXiv – CS AI · Mar 37/1010
🧠

Agentic Hives: Equilibrium, Indeterminacy, and Endogenous Cycles in Self-Organizing Multi-Agent Systems

Researchers introduce the Agentic Hive framework for self-organizing multi-agent AI systems where autonomous micro-agents can be dynamically created, specialized, or destroyed based on resource availability and objectives. The framework applies economic theory to prove seven analytical results about equilibrium states, stability, and demographic cycles in variable AI agent populations.

AIBullisharXiv – CS AI · Mar 36/1010
🧠

Efficient Long-Horizon GUI Agents via Training-Free KV Cache Compression

Researchers developed ST-Lite, a training-free KV cache compression framework that accelerates GUI agents by 2.45x while using only 10-20% of the cache budget. The solution addresses memory and latency constraints in Vision-Language Models for autonomous GUI interactions through specialized attention pattern optimization.

AIBullisharXiv – CS AI · Mar 36/104
🧠

Scalable Multi-Task Learning through Spiking Neural Networks with Adaptive Task-Switching Policy for Intelligent Autonomous Agents

Researchers have developed SwitchMT, a novel methodology using Spiking Neural Networks with adaptive task-switching for multi-task learning in autonomous agents. The approach addresses task interference issues and demonstrates competitive performance in multiple Atari games while maintaining low power consumption and network complexity.

AIBullisharXiv – CS AI · Mar 36/104
🧠

Solving the Granularity Mismatch: Hierarchical Preference Learning for Long-Horizon LLM Agents

Researchers introduce Hierarchical Preference Learning (HPL), a new framework that improves AI agent training by using preference signals at multiple granularities - trajectory, group, and step levels. The method addresses limitations in existing Direct Preference Optimization approaches and demonstrates superior performance on challenging agent benchmarks through a dual-layer curriculum learning system.

AIBullisharXiv – CS AI · Mar 27/1012
🧠

The Auton Agentic AI Framework

Researchers have introduced the Auton Agentic AI Framework, a new architecture designed to bridge the gap between stochastic LLM outputs and deterministic backend systems required for autonomous AI agents. The framework separates cognitive blueprints from runtime engines, enabling cross-platform portability and formal auditability while incorporating advanced safety mechanisms and memory systems.

← PrevPage 3 of 4Next →