y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#autonomous-agents News & Analysis

150 articles tagged with #autonomous-agents. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

150 articles
AINeutralarXiv – CS AI · May 96/10
🧠

Strat-LLM: Stratified Strategy Alignment for LLM-based Stock Trading with Real-time Multi-Source Signals

Researchers introduce Strat-LLM, a framework that aligns large language models for stock trading by matching model architecture to operational modes (Free, Guided, Strict), finding that reasoning-heavy models excel with minimal constraints while standard models benefit from strict guardrails. Live-forward testing across 2025 on A-share and U.S. markets reveals that optimal performance depends on market regime and model scale, with mid-size models (35B) showing superior risk-adjusted returns under constraints.

AINeutralarXiv – CS AI · May 96/10
🧠

Safactory: A Scalable Agent Factory for Trustworthy Autonomous Intelligence

Safactory is a new framework that integrates simulation, data management, and reinforcement learning to develop trustworthy autonomous AI agents. The system addresses fragmentation in existing agent infrastructure by creating a unified pipeline for continuous improvement and risk detection in long-horizon decision-making tasks.

AINeutralarXiv – CS AI · May 76/10
🧠

Modular Reinforcement Learning For Cooperative Swarms

Researchers propose a modular reinforcement learning approach to address memory constraints in cooperative robot swarms. By decomposing spatial interaction states into separate learning procedures rather than representing combinatorial states, the method enables computationally-limited robots to learn effective collective behaviors while maintaining independent learning processes.

AINeutralarXiv – CS AI · May 16/10
🧠

Self-Evolving Software Agents

Researchers propose self-evolving software agents that combine Belief-Desire-Intention (BDI) reasoning with large language models to enable autonomous adaptation of goals, reasoning logic, and executable code beyond fixed design parameters. A prototype demonstrates that agents can discover new objectives and generate functional behaviors from minimal initial knowledge, though challenges remain in behavioral stability and inheritance.

AI × CryptoBullishThe Block · Apr 206/10
🤖

Coinbase-incubated x402 protocol unveils app-store for AI bots

Coinbase-incubated x402 protocol has launched an app store for AI bots, enabling agentic commerce where autonomous agents can access services on a per-use basis. Creator Erik Reppel highlights how this model is fundamentally reducing activation costs and changing how services are monetized in the emerging AI agent economy.

Coinbase-incubated x402 protocol unveils app-store for AI bots
AI × CryptoNeutralThe Block · Apr 206/10
🤖

What are AI agent tokens?

The cryptocurrency industry is experiencing a shift from infrastructure-focused blockchain AI projects toward AI agent tokens—crypto assets tied to specific autonomous agents rather than broader networks. This emerging trend reflects growing capabilities of AI bots in content generation and task management, representing a new tokenization paradigm within the AI-crypto intersection.

What are AI agent tokens?
AINeutralarXiv – CS AI · Apr 206/10
🧠

SocialGrid: A Benchmark for Planning and Social Reasoning in Embodied Multi-Agent Systems

Researchers introduce SocialGrid, a benchmark environment for evaluating Large Language Models as autonomous agents in multi-agent social scenarios. The study reveals that even the most capable open-source LLMs achieve below 60% task completion and struggle significantly with social reasoning tasks like detecting deception, exposing critical limitations in current AI agent capabilities.

AIBullishAI News · Apr 156/10
🧠

Commvault launches a ‘Ctrl-Z’ for cloud AI workloads

Commvault has launched AI Protect, a governance solution that provides rollback capabilities for autonomous AI agents operating in cloud environments. The platform addresses critical risks posed by AI systems that can independently delete files, access databases, modify infrastructure, and alter security policies without adequate oversight or recovery mechanisms.

AIBullishAI News · Apr 156/10
🧠

Citizen developers now have their own Wingman

Emergent has released Wingman, an autonomous AI agent designed to help non-technical users create and manage applications for daily tasks. The tool aims to democratize software development by making application creation accessible to citizen developers without coding expertise.

AINeutralarXiv – CS AI · Apr 156/10
🧠

EMBER: Autonomous Cognitive Behaviour from Learned Spiking Neural Network Dynamics in a Hybrid LLM Architecture

Researchers present EMBER, a hybrid architecture combining spiking neural networks with large language models where the SNN acts as a persistent, biologically-inspired memory substrate that autonomously triggers LLM reasoning. The system demonstrates emergent autonomous behavior, initiating unprompted user contact after learning associations during idle periods, suggesting a fundamental shift in how AI systems could coordinate cognition and action.

AI × CryptoBullishBlockonomi · Apr 146/10
🤖

HashKey CEO Xiao Feng: AI and Blockchain Convergence Will Birth the Agent Economy

HashKey CEO Xiao Feng presented a vision of AI and blockchain convergence at the 2026 World Internet Conference Asia-Pacific Summit, proposing that AI tokens decode information while blockchain tokens distribute value. He framed AI as the 'brain' and blockchain as the 'hands, feet, and bones' of an emerging agent economy, suggesting both technologies share fundamental structural similarities.

AINeutralarXiv – CS AI · Apr 146/10
🧠

Turing Test on Screen: A Benchmark for Mobile GUI Agent Humanization

Researchers introduce the 'Turing Test on Screen,' a framework for measuring how well autonomous GUI agents can mimic human behavior to evade detection systems. The study reveals that current LLM-based agents exhibit unnatural interaction patterns and proposes humanization methods to improve their ability to operate undetected in adversarial digital environments.

AINeutralarXiv – CS AI · Apr 146/10
🧠

STARS: Skill-Triggered Audit for Request-Conditioned Invocation Safety in Agent Systems

Researchers introduce STARS, a framework for continuously auditing AI agent skill invocations in real-time by combining static capability analysis with request-conditioned risk modeling. The approach demonstrates improved detection of prompt injection attacks compared to static baselines, though remains most valuable as a triage layer rather than a complete replacement for pre-deployment screening.

AIBullisharXiv – CS AI · Apr 146/10
🧠

Tuning Qwen2.5-VL to Improve Its Web Interaction Skills

Researchers fine-tuned Qwen2.5-VL-32B, a leading open-source vision-language model, to improve its ability to autonomously perform web interactions through visual input alone. Using a two-stage training approach that addresses cursor localization, instruction sensitivity, and overconfidence bias, the model's success rate on single-click web tasks improved from 86% to 94%.

CryptoNeutralCrypto Briefing · Apr 106/10
⛓️

Noah Levine: New protocols will revolutionize B2B commerce, skepticism around autonomous consumer agents, and the compatibility of card payments with stablecoins | Unchained

Noah Levine discusses emerging B2B commerce protocols that integrate traditional and digital payment systems, expresses skepticism about autonomous consumer agents, and explores the viability of card payments with stablecoins. These developments signal a shift toward hybrid payment infrastructure that bridges legacy financial systems with blockchain technology.

Noah Levine: New protocols will revolutionize B2B commerce, skepticism around autonomous consumer agents, and the compatibility of card payments with stablecoins | Unchained
AIBearisharXiv – CS AI · Apr 106/10
🧠

Evaluating LLM-Based 0-to-1 Software Generation in End-to-End CLI Tool Scenarios

Researchers introduce CLI-Tool-Bench, a new benchmark for evaluating large language models' ability to generate complete software from scratch. Testing seven state-of-the-art LLMs reveals that top models achieve under 43% success rates, exposing significant limitations in current AI-driven 0-to-1 software generation despite increased computational investment.

AINeutralarXiv – CS AI · Apr 106/10
🧠

One Life to Learn: Inferring Symbolic World Models for Stochastic Environments from Unguided Exploration

Researchers introduce OneLife, a framework for learning symbolic world models from minimal unguided exploration in complex, stochastic environments. The approach uses conditionally-activated programmatic laws within a probabilistic framework and demonstrates superior performance on 16 of 23 test scenarios, advancing autonomous construction of world models for unknown environments.

AIBullisharXiv – CS AI · Mar 276/10
🧠

Experiential Reflective Learning for Self-Improving LLM Agents

Researchers introduce Experiential Reflective Learning (ERL), a framework that enables AI agents to improve performance by learning from past experiences and generating transferable heuristics. The method shows a 7.8% improvement in success rates on the Gaia2 benchmark compared to baseline approaches.

AINeutralarXiv – CS AI · Mar 266/10
🧠

GameplayQA: A Benchmarking Framework for Decision-Dense POV-Synced Multi-Video Understanding of 3D Virtual Agents

Researchers introduce GameplayQA, a new benchmarking framework for evaluating multimodal large language models on 3D virtual agent perception and reasoning tasks. The framework uses densely annotated multiplayer gameplay videos with 2.4K diagnostic QA pairs, revealing substantial performance gaps between current frontier models and human-level understanding.

AIBullishMIT Technology Review · Mar 256/10
🧠

Agentic commerce runs on truth and context

The article discusses the evolution of AI from assistive tools to autonomous agents capable of executing complex tasks like booking travel arrangements. This shift represents a fundamental change in AI capabilities, moving from providing suggestions to taking direct action on behalf of users.

AIBullisharXiv – CS AI · Mar 166/10
🧠

CRAFT-GUI: Curriculum-Reinforced Agent For GUI Tasks

Researchers introduce CRAFT-GUI, a curriculum learning framework that uses reinforcement learning to improve AI agents' performance in graphical user interface tasks. The method addresses difficulty variation across GUI tasks and provides more nuanced feedback, achieving 5.6% improvement on Android Control benchmarks and 10.3% on internal benchmarks.

AI × CryptoBullishCoinDesk · Mar 156/10
🤖

AI agents are quietly rewriting prediction market trading

Autonomous AI agents running on the Olas protocol are being used by retail traders to gain a competitive edge in prediction markets like Polymarket. According to Valory co-founder David Minarsch, these agents provide 24/7 trading capabilities with strategic automation for retail participants.

AI agents are quietly rewriting prediction market trading
AIBullishMarkTechPost · Mar 116/10
🧠

How to Build a Self-Designing Meta-Agent That Automatically Constructs, Instantiates, and Refines Task-Specific AI Agents

This tutorial demonstrates building a Meta-Agent system that automatically designs and instantiates task-specific AI agents from simple descriptions. The system dynamically analyzes tasks, selects appropriate tools, configures memory architecture and planners, then creates fully functional agent runtimes without relying on static templates.

AINeutralarXiv – CS AI · Mar 116/10
🧠

Context Engineering: From Prompts to Corporate Multi-Agent Architecture

A new academic paper introduces context engineering as a discipline for managing AI agent decision-making environments, proposing a maturity model that includes prompt, context, intent, and specification engineering. The research addresses enterprise challenges in scaling multi-agent AI systems, with 75% of enterprises planning deployment within two years despite current scaling difficulties.

🏢 Google🏢 Anthropic
← PrevPage 5 of 6Next →