y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#ai-agents News & Analysis

Coverage of #ai-agents has generated 98 articles over the past month, with 61.2% maintaining a bullish sentiment. Discussion remains stable compared to the previous quarter, reflecting consistent interest rather than sudden shifts in outlook. The conversation centers on major AI models including GPT-5 and Claude, with substantial research contributions tracked through arXiv's computer science and AI channels alongside cryptocurrency-focused outlets. The topic frequently intersects with machine learning, large language models, and automation research, while also appearing alongside discussions of blockchain assets like Ethereum and Bitcoin. Scan the articles below to explore how #ai-agents are being developed, deployed, and analyzed across technical and financial perspectives.

sentiment · last 30d (98 articles)
Top sources:arXiv – CS AI · 243Crypto Briefing · 19CoinDesk · 18Fortune Crypto · 12TechCrunch – AI · 12
Most-discussed entities:GPT-5 · 13Claude · 13Anthropic · 10OpenAI · 9Opus · 6
676 articles
AIBullishCrypto Briefing · Apr 207/10
🧠

Aaron Levie: AI will create more lawyers in five years, workflows must be redesigned for AI agents, and the commercial race in AI is reshaping global dynamics | 20VC

Aaron Levie argues that AI-driven automation will expand the legal profession rather than contract it, creating new lawyer roles and job categories within five years. He emphasizes that organizational workflows must be fundamentally redesigned to effectively integrate AI agents, and notes that the commercial AI race is becoming a geopolitical competition reshaping global dynamics.

Aaron Levie: AI will create more lawyers in five years, workflows must be redesigned for AI agents, and the commercial race in AI is reshaping global dynamics | 20VC
AI × CryptoNeutralThe Block · Apr 206/10
🤖

What are AI agent tokens?

The cryptocurrency industry is experiencing a shift from infrastructure-focused blockchain AI projects toward AI agent tokens—crypto assets tied to specific autonomous agents rather than broader networks. This emerging trend reflects growing capabilities of AI bots in content generation and task management, representing a new tokenization paradigm within the AI-crypto intersection.

What are AI agent tokens?
AI × CryptoNeutralDecrypt – AI · Apr 206/10
🤖

Coinbase Tests AI Agents Modeled on ‘Legendary’ Former Execs

Coinbase is testing AI agents trained to replicate the decision-making approaches of co-founder Fred Ehrsam and former CTO Balaji Srinivasan. This initiative represents a growing trend of enterprises embedding institutional expertise into AI systems to enhance strategic decision-making and operational efficiency.

Coinbase Tests AI Agents Modeled on ‘Legendary’ Former Execs
AIBullishBlockonomi · Apr 206/10
🧠

Bernstein: Cloud Infrastructure Will Dominate as AI Agents Reshape Software Industry

Bernstein's latest 5-year AI forecast predicts cloud infrastructure providers will emerge as dominant winners as AI agents fundamentally reshape the software industry. While legacy systems face significant pressure, the research suggests the broader software sector will evolve rather than decline, with cloud platforms positioned as the critical backbone for AI control planes.

AI × CryptoNeutralcrypto.news · Apr 206/10
🤖

Coinbase tests AI agents on Slack, eyes fewer human workers

Coinbase is testing AI agents integrated into Slack and email platforms, signaling a strategic shift toward automation. CEO Brian Armstrong has publicly stated that AI agents may eventually outnumber human employees at the exchange, reflecting broader industry trends toward workforce optimization through artificial intelligence.

Coinbase tests AI agents on Slack, eyes fewer human workers
AINeutralarXiv – CS AI · Apr 206/10
🧠

The Semi-Executable Stack: Agentic Software Engineering and the Expanding Scope of SE

A research paper proposes that AI-driven software engineering doesn't threaten the field but rather expands its scope to include 'semi-executable' artifacts—combinations of natural language, tools, and workflows requiring human or probabilistic interpretation. The Semi-Executable Stack model provides a diagnostic framework across six layers to understand how software engineering practices evolve as AI agents handle routine tasks.

AIBullisharXiv – CS AI · Apr 206/10
🧠

"Excuse me, may I say something..." CoLabScience, A Proactive AI Assistant for Biomedical Discovery and LLM-Expert Collaborations

Researchers introduce CoLabScience, a proactive AI assistant designed to enhance biomedical research collaboration by intervening in scientific discussions at optimal moments. The system uses PULI, a reinforcement learning framework that learns when and how to contribute based on project context and conversation history, supported by a new benchmark dataset (BSDD) of simulated research dialogues.

AINeutralarXiv – CS AI · Apr 206/10
🧠

GTA-2: Benchmarking General Tool Agents from Atomic Tool-Use to Open-Ended Workflows

Researchers introduce GTA-2, a hierarchical benchmark that evaluates AI agents on both atomic tool-use tasks and complex, open-ended workflows using real user queries and deployed tools. The study reveals a significant capability cliff where frontier AI models achieve below 50% success on atomic tasks and only 14.39% on realistic workflows, highlighting that execution framework design matters as much as underlying model capacity.

AIBullishFortune Crypto · Apr 186/10
🧠

This founder was an AI layoff 9 months ago. Then he built an instantly profitable company with 2 partners and 12 agents

A former AI industry employee laid off 9 months ago co-founded a startup with two partners that achieved $300,000 in annualized recurring revenue within 2.5 months of launch, leveraging 12 AI agents. The case demonstrates how AI automation tools enable lean teams to build profitable businesses rapidly, reflecting broader market shifts toward AI-driven efficiency.

This founder was an AI layoff 9 months ago. Then he built an instantly profitable company with 2 partners and 12 agents
AIBullishTechCrunch – AI · Apr 156/10
🧠

Hightouch reaches $100M ARR fueled by marketing tools powered by AI

Hightouch, a data activation platform, has reached $100M ARR by adding AI-powered agent tools for marketers, achieving a $70M revenue increase in just 20 months. The rapid growth demonstrates strong market demand for AI-enhanced marketing automation solutions.

AIBullisharXiv – CS AI · Apr 156/10
🧠

Long-Horizon Plan Execution in Large Tool Spaces through Entropy-Guided Branching

Researchers introduce SLATE, a large-scale benchmark for evaluating AI agents using APIs, and propose Entropy-Guided Branching (EGB), a search algorithm that improves task success rates and computational efficiency. The work addresses critical limitations in deploying language models within complex tool environments by establishing rigorous evaluation frameworks and reducing the computational burden of exploring massive decision spaces.

AIBullishDecrypt · Apr 146/10
🧠

What Is Hermes? The Self-Improving AI Agent Coming for OpenClaw

Nous Research has unveiled Hermes, an open-source AI agent featuring a built-in learning loop that enables it to create and improve skills from experience autonomously. The agent operates on terminal infrastructure and represents a significant advancement in self-improving AI systems, positioning itself as a competitor to proprietary alternatives like OpenAI's tools.

What Is Hermes? The Self-Improving AI Agent Coming for OpenClaw
AINeutralarXiv – CS AI · Apr 146/10
🧠

COMPOSITE-Stem

Researchers introduced COMPOSITE-STEM, a new benchmark containing 70 expert-written scientific tasks across physics, biology, chemistry, and mathematics to evaluate AI agents. The top-performing model achieved only 21% accuracy, indicating the benchmark effectively measures capabilities beyond current AI reach and addresses the saturation of existing evaluation frameworks.

AINeutralarXiv – CS AI · Apr 146/10
🧠

HealthAdminBench: Evaluating Computer-Use Agents on Healthcare Administration Tasks

Researchers introduced HealthAdminBench, a new evaluation framework with 135 tasks across realistic healthcare administration workflows, revealing that current AI agents achieve only 36.3% end-to-end success despite strong individual subtask performance. The benchmark demonstrates a critical gap between AI capabilities and the reliability requirements for automating healthcare administrative processes worth over $1 trillion annually.

🧠 GPT-5🧠 Claude🧠 Opus
AINeutralarXiv – CS AI · Apr 146/10
🧠

Cooperation in Human and Machine Agents: Promise Theory Considerations

A theoretical research paper examines Promise Theory as a framework for understanding cooperation between human and machine agents in autonomous systems. The work revisits established principles of agent cooperation to address how diverse components—humans, hardware, software, and AI—maintain alignment with intended purposes through signaling, trust, and feedback mechanisms.

AINeutralarXiv – CS AI · Apr 146/10
🧠

Agent Mentor: Framing Agent Knowledge through Semantic Trajectory Analysis

Researchers introduce Agent Mentor, an open-source analytics pipeline that monitors and automatically improves AI agent behavior by analyzing execution logs and iteratively refining system prompts with corrective instructions. The framework addresses variability in large language model-based agent performance caused by ambiguous prompt formulations, demonstrating consistent accuracy improvements across multiple configurations.

AINeutralarXiv – CS AI · Apr 146/10
🧠

Do Agent Rules Shape or Distort? Guardrails Beat Guidance in Coding Agents

A large-scale empirical study of 679 GitHub instruction files shows that AI coding agent performance improves by 7-14 percentage points when rules are applied, but surprisingly, random rules work as well as expert-curated ones. The research reveals that negative constraints outperform positive directives, suggesting developers should focus on guardrails rather than prescriptive guidance.

AIBullisharXiv – CS AI · Apr 146/10
🧠

Tuning Qwen2.5-VL to Improve Its Web Interaction Skills

Researchers fine-tuned Qwen2.5-VL-32B, a leading open-source vision-language model, to improve its ability to autonomously perform web interactions through visual input alone. Using a two-stage training approach that addresses cursor localization, instruction sensitivity, and overconfidence bias, the model's success rate on single-click web tasks improved from 86% to 94%.

AINeutralFortune Crypto · Apr 136/10
🧠

AI agents are acting like employees, but company structures still treat them like software

AI agents are increasingly operating autonomously in corporate environments, making independent decisions without human oversight. However, organizational structures and legal frameworks have not evolved to accommodate this shift, creating a mismatch between how these systems function and how companies classify and manage them.

AI agents are acting like employees, but company structures still treat them like software
AINeutralarXiv – CS AI · Apr 136/10
🧠

SEA-Eval: A Benchmark for Evaluating Self-Evolving Agents Beyond Episodic Assessment

Researchers introduce SEA-Eval, a new benchmark for evaluating self-evolving AI agents that go beyond single-task execution by measuring how agents improve across sequential tasks and accumulate experience over time. The benchmark reveals significant inefficiencies in current state-of-the-art frameworks, exposing up to 31.2x differences in token consumption despite identical success rates, highlighting a critical bottleneck in agent development.

AIBearishBlockonomi · Apr 116/10
🧠

Zoom (ZM) Stock Plunges 5.7% Amid AI Agent Disruption Concerns

Zoom's stock declined 5.7% on Thursday following concerns about AI agents from Anthropic and OpenAI potentially disrupting enterprise communication software. The sell-off reflects broader market anxiety about how advanced AI systems could reshape or disintermediate traditional collaboration platforms.

🏢 OpenAI🏢 Anthropic
AI × CryptoBullishCrypto Briefing · Apr 117/10
🤖

Gavriel Cohen: AI native service companies can achieve software-like margins, the rise of AI agents in marketing, and security risks of complex architectures | MLST

Gavriel Cohen discusses how AI-native service companies can achieve software-like profit margins through minimal, secure tool design, exemplified by NanoClaw's success. The article explores the emerging role of AI agents in marketing while highlighting security vulnerabilities inherent in complex AI architectures.

Gavriel Cohen: AI native service companies can achieve software-like margins, the rise of AI agents in marketing, and security risks of complex architectures | MLST
← PrevPage 18 of 28Next →