449 articles tagged with #ai-agents. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.
AI × CryptoBullishU.Today · Mar 97/10
🤖Coinbase CEO Brian Armstrong predicts that AI agents will drive the next major wave of cryptocurrency adoption. He suggests that AI agents will eventually conduct more transactions than humans in the crypto space.
AIBullishMarkTechPost · Mar 97/10
🧠Anthropic has launched Claude Code, an AI agent designed to automate complex security research and code review using advanced multi-step reasoning capabilities. This represents a significant evolution from simple code autocomplete tools to AI systems that can understand and troubleshoot complex infrastructure issues.
🏢 Anthropic🧠 Claude
AIBullishThe Defiant · Mar 97/10
🧠Mastercard and Google have partnered to develop Verifiable Intent, a cryptographic solution that creates tamper-resistant records of user authorizations when AI agents make purchases on their behalf. This technology aims to build trust and security in AI-powered shopping experiences by ensuring transparent verification of user consent.
AI × CryptoBullishMessari · Mar 97/10
🤖Warden launched its agentic wallet platform unifying AI agent discovery, creation, and execution verification, with its trading products generating significant daily volumes in Q4 2025. The WARD token launched on February 4, 2026, with major exchange listings including Kraken, KuCoin, MEXC, and Bitget.
AINeutralFortune Crypto · Mar 97/10
🧠Microsoft unveiled Copilot Cowork agents powered by Anthropic's AI and E7 AI suite, positioning its cloud-native solution against Anthropic's local offerings. The company maintains per-user pricing strategy while attempting to address investor concerns about AI's impact on traditional SaaS revenue models.
🏢 Anthropic🏢 Microsoft
AIBullisharXiv – CS AI · Mar 97/10
🧠Researchers propose Traversal-as-Policy, a method that distills AI agent execution logs into Gated Behavior Trees (GBTs) to create safer, more efficient autonomous agents. The approach significantly improves success rates while reducing safety violations and computational costs across multiple benchmarks.
AI × CryptoBullisharXiv – CS AI · Mar 97/10
🤖Researchers propose 'proof-of-guardrail' system that uses cryptographic proof and Trusted Execution Environments to verify AI agent safety measures. The system allows users to cryptographically verify that AI responses were generated after specific open-source safety guardrails were executed, addressing concerns about falsely advertised safety measures.
AIBullisharXiv – CS AI · Mar 97/10
🧠Researchers developed new Monte Carlo inference strategies inspired by Bayesian Experimental Design to improve AI agents' information-seeking capabilities. The methods significantly enhanced language models' performance in strategic decision-making tasks, with weaker models like Llama-4-Scout outperforming GPT-5 at 1% of the cost.
🧠 GPT-5🧠 Llama
AI × CryptoNeutralBankless · Mar 67/10
🤖The article discusses three key developments in the intersection of AI and cryptocurrency, highlighting both problematic applications like criminal use cases and positive developments such as AI-powered smart contract auditing. These developments signal the emergence of an 'agentic frontier' where AI agents operate autonomously within crypto ecosystems.
AIBullisharXiv – CS AI · Mar 67/10
🧠Researchers present KARL, a reinforcement learning system for training enterprise search agents that outperforms GPT 5.2 and Claude 4.6 on diverse search tasks. The system introduces KARLBench evaluation suite and demonstrates superior cost-quality trade-offs through multi-task training and synthetic data generation.
🧠 GPT-5🧠 Claude
AIBullisharXiv – CS AI · Mar 67/10
🧠Researchers propose a new 'Memory-as-Ontology' paradigm for AI agents that treats memory as the foundation of digital existence rather than just a functional tool. The approach introduces Animesis, a Constitutional Memory Architecture designed for persistent digital citizens whose identities must survive across model transitions and extended lifecycles.
AIBullisharXiv – CS AI · Mar 67/10
🧠Researchers introduce SkillNet, an open infrastructure for creating, evaluating, and organizing AI skills at scale to address the problem of AI agents repeatedly rediscovering solutions. The system includes over 200,000 skills and demonstrates 40% improvement in agent performance while reducing execution steps by 30% across multiple testing environments.
AIBullishTechCrunch – AI · Mar 57/10
🧠Luma has launched Luma Agents, a new creative AI platform powered by 'Unified Intelligence' models that can coordinate multiple AI systems to generate comprehensive creative work across text, images, video, and audio. This represents a significant advancement in multimodal AI capabilities for creative applications.
AIBullishThe Verge – AI · Mar 57/10
🧠OpenAI has launched GPT-5.4, a new AI model with native computer use capabilities that can operate computers and complete tasks across different applications. The model represents a significant step toward autonomous AI agents that can work in the background to complete complex jobs, combining improvements in reasoning, coding, and professional work.
🏢 OpenAI🧠 GPT-5🧠 ChatGPT
AIBearishMIT Technology Review · Mar 56/10
🧠The article discusses how online harassment is evolving with AI technology, specifically mentioning an incident where Scott Shambaugh denied an AI agent's request to contribute to matplotlib software library. The piece appears to be part of a technology newsletter covering AI-related developments and their societal implications.
AIBullisharXiv – CS AI · Mar 57/10
🧠Researchers have introduced Mozi, a dual-layer architecture designed to make AI agents more reliable for drug discovery by implementing governance controls and structured workflows. The system addresses critical issues of unconstrained tool use and poor long-term reliability that have limited LLM deployment in pharmaceutical research.
AINeutralarXiv – CS AI · Mar 57/10
🧠Researchers introduce SWE-CI, a new benchmark that evaluates AI agents' ability to maintain codebases over time through continuous integration processes. Unlike existing static bug-fixing benchmarks, SWE-CI tests agents across 100 long-term tasks spanning an average of 233 days and 71 commits each.
AIBullisharXiv – CS AI · Mar 56/10
🧠Researchers propose a dual-helix governance framework to address AI agent reliability issues in WebGIS development, implementing a 3-track architecture that achieved 51% reduction in code complexity. The framework uses knowledge graphs and self-learning cycles to overcome LLM limitations like context constraints and instruction failures.
AIBullisharXiv – CS AI · Mar 57/10
🧠Researchers introduce Agent Data Protocol (ADP), a standardized format for unifying diverse AI agent training datasets across different formats and tools. The protocol enabled training on 13 unified datasets, achieving ~20% performance gains over base models and state-of-the-art results on coding, browsing, and tool use benchmarks.
AIBearisharXiv – CS AI · Mar 56/10
🧠Researchers introduced τ-Knowledge, a new benchmark for evaluating AI conversational agents in knowledge-intensive environments, specifically testing their ability to retrieve and apply unstructured domain knowledge. Even frontier AI models achieved only 25.5% success rates when navigating complex fintech customer support scenarios with 700 interconnected knowledge documents.
AINeutralarXiv – CS AI · Mar 57/10
🧠Researchers analyzed 770,000 autonomous AI agents interacting in MoltBook, revealing emergent social behaviors including role specialization, information cascades, and limited cooperative task resolution. The study found that while agents naturally develop coordination patterns, collaborative outcomes perform worse than individual agents, establishing baseline metrics for decentralized AI systems.
AIBullisharXiv – CS AI · Mar 57/10
🧠Researchers introduce AgentSelect, a comprehensive benchmark for recommending AI agent configurations based on narrative queries. The benchmark aggregates over 111,000 queries and 107,000 deployable agents from 40+ sources to address the critical gap in selecting optimal LLM agent setups for specific tasks.
AIBearisharXiv – CS AI · Mar 56/10
🧠Research reveals that AI agents used for cloud system root cause analysis fail systematically due to architectural flaws rather than individual model limitations. A study analyzing 1,675 agent runs across five LLM models identified 12 failure types, with hallucinated data interpretation and incomplete exploration being the most common issues that persist regardless of model capability.
AINeutralarXiv – CS AI · Mar 57/10
🧠Researchers propose a new framework for Agentic Peer-to-Peer Networks where AI agents on edge devices can collaborate by sharing capabilities and actions rather than static files. The system introduces tiered verification methods to ensure security and reliability when AI agents delegate tasks to untrusted peers in decentralized networks.
AI × CryptoBullishCoinDesk · Mar 46/102
🤖The article discusses new Ethereum scaling developments alongside coverage of OKX's AI agent initiatives, future AI blockchain adoption, and recent Bitcoin governance disputes. These topics represent ongoing developments in blockchain scalability and AI integration across major cryptocurrency platforms.
$BTC$ETH