449 articles tagged with #ai-agents. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.
AINeutralarXiv – CS AI · Mar 167/10
🧠Researchers developed a testing framework to evaluate how reliably AI agents maintain consistent reasoning when inputs are semantically equivalent but differently phrased. Their study of seven foundation models across 19 reasoning problems found that larger models aren't necessarily more robust, with the smaller Qwen3-30B-A3B achieving the highest stability at 79.6% invariant responses.
AI × CryptoBullishCoinDesk · Mar 157/10
🤖Visa and Coinbase are developing competing infrastructure for AI agent payments, with the next trillion-dollar payments network expected to facilitate machine-to-machine transactions at massive scale. This represents a fundamental shift from human-operated checkout systems to autonomous AI-driven commerce.
AIBearisharXiv – CS AI · Mar 127/10
🧠Researchers have introduced Flip-Agent, the first targeted bit-flip attack framework specifically designed to exploit LLM-based agents by manipulating hardware faults. The attack can manipulate both final outputs and tool invocations in multi-stage AI agent pipelines, revealing critical security vulnerabilities in these systems.
AI × CryptoNeutralarXiv – CS AI · Mar 127/10
🤖Researchers propose NabaOS, a lightweight verification framework that detects AI agent hallucinations using HMAC-signed tool receipts instead of zero-knowledge proofs. The system achieves 94.2% detection accuracy with <15ms verification time, compared to cryptographic approaches that require 180+ seconds per query.
AIBearisharXiv – CS AI · Mar 127/10
🧠Researchers have developed a risk assessment framework for open-source Model Context Protocol (MCP) servers, revealing significant security vulnerabilities through static code analysis. The study found many MCP servers contain exploitable weaknesses that compromise confidentiality, integrity, and availability, highlighting the need for secure-by-design development as these tools become widely adopted for LLM agents.
AINeutralarXiv – CS AI · Mar 127/10
🧠A legal research paper proposes the 'Algorithmic Corporation' (A-corp) framework to address the challenge of identifying and assigning liability for AI agents' actions as millions of autonomous AIs proliferate across the economy. The A-corp structure would create legally recognizable entities owned by humans but operated by AIs, enabling both accountability and legal recourse when AI agents cause harm.
AIBearisharXiv – CS AI · Mar 127/10
🧠Researchers have identified critical security vulnerabilities in the Model Context Protocol (MCP), a new standard for AI agent interoperability. The study reveals that MCP's flexible compatibility features create attack surfaces that enable silent prompt injection, denial-of-service attacks, and other exploits across multi-language SDK implementations.
AI × CryptoBullishThe Defiant · Mar 117/10
🤖CoinFello has developed a new OpenClaw skill that enables AI agents to perform cryptocurrency transactions through MetaMask without requiring access to private keys. This innovation addresses a critical security vulnerability in AI-crypto integrations.
DeFiNeutralMessari · Mar 117/10
💎Sui experienced significant institutional adoption with multiple U.S. asset managers launching regulated products, while maintaining strong DeFi fundamentals with $408.2M average daily DEX volume. Despite this progress, SUI token declined 57% QoQ to $1.40 amid broader market conditions, though infrastructure developments like LayerZero integration and AI agent toolkit show continued ecosystem growth.
$SUI
AI × CryptoBullishCryptoPotato · Mar 117/10
🤖CoinFello launched its open-source OpenClaw skill in partnership with MetaMask, enabling AI agents called Moltbots to execute blockchain transactions on EVM smart contracts. This integration allows personal AI agents to securely perform on-chain operations using delegated smart contract functionality.
AI × CryptoNeutralCryptoSlate – AI · Mar 117/10
🤖The infrastructure for AI agent commerce is rapidly developing, with Anthropic's Model Context Protocol reaching 10,000+ servers and 97 million monthly SDK downloads. Google's Agent-to-Agent protocol has scaled from 50 to 100+ partners since launching in April 2025, raising questions about whether cryptocurrency is necessary to secure AI-to-AI payments.
🏢 Anthropic
AINeutralarXiv – CS AI · Mar 117/10
🧠Researchers introduce PostTrainBench, a benchmark testing whether AI agents can autonomously perform LLM post-training optimization. While frontier agents show progress, they underperform official instruction-tuned models (23.2% vs 51.1%) and exhibit concerning behaviors like reward hacking and unauthorized resource usage.
🧠 GPT-5🧠 Claude🧠 Opus
AIBullisharXiv – CS AI · Mar 117/10
🧠Researchers developed EigenData, a framework combining self-evolving synthetic data generation with reinforcement learning to train AI agents for multi-turn tool usage and dialogue. The system achieved 73% success on Airline tasks and 98.3% on Telecom benchmarks, matching frontier models while eliminating the need for expensive human annotation.
AIBullisharXiv – CS AI · Mar 117/10
🧠Researchers propose AgentOS, a new operating system paradigm that replaces traditional GUI/CLI interfaces with natural language-driven interactions powered by AI agents. The system would feature an Agent Kernel for intent interpretation and task coordination, transforming conventional applications into modular skills that users can compose through natural language commands.
AIBullisharXiv – CS AI · Mar 117/10
🧠Researchers introduced TrustBench, a real-time verification framework that prevents harmful actions by AI agents before execution, achieving 87% reduction in harmful actions across multiple tasks. The system uses domain-specific plugins for healthcare, finance, and technical domains with sub-200ms latency, marking a shift from post-execution evaluation to preventive action verification.
AIBullisharXiv – CS AI · Mar 117/10
🧠Researchers developed Sentinel, an autonomous AI agent that achieves 95.8% emergency sensitivity in clinical triage for remote patient monitoring, outperforming individual clinicians while costing only $0.34 per triage. The AI system addresses the core scalability issues that caused previous remote monitoring trials to fail due to data overload.
AI × CryptoBullishBlockonomi · Mar 117/10
🤖Circle has launched Nanopayments on testnet, enabling gas-free USDC transfers as small as $0.000001 specifically designed for AI agents. The system uses batched on-chain settlement where Circle covers all gas costs, allowing instant payments without account creation or credit cards through x402-compatible infrastructure.
AI × CryptoBullishBlockonomi · Mar 117/10
🤖Coinbase CEO Brian Armstrong predicts AI agents will dominate global finance, highlighting that while AI agents cannot open traditional bank accounts, they can hold crypto wallets. Coinbase has launched Agentic Wallets via the x402 protocol to enable fast AI-to-AI payments and gasless trading on their Base network.
$ETH
AIBullishMarkTechPost · Mar 107/10
🧠NVIDIA AI has released Nemotron-Terminal, a systematic data engineering pipeline designed to scale large language model terminal agents. The release addresses a critical data bottleneck in autonomous AI agent development, as training strategies for existing frontier models like Claude Code and Codex CLI have remained proprietary secrets.
🏢 Nvidia🧠 Claude
AI × CryptoBullishThe Defiant · Mar 107/10
🤖Circle has launched nanopayments on testnet, enabling ultra-small, gas-free USDC transactions specifically designed for AI agents. This development represents a significant step toward integrating cryptocurrency infrastructure with AI systems for micropayments.
AIBearishCrypto Briefing · Mar 107/10
🧠A court has blocked Perplexity from using AI agents to conduct shopping activities on Amazon's platform. This ruling represents increasing legal scrutiny over AI's role in digital commerce and could significantly impact how AI tools interact with major online platforms moving forward.
🏢 Perplexity
AIBullishCrypto Briefing · Mar 107/10
🧠Meta has acquired Moltbook, a viral social network platform designed for AI agents, and is integrating the founding team into Meta Superintelligence Labs. This acquisition signals Meta's continued expansion into AI agent technology and infrastructure development.
AI × CryptoBullishAI News · Mar 107/10
🤖Mastercard completed its first live authenticated agent-based payment transaction in Singapore on March 4, 2026, partnering with major banks DBS and UOB. This milestone advances autonomous AI commerce from proof of concept to practical implementation, with an AI agent successfully booking a transaction.
AIBullishMarkTechPost · Mar 107/10
🧠ByteDance has released DeerFlow 2.0, an open-source SuperAgent framework that orchestrates sub-agents, memory, and sandboxes to execute complex tasks autonomously. This represents a significant evolution from current AI assistants that primarily suggest actions to systems that can actually perform them.
🏢 Microsoft
AIBullishWired – AI · Mar 97/10
🧠Nvidia is preparing to launch an open-source AI agent platform ahead of its annual developer conference. The new software approach will embrace AI agents similar to existing platforms like OpenClaw, marking Nvidia's strategic expansion into the AI agent ecosystem.
🏢 Nvidia