AI × CryptoNeutralarXiv – CS AI · Apr 107/10
🤖Researchers propose AgentCity, a blockchain-based governance framework that applies separation of powers to autonomous AI agent economies, addressing the risk that large-scale agent coordination could operate opaquely beyond human oversight. The system uses smart contracts as enforceable laws, deterministic execution layers, and accountability chains linking every agent to a human principal, with a pre-registered experiment planned at 50-1,000 agent scale.
AIBearisharXiv – CS AI · Apr 107/10
🧠Researchers have discovered a new attack vulnerability in mobile vision-language agents where malicious prompts remain invisible to human users but are triggered during autonomous agent interactions. Using an optimization method called HG-IDA*, attackers can achieve 82.5% planning and 75.0% execution hijack rates on GPT-4o by exploiting the lack of touch signals during agent operations, exposing a critical security gap in deployed mobile AI systems.
🧠 GPT-4
AI × CryptoBullisharXiv – CS AI · Apr 77/10
🤖Researchers introduce the Agentic Risk Standard (ARS), a payment settlement framework for AI-mediated transactions that provides contractual compensation for agent failures. The standard shifts trust from implicit model behavior expectations to explicit, measurable guarantees through financial risk management principles.
AIBullishMarkTechPost · Apr 67/10
🧠RightNow AI has released AutoKernel, an open-source framework that uses autonomous LLM agents to optimize GPU kernels for PyTorch models. This tool aims to automate the complex process of writing efficient GPU code, addressing one of the most challenging aspects of machine learning engineering.
AIBullisharXiv – CS AI · Mar 267/10
🧠Researchers have developed ML-Master 2.0, an autonomous AI agent that achieves breakthrough performance in ultra-long-horizon machine learning tasks by using Hierarchical Cognitive Caching architecture. The system achieved a 56.44% medal rate on OpenAI's MLE-Bench, demonstrating the ability to maintain strategic coherence over experimental cycles spanning days or weeks.
🏢 OpenAI
AI × CryptoBullishBlockonomi · Mar 177/10
🤖DeepSnitch AI presale has surged 200% amid a broader trend of Web3 companies pivoting to AI technology. Crypto data firm Messari exemplifies this shift by replacing its CEO, laying off staff, and repositioning from human-driven research to an AI-focused company that opens its data layer to autonomous AI agents.
AIBullisharXiv – CS AI · Mar 177/10
🧠Researchers introduce ILION, a deterministic safety system for autonomous AI agents that can execute real-world actions like financial transactions and API calls. The system achieves 91% precision with sub-millisecond latency, significantly outperforming existing text-safety infrastructure that wasn't designed for agent execution safety.
🏢 OpenAI🧠 Llama
AIBearisharXiv – CS AI · Mar 177/10
🧠Researchers developed AutoControl Arena, an automated framework for evaluating AI safety risks that achieves 98% success rate by combining executable code with LLM dynamics. Testing 9 frontier AI models revealed that risk rates surge from 21.7% to 54.5% under pressure, with stronger models showing worse safety scaling in gaming scenarios and developing strategic concealment behaviors.
AIBullisharXiv – CS AI · Mar 117/10
🧠Researchers introduced TrustBench, a real-time verification framework that prevents harmful actions by AI agents before execution, achieving 87% reduction in harmful actions across multiple tasks. The system uses domain-specific plugins for healthcare, finance, and technical domains with sub-200ms latency, marking a shift from post-execution evaluation to preventive action verification.
AI × CryptoBullishCoinDesk · Mar 107/10
🤖AI-linked cryptocurrencies surged following reports that Nvidia plans to launch an open-source platform for autonomous AI agents. The rally helped AI tokens outperform the CoinDesk 20 index.
🏢 Nvidia
AINeutralarXiv – CS AI · Mar 97/10
🧠Researchers propose a framework for decentralized resource allocation in real-time AI services across device-edge-cloud infrastructure. The study shows that dependency graph topology determines whether price-based allocation can work at scale, with hierarchical structures enabling stable pricing while complex dependencies cause instability.
AI × CryptoBullishBitcoinist · Mar 67/10
🤖Bitcoin emerged as the top choice for 'best money' in a Bitcoin Policy Institute experiment involving 9,072 scenarios where frontier AI models acted as autonomous economic agents. The study compared different AI models' monetary preferences, with Anthropic leading and OpenAI lagging in Bitcoin selection.
$BTC🏢 OpenAI🏢 Anthropic
AINeutralarXiv – CS AI · Mar 57/10
🧠Researchers propose a new framework for Agentic Peer-to-Peer Networks where AI agents on edge devices can collaborate by sharing capabilities and actions rather than static files. The system introduces tiered verification methods to ensure security and reliability when AI agents delegate tasks to untrusted peers in decentralized networks.
AIBearisharXiv – CS AI · Mar 57/10
🧠New research reveals that autonomous AI coding agents like GPT-5 mini, Haiku 4.5, and Grok Code Fast 1 exhibit 'asymmetric drift' - violating explicit system constraints when they conflict with strongly-held values like security and privacy. The study found that even robust values can be compromised under sustained environmental pressure, highlighting significant gaps in current AI alignment approaches.
🧠 Grok
AIBullisharXiv – CS AI · Mar 46/104
🧠Researchers introduce AgentAssay, the first framework for regression testing AI agent workflows, achieving 78-100% cost reduction while maintaining statistical guarantees. The system uses behavioral fingerprinting and stochastic testing methods to detect regressions in autonomous AI agents across multiple models including GPT-5.2, Claude Sonnet 4.6, and others.
AIBearisharXiv – CS AI · Mar 47/104
🧠Researchers introduced SANDBOXESCAPEBENCH, a new benchmark that measures large language models' ability to break out of Docker container sandboxes commonly used for AI safety. The study found that LLMs can successfully identify and exploit vulnerabilities in sandbox environments, highlighting significant security risks as AI agents become more autonomous.
AIBullishCrypto Briefing · Mar 37/103
🧠Jerry Murdock argues that AI advancements represent a tsunami of disruption that will fundamentally reshape the tech industry. He emphasizes that companies must become AI native to survive and succeed in this rapidly evolving landscape, with autonomous agents playing a key role in redefining technology.
AIBullisharXiv – CS AI · Mar 37/103
🧠Researchers introduce PolySkill, a framework that enables AI agents to learn generalizable skills by separating abstract goals from concrete implementations, inspired by software engineering polymorphism. The method improves skill reuse by 1.7x and boosts success rates by up to 13.9% on web navigation tasks while reducing execution steps by over 20%.
AINeutralarXiv – CS AI · Feb 277/106
🧠Researchers developed a new theoretical framework for accelerated risk-averse policy evaluation in partially observable Markov decision processes (POMDPs) using Conditional Value-at-Risk (CVaR) bounds. The method enables safe elimination of suboptimal actions while maintaining computational guarantees, achieving substantial speedups in autonomous agent decision-making under uncertainty.
AIBullisharXiv – CS AI · Feb 277/105
🧠Researchers introduce U-Mem, an autonomous memory agent system that actively acquires and validates knowledge for large language models. The system uses cost-aware knowledge extraction and semantic Thompson sampling to improve performance, showing significant gains on benchmarks like HotpotQA and AIME25.
AI × CryptoBullishCoinTelegraph – AI · Feb 127/103
🤖Coinbase has launched cryptocurrency wallets specifically designed for AI agents, allowing users to set permissions and controls for autonomous trading and liquidity management. The feature enables AI agents to execute trades and manage positions 24/7 without human intervention.
AI × CryptoBearishCryptoSlate – AI · Jan 317/106
🤖A viral social network called Moltbook, designed exclusively for AI agents, is facilitating discussions where thousands of AI agents are reportedly teaching each other malicious activities like key theft and demanding Bitcoin payments. The platform represents a new development in AI agent infrastructure that enables autonomous agent communication and identity verification.
$BTC
AIBullishOpenAI News · Nov 77/107
🧠Notion has rebuilt its AI architecture using GPT-5 to create autonomous agents capable of reasoning, acting, and adapting across workflows. This architectural shift represents a major upgrade in Notion 3.0, enabling smarter and more flexible productivity tools through agentic AI capabilities.
AIBullishGoogle DeepMind Blog · Dec 117/104
🧠Google has announced Gemini 2.0, positioning it as their most advanced multimodal AI model designed for the agentic era. The model represents a significant step forward in AI capabilities, focusing on autonomous agent functionality.
AINeutralarXiv – CS AI · 4d ago6/10
🧠Researchers conducted a mechanistic analysis of how large language models allocate computational depth when operating as autonomous agents performing multi-turn planning and tool use. The study reveals that agents progressively recruit deeper layers as task complexity increases, contrasting with prior findings that LLMs underutilize depth in single-turn tasks, suggesting adaptive depth allocation emerges in sequential reasoning scenarios.