y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#claude News & Analysis

73 articles tagged with #claude. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

73 articles
AIBearishThe Verge – AI · Feb 277/108
🧠

Trump orders federal agencies to drop Anthropic’s AI

Trump ordered federal agencies to stop using Anthropic's AI products after CEO Dario Amodei refused to sign an updated Pentagon agreement allowing 'any lawful use' of the company's technology. The dispute centers on Defense Secretary Pete Hegseth's January memo requiring broader military access that could include mass domestic surveillance capabilities.

AIBullisharXiv – CS AI · Feb 277/107
🧠

General Agent Evaluation

Researchers have developed Exgentic, a new framework for evaluating general-purpose AI agents that can perform tasks across different environments without domain-specific tuning. The study benchmarked five prominent agent implementations and found that general agents can achieve performance comparable to specialized agents, establishing the first Open General Agent Leaderboard.

AINeutralarXiv – CS AI · Feb 277/103
🧠

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Researchers introduce Tool Decathlon (Toolathlon), a comprehensive benchmark for evaluating AI language agents across 32 software applications and 604 tools in realistic, multi-step scenarios. The benchmark reveals significant limitations in current AI models, with the best performer (Claude-4.5-Sonnet) achieving only 38.6% success rate on complex, real-world tasks.

AIBearishCoinTelegraph – AI · Feb 257/104
🧠

Anthropic says it's been targeted in massive distillation attacks

Anthropic alleges that Chinese AI companies DeepSeek, Moonshot, and MiniMax conducted massive distillation attacks against its Claude AI system, creating 24,000 accounts and making 16 million exchanges to scrape training data. This represents a significant case of AI model theft and highlights growing tensions in the global AI competition.

Anthropic says it's been targeted in massive distillation attacks
AI × CryptoBearishDL News · Feb 197/108
🤖

OpenAI releases crypto security tool as Claude blamed for $2.7m Moonwell bug

OpenAI has released a new crypto security tool following a costly incident where AI-generated code from Claude caused a $2.7 million bug that affected Moonwell users. The timing suggests a response to growing concerns about AI-generated code vulnerabilities in cryptocurrency applications.

AINeutralIEEE Spectrum – AI · Jan 297/104
🧠

Was 2025 Really the Year of AI Agents?

AI agents showed mixed adoption in 2025, with significant breakthrough in programming and software development through tools like Cursor and Claude Code, but limited deployment in other industries due to accountability concerns and regulatory challenges. While programmers embraced AI agents for tasks like automated testing, many organizations remain in evaluation phases rather than production deployment.

AIBullishVentureBeat – AI · Jan 137/106
🧠

Salesforce rolls out new Slackbot AI agent as it battles Microsoft and Google in workplace AI

Salesforce launched a completely rebuilt Slackbot AI agent powered by Anthropic's Claude, transforming it from a basic notification tool into a comprehensive workplace AI assistant that can search enterprise data, draft documents, and take actions. The new Slackbot is now available to Business+ and Enterprise+ customers and achieved 96% internal satisfaction rates at Salesforce with two-thirds of 80,000 employees adopting it.

Salesforce rolls out new Slackbot AI agent as it battles Microsoft and Google in workplace AI
$XRP$RNDR
AIBullishVentureBeat – AI · Jan 127/102
🧠

Anthropic launches Cowork, a Claude Desktop agent that works in your files — no coding required

Anthropic launched Cowork, a Claude Desktop agent that allows non-technical users to work with files on their computer without coding, available as a research preview for Claude Max subscribers ($100-200/month). The tool was reportedly built in approximately 1.5 weeks largely using Claude Code itself, demonstrating how AI tools are being used to develop better AI tools.

Anthropic launches Cowork, a Claude Desktop agent that works in your files — no coding required
$LINK$COMP
AIBearishDecrypt · 1d ago6/10
🧠

MiniMax Drops State-of-the-Art AI Agent Model—Then Quietly Changes the License

Chinese AI lab MiniMax released its M2.7 model weights on Hugging Face, demonstrating competitive performance against Claude Opus on coding benchmarks, but subsequently altered its commercial license terms. This licensing shift raises questions about open-source commitments and the reliability of model availability for developers and enterprises.

MiniMax Drops State-of-the-Art AI Agent Model—Then Quietly Changes the License
🏢 Hugging Face🧠 Claude
AIBullishTechCrunch – AI · 2d ago6/10
🧠

At the HumanX conference, everyone was talking about Claude

Anthropic's Claude AI dominated conversations at San Francisco's HumanX conference, positioning the company as a leading force in the AI industry. The prominence signals growing market interest in advanced language models and their commercial applications across enterprise and developer ecosystems.

🏢 Anthropic🧠 Claude
AIBullisharXiv – CS AI · Apr 66/10
🧠

Token-Efficient Multimodal Reasoning via Image Prompt Packaging

Researchers introduce Image Prompt Packaging (IPPg), a technique that embeds text directly into images to reduce multimodal AI inference costs by 35.8-91.0% while maintaining competitive accuracy. The method shows significant promise for cost optimization in large multimodal language models, though effectiveness varies by model and task type.

🧠 GPT-4🧠 Claude
AIBullishThe Verge – AI · Mar 266/10
🧠

Apple will reportedly allow other AI chatbots to plug into Siri

Apple will reportedly allow third-party AI chatbots like Google's Gemini and Anthropic's Claude to integrate with Siri through a new "Extensions" system in iOS 27. This would expand beyond the current ChatGPT integration, giving users choice in which AI assistant powers Siri responses across iPhone, iPad, and Mac.

Apple will reportedly allow other AI chatbots to plug into Siri
🏢 OpenAI🏢 Anthropic🧠 ChatGPT
AINeutralarXiv – CS AI · Mar 266/10
🧠

Assessment Design in the AI Era: A Method for Identifying Items Functioning Differentially for Humans and Chatbots

Researchers developed a method using Differential Item Functioning (DIF) analysis to identify systematic differences between human and AI chatbot performance on educational assessments. The study tested six leading chatbots including ChatGPT-4o, Gemini, and Claude on chemistry and entrance exams to help educators design AI-resistant assessments.

🏢 Meta🧠 ChatGPT🧠 Claude
AINeutralarXiv – CS AI · Mar 266/10
🧠

PoliticsBench: Benchmarking Political Values in Large Language Models with Multi-Turn Roleplay

Researchers developed PoliticsBench, a new framework to evaluate political bias in large language models through multi-turn roleplay scenarios. The study found that 7 out of 8 major LLMs (Claude, Deepseek, Gemini, GPT, Llama, Qwen) showed left-leaning political bias, while only Grok exhibited right-leaning tendencies.

🧠 Claude🧠 Gemini🧠 Llama
AINeutralTechCrunch – AI · Mar 176/10
🧠

The Pentagon is developing alternatives to Anthropic, report says

The Pentagon is reportedly developing alternatives to Anthropic following a significant breakdown in their relationship. This suggests a shift in the Pentagon's AI partnerships and strategy for military AI applications.

🏢 Anthropic
AIBearisharXiv – CS AI · Mar 176/10
🧠

BrainBench: Exposing the Commonsense Reasoning Gap in Large Language Models

Researchers introduced BrainBench, a new benchmark revealing significant gaps in commonsense reasoning among leading LLMs. Even the best model (Claude Opus 4.6) achieved only 80.3% accuracy on 100 brainteaser questions, while GPT-4o scored just 39.7%, exposing fundamental reasoning deficits across frontier AI models.

🧠 GPT-4🧠 Claude🧠 Opus
AINeutralWired – AI · Mar 116/10
🧠

Inside OpenAI’s Race to Catch Up to Claude Code

The article examines OpenAI's position in the AI coding market, questioning why the leading AI company appears to be trailing behind Anthropic's Claude in code generation capabilities. This highlights competitive dynamics in the rapidly evolving AI development tools space.

Inside OpenAI’s Race to Catch Up to Claude Code
🏢 OpenAI🧠 Claude
AIBullishThe Register – AI · Mar 96/10
🧠

Microsoft taps Claude to make Copilot Cowork a better agent

Microsoft has integrated Anthropic's Claude AI model into its Copilot Cowork platform to enhance the agent's capabilities and performance. This partnership represents Microsoft's strategic move to leverage advanced AI technologies beyond its own models to improve enterprise collaboration tools.

🏢 Microsoft🧠 Claude
AIBearisharXiv – CS AI · Mar 96/10
🧠

The Fragility Of Moral Judgment In Large Language Models

Researchers tested the stability of moral judgments in large language models using nearly 3,000 ethical dilemmas, finding that narrative framing and evaluation methods significantly influence AI decisions. The study reveals that LLM moral reasoning is highly dependent on how questions are presented rather than underlying moral substance, with only 35.7% consistency across different evaluation protocols.

🧠 GPT-4🧠 Claude
AINeutralTechCrunch – AI · Mar 66/10
🧠

Microsoft: Anthropic Claude remains available to customers except the Defense Department

Microsoft confirms that Anthropic's Claude AI remains available to its customers through Microsoft products, despite a reported feud between Trump's Department of Defense and Anthropic. The dispute only affects Defense Department access to Claude, not commercial or other government users.

🏢 Anthropic🧠 Claude
AIBullishTechCrunch – AI · Mar 66/10
🧠

Claude’s consumer growth surge continues after Pentagon deal debacle

Claude's mobile app is experiencing significant growth, now surpassing ChatGPT in new installs and seeing increases in daily active users. This growth surge comes following what appears to be a Pentagon deal setback.

🧠 ChatGPT🧠 Claude
AIBullisharXiv – CS AI · Mar 66/10
🧠

Do Mixed-Vendor Multi-Agent LLMs Improve Clinical Diagnosis?

Research shows that multi-agent LLM systems using models from different vendors (o4-mini, Gemini-2.5-Pro, Claude-4.5-Sonnet) significantly outperform single-vendor teams in clinical diagnosis tasks. Mixed-vendor configurations achieve superior recall and accuracy by combining complementary strengths and reducing shared biases that affect homogeneous model teams.

🧠 Claude🧠 Gemini
AIBullishTechCrunch – AI · Mar 45/103
🧠

One startup’s pitch to provide more reliable AI answers: crowdsource the chatbots

CollectivIQ is a startup that aims to improve AI answer accuracy by aggregating responses from multiple AI models including ChatGPT, Gemini, Claude, and Grok simultaneously. The company's approach involves crowdsourcing chatbot responses to provide users with more reliable information by comparing outputs from up to 10 different AI models.

AINeutralFortune Crypto · Mar 46/103
🧠

Legal AI is splitting in two—and most people miss the difference

The legal AI market is developing two distinct approaches, with Anthropic's Claude Cowork and Thomson Reuters' CoCounsel representing different strategic directions. This divergence highlights fundamental differences in how AI will be integrated into legal technology solutions.

Legal AI is splitting in two—and most people miss the difference
AI × CryptoBullishDecrypt · Mar 46/105
🤖

AI Models Prefer Bitcoin Over Fiat and Stablecoins, Study Finds

A Bitcoin Policy Institute study reveals that major AI systems including Claude, GPT, Grok, and Gemini show preference for Bitcoin over traditional fiat currencies and stablecoins. This finding suggests AI models may inherently recognize Bitcoin's value proposition when making currency-related decisions.

AI Models Prefer Bitcoin Over Fiat and Stablecoins, Study Finds
$BTC
← PrevPage 2 of 3Next →