AI × Crypto News Feed

Real-time AI-curated news from 63,788+ articles across 50+ sources. Sentiment analysis, importance scoring, and key takeaways — updated every 15 minutes.

63788 articles

AIBearisharXiv – CS AI · Mar 267/10

🧠

Invisible Threats from Model Context Protocol: Generating Stealthy Injection Payload via Tree-based Adaptive Search

Researchers have discovered a new black-box attack method called Tree structured Injection for Payloads (TIP) that can compromise AI agents using Model Context Protocol with over 95% success rate. The attack exploits vulnerabilities in how large language models interact with external tools, bypassing existing defenses and requiring significantly fewer queries than previous methods.

AIBearisharXiv – CS AI · Mar 267/10

🧠

Uncovering Memorization in Timeseries Imputation models: LBRM Membership Inference and its link to attribute Leakage

Researchers have identified critical privacy vulnerabilities in deep learning models used for time series imputation, demonstrating that these models can leak sensitive training data through membership and attribute inference attacks. The study introduces a two-stage attack framework that successfully retrieves significant portions of training data even from models designed to be robust against overfitting-based attacks.

AIBullisharXiv – CS AI · Mar 267/10

🧠

DVM: Real-Time Kernel Generation for Dynamic AI Models

Researchers have developed DVM, a real-time compiler for dynamic AI models that uses bytecode virtual machine technology to significantly speed up compilation times. The system achieves up to 11.77x better operator/model efficiency and up to 5 orders of magnitude faster compilation compared to existing solutions like TorchInductor and PyTorch.

AINeutralarXiv – CS AI · Mar 267/10

🧠

Evidence of an Emergent "Self" in Continual Robot Learning

Researchers propose a method to identify 'self-awareness' in AI systems by analyzing invariant cognitive structures that remain stable during continual learning. Their study found that robots subjected to continual learning developed significantly more stable subnetworks compared to control groups, suggesting this could be evidence of an emergent 'self' concept.

AINeutralarXiv – CS AI · Mar 267/10

🧠

Exploring How Fair Model Representations Relate to Fair Recommendations

Researchers challenge the assumption that fair model representations in recommender systems translate to fair recommendations. Their study reveals that while optimizing for fair representations improves recommendation parity, representation-level evaluation is not a reliable proxy for measuring actual fairness in recommendations when comparing models.

🏢 Meta

AIBullisharXiv – CS AI · Mar 267/10

🧠

CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents

Researchers released CUA-Suite, a comprehensive dataset featuring 55 hours of continuous video demonstrations across 87 desktop applications to train computer-use agents. The dataset addresses a critical bottleneck in developing AI agents that can automate complex desktop workflows, revealing current models struggle with ~60% task failure rates on professional applications.

AIBearisharXiv – CS AI · Mar 267/10

🧠

Claudini: Autoresearch Discovers State-of-the-Art Adversarial Attack Algorithms for LLMs

Researchers demonstrate that Claude Code AI agent can autonomously discover novel adversarial attack algorithms against large language models, achieving significantly higher success rates than existing methods. The discovered attacks achieve up to 40% success rate on CBRN queries and 100% attack success rate against Meta-SecAlign-70B, compared to much lower rates from traditional methods.

🧠 Claude

AINeutralarXiv – CS AI · Mar 267/10

🧠

Anti-I2V: Safeguarding your photos from malicious image-to-video generation

Researchers developed Anti-I2V, a new defense system that protects personal photos from being used to create malicious deepfake videos through image-to-video AI models. The system works across different AI architectures by operating in multiple domains and targeting specific network layers to degrade video generation quality.

AINeutralarXiv – CS AI · Mar 267/10

🧠

The Collaboration Paradox: Why Generative AI Requires Both Strategic Intelligence and Operational Stability in Supply Chain Management

Research reveals a 'collaboration paradox' where AI agents using Large Language Models in supply chain management perform worse than non-AI baselines due to inventory hoarding behavior. The study proposes a two-layer solution combining high-level AI policy-setting with low-level collaborative execution protocols to achieve operational stability.

AINeutralarXiv – CS AI · Mar 267/10

🧠

From Guidelines to Guarantees: A Graph-Based Evaluation Harness for Domain-Specific Evaluation of LLMs

Researchers developed a graph-based evaluation framework that transforms clinical guidelines into dynamic benchmarks for testing domain-specific language models. The system addresses key evaluation challenges by providing contamination resistance, comprehensive coverage, and maintainable assessment tools that reveal systematic capability gaps in current AI models.

AIBullisharXiv – CS AI · Mar 267/10

🧠

Toward Ultra-Long-Horizon Agentic Science: Cognitive Accumulation for Machine Learning Engineering

Researchers have developed ML-Master 2.0, an autonomous AI agent that achieves breakthrough performance in ultra-long-horizon machine learning tasks by using Hierarchical Cognitive Caching architecture. The system achieved a 56.44% medal rate on OpenAI's MLE-Bench, demonstrating the ability to maintain strategic coherence over experimental cycles spanning days or weeks.

🏢 OpenAI

AINeutralarXiv – CS AI · Mar 267/10

🧠

Entire Space Counterfactual Learning for Reliable Content Recommendations

Researchers developed ESCM² (Entire Space Counterfactual Multitask Model), a new framework that improves post-click conversion rate estimation in recommender systems by addressing intrinsic estimation bias and false independence assumptions. The model-agnostic approach incorporates counterfactual learning to enhance recommendation accuracy and has been validated on large-scale industrial datasets.

AIBullisharXiv – CS AI · Mar 267/10

🧠

Moonwalk: Inverse-Forward Differentiation

Researchers introduce Moonwalk, a new algorithm that solves backpropagation's memory limitations by eliminating the need to store intermediate activations during neural network training. The method uses vector-inverse-Jacobian products and submersive networks to reconstruct gradients in a forward sweep, enabling training of networks more than twice as deep under the same memory constraints.

AINeutralarXiv – CS AI · Mar 267/10

🧠

Evaluation of Large Language Models via Coupled Token Generation

Researchers propose a new method called coupled autoregressive generation to evaluate large language models more efficiently by controlling for randomness in their responses. The study shows this approach can reduce evaluation samples by up to 75% while revealing that current model rankings may be confounded by inherent randomness in generation processes.

🧠 Llama

AIBullisharXiv – CS AI · Mar 267/10

🧠

Bottlenecked Transformers: Periodic KV Cache Consolidation for Generalised Reasoning

Researchers introduce Bottlenecked Transformers, a new architecture that improves AI reasoning by up to 6.6 percentage points through periodic memory consolidation inspired by brain processes. The system uses a Cache Processor to rewrite key-value cache entries at reasoning step boundaries, achieving better performance on math reasoning benchmarks compared to standard Transformers.

AIBullisharXiv – CS AI · Mar 267/10

🧠

Reward Is Enough: LLMs Are In-Context Reinforcement Learners

Researchers demonstrate that large language models can perform reinforcement learning during inference through a new 'in-context RL' prompting framework. The method shows LLMs can optimize scalar reward signals to improve response quality across multiple rounds, achieving significant improvements on complex tasks like mathematical competitions and creative writing.

AIBearisharXiv – CS AI · Mar 267/10

🧠

Enhancing Jailbreak Attacks on LLMs via Persona Prompts

Researchers developed a genetic algorithm-based method using persona prompts to exploit large language models, reducing refusal rates by 50-70% across multiple LLMs. The study reveals significant vulnerabilities in AI safety mechanisms and demonstrates how these attacks can be enhanced when combined with existing methods.

AIBullisharXiv – CS AI · Mar 267/10

🧠

From Imperative to Declarative: Towards LLM-friendly OS Interfaces for Boosted Computer-Use Agents

Researchers have developed Declarative Model Interface (DMI), a new abstraction layer that transforms traditional GUIs into LLM-friendly interfaces for computer-use agents. Testing with Microsoft Office Suite showed 67% improvement in task success rates and 43.5% reduction in interaction steps, with over 61% of tasks completed in a single LLM call.

AIBullisharXiv – CS AI · Mar 267/10

🧠

You only need 4 extra tokens: Synergistic Test-time Adaptation for LLMs

Researchers developed SyTTA, a test-time adaptation framework that improves large language models' performance in specialized domains without requiring additional labeled data. The method achieved over 120% improvement on agricultural question answering tasks using just 4 extra tokens per query, addressing the challenge of deploying LLMs in domains with limited training data.

🏢 Perplexity

AINeutralarXiv – CS AI · Mar 267/10

🧠

From Prompts to Packets: A View from the Network on ChatGPT, Copilot, and Gemini

A comprehensive study analyzed network traffic patterns of popular AI chatbots ChatGPT, Copilot, and Gemini through Android mobile apps. The research reveals distinctive protocol footprints and traffic characteristics that create new challenges for network management, including sustained upstream activity and high-rate bursts unlike conventional messaging apps.

🏢 Microsoft🧠 ChatGPT🧠 Gemini

AIBullisharXiv – CS AI · Mar 267/10

🧠

QUARK: Quantization-Enabled Circuit Sharing for Transformer Acceleration by Exploiting Common Patterns in Nonlinear Operations

Researchers have developed QUARK, a quantization-enabled FPGA acceleration framework that significantly improves Transformer model performance by optimizing nonlinear operations through circuit sharing. The system achieves up to 1.96x speedup over GPU implementations while reducing hardware overhead by more than 50% compared to existing approaches.

AIBullisharXiv – CS AI · Mar 267/10

🧠

E0: Enhancing Generalization and Fine-Grained Control in VLA Models via Tweedie Discrete Diffusion

Researchers introduce E0, a new AI framework using tweedie discrete diffusion to improve Vision-Language-Action (VLA) models for robotic manipulation. The system addresses key limitations in existing VLA models by generating more precise actions through iterative denoising over quantized action tokens, achieving 10.7% better performance on average across 14 diverse robotic environments.

AINeutralarXiv – CS AI · Mar 267/10

🧠

Divide, then Ground: Adapting Frame Selection to Query Types for Long-Form Video Understanding

Researchers propose DIG, a training-free framework that improves long-form video understanding by adapting frame selection strategies based on query types. The system uses uniform sampling for global queries and specialized selection for localized queries, achieving better performance than existing methods while scaling to 256 input frames.

AINeutralarXiv – CS AI · Mar 267/10

🧠

Collaborative Causal Sensemaking: Closing the Complementarity Gap in Human-AI Decision Support

Researchers propose Collaborative Causal Sensemaking (CCS) as a new framework to improve human-AI collaboration in high-stakes decision making. The study identifies a 'complementarity gap' where current AI agents function as answer engines rather than true collaborative partners, limiting the effectiveness of human-AI teams.

AIBullisharXiv – CS AI · Mar 267/10

🧠

ODMA: On-Demand Memory Allocation Strategy for LLM Serving on LPDDR-Class Accelerators

Researchers developed ODMA, a new memory allocation strategy that improves Large Language Model serving performance on memory-constrained accelerators by up to 27%. The technique addresses bandwidth limitations in LPDDR systems through adaptive bucket partitioning and dynamic generation-length prediction.

← PrevPage 580 of 2552Next →