AI × Crypto News Feed

Real-time AI-curated news from 58,780+ articles across 50+ sources. Sentiment analysis, importance scoring, and key takeaways — updated every 15 minutes.

58780 articles

AIBullisharXiv – CS AI · Apr 77/10

🧠

Evolutionary Search for Automated Design of Uncertainty Quantification Methods

Researchers developed an LLM-powered evolutionary search method to automatically design uncertainty quantification systems for large language models, achieving up to 6.7% improvement in performance over manual designs. The study found that different AI models employ distinct evolutionary strategies, with some favoring complex linear estimators while others prefer simpler positional weighting approaches.

🧠 Claude🧠 Sonnet🧠 Opus

AIBullisharXiv – CS AI · Apr 77/10

🧠

Zero-Shot Quantization via Weight-Space Arithmetic

Researchers have developed a zero-shot quantization method that transfers robustness between AI models through weight-space arithmetic, improving post-training quantization performance by up to 60% without requiring additional training. This breakthrough enables low-cost deployment of extremely low-bit models by extracting 'quantization vectors' from donor models to patch receiver models.

AIBullisharXiv – CS AI · Apr 77/10

🧠

V-Reflection: Transforming MLLMs from Passive Observers to Active Interrogators

Researchers introduce V-Reflection, a new framework that transforms Multimodal Large Language Models (MLLMs) from passive observers to active interrogators through a 'think-then-look' mechanism. The approach addresses perception-related hallucinations in fine-grained tasks by allowing models to dynamically re-examine visual details during reasoning, showing significant improvements across six perception-intensive benchmarks.

AIBullisharXiv – CS AI · Apr 77/10

🧠

Customized User Plane Processing via Code Generating AI Agents for Next Generation Mobile Networks

Researchers propose using generative AI agents to create customized user plane processing blocks for 6G mobile networks based on text-based service requests. The study evaluates factors affecting AI code generation accuracy for network-specific tasks, finding that AI agents can successfully generate desired processing functions under suitable conditions.

AIBullisharXiv – CS AI · Apr 77/10

🧠

Scaling Teams or Scaling Time? Memory Enabled Lifelong Learning in LLM Multi-Agent Systems

Researchers introduce LLMA-Mem, a memory framework for LLM multi-agent systems that balances team size with lifelong learning capabilities. The study reveals that larger agent teams don't always perform better long-term, and smaller teams with better memory design can outperform larger ones while reducing costs.

AIBullisharXiv – CS AI · Apr 77/10

🧠

SoLA: Leveraging Soft Activation Sparsity and Low-Rank Decomposition for Large Language Model Compression

Researchers propose SoLA, a training-free compression method for large language models that combines soft activation sparsity and low-rank decomposition. The method achieves significant compression while improving performance, demonstrating 30% compression on LLaMA-2-70B with reduced perplexity from 6.95 to 4.44 and 10% better downstream task accuracy.

🏢 Perplexity

AIBearisharXiv – CS AI · Apr 77/10

🧠

Artificial Intelligence and Systemic Risk: A Unified Model of Performative Prediction, Algorithmic Herding, and Cognitive Dependency in Financial Markets

A new unified model demonstrates that AI adoption in financial markets creates systemic risk through three channels: performative prediction, algorithmic herding, and cognitive dependency. Using SEC Form 13F data from 2013-2024, researchers found AI adoption generates superlinear growth in systemic risk and tail-loss amplification of 18-54%.

AIBullisharXiv – CS AI · Apr 77/10

🧠

Robust LLM Performance Certification via Constrained Maximum Likelihood Estimation

Researchers propose a new constrained maximum likelihood estimation (MLE) method to accurately estimate failure rates of large language models by combining human-labeled data, automated judge annotations, and domain-specific constraints. The approach outperforms existing methods like Prediction-Powered Inference across various experimental conditions, providing a more reliable framework for LLM safety certification.

AINeutralarXiv – CS AI · Apr 77/10

🧠

Is your AI Model Accurate Enough? The Difficult Choices Behind Rigorous AI Development and the EU AI Act

A research paper challenges the common view of AI accuracy as purely technical, arguing it involves context-dependent normative decisions that determine error priorities and risk distribution. The study analyzes the EU AI Act's "appropriate accuracy" requirements and identifies four critical choices in performance evaluation that embed assumptions about acceptable trade-offs.

AIBullisharXiv – CS AI · Apr 77/10

🧠

MemMachine: A Ground-Truth-Preserving Memory System for Personalized AI Agents

MemMachine is an open-source memory system for AI agents that preserves conversational ground truth and achieves superior accuracy-efficiency tradeoffs compared to existing solutions. The system integrates short-term, long-term episodic, and profile memory while using 80% fewer input tokens than comparable systems like Mem0.

🧠 GPT-4🧠 GPT-5

AIBearisharXiv – CS AI · Apr 77/10

🧠

AI Assistance Reduces Persistence and Hurts Independent Performance

A new study of 1,222 participants found that AI assistance, while improving short-term performance, significantly reduces human persistence and impairs independent performance after only brief 10-minute interactions. The research suggests current AI systems act as short-sighted collaborators that condition users to expect immediate answers, potentially undermining long-term skill acquisition and learning.

AINeutralarXiv – CS AI · Apr 77/10

🧠

AI Trust OS -- A Continuous Governance Framework for Autonomous AI Observability and Zero-Trust Compliance in Enterprise Environments

Researchers propose AI Trust OS, a new governance framework that uses continuous telemetry and automated probes to discover and monitor AI systems across enterprise environments. The system addresses compliance gaps in AI governance by shifting from manual attestation to autonomous observability, automatically registering undocumented AI systems through telemetry analysis.

AIBullisharXiv – CS AI · Apr 77/10

🧠

Springdrift: An Auditable Persistent Runtime for LLM Agents with Case-Based Memory, Normative Safety, and Ambient Self-Perception

Researchers have developed Springdrift, a persistent runtime system for long-lived AI agents that maintains memory across sessions and provides auditable decision-making capabilities. The system was successfully deployed for 23 days, during which the AI agent autonomously diagnosed infrastructure problems and maintained context across multiple communication channels without explicit instructions.

AIBearisharXiv – CS AI · Apr 77/10

🧠

Incompleteness of AI Safety Verification via Kolmogorov Complexity

Researchers prove a fundamental theoretical limit in AI safety verification using Kolmogorov complexity theory. They demonstrate that no finite formal verifier can certify all policy-compliant AI instances of arbitrarily high complexity, revealing intrinsic information-theoretic barriers beyond computational constraints.

AINeutralarXiv – CS AI · Apr 77/10

🧠

The Topology of Multimodal Fusion: Why Current Architectures Fail at Creative Cognition

Researchers identify a fundamental topological limitation in current multimodal AI architectures like CLIP and GPT-4V, proposing that their 'contact topology' structure prevents creative cognition. The paper introduces a philosophical framework combining Chinese epistemology with neuroscience to propose new architectures using Neural ODEs and topological regularization.

🧠 Gemini

AINeutralarXiv – CS AI · Apr 77/10

🧠

Gradual Cognitive Externalization: A Framework for Understanding How Ambient Intelligence Externalizes Human Cognition

Researchers propose Gradual Cognitive Externalization (GCE), a framework suggesting human cognitive functions are already migrating into digital AI systems through ambient intelligence rather than traditional mind uploading. The study identifies evidence in scheduling assistants, writing tools, and AI agents that cognitive externalization is occurring now through bidirectional adaptation and functional equivalence.

AINeutralarXiv – CS AI · Apr 77/10

🧠

ShieldNet: Network-Level Guardrails against Emerging Supply-Chain Injections in Agentic Systems

Researchers have identified a new class of supply-chain threats targeting AI agents through malicious third-party tools and MCP servers. They've created SC-Inject-Bench, a benchmark with over 10,000 malicious tools, and developed ShieldNet, a network-level security framework that achieves 99.5% detection accuracy with minimal false positives.

AIBullisharXiv – CS AI · Apr 77/10

🧠

QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

Researchers developed QED-Nano, a 4B parameter AI model that achieves competitive performance on Olympiad-level mathematical proofs despite being much smaller than proprietary systems. The model uses a three-stage training approach including supervised fine-tuning, reinforcement learning, and reasoning cache expansion to match larger models at a fraction of the inference cost.

🧠 Gemini

AINeutralarXiv – CS AI · Apr 77/10

🧠

The Persuasion Paradox: When LLM Explanations Fail to Improve Human-AI Team Performance

Research reveals a 'Persuasion Paradox' where LLM explanations increase user confidence but don't reliably improve human-AI team performance, and can actually undermine task accuracy. The study found that explanation effectiveness varies significantly by task type, with visual reasoning tasks seeing decreased error recovery while logical reasoning tasks benefited from explanations.

AIBullisharXiv – CS AI · Apr 77/10

🧠

LLMs-Healthcare : Current Applications and Challenges of Large Language Models in various Medical Specialties

A comprehensive research review examines the current applications of Large Language Models (LLMs) across various healthcare specialties including cancer care, dermatology, dental care, neurodegenerative disorders, and mental health. The study highlights LLMs' transformative impact on medical diagnostics and patient care while acknowledging existing challenges and limitations in healthcare integration.

AIBullisharXiv – CS AI · Apr 77/10

🧠

Combee: Scaling Prompt Learning for Self-Improving Language Model Agents

Researchers have developed Combee, a new framework that enables parallel prompt learning for AI language model agents, achieving up to 17x speedup over existing methods. The system allows multiple AI agents to learn simultaneously from their collective experiences without quality degradation, addressing scalability limitations in current single-agent approaches.

AI × CryptoBullisharXiv – CS AI · Apr 77/10

🤖

Quantifying Trust: Financial Risk Management for Trustworthy AI Agents

Researchers introduce the Agentic Risk Standard (ARS), a payment settlement framework for AI-mediated transactions that provides contractual compensation for agent failures. The standard shifts trust from implicit model behavior expectations to explicit, measurable guarantees through financial risk management principles.

AIBullisharXiv – CS AI · Apr 77/10

🧠

Readable Minds: Emergent Theory-of-Mind-Like Behavior in LLM Poker Agents

Research published on arXiv demonstrates that large language models playing poker can develop sophisticated Theory of Mind capabilities when equipped with persistent memory, progressing to advanced levels of opponent modeling and strategic deception. The study found memory is necessary and sufficient for this emergent behavior, while domain expertise enhances but doesn't gate ToM development.

🧠 GPT-4

AIBearisharXiv – CS AI · Apr 77/10

🧠

Comparative reversal learning reveals rigid adaptation in LLMs under non-stationary uncertainty

Research reveals that large language models like DeepSeek-V3.2, Gemini-3, and GPT-5.2 show rigid adaptation patterns when learning from changing environments, particularly struggling with loss-based learning compared to humans. The study found LLMs demonstrate asymmetric responses to positive versus negative feedback, with some models showing extreme perseveration after environmental changes.

🧠 GPT-5🧠 Gemini

AIBullisharXiv – CS AI · Apr 77/10

🧠

Beyond Retrieval: Modeling Confidence Decay and Deterministic Agentic Platforms in Generative Engine Optimization

Researchers propose a new approach to Generative Engine Optimization (GEO) that moves beyond current RAG-based systems to deterministic multi-agent platforms. The study introduces mathematical models for confidence decay in LLMs and demonstrates near-zero hallucination rates through specialized agent routing in industrial applications.

← PrevPage 503 of 2352Next →