y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#machine-learning News & Analysis

Coverage of #machine-learning spans 2,608 indexed articles, with 262 pieces published in the last month. Recent discussion shows 55.7% bullish sentiment, though this represents a 5.3 percentage point decline from the previous quarter, suggesting a modest cooling in tone. Research publications dominate the discourse, particularly through arXiv's computer science and AI sections, while conversations frequently center on models and platforms including Llama, Meta, and Gemini. Related coverage tends to intersect with #research, #ai-research, and #llm discussions. Scan the article list below to explore the latest developments and perspectives.

sentiment · last 30d (262 articles) · -5.3pp bullish vs prior 90d
Top sources:arXiv – CS AI · 1922Apple Machine Learning · 14Crypto Briefing · 10MarkTechPost · 8Hugging Face Blog · 6
Most-discussed entities:Llama · 23Meta · 17Gemini · 15GPT-4 · 14GPT-5 · 13
3129 articles
AIBullisharXiv – CS AI · May 127/10
🧠

Uncovering Intra-expert Activation Sparsity for Efficient Mixture-of-Expert Model Execution

Researchers demonstrate that Mixture of Experts (MoE) models contain substantial underutilized sparsity within individual experts that can be exploited without modifying model parameters. By implementing intra-expert activation sparsity in vLLM, they achieve up to 2.5x speedup in MoE layer execution, offering a practical optimization path for efficient large language model deployment.

AIBullisharXiv – CS AI · May 127/10
🧠

SkillEvolver: Skill Learning as a Meta-Skill

SkillEvolver introduces a meta-learning framework that automatically improves AI agent skills through iterative refinement based on real-world deployment failures, achieving 56.8% accuracy on benchmark tasks compared to 43.6% for manually curated skills. The system learns by modifying skill prose and code rather than model weights, enabling seamless integration with any compatible agent without retraining.

AIBearishDecrypt · May 117/10
🧠

Hackers Used AI to Build a Zero-Day Exploit That Bypasses Two-Factor Authentication: Google

Google's threat intelligence team confirmed that cybercriminals have successfully used AI models to discover and exploit a previously unknown zero-day vulnerability that bypasses two-factor authentication. This represents a significant escalation in attack sophistication, demonstrating how AI tools are being weaponized to automate vulnerability discovery and exploitation at scale.

Hackers Used AI to Build a Zero-Day Exploit That Bypasses Two-Factor Authentication: Google
AI × CryptoBullishThe Block · May 117/10
🤖

Binance says AI security systems prevented $10.5 billion in user losses from crypto scams, phishing

Binance reports that its AI-powered security systems prevented $10.5 billion in potential user losses from crypto scams and phishing attacks, with $1.98 billion protected in Q1 2026 alone across 22.9 million attempted fraud incidents. This announcement underscores the growing sophistication of security threats in cryptocurrency exchanges and the increasing reliance on machine learning to combat fraud at scale.

Binance says AI security systems prevented $10.5 billion in user losses from crypto scams, phishing
AI × CryptoBullishDecrypt · May 117/10
🤖

Binance Says AI Defenses Blocked $10.5 Billion in Crypto Fraud Over 15 Months

Binance has deployed over 100 AI models to combat a rising tide of AI-powered cryptocurrency scams, successfully blocking $10.5 billion in fraudulent activity over 15 months. This defensive measure highlights the escalating sophistication of attacks in crypto markets and the critical role of machine learning in protecting user assets at scale.

Binance Says AI Defenses Blocked $10.5 Billion in Crypto Fraud Over 15 Months
AI × CryptoBullishcrypto.news · May 117/10
🤖

Binance leans on 100+ AI models to block $10.53B in risky funds

Binance has deployed over 100 AI models across 24+ security initiatives to block $10.53B in risky funds between 2025 and Q1 2026, repositioning artificial intelligence as foundational security infrastructure rather than a supplementary feature. This integrated approach demonstrates how major exchanges are leveraging machine learning at scale to combat financial crime and protect user assets.

Binance leans on 100+ AI models to block $10.53B in risky funds
AIBullisharXiv – CS AI · May 117/10
🧠

ATHENA: Agentic Team for Hierarchical Evolutionary Numerical Algorithms

ATHENA is an autonomous AI framework that automates scientific computing and machine learning research by autonomously selecting mathematical approaches, generating code, and iteratively improving solutions through a contextual bandit learning process. The system achieves validation errors as low as 10^-14 and demonstrates performance surpassing traditional foundation models in solving complex multiphysics problems.

AIBullisharXiv – CS AI · May 117/10
🧠

Efficient Data Selection for Multimodal Models via Incremental Optimization Utility

Researchers introduce One-Step-Train (OST), a new data selection framework for Large Multimodal Models that uses incremental optimization to identify high-quality training samples. The method reduces computational costs by 43% while outperforming existing approaches like LLM-as-a-Judge, demonstrating significant efficiency gains in multimodal model training.

AIBullisharXiv – CS AI · May 117/10
🧠

Confidence-Aware Alignment Makes Reasoning LLMs More Reliable

Researchers introduce CASPO, a framework that improves reasoning reliability in large language models by aligning token-level confidence with step-wise logical correctness through preference optimization. The method achieves better performance than tree-search approaches without requiring separate reward models, while introducing CaT inference that dynamically prunes uncertain reasoning branches with minimal computational overhead.

AIBullisharXiv – CS AI · May 117/10
🧠

CASCADE: Case-Based Continual Adaptation for Large Language Models During Deployment

Researchers introduce CASCADE, a framework enabling large language models to continuously learn and improve during deployment without modifying parameters, using an episodic memory system formulated as a contextual bandit problem. The approach demonstrates 20.9% improvement over zero-shot prompting across 16 diverse tasks, addressing a fundamental limitation in current LLM lifecycles where learning stops after training ends.

AIBullisharXiv – CS AI · May 117/10
🧠

EvolveR: Self-Evolving LLM Agents through an Experience-Driven Lifecycle

Researchers introduce EvolveR, a framework enabling LLM agents to self-improve through a closed-loop lifecycle combining offline strategy distillation with online task interaction. The system demonstrates superior performance on complex question-answering benchmarks by enabling agents to learn from their own experiences rather than relying solely on external knowledge.

AIBearisharXiv – CS AI · May 117/10
🧠

Direction for Detection: A Survey of Automated Vulnerability Detection and all of its Pain Points

A comprehensive survey of 87 machine learning vulnerability detection studies reveals that the field has stalled despite a decade of research, trapped in self-reinforcing feedback loops that optimize for narrow, artificial problems. Researchers identify twelve interconnected pain points spanning datasets, formulations, metrics, and evaluation approaches that perpetuate focus on binary C/C++ function-level classification while neglecting vulnerability type prediction, multilingual support, and broader detection granularities.

AIBullisharXiv – CS AI · May 117/10
🧠

Goal-Conditioned Decision Transformer for Multi-Goal Offline Reinforcement Learning

Researchers introduce a Goal-Conditioned Decision Transformer designed for offline reinforcement learning in robotics, enabling multi-goal task learning from pre-collected datasets. The method demonstrates superior performance compared to online baselines on complex robotic tasks while maintaining effectiveness in sparse-reward environments with limited expert data.

AIBullisharXiv – CS AI · May 117/10
🧠

Globally Optimal Training of Spiking Neural Networks via Parameter Reconstruction

Researchers propose a novel parameter reconstruction algorithm for training Spiking Neural Networks (SNNs) that addresses the long-standing problem of non-differentiable spike functions. The method extends convexification theory to recurrent networks and demonstrates consistent improvements over traditional surrogate gradient approaches, with potential applications in large-scale energy-efficient neural network training.

AIBullisharXiv – CS AI · May 117/10
🧠

MatryoshkaLoRA: Learning Accurate Hierarchical Low-Rank Representations for LLM Fine-Tuning

Researchers introduce MatryoshkaLoRA, a novel training framework that improves upon Low-Rank Adaptation (LoRA) for efficient large language model fine-tuning by learning hierarchical low-rank representations through a strategically placed diagonal scaling matrix. The method enables dynamic rank selection with minimal accuracy loss and introduces AURAC, a new evaluation metric for hierarchical adapters, addressing a key limitation in current parameter-efficient fine-tuning approaches.

AIBullisharXiv – CS AI · May 117/10
🧠

Uncertainty Quantification for Prior-Data Fitted Networks using Martingale Posteriors

Researchers propose a novel uncertainty quantification method for Prior-Data Fitted Networks (PFNs), emerging foundation models for tabular data prediction, using martingale posteriors to provide calibrated confidence estimates. The technique is tuning-free, computationally efficient, and mathematically proven to converge, addressing a significant limitation in PFNs' practical applicability.

AIBullisharXiv – CS AI · May 117/10
🧠

Video Understanding Reward Modeling: A Robust Benchmark and Performant Reward Models

Researchers introduce Video Understanding Reward Bench (VURB), a comprehensive benchmark with 2,100 preference pairs for evaluating video reward models, alongside VUP-35K, a large-scale dataset of 35,000 preference examples. Two new models, VideoDRM and VideoGRM, achieve state-of-the-art performance on video understanding tasks, advancing multimodal AI capabilities beyond text and images.

AIBullisharXiv – CS AI · May 117/10
🧠

Text-to-CAD Evaluation with CADTests

Researchers introduce CADTestBench, the first test-based evaluation framework for Text-to-CAD systems that uses executable software tests to verify whether AI-generated CAD models meet geometric and topological requirements. The framework enables both comprehensive benchmarking of existing methods and improved model generation through test-guided approaches, addressing a significant gap in CAD model evaluation methodology.

🏢 Hugging Face
AIBearisharXiv – CS AI · May 117/10
🧠

An Embarrassingly Simple Graph Heuristic Reveals Shortcut-Solvable Benchmarks for Sequential Recommendation

Researchers demonstrate that a simple graph heuristic without machine learning matches or outperforms advanced generative recommendation systems on standard benchmarks, revealing that widely-used datasets contain structural shortcuts that don't require sophisticated modeling. The findings question whether current benchmark evaluations actually validate the advanced capabilities that modern recommendation systems claim to provide.

AIBearisharXiv – CS AI · May 117/10
🧠

GAD in the Wild: Benchmarking Graph Anomaly Detection under Realistic Deployment Challenges

Researchers have published a comprehensive benchmark for Graph Anomaly Detection (GAD) models that exposes critical gaps between academic performance and real-world deployment. The study reveals that leading GAD methods fail to scale to million-node graphs, collapse under realistic anomaly scarcity (0.1%), and struggle with missing data—challenges absent from typical laboratory benchmarks.

AIBearisharXiv – CS AI · May 117/10
🧠

On Privacy Leakage in Tabular Diffusion Models: Influential Factors, Attacker Knowledge, and Metrics

Researchers demonstrate significant privacy vulnerabilities in tabular diffusion models (TDMs), which are increasingly used to generate synthetic data as privacy-preserving alternatives. Through membership inference attacks in both black-box and white-box settings, the study reveals that attackers can successfully breach these systems without perfect knowledge of training data or massive computational resources, while also exposing flaws in commonly-used privacy metrics.

AIBullisharXiv – CS AI · May 117/10
🧠

Toward Privileged Foundation Models:LUPI for Accelerated and Improved Learning

Researchers introduce PIQL, a framework that leverages privileged information to accelerate training and improve generalization in tabular foundation models. By incorporating dataset-level statistics and encodings of data-generating processes during training, the approach reduces computational requirements and convergence time while maintaining inference efficiency through reconstruction mechanisms.

AIBullisharXiv – CS AI · May 117/10
🧠

Memory-Efficient Looped Transformer: Decoupling Compute from Memory in Looped Language Models

Researchers introduce Memory-Efficient Looped Transformer (MELT), an architecture that decouples reasoning depth from memory consumption in recurrent language models. MELT replaces the standard approach of maintaining separate Key-Value caches per reasoning loop with a single shared cache per layer, updated via learnable gating, achieving constant-memory iterative reasoning comparable to standard LLMs while outperforming them on benchmarks.

AIBullisharXiv – CS AI · May 117/10
🧠

Enabling Unsupervised Training of Deep EEG Denoisers With Intelligent Partitioning

Researchers propose Intelligent Partitioning for Self-supervised Denoising (iPSD), a deep learning method that eliminates the need for artifact-free training data to denoise electroencephalogram (EEG) signals from wearable devices. The technique achieves state-of-the-art performance even in extremely noisy conditions by learning to partition noisy EEG segments into independent realizations sharing the same underlying neural signal.

AIBullisharXiv – CS AI · May 117/10
🧠

Toeplitz MLP Mixers are Low Complexity, Information-Rich Sequence Models

Researchers introduce Toeplitz MLP Mixer (TMM), a transformer alternative that replaces attention mechanisms with triangular-masked Toeplitz matrix multiplication, achieving O(dn log n) training complexity and O(dn) inference complexity. TMMs demonstrate superior training efficiency, information retention, and in-context learning performance compared to existing sub-quadratic architectures.

← PrevPage 3 of 126Next →