AI × Crypto News Feed

Real-time AI-curated news from 33,978+ articles across 50+ sources. Sentiment analysis, importance scoring, and key takeaways — updated every 15 minutes.

33978 articles

AINeutralarXiv – CS AI · 6h ago6/10

🧠

Artificial Intelligence in Number Theory: LLMs for Algorithm Generation and Ensemble Methods for Conjecture Verification

Researchers demonstrate that large language models like Qwen2.5-Math achieve 95%+ accuracy on algorithmic number theory problems with optimal hints, and empirically verify a folklore conjecture that Dirichlet character moduli are uniquely determined by L-function zeros using machine learning ensemble methods.

AIBullisharXiv – CS AI · 6h ago6/10

🧠

New AI-Driven Tools for Enhancing Campus Well-being: A Prevention and Intervention Approach

Researchers have developed an integrated AI framework for campus mental health monitoring, combining TigerGPT (an LLM-powered survey chatbot) for prevention and PsychoGPT (a DSM-5-aligned screening tool) for intervention. The system uses reinforcement learning and multi-model reasoning to improve feedback quality and reduce hallucinations in mental health assessment.

AINeutralarXiv – CS AI · 6h ago6/10

🧠

Interpretable Machine Learning for Football Performance Analysis: Evidence of Limited Transferability from Elite Leagues to University Competition

Researchers found that machine learning models trained on elite European football leagues lose interpretability and reliability when applied to university-level competition, suggesting that performance insights don't transfer across competition tiers. The study reveals that explanation stability and feature importance hierarchies are domain-dependent, challenging the assumption that ML-derived performance determinants are universally applicable.

AINeutralarXiv – CS AI · 6h ago6/10

🧠

LLM Translation of Compiler Intermediate Representation

Researchers introduce IRIS-14B, a 14-billion-parameter LLM fine-tuned to translate compiler intermediate representations between GCC's GIMPLE and LLVM IR, achieving up to 44 percentage points higher accuracy than existing state-of-the-art models. The approach demonstrates how LLMs can function as interoperability layers in hybrid compiler architectures, enabling cross-toolchain workflows without modifying existing compiler infrastructure.

AIBearisharXiv – CS AI · 6h ago6/10

🧠

Useful for Exploration, Risky for Precision: Evaluating AI Tools in Academic Research

A new benchmarking framework reveals that AI tools in academic research excel at exploration and summaries but fail at precision tasks requiring exact information extraction. The study demonstrates that explainable AI features are inadequate, forcing researchers to manually verify outputs, and literature review tools lack reproducibility and transparency for systematic research.

🏢 xAI

AIBullisharXiv – CS AI · 6h ago6/10

🧠

Beyond Autonomy: A Dynamic Tiered AgentRunner Framework for Governable and Resilient Enterprise AI Execution

Researchers propose the Dynamic Tiered AgentRunner, an enterprise-grade framework that adds governance controls to autonomous AI agents through risk-adaptive resource allocation, separation of powers between independent agents, and resilience mechanisms. The framework addresses critical gaps in current LLM agent deployments by preventing unauthorized high-risk operations and enabling enterprise compliance requirements.

AIBullisharXiv – CS AI · 6h ago6/10

🧠

EmbodiSkill: Skill-Aware Reflection for Self-Evolving Embodied Agents

EmbodiSkill introduces a training-free framework enabling embodied AI agents to autonomously improve their skills through reflection on task execution trajectories. By distinguishing between skill deficiencies and execution lapses, the system allows frozen language models to achieve significantly higher task success rates, with a Qwen 3.5-27B model reaching 93.28% success on ALFWorld benchmarks.

🧠 GPT-5

AINeutralarXiv – CS AI · 6h ago6/10

🧠

FormalRewardBench: A Benchmark for Formal Theorem Proving Reward Models

Researchers introduce FormalRewardBench, the first benchmark for evaluating reward models in formal theorem proving using Lean 4. The benchmark reveals that frontier LLMs like Claude Opus outperform specialized theorem provers at evaluating proof quality, suggesting that theorem proving ability does not transfer to proof evaluation tasks.

🧠 Claude🧠 Opus

AINeutralarXiv – CS AI · 6h ago6/10

🧠

Resource-Aware Evolutionary Neural Architecture Search for Cardiac MRI Segmentation

CardiacNAS presents an evolutionary neural architecture search framework that optimizes cardiac MRI segmentation models for both accuracy and computational efficiency. The approach achieves 93.22% dice similarity with only 3.58M parameters, demonstrating how resource-aware AI design can enable deployment of medical imaging models on resource-constrained environments.

AINeutralarXiv – CS AI · 6h ago6/10

🧠

Rethinking Constraint Awareness for Efficient State Embedding of Neural Routing Solver

Researchers propose Constraint-Aware Residual Modulation (CARM), a neural module that improves how AI solvers handle complex vehicle routing problems by maintaining global observation during constraint-aware decision-making. The advancement demonstrates significant performance improvements across multiple routing problem variants and scaling capabilities.

AINeutralarXiv – CS AI · 6h ago6/10

🧠

Optimizer-Induced Mode Connectivity: From AdamW to Muon

Researchers demonstrate that neural network solutions trained with specific optimizers like AdamW and Muon form connected sets at large network widths, revealing optimizer-dependent structure in loss landscapes. The study shows that different optimizers converge to disconnected solutions with provable loss barriers in small networks, while empirically in GPT-2 pretraining, same-optimizer paths preserve model spectra differently than cross-optimizer paths.

AIBullisharXiv – CS AI · 6h ago6/10

🧠

TinySSL: Distilled Self-Supervised Pretraining for Sub-Megabyte MCU Models

Researchers introduce CA-DSSL, a new self-supervised learning technique that enables efficient AI model training on microcontrollers with under 500K parameters. The method surpasses existing approaches by 18 percentage points on standard benchmarks while requiring significantly fewer parameters, achieving 94% of supervised learning performance with models deployable in just 378 KB of memory.

AINeutralarXiv – CS AI · 6h ago6/10

🧠

Route by State, Recover from Trace: STAR with Failure-Aware Markov Routing for Multi-Agent Spatiotemporal Reasoning

Researchers present STAR, a failure-aware routing framework for multi-agent AI systems that handles spatiotemporal reasoning tasks by intelligently routing between specialist agents based on typed failure states rather than generic success/failure signals. The system learns recovery transitions from execution traces and demonstrates improved performance across multiple benchmarks, suggesting that explicit failure-aware routing is more effective than implicit language-based decision-making in complex reasoning tasks.

AINeutralarXiv – CS AI · 6h ago6/10

🧠

TRACE: Distilling Where It Matters via Token-Routed Self On-Policy Alignment

Researchers introduce TRACE, a novel training method that improves AI model performance by selectively applying different optimization techniques to critical versus routine tokens in reasoning tasks. The approach addresses inefficiencies in standard self-distillation by concentrating training effort on important decision points, achieving 2.76 percentage point improvements over baseline methods while better preserving out-of-distribution generalization.

AINeutralarXiv – CS AI · 6h ago6/10

🧠

Rethinking Entropy Minimization in Test-Time Adaptation for Autoregressive Models

Researchers present a unified mathematical framework for Test-Time Adaptation (TTA) in autoregressive generative models, decomposing entropy minimization into token-level policy gradient and entropy losses. Validated on Whisper ASR across 20+ domains, the approach demonstrates consistent performance improvements and reconciles previously disparate adaptation methods under a single theoretical foundation.

AINeutralarXiv – CS AI · 6h ago5/10

🧠

Crystal Fractional Graph Neural Network for Energy Prediction of High-Entropy Alloys

Researchers have developed a crystal fractional graph neural network that combines graph neural networks with compositional embeddings to predict the energy of high-entropy alloys, achieving accuracy comparable to first-principles calculations on a dataset of over 1,000 crystal structures. The hybrid architecture addresses a key challenge in materials science by integrating local atomic interactions and global elemental composition, though scalability limitations for larger crystal systems remain.

AINeutralarXiv – CS AI · 6h ago6/10

🧠

Empowering VLMs for Few-Shot Multimodal Time Series Classification via Tailored Agentic Reasoning

Researchers introduce MarsTSC, a novel framework combining Vision Language Models with agentic reasoning for few-shot multimodal time series classification. The system uses collaborative AI roles—Generator, Reflector, and Modifier—to iteratively refine knowledge and improve classification accuracy across 12 benchmarks while providing interpretable explanations.

AIBullisharXiv – CS AI · 6h ago6/10

🧠

Why Do DiT Editors Drift? Plug-and-Play Low Frequency Alignment in VAE Latent Space

Researchers have identified why diffusion transformers (DiTs) degrade in quality during multi-turn image editing and proposed VAE-LFA, a training-free alignment method that operates in VAE latent space to suppress accumulated semantic drift. The solution works with both white-box and black-box models by aligning low-frequency components across editing rounds while preserving high-frequency details.

AINeutralarXiv – CS AI · 6h ago6/10

🧠

An Explainable Unsupervised-to-Supervised Machine Learning Framework for Dietary Pattern Discovery Using UK National Dietary Survey Data

Researchers developed an explainable machine learning framework that uses unsupervised and supervised learning to identify and interpret dietary patterns from UK nutrition survey data. The system discovered four distinct eating patterns and achieved high accuracy in reproducing classifications, with potential applications for dietitian-assisted clinical assessments and personalized nutrition counseling.

AINeutralarXiv – CS AI · 6h ago6/10

🧠

Bangla-WhisperDiar: Fine-Tuning Whisper and PyAnnote for Bangla Long-Form Speech Recognition and Speaker Diarization

Researchers have developed Bangla-WhisperDiar, a fine-tuned speech recognition and speaker diarization system that achieves a 24.41% word error rate for ASR and 23.92% diarization error rate. The work addresses critical gaps in Bangla language processing by combining OpenAI's Whisper model with PyAnnote's diarization framework, trained on custom datasets with extensive data augmentation techniques.

AINeutralarXiv – CS AI · 6h ago6/10

🧠

Don't Click That: Teaching Web Agents to Resist Deceptive Interfaces

Researchers introduce DUDE, a framework that teaches AI web agents to resist deceptive interface elements through hybrid-reward learning and experience summarization. The accompanying RUC benchmark demonstrates the framework reduces susceptibility to deception by 53.8% while preserving task performance, addressing a critical vulnerability in autonomous GUI interaction systems.

AINeutralarXiv – CS AI · 6h ago6/10

🧠

NoisyCoconut: Counterfactual Consensus via Latent Space Reasoning

NoisyCoconut is an inference-time method that improves LLM reliability by injecting controlled noise into internal representations to generate diverse reasoning paths, enabling models to abstain when uncertain without requiring retraining. The technique reduces error rates from 40-70% to below 15% on mathematical reasoning tasks through unanimous agreement among noise-perturbed paths, offering practical reliability improvements compatible with existing models.

AINeutralarXiv – CS AI · 6h ago6/10

🧠

ReplaySCM: A Benchmark for Executable Causal Mechanism Induction from Interventions

ReplaySCM introduces a 1,300-item benchmark for evaluating how well language models can infer causal mechanisms from limited intervention data. The benchmark tests whether AI systems can output executable Boolean causal models that generalize to unseen intervention scenarios, revealing that frontier LLMs struggle significantly when structural information is hidden.

AINeutralarXiv – CS AI · 6h ago6/10

🧠

From Historical Tabular Image to Knowledge Graphs: A Provenance-Aware Modular Pipeline

Researchers present a modular, provenance-aware pipeline that converts handwritten archival tables into Knowledge Graphs while maintaining transparency through intermediate inspection points. The approach combines table structure recognition, handwriting recognition, and semantic interpretation while tracking data lineage to ensure all extracted information remains traceable to its source, addressing the opacity problem in end-to-end AI systems.

AINeutralarXiv – CS AI · 6h ago6/10

🧠

Beyond Penalization: Diffusion-based Out-of-Distribution Detection and Selective Regularization in Offline Reinforcement Learning

DOSER introduces a diffusion-model-based framework for offline reinforcement learning that improves out-of-distribution (OOD) action detection beyond traditional penalization methods. The approach uses single-step denoising reconstruction error to identify risky actions while selectively encouraging beneficial exploration, with theoretical guarantees of convergence and empirical superiority on suboptimal datasets.

← PrevPage 405 of 1360Next →