AI Pulse News

Models, papers, tools. 39,929 articles with AI-powered sentiment analysis and key takeaways.

39929 articles

AINeutralarXiv – CS AI · Jun 96/10

🧠

SAILS: Surrogate-based Analysis of Interactions via Local Effect Smooths

Researchers introduce SAILS, a model-agnostic framework that goes beyond detecting feature interactions in machine learning models to reveal their functional forms and characteristics. Using surrogate generalized additive models, SAILS categorizes interactions as linear, product-separable, or non-product-separable and provides tailored visualizations, advancing the field of explainable AI.

AINeutralarXiv – CS AI · Jun 96/10

🧠

Can Data Work be Reparative?

An ethnographic study examines how a civic-tech initiative is attempting to reform data work practices by building online safety datasets collaboratively with communities most impacted by online harms, framing dataset production through a lens of reparative justice rather than extractive labor.

AINeutralarXiv – CS AI · Jun 96/10

🧠

AI Assurance in UK Defence: Challenges in Operationalising JSP 936

A new report examines implementation challenges in JSP 936, the UK Defence Ministry's AI assurance framework, identifying eight critical gaps between policy requirements and operational deployment. The analysis suggests that while the governance framework is sound, significant technical, organizational, and methodological barriers must be resolved before AI can be safely and responsibly integrated across British military systems.

AINeutralarXiv – CS AI · Jun 96/10

🧠

Harness Engineering for Physical AI: Robot Middleware Is the Harness Layer

Researchers propose that robot middleware should function as a 'harness' layer for Physical AI systems, mediating between learned AI policies and robot hardware across control, computing, and communication domains. The framework introduces three enforcement functions—Projection, Isolation, and Transfer—to safely integrate vision-language-action models into deployed robots, with a suggested ROS 2 Harness Profile implementation.

AIBullisharXiv – CS AI · Jun 96/10

🧠

Context-Aware Deep Learning for Defect Classification in Atomic-Resolution STEM

Researchers developed a context-aware deep learning framework that integrates image contrast with metadata (composition, beam energy, detector geometry) to classify defects in electron microscopy with 98% accuracy on simulations. The approach demonstrates that incorporating physical and experimental context transforms defect classification from an ambiguous image-only task into a well-posed, scientifically grounded problem.

AINeutralarXiv – CS AI · Jun 96/10

🧠

LargeMonitor: Monitoring Online Task-Free Continual Learning via Large Pretrained Models

LargeMonitor is a new framework that uses large pretrained foundation models to detect and diagnose distribution shifts in online task-free continual learning systems without requiring explicit task labels or training-coupled optimization. The approach decouples drift detection from adaptation strategy selection, enabling more precise responses to different types of data stream variations.

AINeutralarXiv – CS AI · Jun 96/10

🧠

A Finetuned SpeechLLM for Joint Multi-Granular L2 Assessment and Natural-Language Rationales

Researchers propose a fine-tuned speech language model that provides both multi-level L2 English proficiency assessment and natural-language explanations for its predictions. The model demonstrates competitive performance on standard benchmarks while offering improved interpretability, though generated rationales show lower reliability at granular word-level assessments.

AINeutralarXiv – CS AI · Jun 96/10

🧠

Closing the Prior-Posterior Loop: Self-Reflective Molecular Design with Analysis-Driven LLM Iteration

Researchers demonstrate that large language models can design molecules with chemist-level precision by replacing simple numerical feedback with detailed physicochemical analysis. The approach couples retrieval-augmented generation with self-reflection modules that feed orbital energies and atomic charges back into design iterations, achieving near-perfect accuracy on HOMO-LUMO gap targets and 100% success rates on moderate molecular design tasks.

AINeutralarXiv – CS AI · Jun 96/10

🧠

Emergence of Context Characteristics Sensitivity in Large Language Models

Researchers studied how large language models develop sensitivity to context characteristics during instruction fine-tuning across three stages: supervised fine-tuning, direct preference optimization, and reinforcement learning. The study found that models progressively learn to favor easily understandable contexts with high length and similarity to queries, with subsequent training stages either reinforcing or resolving these preferences based on dataset design.

AINeutralarXiv – CS AI · Jun 96/10

🧠

SecureClaw: Clawing Back Control of LLM Agents

SecureClaw introduces a dual-boundary security architecture designed to protect LLM agents from both unauthorized external actions and sensitive data exposure. The system uses opaque handles and a PREVIEW→COMMIT protocol to prevent language models from directly accessing secrets or executing unreviewed side effects, achieving zero attack success rates on major security benchmarks.

$COMMIT

AIBullisharXiv – CS AI · Jun 96/10

🧠

FuseFSS: Efficient Secure LLM Inference with Function Secret Sharing

FuseFSS is a new compiler that streamlines secure LLM inference by consolidating fragmented protocol designs into a unified pipeline, achieving 1.24x-1.50x speedup and reducing communication overhead by 9-16% compared to existing function secret sharing approaches. The technology enables privacy-preserving queries to large language models without revealing user prompts, addressing a critical bottleneck in cryptographic systems for AI inference.

AINeutralarXiv – CS AI · Jun 96/10

🧠

Safe-RULE: Safe Reinforcement UnLEarning

Researchers propose Safe-RULE, a new reinforcement unlearning framework designed to defend offline safe reinforcement learning systems against data poisoning attacks. The approach removes malicious data influence without requiring model retraining or access to original training environments, addressing a critical vulnerability in safety-critical applications like robotics.

AINeutralarXiv – CS AI · Jun 96/10

🧠

Seeing the Hivemind: A Consensus-Aware Interaction Technique for Mitigating AI Homogenization

Researchers introduced the Semantic Repulsion Technique (SRT) to combat AI homogenization in creative writing tasks, demonstrating that the method increases semantic diversity by 85-167% while reducing consensus phrases by 43-95%. A user study with 16 participants showed SRT outputs received higher usefulness and coherence ratings, with 68.8% willing to adopt it versus 18.8% for baseline systems, suggesting AI tools can enhance creativity without sacrificing readability.

AIBearisharXiv – CS AI · Jun 96/10

🧠

I Was Scrolling and Then I Saw a Pregnant Strawberry

A research paper examines AI-generated "fruit dramas"—short videos featuring anthropomorphized characters distributed algorithmically on social media—arguing they embed problematic gendered and racialized narratives while using cute aesthetics to evade content moderation systems.

AINeutralarXiv – CS AI · Jun 96/10

🧠

Closure-Validated Circuit Discovery in Attention Heads: Co-activation Proposes, Ablation Disposes

Researchers propose a methodology for validating attention-head circuits in large language models by combining co-activation clustering with causal ablation testing. Their findings reveal that while clustering signals identify circuit proposals, true circuit validation requires closure tests that measure functional impact through ablation—a distinction that challenges current interpretability approaches.

AINeutralarXiv – CS AI · Jun 96/10

🧠

Shape Formation for the Cooperative Transportation of Arbitrary Objects Using Multi-Agent Reinforcement Learning

Researchers have developed a multi-agent reinforcement learning approach enabling robots to autonomously form balanced configurations beneath objects of arbitrary shape and mass distribution for cooperative transportation. The system addresses formation control, navigation, and collision avoidance simultaneously, demonstrating generalization across varied environments and complex geometries.

AIBullisharXiv – CS AI · Jun 96/10

🧠

AGENTSERVESIM: A Hardware-aware Simulator for Multi-Turn LLM Agent Serving

Researchers introduce AGENTSERVESIM, a hardware-aware simulator designed to evaluate serving policies for multi-turn LLM agents without requiring expensive accelerator deployments. The simulator accurately reproduces real-system performance within 6% error while running on standard CPUs, enabling scalable exploration of agent-serving policies across different hardware configurations and workload scenarios.

AINeutralarXiv – CS AI · Jun 96/10

🧠

ReCoVLA: VLM-Guided Reward Compilation for Failure Recovery in Vision-Language-Action Policies

ReCoVLA introduces a framework that enhances vision-language-action (VLA) policies by using external vision-language models to identify failures and guide residual policy training for recovery. The approach freezes pretrained VLA policies and compiles structured rewards for correction, achieving 66.7% success in simulation and 61.7% in zero-shot real-world deployment compared to 36.7% for baseline methods.

AINeutralarXiv – CS AI · Jun 96/10

🧠

Do Video Foundation Models Understand Intuitive Physics? A Layerwise Probing Analysis

Researchers analyzed whether pretrained video foundation models encode intuitive physics understanding by probing three model types (V-JEPA, VideoMAE, and LTX-Video) across frozen representations. Results show physics knowledge emerges reliably in intermediate-to-late layers, with V-JEPA performing strongest and temporal information proving critical for understanding physical dynamics.

AINeutralarXiv – CS AI · Jun 96/10

🧠

ArtiFact: A Large-Scale Multi-Modal Cultural Heritage Dataset

Researchers introduce ArtiFact, a large-scale multi-modal dataset containing 651,045 museum records from three major art institutions combined with images, text, and structured data. The dataset benchmarks AI systems on cross-modal error detection and semantic query processing tasks, revealing significant challenges in detecting domain-specific errors and handling culturally-nuanced information retrieval.

AIBullisharXiv – CS AI · Jun 96/10

🧠

Muon Learns More Robust and Transferable Features than Adam

Research demonstrates that Muon, an emerging optimizer for large language models and vision classifiers, produces more robust and transferable features than Adam and SGD across multiple architectures. The study shows Muon-learned features maintain superior performance on corrupted data and transfer more effectively to downstream tasks, with theoretical support provided through margin and effective rank analysis.

AIBullisharXiv – CS AI · Jun 96/10

🧠

Visual Prompting Meets Feature Reconstruction-Based Anomaly Detection with Dual-Teacher Supervision

Researchers introduce a novel anomaly detection framework combining visual prompting, unfrozen teacher models, and diffusion-based data augmentation to address real-world limitations in industrial inspection systems. The approach achieves a 3.5 percentage point improvement on the challenging AeBAD dataset, demonstrating practical applicability beyond controlled laboratory conditions.

AINeutralarXiv – CS AI · Jun 96/10

🧠

Transition-Based Digital Twin Modelling for Alzheimer's Disease under Sparse Longitudinal Data

Researchers have developed a personalized digital twin framework for predicting Alzheimer's disease progression using multimodal longitudinal data from the ADNI database. The model employs transition-based and sequence-based approaches to capture clinical changes across sparse, irregular patient visits, achieving higher accuracy with local transition modeling while enabling patient-specific what-if scenario analysis.

AINeutralarXiv – CS AI · Jun 96/10

🧠

MeCo: One-Step MeanFlow-based Corrector for Multi-Channel Speech Separation

Researchers propose MeCo, a MeanFlow-based generative corrector that improves multi-channel speech separation by refining discriminative model outputs in a single step. The method combines Data-Space Optimization with specialized loss functions to achieve state-of-the-art performance in both signal fidelity and human listening quality with minimal computational cost.

AINeutralarXiv – CS AI · Jun 96/10

🧠

An 84-Format Numeric Catalog with Bit-Exact Conformance Vectors: A Vendor-Neutral Reference for FP8, BF16, MXFP4, and Microscaling Formats

Researchers have published a vendor-neutral catalog of 84 numeric formats used in machine learning hardware, including FP8, BF16, and MXFP4, with bit-exact conformance test vectors to enable consistent model porting across different accelerators. This addresses a critical gap where silent numerical divergences occur when moving ML models between vendors without a shared reference standard.

← PrevPage 524 of 1598Next →