AI Pulse News

Models, papers, tools. 39,888 articles with AI-powered sentiment analysis and key takeaways.

39888 articles

AINeutralarXiv – CS AI · Jun 95/10

🧠

TimpaTeks: Automatic In-place Text Sequence Modification via Diffusion Language Model Steering

Researchers introduce TimpaTeks, a novel technique for modifying text in-place using diffusion language models through activation steering. The method enables concept changes (sentiment, arbitrary attributes) while maintaining sentence structure, reducing perplexity, and requiring less computational resources than prompt-based alternatives.

🏢 Perplexity

AINeutralarXiv – CS AI · Jun 95/10

🧠

Provably Efficient Personalized Multi-Objective Bandits with Proactive Conversational Queries

Researchers present MO-PQUCB, a novel algorithm for personalized multi-objective decision-making that combines conversational queries with bandit feedback to learn user preferences more efficiently. The method uses a Plackett-Luce choice model and shift-invariant regularization to overcome fundamental learning barriers, demonstrating improved regret scaling and robustness to corrupted preference signals compared to existing approaches.

AINeutralarXiv – CS AI · Jun 96/10

🧠

CoVEBench: Can Video Editing Models Handle Complex Instructions?

Researchers introduce CoVEBench, a comprehensive benchmark for evaluating video editing AI models on complex, multi-step editing tasks. The benchmark reveals that current video editing models struggle significantly with compositional instructions that require simultaneous modifications while preserving unrelated content, exposing a critical gap between simple isolated edits and real-world user workflows.

AIBearisharXiv – CS AI · Jun 96/10

🧠

The Confidence Trap: Calibration Attacks for Graph Neural Networks

Researchers have developed a Unified Graph Calibration Attack (UGCA) framework that exploits vulnerabilities in Graph Neural Networks' confidence calibration through adversarial structural perturbations. The study reveals that GNNs with higher accuracy or trained on complex datasets are more susceptible to calibration attacks, which increase prediction uncertainty while maintaining classification accuracy.

AIBullisharXiv – CS AI · Jun 96/10

🧠

Adaptive Loss Balancing for Noise-Robust GRPO in Generative Recommendation

Researchers introduce AdaGRPO, a reinforcement learning framework that selectively applies reward signals in generative recommendation systems rather than uniformly, addressing the problem of noisy reward models trained on biased data. The approach combines supervised learning with adaptive gating mechanisms and demonstrates significant improvements in e-commerce recommendation metrics and production performance.

AINeutralarXiv – CS AI · Jun 96/10

🧠

PIPE-Cypher: Automatic Enterprise Benchmark Generation for Text-to-Cypher Systems

Researchers introduced PIPE-Cypher, an automated pipeline for generating Text-to-Cypher benchmarks tailored to enterprise property graphs. The system combines schema profiling, LLM generation, and validation to create deployment-relevant datasets that reflect real user queries, addressing the challenge that enterprise graphs have unique structures and evolving schemas that make standardized benchmarks inadequate.

AINeutralarXiv – CS AI · Jun 95/10

🧠

STELLAR: Spatio-Temporal Environmental Learning with Latent Alignment and Refinement for Long-Tailed Species Distribution Modeling

Researchers introduce STELLAR, a machine learning framework designed to improve species distribution modeling by jointly analyzing spatio-temporal environmental data and species interactions while addressing the challenge of rare species prediction. The approach combines graph-temporal encoding, latent space alignment, and specialized loss functions to outperform existing models on biodiversity datasets.

AINeutralarXiv – CS AI · Jun 96/10

🧠

Seeing is Believing: Aligning Prompt Rewriting with Visual Anchors for Text-to-Image Generation

Researchers introduce FaithRewriter, a novel framework that enhances text-to-image generation by grounding prompt rewrites in actual visual outputs rather than linguistic improvements alone. The system uses multimodal AI to generate intermediate images from user prompts, then leverages this visual context to create more faithful augmentations that better align user intent with generated results.

AINeutralarXiv – CS AI · Jun 96/10

🧠

Projecting the Emerging Mindset of SWE Agent by Launching a Wild Code Understanding Journey

Researchers introduce Ada, a systematic framework for observing how software engineering agents navigate real codebases through tool-mediated exploration. By analyzing 408 trajectories across multiple models and repositories, the study develops observation methods that reveal agent decision-making patterns—including navigation choices, evidence selection, and stopping criteria—without reducing behavior to raw metrics or speculation.

$ADA

AINeutralarXiv – CS AI · Jun 95/10

🧠

When Video Misreads: Closed-Loop Distillation of Reading Heuristics for Exploratory Manipulation Trace QA

Researchers introduce Closed-Loop Trace Distillation, a method to improve AI systems' ability to understand robotic manipulation failures and infer necessary action sequences. The approach uses distilled natural-language heuristics derived from training traces, enabling frozen vision-language models to achieve 38-47% accuracy improvements over baseline methods in predicting minimal-success action chains on both simulated and real robots.

AINeutralarXiv – CS AI · Jun 96/10

🧠

EinSort: Sorting is All We Need for Tensorizing LLM

Researchers propose EinSort, an adaptive tensorization method that uses index ordering to identify and compress low-rank structures in large language models, demonstrating improved results for weight and KV-cache compression compared to existing approaches.

AIBullisharXiv – CS AI · Jun 96/10

🧠

Calibration of Structured Ignorance Certificates for Diagnosing Unknown Unknowns in Reasoning Models

Researchers introduce Structured Ignorance Certificates (SICs), a JSON-formatted output schema that trains language models to explicitly acknowledge knowledge gaps rather than hallucinate answers. The approach uses a novel 7,347-sample dataset of cross-domain questions and achieves 99.46% JSON validity with measurable improvements in epistemic awareness.

AINeutralarXiv – CS AI · Jun 96/10

🧠

Auditable Graph-Guided Root Cause Analysis for Kubernetes Incidents

Researchers present Graph Traversal Agent, an LLM-based root cause analysis system for Kubernetes incidents that combines graph-guided reasoning with deterministic validation tools. The system demonstrates significant performance improvements on benchmarks but acknowledges limitations in production environments and benchmark-specific coupling.

AINeutralarXiv – CS AI · Jun 96/10

🧠

Reinforcement Learning for Flow-Matching Policies with Density Transport

Researchers present RLDT, a reinforcement learning algorithm that fine-tunes flow-matching policies by treating policy improvement as density transport toward high-reward regions. The method addresses limitations in existing approaches by preserving multimodal modeling capacity while using Stein Variational Gradient Descent and expected-target estimation to stabilize training across continuous-control tasks.

AIBullisharXiv – CS AI · Jun 96/10

🧠

Tyan-WP: A Wind Power Foundation Model for Ultra-Short-Term Probabilistic Forecasting

Researchers introduce Tyan-WP, a foundation model for wind power forecasting pretrained on 126,000 U.S. sites that achieves superior accuracy without site-specific training. The model addresses critical challenges in renewable energy deployment by enabling rapid turbine onboarding and probabilistic risk assessment for new wind farms.

AINeutralarXiv – CS AI · Jun 96/10

🧠

Sample-Efficient LLM-Based Detection of Malicious Web Server Logs with Forensically Explainable Reasoning

Researchers introduce CEF-Log, an LLM-based method for detecting malicious web server logs that achieves 99% F1-score using only four examples while generating forensically explainable reasoning. The approach embeds investigative methodology through structured chain-of-thought prompting, addressing the critical need for both accuracy and legal-admissible explanations in cybersecurity forensics.

AINeutralarXiv – CS AI · Jun 95/10

🧠

Reconstructing Synthetic SDO/AIA 193 A EUV Images from He I 10830 A Observations with Diffusion Model Translator

Researchers developed a diffusion model-based framework called CH-aware DMT that reconstructs synthetic SDO/AIA 193 Å EUV solar images from historical He I 10830 Å observations, enabling coronal analysis extending back decades before modern EUV imaging became available. The model achieves high fidelity on test data (CC=0.92 for full-disk morphology) and demonstrates physical plausibility when validated against SOHO, Yohkoh, and long-term solar activity proxies spanning 1974-2015.

AIBullisharXiv – CS AI · Jun 96/10

🧠

FiberTune: Preserving Action-Fiber Visual Residuals in Vision-Language-Action Fine-Tuning

FiberTune is a new training methodology for vision-language-action (VLA) policies that prevents visual feature collapse during fine-tuning by preserving action-invariant visual information. The approach demonstrates consistent improvements across simulation benchmarks and physical robot tasks without adding computational overhead at inference time.

AINeutralarXiv – CS AI · Jun 96/10

🧠

Latent Diffusion Policy: Shaping Latent Spaces for Diffusion-Based Robotic Manipulation

Researchers introduce Latent Diffusion Policy (LDP), a two-stage framework that simplifies robotic manipulation by separating scene understanding from trajectory generation using a shaped latent space. The method outperforms existing approaches on complex multi-arm coordination tasks and successfully transfers to real-world bimanual robots.

AINeutralarXiv – CS AI · Jun 96/10

🧠

BioVid: Autoregressive Video Generation with Biological Behavior Semantic Comprehension

BioVid introduces an autoregressive video generation framework that learns temporal structure from behavioral data rather than using fixed frame counts. The system uses a specialized tokenizer and transformer architecture to naturally determine when behavioral sequences end, matching real-world action duration distributions significantly better than existing methods.

AINeutralarXiv – CS AI · Jun 96/10

🧠

Lost in the Flow with Code Talkers: Unveiling the Instruction-Tuning Tax of Large Language Models in Code Tasks

Researchers reveal a critical trade-off in instruction-tuned large language models for code generation: while these models excel at following natural-language commands, they sacrifice performance in code infilling tasks that require completing unfinished programs. This 'Instruction-Tuning Tax' suggests developers must choose between instruction-following capability and effective code completion assistance.

AINeutralarXiv – CS AI · Jun 96/10

🧠

Agentic Search for Counterfactual Recourse under Fixed LLM Budgets

Researchers propose Comp-MCTS, an AI framework that efficiently generates multiple counterfactual explanations under limited LLM budget constraints by using tree-search algorithms to allocate queries toward novel intervention directions. The approach demonstrates superior performance in producing diverse, validated counterfactuals compared to existing single-candidate and multi-candidate baselines on real-world datasets.

AINeutralarXiv – CS AI · Jun 96/10

🧠

Structuring agentic AI for HPC code modernization

Researchers successfully modernized NMAP-RKPM, a 60,000-line Fortran physics simulation engine, from single-threaded MPI to parallel C++ using a structured agentic AI approach. Rather than relying on LLMs alone, the team developed a 'hand-holding' methodology combining manual examples, continuous buildability checks, and scoped sessions that proved highly effective for legacy code transformation.

AINeutralarXiv – CS AI · Jun 96/10

🧠

SNR-ST-Mix: Sample-specific Neighborhood Regression Mixup for Augmented Spatial Transcriptomics Imputation with Deep Neural Network

Researchers introduce SNR-ST-Mix, a data augmentation framework designed specifically for spatial transcriptomics that uses geometry-aware and expression-aware mixing to improve deep neural network performance. The method constrains data interpolation to k-nearest spatial neighbors and weights coefficients by expression similarity, enabling more biologically plausible synthetic training samples that enhance prediction accuracy without architectural changes.

AINeutralarXiv – CS AI · Jun 96/10

🧠

Hybrid Neural Network and Conventional Controller Approach for Robust Control of Highly Unstable Systems: Application to Tilt-Rotor Control

Researchers demonstrate that direct neural network approaches fail for controlling highly unstable tilt-rotor systems, but propose a hybrid solution combining sliding mode control with neural networks to predict system dynamics. The LSTM-based implementation outperforms traditional methods while reducing computational overhead, advancing autonomous aerial vehicle control capabilities.

← PrevPage 520 of 1596Next →