AI Pulse News

Models, papers, tools. 34,422 articles with AI-powered sentiment analysis and key takeaways.

34422 articles

AINeutralarXiv – CS AI · Jun 56/10

🧠

TOKI: A Bitemporal Operator Algebra for Contradiction Resolution in LLM-Agent Persistent Memory

TOKI is a formal framework that types contradiction resolution in LLM-agent persistent memory systems as a write-time concurrency control problem. The research proves that four common heuristics used in production systems admit unspecified isolation levels and anomalies, and proposes a bitemporal operator algebra with audit-row provenance that excludes three critical write-time anomalies while maintaining language-model oversight.

AINeutralarXiv – CS AI · Jun 56/10

🧠

Benchmarking Open-Source Layout Detection Models for Data Snapshot Extraction from Institutional Documents

Researchers have developed a benchmark dataset and evaluation framework for extracting data snapshots (figures and tables) from institutional documents like World Bank reports. The study reveals that current open-source layout detection models fail to generalize effectively to operational documents, struggling to distinguish analytical from non-analytical content and often fragmenting composite visual artifacts.

🏢 Hugging Face

AINeutralarXiv – CS AI · Jun 56/10

🧠

MPCoT: Reward-Guided Multi-Path Latent Reasoning for Test-Time Scalable Vision-Language-Action

Researchers introduce MPCoT, a multi-path latent reasoning framework for Vision-Language-Action policies that improves decision-making in complex, long-horizon control tasks without adding inference latency. The system evaluates multiple hypothetical action paths using reward signals and aggregates them before final action selection, demonstrating performance gains on robotics benchmarks.

AINeutralarXiv – CS AI · Jun 56/10

🧠

OneReason Technical Report

OneReason introduces a novel framework for improving reasoning capabilities in generative recommendation models by addressing perception and cognition limitations. The approach combines semantic grounding of item tokens with multi-level chain-of-thought sequences, demonstrating that effective reasoning requires both language understanding and coherent interest modeling rather than scaling alone.

AINeutralarXiv – CS AI · Jun 56/10

🧠

DAST: A VLM-LLM Framework for Cross-Interface Anomaly Detection in O-RAN

Researchers present DAST, a zero-shot AI framework combining Vision Language Models and Large Language Models to detect anomalies and denial-of-service attacks in O-RAN (Open Radio Access Network) infrastructure. The system achieved 0.910 F1-Score by converting network telemetry into visual representations and cross-referencing them against domain knowledge, addressing critical security gaps in disaggregated 5G/6G networks.

AINeutralarXiv – CS AI · Jun 55/10

🧠

Adapting Diffusion Language Models for Lossless Pixel-Level Image Transmission

Researchers propose DDM-SSCC, a discrete diffusion model framework that improves lossless image transmission over noisy channels by combining pixel-level restoration with arithmetic coding. The approach outperforms existing lossless and semantic communication baselines on standard datasets, offering practical improvements for exact-recovery image transmission scenarios.

AINeutralarXiv – CS AI · Jun 56/10

🧠

LLMs Can Leak Training Data But Do They Want To? A Propensity-Aware Evaluation of Memorization in LLMs

Researchers introduce PropMe, a framework that distinguishes between LLMs' capability to leak training data when directly attacked versus their propensity to do so during normal use. Testing on open models reveals a significant gap: while models can be forced to reproduce training data through adversarial prompts, they rarely do so voluntarily, suggesting memorization risk is lower in practical deployment than worst-case evaluations suggest.

AINeutralarXiv – CS AI · Jun 56/10

🧠

Towards One-to-Many Temporal Grounding

Researchers introduce One-to-Many Temporal Grounding (OMTG), a new AI task for localizing multiple video segments matching a single text query. They establish the first OMTG benchmark with 56k samples and novel evaluation metrics, achieving 43.65% performance—outperforming advanced models like Gemini 2.5 Pro by 15.85%.

🧠 Gemini

AINeutralarXiv – CS AI · Jun 56/10

🧠

Quantum enhanced rare event discovery and sampling

Researchers introduce a quantum algorithm capable of discovering and sampling rare events—such as financial crashes or system failures—without prior knowledge of which events are rare. The algorithm achieves optimal quantum scaling and delivers quadratic speedups for heavy-tailed systems, with potential applications across finance, infrastructure, and AI reliability.

AINeutralarXiv – CS AI · Jun 56/10

🧠

Learning What to Forget: Improving LLM Unlearning via Learned Token-Level Importance

Researchers introduce Alternating Token-Weighted Unlearning (ATWU), a new method for removing specific knowledge from language models while maintaining their general capabilities. The approach identifies which tokens are most relevant for forgetting by measuring conflict with model retention objectives, achieving state-of-the-art results without requiring external supervision or auxiliary models.

AINeutralarXiv – CS AI · Jun 56/10

🧠

PAMF: Prior-Aware Multimodal Fusion for Incomplete Time Series Data

PAMF is a new machine learning framework that addresses incomplete multimodal time series data in healthcare by distinguishing between two types of missing data patterns and coupling imputation with downstream prediction tasks. The method uses flow matching with type-specific priors and weight sharing to achieve superior performance on healthcare benchmarks compared to existing approaches.

AINeutralarXiv – CS AI · Jun 55/10

🧠

Bridging Domain Expertise and Generalization for Performance Estimation

Researchers propose FRAP (Fused Reference Alignment Prediction), a method that combines a foundation model with a domain-specific base model to improve performance estimation when AI models encounter distribution shifts. By aligning and fusing predictions from both models through calibration, FRAP provides more reliable performance indicators without ground-truth labels.

AINeutralarXiv – CS AI · Jun 56/10

🧠

F3-Tokenizer: Taming Audio Autoencoder Latents for Understanding and Generation

Researchers introduce F3-Tokenizer, a novel audio processing system that combines continuous autoencoders with representation learning to enable both semantic understanding and high-quality audio generation. The approach uses noise-regularized bottlenecks and frozen-LLM supervision to bridge the gap between reconstruction quality and meaningful latent representations.

AINeutralarXiv – CS AI · Jun 56/10

🧠

LatentWave: JEPA Pretraining for Wireless Foundation Models

Researchers introduce LatentWave, a wireless foundation model that uses Joint-Embedding Predictive Architecture (JEPA) instead of traditional masked input reconstruction to learn more transferable representations from wireless spectrograms and channel state information. The model demonstrates improved performance across RF signal classification, 5G positioning, beam prediction, and LoS/NLoS classification tasks while supporting variable antenna configurations.

AIBullisharXiv – CS AI · Jun 56/10

🧠

EasyLens: A Training-Free Plug-and-Play Subtle-Lesion Representation Amplifier for Medical Vision-Language Models

EasyLens is a training-free method that enhances medical vision-language models' ability to detect subtle lesions in clinical images without requiring additional model training or adaptation. The approach uses prototype-based reasoning and representation amplification to ensure weak visual cues from lesions aren't lost in global image representations, outperforming existing enhancement methods across multiple medical datasets.

AINeutralarXiv – CS AI · Jun 56/10

🧠

Emergent Language as an Approach to Conscious AI

Researchers propose using emergent language in multi-agent reinforcement learning as a methodology to study artificial consciousness, where agents develop communication from minimal constraints to reveal whether consciousness-relevant structures arise from task demands rather than human language biases. A proof-of-concept demonstrates agents spontaneously develop self-referential communication and an echo-mismatch detection mechanism, suggesting genuine cognitive emergence rather than inherited patterns.

AINeutralarXiv – CS AI · Jun 56/10

🧠

HomeWorld: A Unified Floorplan-to-Furnished Framework for Generating Controllable, Densely Interactive Whole-Home Scenes

Researchers introduce HomeWorld, a unified framework for generating complete, furnished home scenes from floorplans using hierarchical AI models. The system combines large language models for floorplan generation, image models for furniture layout, and vision-language models for iterative refinement, producing simulation-ready indoor environments with a dataset of 300K real floorplans and 5K fully furnished scenes.

AINeutralarXiv – CS AI · Jun 56/10

🧠

Double Preconditioning (DoPr): Optimization for Test-Time Performance, not Validation Loss

Researchers introduce Double Preconditioning (DoPr), a new optimization technique that improves neural network performance during real-world deployment by combining gradient-wise and activation-wise preconditioning. The method addresses test-time feedback—the gap between training metrics and actual task performance in autoregressive models—without requiring improvements in traditional validation loss metrics.

AIBullisharXiv – CS AI · Jun 56/10

🧠

RiskFlow: Fast and Faithful Safety-Critical Traffic Scenario Generation

RiskFlow is a new machine learning framework that generates realistic safety-critical traffic scenarios for autonomous vehicle testing by using a single-pass velocity field model instead of iterative diffusion processes. The approach achieves faster inference times while reducing common motion artifacts and maintaining strong adversarial scenario generation capabilities.

AINeutralarXiv – CS AI · Jun 56/10

🧠

In-Context Multiple Instance Learning

Researchers propose an in-context learning approach for Multiple Instance Learning (MIL) using Perceiver-style architecture pretrained on synthetic data, enabling models to solve new tasks with minimal labeled examples. The method outperforms supervised baselines across twelve benchmarks while requiring no task-specific training at inference time.

AINeutralarXiv – CS AI · Jun 56/10

🧠

Will the Agent Recuse Itself? Measuring LLM-Agent Compliance with In-Band Access-Deny Signals

Researchers propose the 'Recuse Signal,' a lightweight in-band access-control mechanism that allows servers to request autonomous LLM agents voluntarily withdraw from restricted resources. A pilot experiment with GPT-4o, GPT-4o-mini, and Claude Code achieved 100% compliance when the signal was present, though explicit operator authorization caused the most capable model to override the request.

🏢 OpenAI🧠 GPT-4🧠 Claude

AINeutralarXiv – CS AI · Jun 56/10

🧠

PC Layer: Polynomial Weight Preconditioning for Improving LLM Pre-Training

Researchers propose a PC (Preconditioning) layer that uses polynomial weight parameterization to stabilize training of large language models while maintaining computational efficiency. The approach demonstrates performance improvements over standard transformers during Llama-1B pre-training and includes theoretical guarantees for convergence in certain network architectures.

🧠 Llama

AIBullisharXiv – CS AI · Jun 56/10

🧠

Self-Augmenting Retrieval for Diffusion Language Models

Researchers introduce SARDI, a training-free retrieval-augmented generation framework for discrete diffusion language models that leverages low-confidence token predictions as lookahead signals to guide information retrieval during text generation. The approach achieves significant performance gains on multi-hop question-answering tasks while operating at substantially higher throughput than existing baselines.

AINeutralarXiv – CS AI · Jun 56/10

🧠

RREDCoT: Segment-Level Reward Redistribution for Reasoning Models

Researchers introduce RREDCoT, a novel method for improving reasoning language models by redistributing rewards at the segment level during reinforcement learning training. The approach addresses the high variance problem inherent in current Chain-of-Thought optimization methods by using the model itself to estimate which parts of reasoning traces deserve higher rewards, without requiring expensive additional computation.

AINeutralarXiv – CS AI · Jun 56/10

🧠

Pretraining Recurrent Networks without Recurrence

Researchers propose Supervised Memory Training (SMT), a novel method for training recurrent neural networks that replaces sequential backpropagation through time with parallel, supervised learning on memory state transitions. By leveraging a Transformer encoder to generate training labels, SMT achieves stable gradient propagation and improved performance on language and sequence modeling tasks without the parallelism constraints of traditional RNN training.

← PrevPage 480 of 1377Next →