AI Pulse News

Models, papers, tools. 40,023 articles with AI-powered sentiment analysis and key takeaways.

40023 articles

AINeutralarXiv – CS AI · Jun 96/10

🧠

Generation Properties of Stochastic Interpolation under Finite Training Set

Researchers derive closed-form expressions for optimal velocity fields in stochastic interpolation generative models trained on finite datasets, demonstrating that deterministic processes exactly recover training samples while stochastic processes add Gaussian noise. The work formalizes underfitting and overfitting for generative models, showing that estimation errors produce convex combinations of training samples with mixed noise corruption.

AINeutralarXiv – CS AI · Jun 96/10

🧠

VFEM: Visual Feature Empowered Multivariate Time Series Forecasting with Cross-Modal Fusion

Researchers present VFEM, a cross-modal forecasting model that combines pre-trained vision models with time series data to improve multivariate forecasting by capturing cross-channel dependencies. The approach transforms time series into visual representations and uses cross-modal attention fusion, achieving competitive performance while training only 7.45% of total parameters.

AINeutralarXiv – CS AI · Jun 96/10

🧠

Projection and Quantisation: A Unifying View of Learning to Hash, from Random Projections to the RAG Era

Researchers present a unified framework (PQO) that unifies diverse approximate nearest neighbor search methods under three design choices: projection placement, quantization thresholds, and code organization. The framework demonstrates that one-bit codes achieve 32x compression over floats while maintaining quality through re-ranking, with supervised eight-byte codes doubling the performance of two-kilobyte embeddings.

AIBullisharXiv – CS AI · Jun 96/10

🧠

Large Language Models for Imbalanced Classification: Diversity makes the difference

Researchers have developed a novel LLM-based oversampling method to address imbalanced classification in machine learning, focusing on generating diverse synthetic minority samples. The approach outperforms existing methods like SMOTE by preserving categorical information and introducing enhanced diversity through novel sampling and fine-tuning strategies.

AINeutralarXiv – CS AI · Jun 96/10

🧠

Efficient Onboard Vision-Language Inference in UAV-Enabled Low-Altitude Economy Networks via LLM-Enhanced Optimization

Researchers propose an optimized system for running vision-language models on UAVs in low-altitude networks, combining resource allocation algorithms with LLM-enhanced reinforcement learning to minimize latency and power consumption while maintaining inference accuracy. The framework addresses a critical challenge in aerial IoT applications where onboard computational constraints and dynamic network conditions limit real-time multimodal data processing.

AINeutralarXiv – CS AI · Jun 95/10

🧠

SmartMixed: A Two-Phase Training Strategy for Adaptive Activation Function Learning in Neural Networks

SmartMixed introduces a two-phase training strategy enabling neural networks to learn optimal per-neuron activation functions dynamically, then fix them for efficient inference. The approach allows different neurons to select from six candidate activation functions based on learned preferences, demonstrating that layer-specific activation choices improve network performance compared to uniform activation function architectures.

AIBullisharXiv – CS AI · Jun 96/10

🧠

Learning Quantized Continuous Controllers for Integer Hardware

Researchers demonstrate quantization-aware training techniques that compress reinforcement learning policies to 2-3 bits per weight while maintaining performance comparable to full-precision models, enabling efficient deployment on resource-constrained FPGA hardware with microsecond-level inference latency.

AIBullisharXiv – CS AI · Jun 96/10

🧠

Correcting Mean Bias in Text Embeddings: A Refined Renormalization with Training-Free Improvements on MMTEB

Researchers identify a systematic mean bias in sentence-embedding models where all embeddings share a near-identical mean component. They propose two training-free corrections, with the projection-based method (R2) demonstrating consistent improvements across 38 models on MMTEB benchmarks by better canceling mean-estimation errors than direct subtraction.

AINeutralarXiv – CS AI · Jun 95/10

🧠

SMART: Shot-Aware Multimodal Video Moment Retrieval with Audio-Enhanced MLLM

Researchers introduce SMART, a new multimodal AI framework for video moment retrieval that combines audio and visual features with shot-aware token compression to locate specific temporal segments in untrimmed videos. The method demonstrates significant performance improvements on benchmark datasets, achieving 1.61% and 2.59% gains in key metrics over previous state-of-the-art approaches.

AINeutralarXiv – CS AI · Jun 96/10

🧠

SVRG and Beyond via Posterior Correction

Researchers have established a fundamental connection between Stochastic Variance Reduced Gradient (SVRG), a decade-old optimization method, and Bayesian posterior correction techniques. This theoretical breakthrough enables the derivation of novel SVRG extensions using flexible exponential-family posteriors, including Newton-like and Adam-like variants that improve training efficiency.

AINeutralarXiv – CS AI · Jun 96/10

🧠

Developing Distance-Aware Physics-Constrained Probabilistic Frameworks for Industrial Prognostics

Researchers present two physics-constrained probabilistic frameworks (PC-SNGP and PC-SNER) for industrial prognostics that improve prediction accuracy and uncertainty quantification by maintaining awareness of input distance from training data. The methods use spectral normalization to preserve distance representations and dynamic weighting strategies, demonstrating improved performance on bearing failure prediction benchmarks while maintaining robustness under distributional shifts.

AINeutralarXiv – CS AI · Jun 96/10

🧠

Unambiguous Representations in Neural Networks: An Information-Theoretic Approach to Intentionality

Researchers introduce an information-theoretic framework to measure representational ambiguity in neural networks, demonstrating that network connectivity structures can encode unambiguous content independent of behavioral performance. Using MNIST classification experiments, they achieve 100% accuracy in identifying output neuron class identity from relational structure alone in dropout-trained networks, suggesting neural systems can exhibit the low-ambiguity representations theorized as necessary for consciousness.

AINeutralarXiv – CS AI · Jun 96/10

🧠

FADTI: Fourier and Attention Driven Diffusion for Multivariate Time Series Imputation

Researchers introduce FADTI, a diffusion-based framework for multivariate time series imputation that combines Fourier frequency analysis with attention mechanisms to handle missing data in healthcare, traffic, and biological systems. The model demonstrates superior performance over existing methods, particularly when dealing with high missing data rates and distribution shifts.

AINeutralarXiv – CS AI · Jun 96/10

🧠

Collaborative Edge-to-Server Inference for Vision-Language Models

Researchers propose a collaborative edge-to-server inference framework for vision-language models that reduces communication costs by selectively transmitting only high-entropy regions of interest rather than full-resolution images. The two-stage approach maintains inference accuracy while substantially decreasing bandwidth requirements across visual question-answering tasks.

AINeutralarXiv – CS AI · Jun 96/10

🧠

Exploring the Effect of Basis Rotation on NQS Performance

Researchers demonstrate that basis rotations in Neural Quantum States (NQS) alter the optimization landscape geometry without changing the underlying physics, causing optimization algorithms to converge toward saddle points rather than true ground states. This finding reveals a fundamental geometric mechanism explaining why NQS performance depends on basis choice, with implications for quantum computing and variational algorithms.

AINeutralarXiv – CS AI · Jun 96/10

🧠

GenTSE: Enhancing Target Speaker Extraction via a Coarse-to-Fine Generative Language Model

GenTSE introduces a two-stage generative language model for target speaker extraction that separates semantic and acoustic token generation, demonstrating improved speech quality and speaker consistency over previous LM-based approaches. The system employs novel training strategies including Frozen-LM Conditioning and Direct Preference Optimization to reduce exposure bias and align outputs with human perceptual preferences.

AINeutralarXiv – CS AI · Jun 96/10

🧠

Adversarial Instance Generation and Robust Training for Neural Combinatorial Optimization with Multiple Objectives

Researchers propose a framework for improving the robustness of deep reinforcement learning solvers for multi-objective combinatorial optimization problems by generating adversarial instances that expose weaknesses and training defenses using hardness-aware preference selection. The method demonstrates significant improvements in solver generalizability across traveling salesman, vehicle routing, and knapsack problems.

AINeutralarXiv – CS AI · Jun 95/10

🧠

One if by Land, Two if by Sea, Three if by Four Seas, and More to Come -- Values of Perception, Prediction, Communication, and Common Sense in Decision Making

Researchers have developed a formal decision-theoretic framework that quantifies the value of perception, prediction, communication, and common sense in autonomous decision-making systems. The work reveals that perception alone can have negative value, while combined perception-prediction and standalone prediction always yield non-negative returns, with applications to autonomous systems design and cognitive science.

AINeutralarXiv – CS AI · Jun 96/10

🧠

How Context Shapes Truth: Geometric Transformations of Statement-level Truth Representations in LLMs

Researchers demonstrate that Large Language Models encode truth as geometric vectors in their activation space, and these vectors undergo predictable transformations when contextual information is introduced. The study reveals that larger models rely on directional changes to distinguish relevant context while smaller models use magnitude shifts, with conflicting context producing larger geometric shifts than aligned context.

AINeutralarXiv – CS AI · Jun 96/10

🧠

Revisiting Training Scale: An Empirical Study of Token Count, Power Consumption, and Parameter Efficiency

A new empirical study challenges the assumption that scaling training token counts linearly improves large language model performance, revealing instead that increased token counts lead to strictly declining training efficiency when energy consumption and execution duration are measured alongside traditional metrics.

AIBullisharXiv – CS AI · Jun 96/10

🧠

DYCP: Dynamic Context Pruning for Long-Form Dialogue with LLMs

Researchers introduce DyCP, a lightweight context management system that dynamically selects relevant dialogue segments for long-form conversations with large language models, improving inference efficiency without offline preprocessing. The method demonstrates competitive performance across multiple LLM benchmarks while reducing computational costs and latency in real-world dialogue applications.

AIBullisharXiv – CS AI · Jun 96/10

🧠

A Comparative Study of Student Perspectives on Technical Writing Feedback Quality: Evaluating LLMs, SLMs, and Humans in Computer Science Topics

A research study compares feedback quality from locally-hosted small language models (SLMs), commercial LLMs like GPT-4, and human instructors across computer science courses. The findings show that quantized Llama-3.1 matched commercial LLM performance while offering privacy and cost advantages, though human feedback remained superior for specialized writing tasks.

🧠 GPT-4🧠 Llama

AINeutralarXiv – CS AI · Jun 96/10

🧠

XCR-Bench: Benchmarking Cross-Cultural Reasoning in LLMs via Culture-Specific Items and Hall's Triad

Researchers introduce XCR-Bench, a benchmark dataset for evaluating cross-cultural reasoning in large language models, containing 4,100 parallel sentences and 1,098 culture-specific items across three reasoning tasks. The study reveals that state-of-the-art multilingual LLMs consistently fail to properly identify and adapt culturally sensitive content, exposing systematic biases and gaps in cultural competency.

AINeutralarXiv – CS AI · Jun 96/10

🧠

Comparative evaluation of training strategies using partially labelled datasets for segmentation of white matter hyperintensities and stroke lesions in FLAIR MRI

Researchers developed and evaluated six training strategies for deep learning models to segment white matter hyperintensities and stroke lesions in MRI scans using partially labeled datasets. Pseudolabeling emerged as the most effective approach, successfully leveraging 2,052 MRI volumes with incomplete annotations to create reliable automated segmentation tools for cerebral small vessel disease monitoring.

AINeutralarXiv – CS AI · Jun 96/10

🧠

UA-DCM: Uncertainty-aware Causal Decision Making via Effect Bound Decomposition

Researchers introduce UA-DCM, a framework that distinguishes between causal effect uncertainty that can be resolved with more data versus uncertainty inherent to unobserved confounding. By decomposing effect bounds through max-min optimization, the method helps practitioners determine whether additional sampling will improve decision-making or if alternative approaches like randomized trials are necessary.

← PrevPage 530 of 1601Next →