#distribution-shift News & Analysis

18 articles tagged with #distribution-shift. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

18 articles

AIBearisharXiv – CS AI · 2d ago7/10

🧠

Do Physics Foundation Models Learn Generalizable Physics? A Bias-Aware Benchmark Across Physical Regimes and Distribution Shifts

Researchers benchmarked five physics foundation models across 8 physical dynamics and 25 test regimes, revealing that current models function as conditional rather than universal generalists. The study demonstrates that model performance heavily depends on physical regime, temporal scale, and distribution shifts, with pretraining and scaling unable to reliably overcome these limitations.

AIBullisharXiv – CS AI · May 127/10

🧠

Do Linear Probes Generalize Better in Persona Coordinates?

Researchers propose using 'persona coordinates'—low-dimensional subspaces derived from contrasting harmful and harmless model behaviors—to improve the generalization of linear probes that monitor language models for deception and harmful outputs. Testing across 10 datasets shows that probes trained on persona-derived directions significantly outperform those trained on raw model activations, addressing a critical gap in AI safety monitoring.

AIBullisharXiv – CS AI · Mar 177/10

🧠

OrthoFormer: Instrumental Variable Estimation in Transformer Hidden States via Neural Control Functions

Researchers propose OrthoFormer, a new Transformer architecture that addresses causal learning limitations by embedding instrumental variable estimation directly into neural networks. The framework aims to distinguish between spurious correlations and true causal mechanisms, potentially improving AI model robustness and reliability under distribution shifts.

AINeutralarXiv – CS AI · Mar 37/104

🧠

The Information-Theoretic Imperative: Compression and the Epistemic Foundations of Intelligence

Researchers propose the Compression Efficiency Principle (CEP) to explain why artificial neural networks and biological brains develop similar representations despite different substrates. The theory suggests both systems converge on efficient compression strategies that encode stable invariants rather than unstable correlations, providing a unified framework for understanding intelligence across biological and artificial systems.

AINeutralarXiv – CS AI · 2d ago6/10

🧠

Entropy-KL Divergence-based Token Masking: A Novel Approach for Selective Fine-tuning of Large Language Models

Researchers propose EKSFT, a novel fine-tuning method that selectively masks high-entropy and high-KL divergence tokens during supervised fine-tuning of large language models. The approach aims to preserve pre-trained model distributions while efficiently activating task-relevant capabilities in low-data regimes, demonstrating improved performance on mathematical reasoning benchmarks.

AINeutralarXiv – CS AI · 3d ago6/10

🧠

On the Learnability of Test-Time Adaptation: A Recovery Complexity Perspective

Researchers introduce the first theoretical framework for analyzing test-time adaptation (TTA) in machine learning, establishing recovery complexity bounds that reveal fundamental limits on how quickly models can adapt to non-stationary data streams without labeled data. The work provides mathematical guarantees for TTA learnability and identifies an intrinsic trade-off between adaptivity and information constraints.

AINeutralarXiv – CS AI · 4d ago6/10

🧠

From Static Context to Calibrated Interactive RL: Mitigating Distribution Shift in Multi-turn Dialogue with Aligned Simulator

Researchers propose Calibrated Interactive RL, a framework addressing distribution shift problems in multi-turn dialogue systems by combining interactive reinforcement learning with simulator alignment. The approach theoretically and empirically demonstrates that aligning simulators with human interaction patterns significantly improves LLM-based dialogue agent performance compared to static context and unaligned interactive methods.

AINeutralarXiv – CS AI · 4d ago6/10

🧠

SL-BiLEM: Structured Learnable Behavior-in-the-Loop Epidemic Modeling for Forecasting and Policy Evaluation

Researchers introduce SL-BiLEM, a machine learning framework that improves epidemic forecasting by accounting for how human behavior changes in response to disease spread and policy interventions. The model uses physical constraints to maintain accuracy even when facing novel policy scenarios, demonstrating 76% improvement over existing neural baselines and potential applications for public health decision-making.

AINeutralarXiv – CS AI · May 126/10

🧠

Reasoning Is Not Free: Robust Adaptive Cost-Efficient Routing for LLM-as-a-Judge

Researchers demonstrate that reasoning-capable LLMs improve judgment accuracy significantly on complex tasks like math and coding, but offer minimal or negative benefits on simpler evaluations while consuming substantially more computational resources. They introduce RACER, an adaptive routing algorithm that dynamically selects between reasoning and non-reasoning judges under budget constraints while accounting for distribution shifts.

AINeutralarXiv – CS AI · May 126/10

🧠

Rethinking Entropy Minimization in Test-Time Adaptation for Autoregressive Models

Researchers present a unified mathematical framework for Test-Time Adaptation (TTA) in autoregressive generative models, decomposing entropy minimization into token-level policy gradient and entropy losses. Validated on Whisper ASR across 20+ domains, the approach demonstrates consistent performance improvements and reconciles previously disparate adaptation methods under a single theoretical foundation.

AINeutralarXiv – CS AI · May 126/10

🧠

Normalization Equivariance for Arbitrary Backbones, with Application to Image Denoising

Researchers present a parameter-free wrapper method (WNE) that enforces Normalization Equivariance—robustness to brightness and contrast shifts—around any neural network backbone without architectural constraints. The approach characterizes NE as a normalize-process-denormalize factorization, enabling compatibility with modern components like transformers and attention mechanisms while avoiding the 1.6x computational overhead of existing methods.

AINeutralarXiv – CS AI · May 46/10

🧠

TimeRFT: Stimulating Generalizable Time Series Forecasting for TSFMs via Reinforcement Finetuning

Researchers introduce TimeRFT, a reinforcement learning-based fine-tuning method for Time Series Foundation Models that improves forecasting accuracy and generalization. By implementing temporal reward mechanisms and intelligent data selection, TimeRFT outperforms traditional supervised fine-tuning approaches across diverse forecasting tasks and data conditions.

AINeutralarXiv – CS AI · Apr 206/10

🧠

Beyond Surface Statistics: Robust Conformal Prediction for LLMs via Internal Representations

Researchers propose a conformal prediction framework for large language models that uses internal neural representations rather than surface-level outputs to assess reliability and uncertainty. The Layer-Wise Information scoring method improves prediction validity under distribution shift while maintaining competitive performance, addressing a critical challenge in deploying LLMs where traditional uncertainty signals become unreliable.

AINeutralarXiv – CS AI · Apr 146/10

🧠

When Valid Signals Fail: Regime Boundaries Between LLM Features and RL Trading Policies

Researchers demonstrate that large language models can extract predictive features from financial news with valid intermediate signals (Information Coefficient >0.15), yet these features fail to improve reinforcement learning trading agents during macroeconomic shocks. The findings reveal a critical gap between feature-level validity and downstream policy robustness, suggesting that valid signals alone cannot guarantee trading performance under distribution shifts.

AINeutralarXiv – CS AI · Apr 146/10

🧠

Understanding Generalization in Role-Playing Models via Information Theory

Researchers introduce R-EMID, an information-theoretic metric to diagnose how distribution shifts degrade role-playing model performance in real-world deployments. The framework reveals that user shifts pose the greatest generalization risk, while co-evolving reinforcement learning provides the most effective mitigation strategy.

AINeutralarXiv – CS AI · Mar 35/104

🧠

Spurious Correlation-Aware Embedding Regularization for Worst-Group Robustness

Researchers propose SCER (Spurious Correlation-Aware Embedding Regularization), a new deep learning approach that improves AI model robustness by regularizing feature representations to suppress spurious correlations. The method demonstrates superior performance in worst-group accuracy across vision and language tasks compared to existing state-of-the-art approaches.

AIBullisharXiv – CS AI · Mar 26/109

🧠

ProtoDCS: Towards Robust and Efficient Open-Set Test-Time Adaptation for Vision-Language Models

Researchers propose ProtoDCS, a new framework for robust test-time adaptation of Vision-Language Models in open-set scenarios. The method uses Gaussian Mixture Model verification and uncertainty-aware learning to better handle distribution shifts while maintaining computational efficiency.

AINeutralarXiv – CS AI · Mar 54/10

🧠

BD-Merging: Bias-Aware Dynamic Model Merging with Evidence-Guided Contrastive Learning

Researchers introduce BD-Merging, a new AI framework that improves model merging for multi-task learning by addressing bias and distribution shift issues. The method uses uncertainty modeling and contrastive learning to create more reliable AI systems that can better handle real-world data variations.