y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#distributional-shift News & Analysis

3 articles tagged with #distributional-shift. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

3 articles
AIBullisharXiv – CS AI · Jun 27/10
🧠

Zero-Shot Off-Policy Learning

Researchers present a novel off-policy learning method that addresses distributional shift and value overestimation in zero-shot reinforcement learning by establishing a theoretical connection between successor measures and stationary density ratios. The approach enables agents to adapt to new tasks without additional training by inferring optimal importance sampling ratios on-the-fly, with successful benchmarks across motion tracking, continuous control, and long-horizon tasks.

AINeutralarXiv – CS AI · May 287/10
🧠

Pressure-Testing Deception Probes in LLMs: Scaling, Robustness, and the Geometry of Deceptive Representations

Researchers systematically tested linear probes used to detect deception in large language models, finding they achieve near-perfect accuracy on clean data but fail dramatically under distributional shifts. The study reveals deception is encoded through distributed multi-dimensional features rather than a single direction, and probe robustness can be recovered through style augmentation, indicating failures stem from narrow training distributions rather than fundamental architectural limitations.

AINeutralarXiv – CS AI · Mar 177/10
🧠

Right for the Wrong Reasons: Epistemic Regret Minimization for Causal Rung Collapse in LLMs

Researchers identify a fundamental flaw in large language models called 'Rung Collapse' where AI systems achieve correct answers through flawed causal reasoning that fails under distribution shifts. They propose Epistemic Regret Minimization (ERM) as a solution that penalizes incorrect reasoning processes independently of task success, showing 53-59% recovery of reasoning errors in experiments across six frontier LLMs.

🧠 GPT-5