#latent-representations News & Analysis

9 articles tagged with #latent-representations. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

9 articles

AIBearisharXiv – CS AI · Jun 237/10

🧠

Attacking the Trusted Imagination: Oracle-Level Integrity Attacks on Imagine-then-Act World Models

Researchers demonstrate a novel attack vector against vision-language-action (VLA) policies that exploit the 'trusted imagination' component of world-action models rather than targeting reactive policies directly. By perturbing observations to corrupt latent trajectory predictions, attackers can fool downstream systems like safety gates and MPC planners while leaving the base policy unaffected, revealing a critical asymmetry in AI system robustness.

AIBullisharXiv – CS AI · Jun 97/10

🧠

ATM: Action-Consistency Transfer Matrix for Diagnosing and Improving Latent World Models

Researchers introduce ATM (Action-Consistency Transfer Matrix), a diagnostic tool that evaluates latent world models used in AI planning by analyzing whether learned representations preserve action semantics. The method reduces evaluation time from hours to seconds while providing interpretable insights into model quality, achieving over 100x speedup compared to traditional simulator-based approaches.

AIBullisharXiv – CS AI · Jun 57/10

🧠

PLAN-S: Bridging Planning with Latent Style Dynamics for Autonomous Driving World Models

Researchers introduce PLAN-S, a new neural architecture that improves autonomous driving by creating interpretable cost maps from latent world models, enabling better control over driving style dynamics. The method demonstrates significant safety improvements on benchmark datasets, reducing collision rates by 42% on nuScenes while maintaining frozen backbone models.

AIBullisharXiv – CS AI · Jun 47/10

🧠

MIRAGE: Mobile Agents with Implicit Reasoning and Generative World Models

MIRAGE is a new AI framework that enables mobile agents to reason internally using compressed latent representations instead of generating verbose reasoning chains. By aligning hidden states with future interface screenshots, the system achieves comparable performance to explicit chain-of-thought approaches while reducing token generation by 3-5x, offering significant efficiency gains for AI-powered mobile automation.

AIBullisharXiv – CS AI · Jun 27/10

🧠

Continuous Reasoning for Vision-Language-Action

Researchers propose Continuous Reasoning for Vision-Language-Action (VLA), a framework that uses shared Gaussian latent representations instead of discrete tokens to enable robotic control. The approach achieves 40.4% improvement on robotic manipulation tasks, suggesting that effective AI reasoning for physical control requires verifiable, shareable internal representations rather than explicit language.

AIBullisharXiv – CS AI · Mar 46/102

🧠

Chain of World: World Model Thinking in Latent Motion

Researchers introduce CoWVLA (Chain-of-World VLA), a new Vision-Language-Action model paradigm that combines world-model temporal reasoning with latent motion representation for embodied AI. The approach outperforms existing methods in robotic simulation benchmarks while maintaining computational efficiency through a unified autoregressive decoder that models both keyframes and action sequences.

AINeutralarXiv – CS AI · Jun 236/10

🧠

PoLAR: Factorizing Extent and Mode in Latent Actions for Robot Policy Learning

Researchers introduce PoLAR, a novel latent action representation framework that uses radial-direction structure in hyperbolic space to separately encode transition extent and mode for robot policy learning. The method improves downstream performance across simulation and real-world experiments by leveraging temporal gaps as a proxy for transition magnitude, outperforming existing latent action baselines and vision-language models.

AINeutralarXiv – CS AI · Jun 46/10

🧠

Unifying Model-Free Efficiency and Model-Based Representations via Latent Dynamics

Researchers introduce Unified Latent Dynamics (ULD), a reinforcement learning algorithm that combines the sample efficiency of model-free methods with the representational advantages of model-based approaches without requiring planning overhead. The method achieves competitive performance across 80 diverse environments including continuous control, visual tasks, and Atari games with minimal hyperparameter tuning.

🏢 Google

AINeutralarXiv – CS AI · Jun 16/10

🧠

Dreaming Of Others: Latent Teammate Modeling In World Models For Multi-Agent Reinforcement Learning

Researchers propose a novel architecture for multi-agent reinforcement learning that models teammates as learnable components within a world model, using a Theory-of-Mind head to infer partner behavior and enable zero-shot coordination. This approach extends Dreamer-style models beyond single-agent settings by factorizing latent states into environment and teammate representations, potentially advancing cooperative AI systems.