y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#jepa News & Analysis

8 articles tagged with #jepa. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

8 articles
AINeutralarXiv – CS AI · 3d ago6/10
🧠

Predict and Reconstruct: Joint Objectives for Self-Supervised Language Representation Learning

Researchers propose a hybrid pre-training approach for language models that combines masked language modeling with a JEPA-style latent-space prediction objective, creating more semantically-aligned embeddings with better geometric properties than traditional MLM-only approaches despite achieving similar downstream accuracy.

🏢 Nvidia
AINeutralarXiv – CS AI · 3d ago6/10
🧠

LatentWave: JEPA Pretraining for Wireless Foundation Models

Researchers introduce LatentWave, a wireless foundation model that uses Joint-Embedding Predictive Architecture (JEPA) instead of traditional masked input reconstruction to learn more transferable representations from wireless spectrograms and channel state information. The model demonstrates improved performance across RF signal classification, 5G positioning, beam prediction, and LoS/NLoS classification tasks while supporting variable antenna configurations.

AINeutralarXiv – CS AI · 6d ago6/10
🧠

BRo-JEPA: Learning Modular Arithmetic in Latent Space

Researchers introduce BRo-JEPA, a neural network architecture that learns modular arithmetic rules by imposing circular structure in latent space, achieving 99.46% zero-shot generalization on unseen operations. The work demonstrates that neural networks can learn abstract algebraic rules rather than merely memorizing patterns when architecture aligns with problem structure.

AIBullisharXiv – CS AI · 6d ago6/10
🧠

UR-JEPA: Uniform Rectifiability as a Regularizer for Joint-Embedding Predictive Architectures

Researchers introduce UR-JEPA, a novel regularization technique for Joint-Embedding Predictive Architectures that addresses representation collapse by targeting uniformly rectifiable measures rather than isotropic Gaussians. The method demonstrates superior performance on Inet10 with an 0.83 percentage-point gain over existing approaches and produces geometrically distinct embeddings with sharper spectral drops, suggesting more structured learned representations.

AINeutralarXiv – CS AI · 6d ago6/10
🧠

Echo: A Joint-Embedding Predictive Architecture for Speaker Diarization and Speech Recognition in a Shared Latent Space

Echo is a proof-of-concept audio system that unifies speaker diarization, speech recognition, and source separation on a single 25M-parameter ViT encoder pretrained with joint-embedding predictive architecture (JEPA). The system demonstrates competitive performance across three tasks simultaneously without per-task fine-tuning, though it represents a design exploration rather than state-of-the-art on individual metrics.

AINeutralarXiv – CS AI · May 125/10
🧠

Sub-JEPA: Subspace Gaussian Regularization for Stable End-to-End World Models

Researchers propose Sub-JEPA, an improved approach to training world models that addresses stability issues in Joint-Embedding Predictive Architectures by applying Gaussian constraints across random subspaces rather than the full embedding space. The method achieves better performance than the existing LeWorldModel baseline while maintaining training stability and representation flexibility.

AIBullisharXiv – CS AI · May 116/10
🧠

ProteinJEPA: Latent prediction complements protein language models

Researchers demonstrate that ProteinJEPA, a latent-space prediction technique, can complement traditional masked language modeling (MLM) in protein language models, achieving better downstream task performance when combined strategically. The optimal approach—masked-position MLM+JEPA—wins 10 out of 16 evaluation tasks against MLM-only baselines while maintaining computational efficiency.

AINeutralarXiv – CS AI · Mar 44/103
🧠

Improving Diffusion Planners by Self-Supervised Action Gating with Energies

Researchers propose SAGE (Self-supervised Action Gating with Energies), a new method to improve diffusion planners in offline reinforcement learning by filtering out dynamically inconsistent trajectories. The approach uses a latent consistency signal to re-rank candidate actions at inference time, improving performance across locomotion, navigation, and manipulation tasks without requiring environment rollouts or policy retraining.