y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#self-supervised-learning News & Analysis

44 articles tagged with #self-supervised-learning. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

44 articles
AIBullisharXiv – CS AI · 10h ago7/10
🧠

RayDer: Scalable Self-Supervised Novel View Synthesis from Real-World Video

RayDer introduces a unified transformer architecture that consolidates camera estimation, scene reconstruction, and rendering into a single model for self-supervised novel view synthesis from real-world video. The system achieves clean power-law scaling with data and compute while maintaining competitive performance with supervised approaches, addressing a key scalability challenge in 3D vision.

AIBullisharXiv – CS AI · May 127/10
🧠

Event Fields: Learning Latent Event Structure for Waveform Foundation Models

Researchers introduce a novel waveform foundation model that represents physiological signals as latent event processes rather than sequential tokens, using self-supervised learning to capture clinically meaningful structure. The approach demonstrates improved performance on medical benchmarks including arrhythmia classification and hemodynamic prediction, suggesting event-centric representations may be more suitable for healthcare AI than traditional sequence-based methods.

AIBullisharXiv – CS AI · May 127/10
🧠

Team-Based Self-Play With Dual Adaptive Weighting for Fine-Tuning LLMs

Researchers propose TPAW, a self-play algorithm that improves LLM alignment without human-labeled data by having models collaborate and compete against historical checkpoints while using adaptive weighting mechanisms. The approach addresses instability and diminishing optimization gains in existing self-training methods, demonstrating consistent improvements across multiple benchmarks.

AIBullisharXiv – CS AI · May 117/10
🧠

When Losses Align: Gradient-Based Composite Loss Weighting for Efficient Pretraining

Researchers propose a gradient-based bilevel optimization method that automatically learns composite loss weights during pretraining by aligning gradients with downstream objectives. The approach reduces hyperparameter tuning overhead to ~30% above baseline training cost while matching or exceeding manually tuned baselines across event-sequence and computer vision tasks.

AIBullisharXiv – CS AI · May 117/10
🧠

Pan-FM: A Pan-Organ Foundation Model with Saliency-Guided Masking for Missing Robustness

Researchers introduce Pan-FM, a foundation model trained on multimodal medical imaging from seven organs that addresses the critical problem of missing data in real-world biomedical datasets. The model uses Saliency-Guided Masking to prevent bias toward dominant organs and demonstrates superior performance on disease prediction tasks across the UK Biobank.

AIBullisharXiv – CS AI · May 117/10
🧠

Enabling Unsupervised Training of Deep EEG Denoisers With Intelligent Partitioning

Researchers propose Intelligent Partitioning for Self-supervised Denoising (iPSD), a deep learning method that eliminates the need for artifact-free training data to denoise electroencephalogram (EEG) signals from wearable devices. The technique achieves state-of-the-art performance even in extremely noisy conditions by learning to partition noisy EEG segments into independent realizations sharing the same underlying neural signal.

AIBullisharXiv – CS AI · Apr 157/10
🧠

Instructions are all you need: Self-supervised Reinforcement Learning for Instruction Following

Researchers propose a label-free self-supervised reinforcement learning framework that enables language models to follow complex multi-constraint instructions without external supervision. The approach derives reward signals directly from instructions and uses constraint decomposition strategies to address sparse reward challenges, demonstrating strong performance across both in-domain and out-of-domain instruction-following tasks.

AIBullisharXiv – CS AI · Apr 147/10
🧠

TimeRewarder: Learning Dense Reward from Passive Videos via Frame-wise Temporal Distance

TimeRewarder is a new machine learning method that learns dense reward signals from passive videos to improve reinforcement learning in robotics. By modeling temporal distances between video frames, the approach achieves 90% success rates on Meta-World tasks using significantly fewer environment interactions than prior methods, while also leveraging human videos for scalable reward learning.

AINeutralarXiv – CS AI · 10h ago6/10
🧠

AnchorSteer: Self-Discovered Concept Injection for Structure-Preserving Music Editing

AnchorSteer is a new AI framework for music editing that maintains rhythmic and melodic structure while allowing semantic modifications through self-discovered concept vectors injected into diffusion models. The approach addresses a core tension in music AI: steering methods that enable high-level edits typically degrade structural integrity, while protective mechanisms suppress semantic control.

AINeutralarXiv – CS AI · 10h ago6/10
🧠

STEP: Learning STructured Embeddings for Progressive Time Series

Researchers introduce STEP, a self-supervised learning method that creates interpretable representations of time series data showing irreversible state transitions like equipment degradation or task completion. The approach encodes progression information in geometric coordinates (polar angles and radius) without requiring labeled data, matching or exceeding black-box models while providing transparency into underlying mechanisms.

AINeutralarXiv – CS AI · 10h ago6/10
🧠

Learning Cardiac Latent Representations in Vectorcardiogram Space

Researchers introduce LVCG, a self-supervised learning framework that represents cardiac electrical activity in vectorcardiogram (VCG) space rather than traditional ECG signal space. By learning unified latent representations instead of lead-specific artifacts, the method reduces redundancy, minimizes spurious correlations, and demonstrates improved generalization across cardiac assessment tasks.

AINeutralarXiv – CS AI · 3d ago6/10
🧠

FHRFormer: A Self-Supervised Masked Transformer Framework for Fetal Heart Rate Time-Series Inpainting and Forecasting

Researchers propose FHRFormer, a masked transformer-based autoencoder that reconstructs missing fetal heart rate data from wearable monitors using self-supervised learning. The method addresses signal dropout caused by sensor displacement and positional changes, preserving spectral characteristics better than traditional interpolation while enabling both data inpainting and forecasting for improved fetal risk assessment.

AINeutralarXiv – CS AI · 3d ago6/10
🧠

The Impact of Semantic Pairs on Self-Supervised Representation Learning

Researchers demonstrate that training self-supervised learning models with semantic positive pairs (different images of the same class) outperforms traditional augmented-pair methods across multiple benchmarks. The controlled study isolates semantic pairing's effectiveness and shows contrastive methods like SimCLR benefit most strongly, providing guidance for designing more generalizable representation learning frameworks.

AINeutralCrypto Briefing · 3d ago6/10
🧠

Yann LeCun’s paper reveals conditions for LeJEPA to learn world models

Yann LeCun's research paper outlines the specific conditions necessary for LeJEPA (Joint-Embedding Predictive Architecture) to effectively learn world models, potentially advancing AI's ability to understand complex systems. However, practical implementation faces significant hurdles due to environmental variability and real-world complexity.

Yann LeCun’s paper reveals conditions for LeJEPA to learn world models
AIBullisharXiv – CS AI · 4d ago6/10
🧠

Bayesian Gated Non-Negative Contrastive Learning

Researchers propose BayesNCL, a new machine learning approach that improves the interpretability of self-supervised learning models by using probabilistic gating to filter out task-irrelevant features. The method achieves a 142.1% improvement in semantic consistency on ImageNet-100 while maintaining downstream task performance, addressing a fundamental limitation in how contrastive learning models process information.

AINeutralarXiv – CS AI · 4d ago6/10
🧠

Revisiting Graph Autoencoders as Implicit Contrastive Learners

Researchers demonstrate that graph autoencoders (GAEs), traditionally viewed as distinct from graph contrastive learning approaches, actually function as implicit contrastive learners. By unifying these paradigms and introducing asymmetric contrastive views as a design principle, the work provides a clearer framework for understanding and building more effective graph neural networks for self-supervised learning tasks.

AINeutralarXiv – CS AI · 4d ago6/10
🧠

SelfJudge: Faster Speculative Decoding via Self-Supervised Judge Verification

Researchers propose SelfJudge, a new method for accelerating large language model inference through self-supervised judge verification that eliminates the need for human annotations. The approach trains verifiers to assess whether token substitutions preserve semantic meaning, enabling faster inference without sacrificing accuracy across diverse NLP tasks.

AINeutralarXiv – CS AI · 5d ago6/10
🧠

Geometry-Aware Contrastive Learning for Few-Shot Automatic Modulation Recognition

Researchers propose Dynamic-Consistency Contrastive Learning (DyCo-CL), a machine learning framework that improves automatic modulation recognition in wireless signal processing by combining virtual adversarial augmentation with semantic consistency loss. The method achieves a 6.27% accuracy improvement in few-shot learning scenarios on standard benchmarks, addressing key challenges in self-supervised learning for signal classification.

AINeutralarXiv – CS AI · 5d ago6/10
🧠

FoundObj: Self-supervised Foundation Models as Rewards for Label-free 3D Object Segmentation

FoundObj introduces a self-supervised framework for 3D object segmentation in point clouds without manual scene-level annotations, using reinforcement learning guided by semantic and geometric reward modules from foundation models. The approach demonstrates strong performance across benchmarks and shows particular promise in zero-shot and long-tail scenarios, advancing label-free computer vision capabilities.

AIBullisharXiv – CS AI · 5d ago6/10
🧠

Olaf-World: Orienting Latent Actions for Video World Modeling

Researchers introduce Olaf-World, a new approach to training action-controllable video world models that solves the problem of action latents failing to transfer across different contexts. By anchoring latent actions to observable semantic effects rather than relying on scarce labeled data, the method achieves stronger zero-shot transfer and more efficient adaptation to new control interfaces.

AINeutralarXiv – CS AI · May 125/10
🧠

Multi-Level Graph Attention Network Contrastive Learning for Knowledge-Aware Recommendation

Researchers propose a multi-level graph attention network framework that uses contrastive learning to improve knowledge-graph-based recommendation systems. The approach addresses limitations in existing methods by leveraging multi-view learning and self-supervised techniques to better model user preferences and item representations.

AINeutralarXiv – CS AI · May 126/10
🧠

WavesFM: Hierarchical Representation Learning for Longitudinal Wearable Sensor Waveforms

Researchers introduce WavesFM, a foundation model using hierarchical self-supervised learning to extract health insights from continuous wearable sensor data. Trained on 6.8M hours of physiological recordings from 324k individuals, the model captures both local waveform patterns and long-term behavioral dynamics, demonstrating strong performance across 58 health-related prediction tasks.

AINeutralarXiv – CS AI · May 126/10
🧠

Rethinking Temporal Consistency in Video Object-Centric Learning: From Prediction to Correspondence

Researchers propose Grounded Correspondence, a new framework for video object tracking that replaces learned prediction models with deterministic bipartite matching. By leveraging existing vision backbone features, the approach achieves competitive results without learnable temporal parameters, challenging the conventional approach of using dynamics modules for temporal consistency.

AIBullisharXiv – CS AI · May 126/10
🧠

TinySSL: Distilled Self-Supervised Pretraining for Sub-Megabyte MCU Models

Researchers introduce CA-DSSL, a new self-supervised learning technique that enables efficient AI model training on microcontrollers with under 500K parameters. The method surpasses existing approaches by 18 percentage points on standard benchmarks while requiring significantly fewer parameters, achieving 94% of supervised learning performance with models deployable in just 378 KB of memory.

AIBullisharXiv – CS AI · May 116/10
🧠

ProteinJEPA: Latent prediction complements protein language models

Researchers demonstrate that ProteinJEPA, a latent-space prediction technique, can complement traditional masked language modeling (MLM) in protein language models, achieving better downstream task performance when combined strategically. The optimal approach—masked-position MLM+JEPA—wins 10 out of 16 evaluation tasks against MLM-only baselines while maintaining computational efficiency.

Page 1 of 2Next →