y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#self-supervised-learning News & Analysis

44 articles tagged with #self-supervised-learning. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

44 articles
AIBullisharXiv – CS AI · May 116/10
🧠

Miner:Mining Intrinsic Mastery for Data-Efficient RL in Large Reasoning Models

Researchers introduce Miner, a novel reinforcement learning method that leverages a model's intrinsic uncertainty as a self-supervised reward signal to improve training efficiency for large reasoning models. The approach achieves state-of-the-art results on reasoning benchmarks, with performance gains up to 4.58 points in Pass@1 metrics compared to existing methods, addressing a critical inefficiency in current critic-free RL training.

AINeutralarXiv – CS AI · May 16/10
🧠

Why Self-Supervised Encoders Want to Be Normal

Researchers develop a theoretical framework connecting Information Bottleneck principles to encoder-decoder learning through rate-distortion analysis, showing optimal representations form soft clusters on probability manifolds. The work introduces Sketched Isotropic Gaussian Regularization (SIGReg) as a principled regularizer for self-supervised, semi-supervised, and supervised learning without requiring variational bounds.

AIBullisharXiv – CS AI · Apr 206/10
🧠

SSMamba: A Self-Supervised Hybrid State Space Model for Pathological Image Classification

SSMamba introduces a self-supervised hybrid state space model designed to improve pathological image classification by addressing domain shift, local-global relationship modeling, and fine-grained feature detection. The framework outperforms 11 state-of-the-art pathological foundation models on multiple public datasets without requiring large external training datasets.

AINeutralarXiv – CS AI · Apr 146/10
🧠

A Minimal Model of Representation Collapse: Frustration, Stop-Gradient, and Dynamics

Researchers present a minimal mathematical model demonstrating how representation collapse occurs in self-supervised learning when frustrated (misclassified) samples exist, and show that stop-gradient techniques prevent this failure mode. The work provides closed-form analysis of gradient-flow dynamics and fixed points, offering theoretical insights into why modern embedding-based learning systems sometimes lose discriminative power.

AINeutralarXiv – CS AI · Apr 106/10
🧠

A Lightweight Library for Energy-Based Joint-Embedding Predictive Architectures

Facebook Research releases EB-JEPA, an open-source library for learning representations through Joint-Embedding Predictive Architectures that predict in representation space rather than pixel space. The framework demonstrates strong performance across image classification (91% on CIFAR-10), video prediction, and action-conditioned world models, making self-supervised learning more accessible for research and practical applications.

AIBullisharXiv – CS AI · Mar 176/10
🧠

Pixel-level Scene Understanding in One Token: Visual States Need What-is-Where Composition

Researchers propose CroBo, a new visual state representation learning framework that helps robotic agents better understand dynamic environments by encoding both semantic identities and spatial locations of scene elements. The framework uses a global-to-local reconstruction method that compresses observations into compact tokens, achieving state-of-the-art performance on robot policy learning benchmarks.

AIBullisharXiv – CS AI · Mar 37/107
🧠

MetaMind: General and Cognitive World Models in Multi-Agent Systems by Meta-Theory of Mind

Meta researchers introduced MetaMind, a cognitive world model for multi-agent systems that enables agents to understand and predict other agents' behaviors without centralized supervision or communication. The system uses a meta-theory of mind framework allowing agents to reason about goals and beliefs of others through self-reflective learning and analogical reasoning.

AIBullisharXiv – CS AI · Mar 36/106
🧠

DINOv3 Meets YOLO26 for Weed Detection in Vegetable Crops

Researchers developed a foundational crop-weed detection model combining DINOv3 vision transformer with YOLO26 architecture, achieving significant improvements in precision agriculture applications. The model showed up to 14% better performance on cross-domain datasets while maintaining real-time processing at 28.5 fps despite increased computational requirements.

AINeutralarXiv – CS AI · Mar 36/105
🧠

A SUPERB-Style Benchmark of Self-Supervised Speech Models for Audio Deepfake Detection

Researchers introduced Spoof-SUPERB, a new benchmark for evaluating self-supervised learning models' ability to detect audio deepfakes. The study tested 20 SSL models and found that large-scale discriminative models like XLS-R and WavLM Large consistently outperformed others, especially under acoustic degradations.

AIBullisharXiv – CS AI · Mar 36/103
🧠

Latent Diffusion Model without Variational Autoencoder

Researchers introduce SVG, a new latent diffusion model that eliminates the need for variational autoencoders by using self-supervised representations. The approach leverages frozen DINO features to create semantically structured latent spaces, enabling faster training, fewer sampling steps, and better generative quality while maintaining semantic capabilities.

AIBullisharXiv – CS AI · Feb 275/107
🧠

Learning to reconstruct from saturated data: audio declipping and high-dynamic range imaging

Researchers have developed a self-supervised learning method that can reconstruct audio and images from clipped/saturated measurements without requiring ground truth training data. The approach extends self-supervised learning to non-linear inverse problems and performs nearly as well as fully supervised methods while using only clipped measurements for training.

AIBullisharXiv – CS AI · Mar 175/10
🧠

Human-like Object Grouping in Self-supervised Vision Transformers

Researchers developed a behavioral benchmark showing that self-supervised vision transformers, particularly those trained with DINO objectives, align closely with human object perception and segmentation behavior. The study found that models with stronger object-centric representations better predict human visual judgments, with Gram matrix structure playing a key role in perceptual alignment.

AINeutralarXiv – CS AI · Mar 54/10
🧠

Directional Neural Collapse Explains Few-Shot Transfer in Self-Supervised Learning

Researchers propose directional CDNV (decision-axis variance) as a key geometric quantity explaining why self-supervised learning representations transfer well with few labels. The study shows that small variability along class-separating directions enables strong few-shot transfer and low interference across multiple tasks.

AINeutralarXiv – CS AI · Mar 35/107
🧠

SIGMAS: Second-Order Interaction-based Grouping for Overlapping Multi-Agent Swarms

Researchers introduce SIGMAS, a self-supervised AI framework for identifying group structures in multi-agent swarms like drone fleets without ground-truth supervision. The system uses second-order interactions to infer latent group memberships from agent trajectories, demonstrating robust performance across diverse synthetic swarm scenarios.

AINeutralarXiv – CS AI · Feb 274/105
🧠

FM-RME: Foundation Model Empowered Radio Map Estimation

Researchers introduce FM-RME, a foundation model for radio map estimation that combines geometry-aware feature extraction with attention-based neural networks. The model uses self-supervised pre-training to enable zero-shot generalization across spatial, temporal, and spectral domains without scenario-specific retraining.

AINeutralarXiv – CS AI · Feb 274/103
🧠

DyGnROLE: Modeling Asymmetry in Dynamic Graphs with Node-Role-Oriented Latent Encoding

Researchers introduce DyGnROLE, a new AI architecture that better models directed dynamic graphs by treating source and destination nodes differently. The system uses role-specific embeddings and a self-supervised learning approach called Temporal Contrastive Link Prediction to achieve superior performance on future edge classification tasks.

$LINK
AINeutralarXiv – CS AI · Feb 274/107
🧠

A Semi-Supervised Learning Method for the Identification of Bad Exposures in Large Imaging Surveys

Researchers developed a semi-supervised machine learning pipeline using vision transformers and k-Nearest Neighbor classifiers to automatically detect poor-quality exposures in astronomical imaging surveys. The method was successfully applied to the DECam Legacy Survey, identifying 780 problematic exposures that were verified through visual inspection.

AINeutralarXiv – CS AI · Feb 274/105
🧠

TokEye: Fast Signal Extraction for Fluctuating Time Series via Offline Self-Supervised Learning From Fusion Diagnostics to Bioacoustics

Researchers developed TokEye, a self-supervised AI framework that can extract coherent signals from noisy time-series data in 0.5 seconds, initially designed for fusion reactor diagnostics. The system demonstrates applications beyond fusion research, including bioacoustics, suggesting broader potential for real-time signal processing across industries.

← PrevPage 2 of 2