y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#transfer-learning News & Analysis

54 articles tagged with #transfer-learning. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

54 articles
AIBullisharXiv – CS AI · May 126/10
🧠

CORTEG: Foundation Models Enable Cross-Modality Representation Transfer from Scalp to Intracranial Brain Recordings

Researchers introduce CORTEG, a framework that adapts pretrained scalp-EEG foundation models to intracranial ECoG recordings, enabling brain-computer interfaces to learn across patients with minimal calibration time. The approach demonstrates competitive or superior performance on finger trajectory and audio envelope decoding tasks while reducing per-patient training requirements to 10-30 minutes.

AINeutralarXiv – CS AI · May 126/10
🧠

Digital Image Forgery Detection Using Transfer Learning

Researchers present a transfer learning framework for detecting digitally forged images by combining RGB data with compression-difference features and optimized thresholds. Testing across multiple CNN architectures on the CASIA v2.0 dataset shows DenseNet121 achieves highest accuracy while ResNet50 provides most reliable predictions, addressing critical forensic security needs.

AINeutralarXiv – CS AI · May 126/10
🧠

What Cohort INRs Encode and Where to Freeze Them

Researchers demonstrate that early layers of cohort-trained Implicit Neural Representations (INRs) encode transferable features for signal fitting, identifying optimal freezing points through weight stable rank analysis. Using sparse autoencoders for mechanistic interpretability, they reveal that SIREN and Fourier-feature MLPs learn fundamentally different dictionary representations despite comparable performance, with implications for designing more generalizable neural architectures.

AINeutralarXiv – CS AI · May 126/10
🧠

Decoupling Endpoint and Semantic Transition Learning for Zero-Shot Composed Image Retrieval

Researchers propose DeCIR, a new approach to zero-shot composed image retrieval that separates endpoint matching from semantic transition learning to overcome limitations in projection-based methods. The technique uses decoupled text adapters and low-rank directional merging to improve performance on image retrieval tasks without increasing computational complexity at inference time.

AINeutralarXiv – CS AI · May 126/10
🧠

Structure-Centric Graph Foundation Model via Geometric Bases

Researchers propose Structure-Centric Graph Foundation Models (SCGFM), a novel approach that treats graph topology as the primary source of transferable knowledge using geometric bases and Gromov-Wasserstein distances. The method addresses key limitations in existing graph foundation models by handling structural heterogeneity and incompatible node feature spaces, demonstrating improved generalization across both in-domain and cross-domain graph tasks.

AINeutralarXiv – CS AI · May 126/10
🧠

From Traditional Taggers to LLMs: A Comparative Study of POS Tagging for Medieval Romance Languages

Researchers conducted a systematic evaluation of large language models for part-of-speech tagging in Medieval Romance languages, comparing them against traditional taggers. The study demonstrates that LLM-based approaches with fine-tuning and cross-lingual transfer learning significantly outperform conventional methods, offering practical applications for digital humanities research on historical texts.

AINeutralarXiv – CS AI · May 116/10
🧠

Closed-Form Linear-Probe Dataset Distillation for Pre-trained Vision Models

Researchers introduce CLP-DD, a novel dataset distillation method optimized for frozen pre-trained vision models using closed-form linear probing. The technique achieves comparable or superior performance to existing methods while running 14x faster and using 87.5% less GPU memory on ImageNet-1K.

AINeutralarXiv – CS AI · May 116/10
🧠

Excluding the Target Domain Improves Extrapolation: Deconfounded Hierarchical Physics Constraints

Researchers propose Deconfounded Hierarchical Gate (DHG), a novel approach to improve physics-constrained deep generative models' ability to extrapolate beyond training conditions. The method counterintuitively finds that excluding target-domain data during pretraining improves extrapolation performance by 39%, achieving 46% better results on lithium-ion battery temperature prediction benchmarks.

AINeutralarXiv – CS AI · May 96/10
🧠

Graphlets as Building Blocks for Structural Vocabulary in Knowledge Graph Foundation Models

Researchers propose using graphlets—small recurring subgraph patterns—as structural tokens for Knowledge Graph Foundation Models (KGFMs), enabling better transfer learning across diverse knowledge graphs. Testing on 51 knowledge graphs demonstrates that this approach outperforms existing KGFMs for zero-shot link prediction tasks.

AINeutralarXiv – CS AI · Apr 156/10
🧠

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Researchers investigate on-policy distillation (OPD) dynamics in large language model training, identifying two critical success conditions: compatible thinking patterns between student and teacher models, and genuine new capabilities from the teacher. The study reveals that successful OPD relies on token-level alignment and proposes recovery strategies for failing distillation scenarios.

AINeutralarXiv – CS AI · Apr 136/10
🧠

WOMBET: World Model-based Experience Transfer for Robust and Sample-efficient Reinforcement Learning

Researchers introduce WOMBET, a framework that improves reinforcement learning efficiency in robotics by generating synthetic training data from a world model in source tasks and selectively transferring it to target tasks. The approach combines offline-to-online learning with uncertainty-aware planning to reduce data collection costs while maintaining robustness.

AINeutralarXiv – CS AI · Apr 136/10
🧠

ASPECT:Analogical Semantic Policy Execution via Language Conditioned Transfer

Researchers introduce ASPECT, a novel reinforcement learning framework that uses large language models as semantic operators to enable zero-shot transfer learning across novel tasks. By conditioning a text-based VAE on LLM-generated task descriptions, the approach allows agents to reuse policies on structurally similar but previously unseen tasks without discrete category constraints.

AIBullisharXiv – CS AI · Mar 266/10
🧠

ELITE: Experiential Learning and Intent-Aware Transfer for Self-improving Embodied Agents

Researchers introduce ELITE, a new framework that enables AI embodied agents to learn from their own experiences and transfer knowledge to similar tasks. The system addresses failures in vision-language models when performing complex physical tasks by using self-reflective knowledge construction and intent-aware retrieval mechanisms.

AIBullisharXiv – CS AI · Mar 176/10
🧠

AdapterTune: Zero-Initialized Low-Rank Adapters for Frozen Vision Transformers

AdapterTune introduces a new method for efficiently fine-tuning Vision Transformers by using zero-initialized low-rank adapters that start at the pretrained function to prevent optimization instability. The technique achieves +14.9 point accuracy improvement over head-only transfer while using only 0.92% of parameters needed for full fine-tuning.

AINeutralarXiv – CS AI · Mar 37/108
🧠

Diagnosing Generalization Failures from Representational Geometry Markers

Researchers propose a new approach to predict AI model failures by analyzing geometric properties of data representations rather than reverse-engineering internal mechanisms. They found that reduced manifold dimensionality and utility in training data consistently predict poor performance on out-of-distribution tasks across different architectures and datasets.

AIBullisharXiv – CS AI · Mar 27/1016
🧠

SMAC: Score-Matched Actor-Critics for Robust Offline-to-Online Transfer

Researchers developed Score Matched Actor-Critic (SMAC), a new offline reinforcement learning method that enables smooth transition to online RL algorithms without performance drops. SMAC achieved successful transfer in all 6 D4RL tasks tested and reduced regret by 34-58% in 4 of 6 environments compared to best baselines.

AIBullishLil'Log (Lilian Weng) · Jan 316/10
🧠

Generalized Language Models

This article discusses the evolution of generalized language models including BERT, GPT, and other major pre-trained models that achieved state-of-the-art results on various NLP tasks. The piece covers the breakthrough progress in 2018 with large-scale unsupervised pre-training approaches that don't require labeled data, similar to how ImageNet helped computer vision.

🏢 OpenAI
AIBullisharXiv – CS AI · Mar 275/10
🧠

Neural Network Conversion of Machine Learning Pipelines

Researchers developed a method to transfer knowledge from traditional machine learning pipelines to neural networks, specifically converting random forest classifiers into student neural networks. Testing on 100 OpenML tasks showed that neural networks can successfully mimic random forest performance when proper hyperparameters are selected.

AINeutralarXiv – CS AI · Mar 174/10
🧠

Evolutionary Transfer Learning for Dragonchess

Researchers developed an evolutionary transfer learning approach to adapt chess AI heuristics for Dragonchess, a 3D chess variant. While direct transfers from Stockfish failed, evolutionary optimization using CMA-ES significantly improved AI performance in this complex multi-layer game environment.

AINeutralarXiv – CS AI · Mar 114/10
🧠

Multi-model approach for autonomous driving: A comprehensive study on traffic sign-, vehicle- and lane detection and behavioral cloning

Researchers have developed a comprehensive multi-model approach for autonomous driving that integrates deep learning and computer vision techniques for traffic sign classification, vehicle detection, lane detection, and behavioral cloning. The study utilizes pre-trained and custom neural networks with data augmentation and transfer learning techniques, testing on datasets including the German Traffic Sign Recognition Benchmark and Udacity simulator data.

AINeutralarXiv – CS AI · Mar 54/10
🧠

The Influence of Iconicity in Transfer Learning for Sign Language Recognition

Researchers examined transfer learning effectiveness for sign language recognition by comparing iconic signs between different language pairs (Chinese to Arabic and Greek to Flemish). The study achieved modest improvements of 7.02% for Arabic and 1.07% for Flemish using Google Mediapipe for feature extraction and neural network architectures.

AINeutralarXiv – CS AI · Mar 54/10
🧠

Directional Neural Collapse Explains Few-Shot Transfer in Self-Supervised Learning

Researchers propose directional CDNV (decision-axis variance) as a key geometric quantity explaining why self-supervised learning representations transfer well with few labels. The study shows that small variability along class-separating directions enables strong few-shot transfer and low interference across multiple tasks.

AINeutralarXiv – CS AI · Mar 44/102
🧠

Deep Learning Based Wildfire Detection for Peatland Fires Using Transfer Learning

Researchers developed a transfer learning approach for detecting peatland fires using deep learning models adapted from conventional wildfire detection systems. The method addresses the unique challenges of peatland fires, which have distinct characteristics like low flame intensity and persistent smoke that make them difficult to detect with standard wildfire detection models.

AINeutralarXiv – CS AI · Mar 25/106
🧠

General vs Domain-Specific CNNs: Understanding Pretraining Effects on Brain MRI Tumor Classification

Research comparing CNN architectures for brain tumor classification found that general-purpose models like ConvNeXt-Tiny (93% accuracy) outperformed domain-specific medical pre-trained models like RadImageNet DenseNet121 (68% accuracy). The study suggests that contemporary general-purpose CNNs with diverse pre-training may be more effective for medical imaging tasks in data-scarce scenarios.

← PrevPage 2 of 3Next →