#transfer-learning News & Analysis

26 articles tagged with #transfer-learning. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

26 articles

AIBullisharXiv – CS AI · 2d ago7/10

🧠

Adapting 2D Multi-Modal Large Language Model for 3D CT Image Analysis

Researchers propose a method to adapt 2D multimodal large language models for 3D medical imaging analysis, introducing a Text-Guided Hierarchical Mixture of Experts framework that enables task-specific feature extraction. The approach demonstrates improved performance on medical report generation and visual question answering tasks while reusing pre-trained parameters from 2D models.

AIBullisharXiv – CS AI · Mar 46/104

🧠

EvoSkill: Automated Skill Discovery for Multi-Agent Systems

Researchers have developed EvoSkill, an automated framework that enables AI agents to discover and refine domain-specific skills through iterative failure analysis. The system demonstrated significant performance improvements on specialized tasks, with accuracy gains of 7.3% on financial data analysis and 12.1% on search-augmented QA, while showing transferable capabilities across different domains.

AIBullisharXiv – CS AI · Mar 47/103

🧠

Can Computational Reducibility Lead to Transferable Models for Graph Combinatorial Optimization?

Researchers developed a new neural solver model using GCON modules and energy-based loss functions that achieves state-of-the-art performance across multiple graph combinatorial optimization tasks. The study demonstrates effective transfer learning between related optimization problems through computational reducibility-informed pretraining strategies, representing progress toward foundational AI models for combinatorial optimization.

AIBullisharXiv – CS AI · Mar 47/103

🧠

D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI

Researchers developed D2E (Desktop to Embodied AI), a framework that uses desktop gaming data to pretrain AI models for robotics tasks. Their 1B-parameter model achieved 96.6% success on manipulation tasks and 83.3% on navigation, matching performance of models up to 7 times larger while using scalable desktop data instead of expensive physical robot training data.

AIBullisharXiv – CS AI · Mar 47/103

🧠

Robust Weight Imprinting: Insights from Neural Collapse and Proxy-Based Aggregation

Researchers propose a new IMPRINT framework for transfer learning that improves foundation model adaptation to new tasks without parameter optimization. The framework identifies three key components and introduces a clustering-based variant that outperforms existing methods by 4%.

AINeutralarXiv – CS AI · Mar 37/105

🧠

Robust Fine-Tuning from Non-Robust Pretrained Models: Mitigating Suboptimal Transfer With Epsilon-Scheduling

Researchers identified that fine-tuning non-robust pretrained AI models with robust objectives can lead to poor performance, termed 'suboptimal transfer.' They propose Epsilon-Scheduling, a novel training technique that adjusts perturbation strength during training to improve both task adaptation and adversarial robustness.

AINeutralarXiv – CS AI · 1d ago6/10

🧠

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Researchers investigate on-policy distillation (OPD) dynamics in large language model training, identifying two critical success conditions: compatible thinking patterns between student and teacher models, and genuine new capabilities from the teacher. The study reveals that successful OPD relies on token-level alignment and proposes recovery strategies for failing distillation scenarios.

AINeutralarXiv – CS AI · 3d ago6/10

🧠

WOMBET: World Model-based Experience Transfer for Robust and Sample-efficient Reinforcement Learning

Researchers introduce WOMBET, a framework that improves reinforcement learning efficiency in robotics by generating synthetic training data from a world model in source tasks and selectively transferring it to target tasks. The approach combines offline-to-online learning with uncertainty-aware planning to reduce data collection costs while maintaining robustness.

AINeutralarXiv – CS AI · 3d ago6/10

🧠

ASPECT:Analogical Semantic Policy Execution via Language Conditioned Transfer

Researchers introduce ASPECT, a novel reinforcement learning framework that uses large language models as semantic operators to enable zero-shot transfer learning across novel tasks. By conditioning a text-based VAE on LLM-generated task descriptions, the approach allows agents to reuse policies on structurally similar but previously unseen tasks without discrete category constraints.

AIBullisharXiv – CS AI · Mar 266/10

🧠

ELITE: Experiential Learning and Intent-Aware Transfer for Self-improving Embodied Agents

Researchers introduce ELITE, a new framework that enables AI embodied agents to learn from their own experiences and transfer knowledge to similar tasks. The system addresses failures in vision-language models when performing complex physical tasks by using self-reflective knowledge construction and intent-aware retrieval mechanisms.

AIBullisharXiv – CS AI · Mar 176/10

🧠

AdapterTune: Zero-Initialized Low-Rank Adapters for Frozen Vision Transformers

AdapterTune introduces a new method for efficiently fine-tuning Vision Transformers by using zero-initialized low-rank adapters that start at the pretrained function to prevent optimization instability. The technique achieves +14.9 point accuracy improvement over head-only transfer while using only 0.92% of parameters needed for full fine-tuning.

AINeutralarXiv – CS AI · Mar 37/108

🧠

Diagnosing Generalization Failures from Representational Geometry Markers

Researchers propose a new approach to predict AI model failures by analyzing geometric properties of data representations rather than reverse-engineering internal mechanisms. They found that reduced manifold dimensionality and utility in training data consistently predict poor performance on out-of-distribution tasks across different architectures and datasets.

AIBullisharXiv – CS AI · Mar 27/1016

🧠

SMAC: Score-Matched Actor-Critics for Robust Offline-to-Online Transfer

Researchers developed Score Matched Actor-Critic (SMAC), a new offline reinforcement learning method that enables smooth transition to online RL algorithms without performance drops. SMAC achieved successful transfer in all 6 D4RL tasks tested and reduced regret by 34-58% in 4 of 6 environments compared to best baselines.

AIBullishLil'Log (Lilian Weng) · Jan 316/10

🧠

Generalized Language Models

This article discusses the evolution of generalized language models including BERT, GPT, and other major pre-trained models that achieved state-of-the-art results on various NLP tasks. The piece covers the breakthrough progress in 2018 with large-scale unsupervised pre-training approaches that don't require labeled data, similar to how ImageNet helped computer vision.

🏢 OpenAI

AIBullisharXiv – CS AI · Mar 275/10

🧠

Neural Network Conversion of Machine Learning Pipelines

Researchers developed a method to transfer knowledge from traditional machine learning pipelines to neural networks, specifically converting random forest classifiers into student neural networks. Testing on 100 OpenML tasks showed that neural networks can successfully mimic random forest performance when proper hyperparameters are selected.

AINeutralarXiv – CS AI · Mar 174/10

🧠

Evolutionary Transfer Learning for Dragonchess

Researchers developed an evolutionary transfer learning approach to adapt chess AI heuristics for Dragonchess, a 3D chess variant. While direct transfers from Stockfish failed, evolutionary optimization using CMA-ES significantly improved AI performance in this complex multi-layer game environment.

AINeutralarXiv – CS AI · Mar 114/10

🧠

Multi-model approach for autonomous driving: A comprehensive study on traffic sign-, vehicle- and lane detection and behavioral cloning

Researchers have developed a comprehensive multi-model approach for autonomous driving that integrates deep learning and computer vision techniques for traffic sign classification, vehicle detection, lane detection, and behavioral cloning. The study utilizes pre-trained and custom neural networks with data augmentation and transfer learning techniques, testing on datasets including the German Traffic Sign Recognition Benchmark and Udacity simulator data.

AINeutralarXiv – CS AI · Mar 54/10

🧠

The Influence of Iconicity in Transfer Learning for Sign Language Recognition

Researchers examined transfer learning effectiveness for sign language recognition by comparing iconic signs between different language pairs (Chinese to Arabic and Greek to Flemish). The study achieved modest improvements of 7.02% for Arabic and 1.07% for Flemish using Google Mediapipe for feature extraction and neural network architectures.

AINeutralarXiv – CS AI · Mar 54/10

🧠

Directional Neural Collapse Explains Few-Shot Transfer in Self-Supervised Learning

Researchers propose directional CDNV (decision-axis variance) as a key geometric quantity explaining why self-supervised learning representations transfer well with few labels. The study shows that small variability along class-separating directions enables strong few-shot transfer and low interference across multiple tasks.

AINeutralarXiv – CS AI · Mar 54/10

🧠

A Bayesian Framework for Active Tactile Object Recognition, Pose Estimation and Shape Transfer Learning

Researchers developed a Bayesian framework combining particle filters and Gaussian processes for robotic tactile object recognition and pose estimation. The system can identify known objects, detect novel objects, and transfer knowledge to learn new shapes through active touch exploration.

AINeutralarXiv – CS AI · Mar 44/102

🧠

Deep Learning Based Wildfire Detection for Peatland Fires Using Transfer Learning

Researchers developed a transfer learning approach for detecting peatland fires using deep learning models adapted from conventional wildfire detection systems. The method addresses the unique challenges of peatland fires, which have distinct characteristics like low flame intensity and persistent smoke that make them difficult to detect with standard wildfire detection models.

AINeutralarXiv – CS AI · Mar 25/106

🧠

General vs Domain-Specific CNNs: Understanding Pretraining Effects on Brain MRI Tumor Classification

Research comparing CNN architectures for brain tumor classification found that general-purpose models like ConvNeXt-Tiny (93% accuracy) outperformed domain-specific medical pre-trained models like RadImageNet DenseNet121 (68% accuracy). The study suggests that contemporary general-purpose CNNs with diverse pre-training may be more effective for medical imaging tasks in data-scarce scenarios.

AINeutralarXiv – CS AI · Feb 274/108

🧠

Explainability-Aware Evaluation of Transfer Learning Models for IoT DDoS Detection Under Resource Constraints

Researchers evaluated seven pre-trained CNN architectures for IoT DDoS attack detection, finding that DenseNet and MobileNet models provide the best balance of accuracy, reliability, and interpretability under resource constraints. The study emphasizes the importance of combining performance metrics with explainability when deploying AI security models in IoT environments.

AINeutralOpenAI News · Apr 54/105

🧠

Retro Contest

A transfer learning contest is being launched to evaluate reinforcement learning algorithms' ability to generalize from previous experience. The contest appears to focus on measuring how well AI models can apply learned knowledge to new situations.

AINeutralarXiv – CS AI · Mar 34/105

🧠

Decoupling Stability and Plasticity for Multi-Modal Test-Time Adaptation

Researchers propose DASP (Decoupling Adaptation for Stability and Plasticity), a novel framework for adapting multi-modal AI models to changing test environments. The method addresses key challenges of negative transfer and catastrophic forgetting by using asymmetric adaptation strategies that treat biased and unbiased modalities differently.

Page 1 of 2Next →