#transfer-learning News & Analysis

99 articles tagged with #transfer-learning. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

99 articles

AINeutralarXiv – CS AI · Jun 256/10

🧠

Offline Multi-agent Continual Cooperation via Skill Partition and Reuse

Researchers introduce COMAD, a framework for multi-agent reinforcement learning systems to continually discover and reuse coordination skills from offline data without catastrophic forgetting. The approach uses skill partitioning and density-based reusability estimation to enable agents to efficiently transfer knowledge across sequential tasks in open environments.

AIBullisharXiv – CS AI · Jun 256/10

🧠

Learning Action Priors for Cross-embodiment Robot Manipulation

Researchers propose a two-stage training framework for Vision-Language-Action (VLA) models that pretrains the action module with motion priors before multimodal alignment. This approach enables robots to learn temporal dynamics more efficiently and generalizes better across different embodiments and real-world tasks with limited data.

AINeutralarXiv – CS AI · Jun 256/10

🧠

Transferability for General Reasoning: An Automated Curriculum for Multi-Domain RLVR

Researchers propose Transfer-Aware Curriculum (TAC), a machine learning optimization technique that dynamically adjusts training priorities across multiple domains by measuring how well improvements in one area transfer to others. The method achieves superior performance on reasoning tasks compared to fixed curricula, suggesting that cross-domain transferability is a critical factor for training more capable AI systems.

🧠 Llama

AINeutralarXiv – CS AI · Jun 236/10

🧠

NeuroShield: A Device-Agnostic Foundation Model for EEG Authentication

NeuroShield is a foundation model that enables EEG-based biometric authentication across different hardware devices and recording configurations. The model was pretrained on over 15,000 subjects and demonstrates significant accuracy improvements while generalizing to unseen equipment and data formats.

AINeutralarXiv – CS AI · Jun 236/10

🧠

DSSCNet: A Transfer Learning Framework for Cross-Corpus Dysarthric Speech Severity Classification

Researchers introduce DSSCNet, a deep learning framework using transfer learning to improve dysarthric speech severity classification across different datasets. The model achieves 75.80% accuracy on TORGO and 68.25% on UA-Speech corpora, demonstrating significant improvements in speaker-independent assessment and cross-corpus generalization for assistive speech technologies.

AINeutralarXiv – CS AI · Jun 236/10

🧠

Structured Hyperedge Adaptation for Parameter-Efficient Fine-Tuning of Vision Transformers

Researchers introduce HyperAdapter, a parameter-efficient fine-tuning method for vision transformers that adapts model weights through hypergraph-structured token groupings rather than individual tokens. The approach demonstrates consistent performance improvements over existing adapter methods while maintaining computational efficiency, suggesting that adaptation space design is critical for vision transformer transfer learning.

AINeutralarXiv – CS AI · Jun 236/10

🧠

DiT-Reward: Generative Representations for Text-to-Image Reward Modeling

Researchers introduce DiT-Reward, a reward model derived from pretrained Diffusion Transformers that outperforms existing benchmarks like HPSv3 for evaluating text-to-image generation quality. The approach demonstrates that representations learned during generative model training transfer effectively to reward prediction tasks, achieving measurable improvements in preference prediction accuracy and inference speed.

🧠 Stable Diffusion

AINeutralarXiv – CS AI · Jun 195/10

🧠

Leveraging systems' non-linearity to tackle the scarcity of data in the design of Intelligent Fault Diagnosis Systems

Researchers propose a novel Deep Transfer Learning approach for Intelligent Fault Diagnosis Systems that addresses data scarcity by leveraging system non-linearities and multi-excitation vibration analysis. The method combines pre-trained CNNs with a new data visualization and augmentation technique, validated on railway pantograph structures.

AINeutralarXiv – CS AI · Jun 196/10

🧠

Playful Agentic Robot Learning

Researchers introduce RATs (Robotics Agent Teams), an agentic robot learning system that uses self-directed play to acquire reusable skills before receiving downstream tasks. The approach demonstrates significant performance improvements on robotics benchmarks and enables learned skills to transfer across different agents without finetuning.

AINeutralarXiv – CS AI · Jun 196/10

🧠

Hybrid ANN-SNN Pipeline with Local Plasticity

Researchers propose a hybrid pipeline combining pretrained EfficientNet encoders with spiking neural networks (SNNs) trained via biologically-inspired local learning rules. The system achieves 99.09% accuracy on ImageNet while reducing computational overhead and enabling neuromorphic hardware deployment.

AIBullisharXiv – CS AI · Jun 116/10

🧠

Time-Series Foundation Model Embeddings for Remaining Useful Life Estimation

Researchers introduce a data-efficient approach for Remaining Useful Life (RUL) prediction in industrial equipment using frozen pretrained time-series foundation models (Chronos-2) combined with lightweight regression heads. Testing on real-world sensor data demonstrates superior performance compared to traditional recurrent, convolutional, and Transformer-based models, suggesting foundation models offer practical advantages for predictive maintenance without extensive feature engineering.

AINeutralarXiv – CS AI · Jun 116/10

🧠

Tabular Foundation Models for Clinical Survival Analysis via Survival-Aware Adaptation

Researchers propose a lightweight adaptation method to apply tabular foundation models to clinical survival analysis, demonstrating that pretrained representations combined with survival-aware objectives outperform traditional approaches. Testing on MIMIC-IV and eICU datasets shows 1.4-1.7% improvements over strong baselines like DeepSurv in predicting patient mortality and time-to-event outcomes.

AINeutralarXiv – CS AI · Jun 116/10

🧠

Bridging the Morphology Gap: Adapting VLA Models to Dexterous Manipulation via Intent-Conditioned Fine-Tuning

Researchers introduce InDex, a framework that adapts Vision-Language-Action (VLA) models from simple parallel grippers to complex dexterous robotic hands through intent-conditioned fine-tuning. The approach uses a two-stage architecture that preserves spatial reasoning capabilities while efficiently learning fine-grained multi-finger control with minimal training data.

AIBullisharXiv – CS AI · Jun 116/10

🧠

CHORUS: Decentralized Multi-Embodiment Collaboration with One VLA Policy

Researchers introduce CHORUS, a framework that enables decentralized multi-robot coordination using a single pretrained vision-language-action (VLA) model. Rather than requiring centralized control or per-robot policies, CHORUS allows each robot to operate independently using only its own observations and a robot-identifying prompt, achieving significant performance improvements in real-world collaborative tasks.

AINeutralarXiv – CS AI · Jun 106/10

🧠

Tractogram foundation model

Researchers introduce TractFM, a foundation model that learns reusable representations from whole-brain diffusion MRI tractography data by combining local streamline encoding with permutation-equivariant processing. The model demonstrates strong transfer learning capabilities across different tractography algorithms, datasets, and prediction tasks, achieving accurate tract parcellation and demographic predictions without task-specific fine-tuning.

AINeutralarXiv – CS AI · Jun 96/10

🧠

Bidirectional Small-Granularity Search between Code and Text

Researchers introduce a bidirectional search task linking code snippets with text descriptions and vice versa, addressing the gap between scientific publications and their implementations. They present a large dataset with automatically-generated training data and manually-annotated test sets, along with a modular encoder-based approach that achieves strong in-domain results with promising out-of-domain generalization.

🧠 GPT-4

AINeutralarXiv – CS AI · Jun 96/10

🧠

A Systematic Study of Behavioral Cloning for Scientific Data Annotation

Researchers introduce a behavioral cloning framework for scientific data annotation that learns from expert annotation strategies rather than direct prediction. The study demonstrates that larger models trained on multiple annotation tasks develop hierarchical skills, generalize across tasks, and internally represent latent variables of the annotation process, offering a foundation for automating labor-intensive verification and correction workflows.

AINeutralarXiv – CS AI · Jun 96/10

🧠

A Topological Characterization of Graph Neural Networks via Stochastic Block Model Embeddings on the n-Sphere

Researchers propose a novel topological framework for analyzing and comparing trained Graph Neural Networks by mapping induced stochastic block models onto an n-dimensional sphere, creating low-dimensional 'fingerprints' that enable transfer-learning candidate retrieval across model zoos without retraining.

AINeutralarXiv – CS AI · Jun 96/10

🧠

The Cross-Architecture Substrate: A Domain-Transcendent, Calibration-Surviving Geometric Invariant of Modern Vision Encoders

Researchers discovered that thirteen different vision neural networks, despite being trained for distinct tasks (classification, contrast learning, image-text matching), converge on the same sixteen-dimensional geometric structure called the 'cross-architecture substrate.' This invariant structure persists across multiple visual domains and survives calibration testing, suggesting a universal representational principle in modern vision encoders that could enable new transfer learning and distillation techniques.

AIBullisharXiv – CS AI · Jun 96/10

🧠

Self-Supervised Vision Transformers for CBCT-Based Detection of Temporomandibular Joint Osteoarthritis

Researchers demonstrate that self-supervised Vision Transformers, particularly the DINO family, can effectively detect temporomandibular joint osteoarthritis from cone-beam CT scans with 90.2% AUC when partially adapted. The study shows that strategic backbone unfreezing of final transformer blocks outperforms fully frozen models and supervised baselines, providing practical guidance for deploying foundation models in medical imaging with limited training data.

AINeutralarXiv – CS AI · Jun 95/10

🧠

A Universal Dense Football Event Representation Based on TabTransformer

Researchers propose a TabTransformer-based neural network that learns dense representations of football event data by treating categorical features as learned embeddings rather than one-hot encodings. The approach captures sport-specific action semantics during pretraining, enabling superior performance on downstream tasks like action value estimation and play style recognition.

AINeutralarXiv – CS AI · Jun 96/10

🧠

Beyond Humans: Multispecies Animal Face Recognition Using Transfer Learning

Researchers demonstrate that transfer learning with Vision Transformer (ViT) models can effectively identify individual animals across multiple species—dogs, primates, and cattle—achieving up to 96.85% verification accuracy on dogs without species-specific training data. This non-invasive facial recognition approach could replace physical identification methods like microchips for pet recovery, endangered species tracking, and agricultural monitoring.

AINeutralarXiv – CS AI · Jun 96/10

🧠

In-Context Reinforcement Learning via Communicative World Models

Researchers introduce CORAL, a framework that enables reinforcement learning agents to adapt to new tasks without retraining by separating world modeling from control through emergent communication between two agents. The approach demonstrates improved sample efficiency and zero-shot adaptation across diverse environments, advancing in-context reinforcement learning capabilities.

AINeutralarXiv – CS AI · Jun 85/10

🧠

Phonetic Error Analysis of Raw Waveform Acoustic Models

Researchers achieved state-of-the-art performance on raw waveform acoustic models for phone recognition using CNN-LSTM architectures, with error rates of 13.9%/15.3% on TIMIT benchmarks. Analysis reveals that different phonetic classes benefit differently from model components, and transfer learning from WSJ data improves consonant recognition significantly more than vowels.

AIBullisharXiv – CS AI · Jun 56/10

🧠

Sample-efficient Low-level Motion Planning for Robotic Manipulation Tasks via Zero-shot Transfer Learning

Researchers propose iCEM+TL, a framework combining the Cross-Entropy Method with transfer learning to improve robotic manipulation planning efficiency. The approach achieves up to 23% success rate improvements in complex tasks like stacking and shelf placement, with validation demonstrated on a real Franka Emika robot.

← PrevPage 2 of 4Next →