#knowledge-transfer News & Analysis

19 articles tagged with #knowledge-transfer. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

19 articles

AIBullisharXiv – CS AI · 4d ago7/10

🧠

MUSE-Autoskill: Self-Evolving Agents via Skill Creation, Memory, Management, and Evaluation

Researchers propose MUSE-Autoskill, a framework enabling LLM agents to autonomously create, store, and refine reusable skills throughout their operational lifecycle. The system treats skills as long-lived, testable assets with integrated memory and evaluation mechanisms, demonstrating improved task success rates and cross-agent knowledge transfer on benchmark tests.

AIBullisharXiv – CS AI · May 127/10

🧠

Evidence Over Plans: Online Trajectory Verification for Skill Distillation

Researchers introduce SPARK, a framework that verifies AI agent skills through direct environment interaction rather than relying on pre-written plans. The Posterior Distillation Index (PDI) metric ensures skills are grounded in actual task evidence, producing student models that match or exceed human-written skills while reducing inference costs by up to 1,000x.

AIBullisharXiv – CS AI · May 127/10

🧠

Priming: Hybrid State Space Models From Pre-trained Transformers

Researchers introduce Priming, a method that converts pre-trained Transformers into efficient Hybrid State-Space models through knowledge transfer rather than training from scratch. The technique recovers downstream performance using less than 0.5% of original pre-training tokens and enables the first large-scale comparison of SSM architectures, with Hybrid GKA 32B achieving 3.8-point reasoning improvements while delivering 2.3x faster decoding.

🧠 Llama

AIBullisharXiv – CS AI · May 97/10

🧠

LANTERN: LLM-Augmented Neurosymbolic Transfer with Experience-Gated Reasoning Networks

Researchers introduce LANTERN, a framework that uses large language models to automatically generate task descriptions and intelligently aggregate knowledge from multiple source tasks for reinforcement learning. The system achieves 40-60% improvements in sample efficiency by adaptively weighting source policies based on task similarity and managing teacher-student knowledge transfer through uncertainty-aware gating.

AIBullisharXiv – CS AI · May 97/10

🧠

Continually Evolving Skill Knowledge in Vision Language Action Model

Researchers introduce Stellar VLA, a continual learning framework for vision-language-action models that improves knowledge accumulation without adding network parameters. The approach uses knowledge-guided expert routing and hierarchical task structures, achieving strong performance on robotics benchmarks with minimal data replay and validated real-world transfer capabilities.

AIBullisharXiv – CS AI · Apr 157/10

🧠

Transferable Expertise for Autonomous Agents via Real-World Case-Based Learning

Researchers propose a case-based learning framework enabling LLM-based autonomous agents to extract and reuse knowledge from past tasks, improving performance on complex real-world problems. The method outperforms traditional zero-shot, few-shot, and prompt-based baselines across six task categories, with gains increasing as task complexity rises.

AIBullisharXiv – CS AI · Apr 157/10

🧠

Chain-of-Models Pre-Training: Rethinking Training Acceleration of Vision Foundation Models

Researchers present Chain-of-Models Pre-Training (CoM-PT), a novel method that accelerates vision foundation model training by up to 7.09X through sequential knowledge transfer from smaller to larger models in a unified pipeline, rather than training each model independently. The approach maintains or improves performance while significantly reducing computational costs, with efficiency gains increasing as more models are added to the training sequence.

AIBullisharXiv – CS AI · Feb 277/106

🧠

Knowledge Fusion of Large Language Models Via Modular SkillPacks

Researchers introduce GraftLLM, a new method for transferring knowledge between large language models using 'SkillPack' format that preserves capabilities while avoiding catastrophic forgetting. The approach enables efficient model fusion and continual learning for heterogeneous models through modular knowledge storage.

AINeutralarXiv – CS AI · May 126/10

🧠

LLM-guided Semi-Supervised Approaches for Social Media Crisis Data Classification

Researchers evaluate LLM-guided semi-supervised learning methods for classifying crisis-related social media data, finding that LG-CoTrain significantly outperforms traditional approaches in low-resource settings while compact models can rival large zero-shot LLMs. This demonstrates practical pathways for deploying AI in disaster response applications with minimal labeled training data.

AIBullisharXiv – CS AI · May 126/10

🧠

Any2Any 3D Diffusion Models with Knowledge Transfer: A Radiotherapy Planning Study

Researchers introduced DiffKT3D, a 3D diffusion model framework that applies knowledge transfer from video diffusion models to radiotherapy dose prediction. The approach achieves state-of-the-art results by reducing prediction error by 7% compared to previous benchmarks while maintaining clinical alignment through reinforcement learning post-training.

AIBullisharXiv – CS AI · May 116/10

🧠

Knowledge Transfer Scaling Laws for 3D Medical Imaging

Researchers demonstrate that different 3D medical imaging domains (CT, MRI, PET) transfer knowledge asymmetrically during pretraining, following predictable power-law patterns. By optimizing data allocation based on these transfer dynamics, they achieve up to 58% performance gains over proportional sampling, revealing a hub-and-island structure where certain domains act as foundational knowledge sources for others.

AIBullisharXiv – CS AI · Apr 66/10

🧠

Haiku to Opus in Just 10 bits: LLMs Unlock Massive Compression Gains

Researchers developed new compression techniques for LLM-generated text, achieving massive compression ratios through domain-adapted LoRA adapters and an interactive 'Question-Asking' protocol. The QA method uses binary questions to transfer knowledge between small and large models, achieving compression ratios of 0.0006-0.004 while recovering 23-72% of capability gaps.

AIBullisharXiv – CS AI · Mar 266/10

🧠

From Physician Expertise to Clinical Agents: Preserving, Standardizing, and Scaling Physicians' Medical Expertise with Lightweight LLM

Researchers developed Med-Shicheng, a framework that enables lightweight LLMs to learn and transfer medical expertise from distinguished physicians. Built on a 1.5B parameter model, it achieves performance comparable to much larger models like GPT-5 while running on resource-constrained hardware.

🧠 GPT-5

AIBullisharXiv – CS AI · Mar 176/10

🧠

Deconfounded Lifelong Learning for Autonomous Driving via Dynamic Knowledge Spaces

Researchers propose DeLL, a new framework for autonomous driving systems that addresses lifelong learning challenges through dynamic knowledge spaces and causal inference mechanisms. The system uses Dirichlet process mixture models to prevent catastrophic forgetting and improve adaptability to new driving scenarios while maintaining previously learned knowledge.

AINeutralarXiv – CS AI · Mar 166/10

🧠

Continual Learning in Large Language Models: Methods, Challenges, and Opportunities

This comprehensive survey examines continual learning methodologies for large language models, focusing on three core training stages and methods to mitigate catastrophic forgetting. The research reveals that while current approaches show promise in specific domains, fundamental challenges remain in achieving seamless knowledge integration across diverse tasks and temporal scales.

AIBullisharXiv – CS AI · Mar 126/10

🧠

HEAL: Hindsight Entropy-Assisted Learning for Reasoning Distillation

Researchers introduce HEAL (Hindsight Entropy-Assisted Learning), a new framework for distilling reasoning capabilities from large AI models into smaller ones. The method overcomes traditional limitations by using three core modules to bridge reasoning gaps and significantly outperforms standard distillation techniques.

🏢 Perplexity

AIBullisharXiv – CS AI · Mar 36/104

🧠

TiTok: Transfer Token-level Knowledge via Contrastive Excess to Transplant LoRA

TiTok is a new framework for transferring LoRA (Low-Rank Adaptation) parameters between different Large Language Model backbones without requiring additional training data or discriminator models. The method uses token-level contrastive learning to achieve 4-10% performance gains over existing approaches in parameter-efficient fine-tuning scenarios.

AIBullisharXiv – CS AI · Mar 27/1016

🧠

MINT: Multimodal Imaging-to-Speech Knowledge Transfer for Early Alzheimer's Screening

Researchers developed MINT, a framework that transfers knowledge from MRI brain scans to speech analysis for early Alzheimer's detection. The system achieves comparable performance to speech-only methods while being grounded in neuroimaging biomarkers, enabling population-scale screening without requiring expensive MRI scans at inference.

AINeutralOpenAI News · Oct 184/106

🧠

Semi-supervised knowledge transfer for deep learning from private training data

The article title suggests a research paper on semi-supervised knowledge transfer techniques for deep learning systems that use private training data. However, no article body content was provided for analysis.