AINeutralarXiv – CS AI · 7h ago6/10
🧠
Repurposing Adversarial Perturbations for Continual Learning: From Defense to Active Alignment
Researchers introduce AdvCL, a novel framework that repurposes adversarial perturbations to improve continual learning in large language models by addressing forgetting, limited transfer, and adversarial vulnerability. The approach combines three modules—Intra-Smooth, Proto-Clip, and Inter-Align—to provide geometric control signals that stabilize model adaptation across sequential tasks while maintaining robustness.