#knowledge-removal News & Analysis

5 articles tagged with #knowledge-removal. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

5 articles

AIBullisharXiv – CS AI · Mar 127/10

🧠

Explainable LLM Unlearning Through Reasoning

Researchers introduce Targeted Reasoning Unlearning (TRU), a new method for removing specific knowledge from large language models while preserving general capabilities. The approach uses reasoning-based targets to guide the unlearning process, addressing issues with previous gradient ascent methods that caused unintended capability degradation.

AINeutralarXiv – CS AI · Jun 56/10

🧠

Learning What to Forget: Improving LLM Unlearning via Learned Token-Level Importance

Researchers introduce Alternating Token-Weighted Unlearning (ATWU), a new method for removing specific knowledge from language models while maintaining their general capabilities. The approach identifies which tokens are most relevant for forgetting by measuring conflict with model retention objectives, achieving state-of-the-art results without requiring external supervision or auxiliary models.

AINeutralarXiv – CS AI · Jun 26/10

🧠

Visual-Noise Guided In-Context Distillation for Multimodal Large Language Model Unlearning

Researchers propose Visual-Noise Guided In-Context Distillation (VGID), a novel framework for removing sensitive knowledge from multimodal large language models without full retraining. The method combines visual perturbation with textual in-context unlearning to achieve parameter-level knowledge removal while maintaining model performance, addressing critical privacy and safety concerns in MLLMs.

AINeutralarXiv – CS AI · Apr 206/10

🧠

Harmonizing Multi-Objective LLM Unlearning via Unified Domain Representation and Bidirectional Logit Distillation

Researchers propose a multi-objective unlearning framework for Large Language Models that simultaneously removes hazardous information, preserves general utility, avoids over-refusal, and resists adversarial attacks. The method uses unified domain representation and bidirectional logit distillation to harmonize competing optimization goals, achieving state-of-the-art performance across diverse unlearning requirements.

AIBullisharXiv – CS AI · Mar 27/1024

🧠

DUET: Distilled LLM Unlearning from an Efficiently Contextualized Teacher

Researchers propose DUET, a new distillation-based method for LLM unlearning that removes undesirable knowledge from AI models without full retraining. The technique combines computational efficiency with security advantages, achieving better performance in both knowledge removal and utility preservation while being significantly more data-efficient than existing methods.