AINeutralarXiv – CS AI · 14h ago6/10
🧠
Towards Localized and Disentangled Knowledge Editing for Multimodal Large Language Models
Researchers propose LDKE, a new framework for editing knowledge in Multimodal Large Language Models that addresses two critical failure modes: causal misalignment (edits confined to specific samples) and feature entanglement (unintended alterations to related information). The method uses localized layer identification and input disentanglement to enable precise, generalized edits while preserving unrelated knowledge.