AINeutralarXiv – CS AI · 3h ago6/10
🧠
From Fact Overwriting to Knowledge Evolution: Causal Editing via On-Policy Self-Distillation
Researchers present CODE, a novel approach to knowledge editing in large language models that replaces fact overwriting with causal reasoning. By embedding causal narratives and on-policy distillation into model parameters, CODE reduces self-refutation rates from 95.6% to 1.8%, enabling LLMs to evolve knowledge coherently rather than storing isolated facts.