LLM Memory & RAG 相关度: 8/10

Beyond the Covariance Trap: Unlocking Generalization in Same-Subject Knowledge Editing for Large Language Models

Xiyu Liu, Qingyi Si, Zhengxiao Liu, Chenxu Yang, Naibin Gu, Zheng Lin
arXiv: 2603.15518v1 发布: 2026-03-16 更新: 2026-03-16

AI 摘要

论文针对大语言模型在同主题知识编辑中泛化性不足的问题,提出RoSE方法提升模型指令跟随能力。

主要贡献

  • 发现了同主题知识编辑中泛化性崩溃的几何根源
  • 提出了Isotropic Geometric Alignment降低表征偏差
  • 提出了Hierarchical Knowledge Integration平滑优化

方法论

通过Isotropic Geometric Alignment和Hierarchical Knowledge Integration,最小化表征偏差并平滑优化过程,提升指令跟随能力。

原文摘要

While locate-then-edit knowledge editing efficiently updates knowledge encoded within Large Language Models (LLMs), a critical generalization failure mode emerges in the practical same-subject knowledge editing scenario: models fail to recall the updated knowledge when following user instructions, despite successfully recalling it in the original edited form. This paper identifies the geometric root of this generalization collapse as a fundamental conflict where the inner activation drifts induced by prompt variations exceed the model's geometric tolerance for generalization after editing. We attribute this instability to a dual pathology: (1) The joint optimization with orthogonal gradients collapses solutions into sharp minima with narrow stability, and (2) the standard covariance constraint paradoxically acts as a Covariance Trap that amplifies input perturbations. To resolve this, we introduce RoSE (Robust Same-subject Editing), which employs Isotropic Geometric Alignment to minimize representational deviation and Hierarchical Knowledge Integration to smooth the optimization landscape. Extensive experiments demonstrate that RoSE significantly improves instruction-following capabilities, laying the foundation for robust interactive parametric memory of LLM agents.

标签

知识编辑 大语言模型 泛化性 指令跟随

arXiv 分类

cs.CL