SWEA: Updating Factual Knowledge in Large Language Models via Subject Word Embedding Altering

Li, Xiaopeng; Li, Shasha; Song, Shezheng; Liu, Huijun; Ji, Bin; Wang, Xi; Ma, Jun; Yu, Jie; Liu, Xiaodong; Wang, Jing; Zhang, Weimin

Full-text links:

Download:

Current browse context:

cs.CL

< prev | next >

new | recent | 2401

Computer Science > Computation and Language

Title: SWEA: Updating Factual Knowledge in Large Language Models via Subject Word Embedding Altering

Authors: Xiaopeng Li, Shasha Li, Shezheng Song, Huijun Liu, Bin Ji, Xi Wang, Jun Ma, Jie Yu, Xiaodong Liu, Jing Wang, Weimin Zhang

(Submitted on 31 Jan 2024 (v1), last revised 23 Apr 2024 (this version, v3))

Abstract: The general capabilities of large language models (LLMs) make them the infrastructure for various AI applications, but updating their inner knowledge requires significant resources. Recent model editing is a promising technique for efficiently updating a small amount of knowledge of LLMs and has attracted much attention. In particular, local editing methods, which directly update model parameters, are more suitable for updating a small amount of knowledge. Local editing methods update weights by computing least squares closed-form solutions and identify edited knowledge by vector-level matching in inference, which achieve promising results. However, these methods still require a lot of time and resources to complete the computation. Moreover, vector-level matching lacks reliability, and such updates disrupt the original organization of the model's parameters. To address these issues, we propose an detachable and expandable Subject Word Embedding Altering (SWEA) framework, which finds the editing embeddings through token-level matching and adds them to the subject word embeddings in Transformer input. To get these editing embeddings, we propose optimizing then suppressing fusion method, which first optimizes learnable embedding vectors for the editing target and then suppresses the Knowledge Embedding Dimensions (KEDs) to obtain final editing embeddings. We thus propose SWEA$\oplus$OS method for editing factual knowledge in LLMs. We demonstrate the overall state-of-the-art (SOTA) performance of SWEA$\oplus$OS on the \textsc{CounterFact} and zsRE datasets. To further validate the reasoning ability of SWEA$\oplus$OS in editing knowledge, we evaluate it on the more complex \textsc{RippleEdits} benchmark. The results demonstrate that SWEA$\oplus$OS possesses SOTA reasoning ability.

Comments:	Under review; Our code is available at this https URL
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2401.17809 [cs.CL]
	(or arXiv:2401.17809v3 [cs.CL] for this version)

Submission history

From: Xiaopeng Li [view email]
[v1] Wed, 31 Jan 2024 13:08:45 GMT (206kb,D)
[v2] Thu, 15 Feb 2024 15:43:55 GMT (380kb,D)
[v3] Tue, 23 Apr 2024 01:08:44 GMT (391kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2401.17809

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Computation and Language

Title: SWEA: Updating Factual Knowledge in Large Language Models via Subject Word Embedding Altering

Submission history