Remembering Transformer for Continual Learning

Sun, Yuwei; Fujisawa, Ippei; Juliani, Arthur; Sakuma, Jun; Kanai, Ryota

Full-text links:

Download:

Current browse context:

cs.LG

< prev | next >

new | recent | 2404

Computer Science > Machine Learning

Title: Remembering Transformer for Continual Learning

Authors: Yuwei Sun, Ippei Fujisawa, Arthur Juliani, Jun Sakuma, Ryota Kanai

(Submitted on 11 Apr 2024 (v1), last revised 23 Apr 2024 (this version, v2))

Abstract: Neural networks encounter the challenge of Catastrophic Forgetting (CF) in continual learning, where new task knowledge interferes with previously learned knowledge. We propose Remembering Transformer, inspired by the brain's Complementary Learning Systems (CLS), to tackle this issue. Remembering Transformer employs a mixture-of-adapters and a generative model-based routing mechanism to alleviate CF by dynamically routing task data to relevant adapters. Our approach demonstrated a new SOTA performance in various vision continual learning tasks and great parameter efficiency.

Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2404.07518 [cs.LG]
	(or arXiv:2404.07518v2 [cs.LG] for this version)

Submission history

From: Yuwei Sun [view email]
[v1] Thu, 11 Apr 2024 07:22:14 GMT (4834kb,D)
[v2] Tue, 23 Apr 2024 08:02:23 GMT (5074kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

Link back to: arXiv, form interface, contact.

> cs > arXiv:2404.07518

Download:

Current browse context:

Change to browse by:

References & Citations

DBLP - CS Bibliography

Bookmark

Computer Science > Machine Learning

Title: Remembering Transformer for Continual Learning

Submission history