Continual prune-and-select: Class-incremental learning with specialized subnetworks

Aleksandr  Dekhovich; David M.J. Tax; Marel H.F. Sluiter; Miguel A. Bessa

doi:10.1007/s10489-022-04441-z

Continual prune-and-select: Class-incremental learning with specialized subnetworks

Aleksandr Dekhovich, David M.J. Tax, Marel H.F. Sluiter, Miguel A. Bessa^*

^*Corresponding author for this work

Research output: Contribution to journal › Article › Scientific › peer-review

1 Citation (Scopus)

31 Downloads (Pure)

Abstract

The human brain is capable of learning tasks sequentially mostly without forgetting. However, deep neural networks (DNNs) suffer from catastrophic forgetting when learning one task after another. We address this challenge considering a class-incremental learning scenario where the DNN sees test data without knowing the task from which this data originates. During training, Continual Prune-and-Select (CP&S) finds a subnetwork within the DNN that is responsible for solving a given task. Then, during inference, CP&S selects the correct subnetwork to make predictions for that task. A new task is learned by training available neuronal connections of the DNN (previously untrained) to create a new subnetwork by pruning, which can include previously trained connections belonging to other subnetwork(s) because it does not update shared connections. This enables to eliminate catastrophic forgetting by creating specialized regions in the DNN that do not conflict with each other while still allowing knowledge transfer across them. The CP&S strategy is implemented with different subnetwork selection strategies, revealing superior performance to state-of-the-art continual learning methods tested on various datasets (CIFAR-100, CUB-200-2011, ImageNet-100 and ImageNet-1000). In particular, CP&S is capable of sequentially learning 10 tasks from ImageNet-1000 keeping an accuracy around 94% with negligible forgetting, a first-of-its-kind result in class-incremental learning. To the best of the authors’ knowledge, this represents an improvement in accuracy above 10% when compared to the best alternative method.

Original language	English
Pages (from-to)	17849-17864
Number of pages	16
Journal	Applied Intelligence
Volume	53
Issue number	14
DOIs	https://doi.org/10.1007/s10489-022-04441-z
Publication status	Published - 2023

Bibliographical note

Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care
Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

Keywords

Catastrophic forgetting
Class-incremental learning
Continual learning
Sparse network representation

Access to Document

10.1007/s10489-022-04441-z

s10489-022-04441-zFinal published version, 1.73 MB

Cite this

@article{5afd326982c6439cb232cf9fed210aca,

title = "Continual prune-and-select: Class-incremental learning with specialized subnetworks",

abstract = "The human brain is capable of learning tasks sequentially mostly without forgetting. However, deep neural networks (DNNs) suffer from catastrophic forgetting when learning one task after another. We address this challenge considering a class-incremental learning scenario where the DNN sees test data without knowing the task from which this data originates. During training, Continual Prune-and-Select (CP&S) finds a subnetwork within the DNN that is responsible for solving a given task. Then, during inference, CP&S selects the correct subnetwork to make predictions for that task. A new task is learned by training available neuronal connections of the DNN (previously untrained) to create a new subnetwork by pruning, which can include previously trained connections belonging to other subnetwork(s) because it does not update shared connections. This enables to eliminate catastrophic forgetting by creating specialized regions in the DNN that do not conflict with each other while still allowing knowledge transfer across them. The CP&S strategy is implemented with different subnetwork selection strategies, revealing superior performance to state-of-the-art continual learning methods tested on various datasets (CIFAR-100, CUB-200-2011, ImageNet-100 and ImageNet-1000). In particular, CP&S is capable of sequentially learning 10 tasks from ImageNet-1000 keeping an accuracy around 94% with negligible forgetting, a first-of-its-kind result in class-incremental learning. To the best of the authors{\textquoteright} knowledge, this represents an improvement in accuracy above 10% when compared to the best alternative method.",

keywords = "Catastrophic forgetting, Class-incremental learning, Continual learning, Sparse network representation",

author = "Aleksandr Dekhovich and Tax, {David M.J.} and Sluiter, {Marel H.F.} and Bessa, {Miguel A.}",

note = "Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.",

year = "2023",

doi = "10.1007/s10489-022-04441-z",

language = "English",

volume = "53",

pages = "17849--17864",

journal = "Applied Intelligence",

issn = "0924-669X",

publisher = "Springer",

number = "14",

}

TY - JOUR

T1 - Continual prune-and-select

T2 - Class-incremental learning with specialized subnetworks

AU - Dekhovich, Aleksandr

AU - Tax, David M.J.

AU - Sluiter, Marel H.F.

AU - Bessa, Miguel A.

N1 - Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

PY - 2023

Y1 - 2023

N2 - The human brain is capable of learning tasks sequentially mostly without forgetting. However, deep neural networks (DNNs) suffer from catastrophic forgetting when learning one task after another. We address this challenge considering a class-incremental learning scenario where the DNN sees test data without knowing the task from which this data originates. During training, Continual Prune-and-Select (CP&S) finds a subnetwork within the DNN that is responsible for solving a given task. Then, during inference, CP&S selects the correct subnetwork to make predictions for that task. A new task is learned by training available neuronal connections of the DNN (previously untrained) to create a new subnetwork by pruning, which can include previously trained connections belonging to other subnetwork(s) because it does not update shared connections. This enables to eliminate catastrophic forgetting by creating specialized regions in the DNN that do not conflict with each other while still allowing knowledge transfer across them. The CP&S strategy is implemented with different subnetwork selection strategies, revealing superior performance to state-of-the-art continual learning methods tested on various datasets (CIFAR-100, CUB-200-2011, ImageNet-100 and ImageNet-1000). In particular, CP&S is capable of sequentially learning 10 tasks from ImageNet-1000 keeping an accuracy around 94% with negligible forgetting, a first-of-its-kind result in class-incremental learning. To the best of the authors’ knowledge, this represents an improvement in accuracy above 10% when compared to the best alternative method.

AB - The human brain is capable of learning tasks sequentially mostly without forgetting. However, deep neural networks (DNNs) suffer from catastrophic forgetting when learning one task after another. We address this challenge considering a class-incremental learning scenario where the DNN sees test data without knowing the task from which this data originates. During training, Continual Prune-and-Select (CP&S) finds a subnetwork within the DNN that is responsible for solving a given task. Then, during inference, CP&S selects the correct subnetwork to make predictions for that task. A new task is learned by training available neuronal connections of the DNN (previously untrained) to create a new subnetwork by pruning, which can include previously trained connections belonging to other subnetwork(s) because it does not update shared connections. This enables to eliminate catastrophic forgetting by creating specialized regions in the DNN that do not conflict with each other while still allowing knowledge transfer across them. The CP&S strategy is implemented with different subnetwork selection strategies, revealing superior performance to state-of-the-art continual learning methods tested on various datasets (CIFAR-100, CUB-200-2011, ImageNet-100 and ImageNet-1000). In particular, CP&S is capable of sequentially learning 10 tasks from ImageNet-1000 keeping an accuracy around 94% with negligible forgetting, a first-of-its-kind result in class-incremental learning. To the best of the authors’ knowledge, this represents an improvement in accuracy above 10% when compared to the best alternative method.

KW - Catastrophic forgetting

KW - Class-incremental learning

KW - Continual learning

KW - Sparse network representation

UR - http://www.scopus.com/inward/record.url?scp=85146173366&partnerID=8YFLogxK

U2 - 10.1007/s10489-022-04441-z

DO - 10.1007/s10489-022-04441-z

M3 - Article

AN - SCOPUS:85146173366

SN - 0924-669X

VL - 53

SP - 17849

EP - 17864

JO - Applied Intelligence

JF - Applied Intelligence

IS - 14

ER -

Continual prune-and-select: Class-incremental learning with specialized subnetworks

Abstract

Bibliographical note

Keywords

Access to Document

Other files and links

Fingerprint

Cite this