Neural network relief: a pruning algorithm based on neural activity

Aleksandr Dekhovich; David M.J. Tax; Marcel H.F. Sluiter; Miguel A. Bessa

doi:10.1007/s10994-024-06516-z

Neural network relief: a pruning algorithm based on neural activity

Aleksandr Dekhovich, David M.J. Tax, Marcel H.F. Sluiter, Miguel A. Bessa^*

^*Corresponding author for this work

Research output: Contribution to journal › Article › Scientific › peer-review

Abstract

Current deep neural networks (DNNs) are overparameterized and use most of their neuronal connections during inference for each task. The human brain, however, developed specialized regions for different tasks and performs inference with a small fraction of its neuronal connections. We propose an iterative pruning strategy introducing a simple importance-score metric that deactivates unimportant connections, tackling overparameterization in DNNs and modulating the firing patterns. The aim is to find the smallest number of connections that is still capable of solving a given task with comparable accuracy, i.e. a simpler subnetwork. We achieve comparable performance for LeNet architectures on MNIST, and significantly higher parameter compression than state-of-the-art algorithms for VGG and ResNet architectures on CIFAR-10/100 and Tiny-ImageNet. Our approach also performs well for the two different optimizers considered—Adam and SGD. The algorithm is not designed to minimize FLOPs when considering current hardware and software implementations, although it performs reasonably when compared to the state of the art.

Original language	English
Pages (from-to)	2597-2618
Number of pages	22
Journal	Machine Learning
Volume	113
Issue number	5
DOIs	https://doi.org/10.1007/s10994-024-06516-z
Publication status	Published - 2024

Bibliographical note

Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care
Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

Keywords

Connection pruning
Iterative pruning
Neural network pruning
Sparsity

Access to Document

10.1007/s10994-024-06516-z

Cite this

@article{1c5b4a481f6c46fea48082354e07aa5c,

title = "Neural network relief: a pruning algorithm based on neural activity",

abstract = "Current deep neural networks (DNNs) are overparameterized and use most of their neuronal connections during inference for each task. The human brain, however, developed specialized regions for different tasks and performs inference with a small fraction of its neuronal connections. We propose an iterative pruning strategy introducing a simple importance-score metric that deactivates unimportant connections, tackling overparameterization in DNNs and modulating the firing patterns. The aim is to find the smallest number of connections that is still capable of solving a given task with comparable accuracy, i.e. a simpler subnetwork. We achieve comparable performance for LeNet architectures on MNIST, and significantly higher parameter compression than state-of-the-art algorithms for VGG and ResNet architectures on CIFAR-10/100 and Tiny-ImageNet. Our approach also performs well for the two different optimizers considered—Adam and SGD. The algorithm is not designed to minimize FLOPs when considering current hardware and software implementations, although it performs reasonably when compared to the state of the art.",

keywords = "Connection pruning, Iterative pruning, Neural network pruning, Sparsity",

author = "Aleksandr Dekhovich and Tax, {David M.J.} and Sluiter, {Marcel H.F.} and Bessa, {Miguel A.}",

note = "Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.",

year = "2024",

doi = "10.1007/s10994-024-06516-z",

language = "English",

volume = "113",

pages = "2597--2618",

journal = "Machine Learning",

issn = "0885-6125",

publisher = "Springer",

number = "5",

}

TY - JOUR

T1 - Neural network relief

T2 - a pruning algorithm based on neural activity

AU - Dekhovich, Aleksandr

AU - Tax, David M.J.

AU - Sluiter, Marcel H.F.

AU - Bessa, Miguel A.

N1 - Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

PY - 2024

Y1 - 2024

N2 - Current deep neural networks (DNNs) are overparameterized and use most of their neuronal connections during inference for each task. The human brain, however, developed specialized regions for different tasks and performs inference with a small fraction of its neuronal connections. We propose an iterative pruning strategy introducing a simple importance-score metric that deactivates unimportant connections, tackling overparameterization in DNNs and modulating the firing patterns. The aim is to find the smallest number of connections that is still capable of solving a given task with comparable accuracy, i.e. a simpler subnetwork. We achieve comparable performance for LeNet architectures on MNIST, and significantly higher parameter compression than state-of-the-art algorithms for VGG and ResNet architectures on CIFAR-10/100 and Tiny-ImageNet. Our approach also performs well for the two different optimizers considered—Adam and SGD. The algorithm is not designed to minimize FLOPs when considering current hardware and software implementations, although it performs reasonably when compared to the state of the art.

AB - Current deep neural networks (DNNs) are overparameterized and use most of their neuronal connections during inference for each task. The human brain, however, developed specialized regions for different tasks and performs inference with a small fraction of its neuronal connections. We propose an iterative pruning strategy introducing a simple importance-score metric that deactivates unimportant connections, tackling overparameterization in DNNs and modulating the firing patterns. The aim is to find the smallest number of connections that is still capable of solving a given task with comparable accuracy, i.e. a simpler subnetwork. We achieve comparable performance for LeNet architectures on MNIST, and significantly higher parameter compression than state-of-the-art algorithms for VGG and ResNet architectures on CIFAR-10/100 and Tiny-ImageNet. Our approach also performs well for the two different optimizers considered—Adam and SGD. The algorithm is not designed to minimize FLOPs when considering current hardware and software implementations, although it performs reasonably when compared to the state of the art.

KW - Connection pruning

KW - Iterative pruning

KW - Neural network pruning

KW - Sparsity

UR - http://www.scopus.com/inward/record.url?scp=85186585134&partnerID=8YFLogxK

U2 - 10.1007/s10994-024-06516-z

DO - 10.1007/s10994-024-06516-z

M3 - Article

AN - SCOPUS:85186585134

SN - 0885-6125

VL - 113

SP - 2597

EP - 2618

JO - Machine Learning

JF - Machine Learning

IS - 5

ER -

Neural network relief: a pruning algorithm based on neural activity

Abstract

Bibliographical note

Keywords

Access to Document

Other files and links

Embargoed Document

Fingerprint

Cite this