To Overfit, or Not to Overfit: Improving the Performance of Deep Learning-Based SCA

Azade Rezaeezade; Guilherme Perin; Stjepan Picek

doi:10.1007/978-3-031-17433-9_17

To Overfit, or Not to Overfit: Improving the Performance of Deep Learning-Based SCA

Azade Rezaeezade^*, Guilherme Perin, Stjepan Picek

^*Corresponding author for this work

Cyber Security

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

21 Downloads (Pure)

Abstract

Profiling side-channel analysis allows evaluators to estimate the worst-case security of a target. When security evaluations relax the assumptions about the adversary’s knowledge, profiling models may easily be sub-optimal due to the inability to extract the most informative points of interest from the side-channel measurements. When used for profiling attacks, deep neural networks can learn strong models without feature selection with the drawback of expensive hyperparameter tuning. Unfortunately, due to very large search spaces, one usually finds very different model behaviors, and a widespread situation is to face overfitting with typically poor generalization capacity. Usually, overfitting or poor generalization would be mitigated by adding more measurements to the profiling phase to reduce estimation errors. This paper provides a detailed analysis of different deep learning model behaviors and shows that adding more profiling traces as a single solution does not necessarily help improve generalization. We recognize the main problem to be the sub-optimal selection of hyperparameters, which is then difficult to resolve by simply adding more measurements. Instead, we propose to use small hyperparameter tweaks or regularization as techniques to resolve the problem.

Original language	English
Title of host publication	Progress in Cryptology - AFRICACRYPT 2022 - 13th International Conference on Cryptology in Africa, AFRICACRYPT 2022, Proceedings
Editors	Lejla Batina, Joan Daemen
Publisher	Springer
Pages	397-421
Number of pages	25
ISBN (Print)	978-3-031-17432-2
DOIs	https://doi.org/10.1007/978-3-031-17433-9_17
Publication status	Published - 2022
Event	13th International Conference on Progress in Cryptology in Africa, AFRICACRYPT 2022 - Fes, Morocco Duration: 18 Jul 2022 → 20 Jul 2022

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	13503 LNCS
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	13th International Conference on Progress in Cryptology in Africa, AFRICACRYPT 2022
Country/Territory	Morocco
City	Fes
Period	18/07/22 → 20/07/22

Bibliographical note

Green Open Access added to TU Delft Institutional Repository ‘You share, we take care!’ – Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

Keywords

Deep learning
Generalization
Overfitting
Side-channel analysis

Access to Document

10.1007/978-3-031-17433-9_17

978-3-031-17433-9_17Final published version, 1.87 MB

Cite this

Rezaeezade, A., Perin, G., & Picek, S. (2022). To Overfit, or Not to Overfit: Improving the Performance of Deep Learning-Based SCA. In L. Batina, & J. Daemen (Eds.), Progress in Cryptology - AFRICACRYPT 2022 - 13th International Conference on Cryptology in Africa, AFRICACRYPT 2022, Proceedings (pp. 397-421). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 13503 LNCS). Springer. https://doi.org/10.1007/978-3-031-17433-9_17

Rezaeezade, Azade ; Perin, Guilherme ; Picek, Stjepan. / To Overfit, or Not to Overfit : Improving the Performance of Deep Learning-Based SCA. Progress in Cryptology - AFRICACRYPT 2022 - 13th International Conference on Cryptology in Africa, AFRICACRYPT 2022, Proceedings. editor / Lejla Batina ; Joan Daemen. Springer, 2022. pp. 397-421 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{8ea7dff1462343f5a4b713f53f1c049f,

title = "To Overfit, or Not to Overfit: Improving the Performance of Deep Learning-Based SCA",

abstract = "Profiling side-channel analysis allows evaluators to estimate the worst-case security of a target. When security evaluations relax the assumptions about the adversary{\textquoteright}s knowledge, profiling models may easily be sub-optimal due to the inability to extract the most informative points of interest from the side-channel measurements. When used for profiling attacks, deep neural networks can learn strong models without feature selection with the drawback of expensive hyperparameter tuning. Unfortunately, due to very large search spaces, one usually finds very different model behaviors, and a widespread situation is to face overfitting with typically poor generalization capacity. Usually, overfitting or poor generalization would be mitigated by adding more measurements to the profiling phase to reduce estimation errors. This paper provides a detailed analysis of different deep learning model behaviors and shows that adding more profiling traces as a single solution does not necessarily help improve generalization. We recognize the main problem to be the sub-optimal selection of hyperparameters, which is then difficult to resolve by simply adding more measurements. Instead, we propose to use small hyperparameter tweaks or regularization as techniques to resolve the problem.",

keywords = "Deep learning, Generalization, Overfitting, Side-channel analysis",

author = "Azade Rezaeezade and Guilherme Perin and Stjepan Picek",

note = "Green Open Access added to TU Delft Institutional Repository {\textquoteleft}You share, we take care!{\textquoteright} – Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public. ; 13th International Conference on Progress in Cryptology in Africa, AFRICACRYPT 2022 ; Conference date: 18-07-2022 Through 20-07-2022",

year = "2022",

doi = "10.1007/978-3-031-17433-9_17",

language = "English",

isbn = "978-3-031-17432-2",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer",

pages = "397--421",

editor = "Lejla Batina and Joan Daemen",

booktitle = "Progress in Cryptology - AFRICACRYPT 2022 - 13th International Conference on Cryptology in Africa, AFRICACRYPT 2022, Proceedings",

}

Rezaeezade, A , Perin, G & Picek, S 2022, To Overfit, or Not to Overfit: Improving the Performance of Deep Learning-Based SCA. in L Batina & J Daemen (eds), Progress in Cryptology - AFRICACRYPT 2022 - 13th International Conference on Cryptology in Africa, AFRICACRYPT 2022, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 13503 LNCS, Springer, pp. 397-421, 13th International Conference on Progress in Cryptology in Africa, AFRICACRYPT 2022, Fes, Morocco, 18/07/22. https://doi.org/10.1007/978-3-031-17433-9_17

To Overfit, or Not to Overfit: Improving the Performance of Deep Learning-Based SCA. / Rezaeezade, Azade ; Perin, Guilherme ; Picek, Stjepan.
Progress in Cryptology - AFRICACRYPT 2022 - 13th International Conference on Cryptology in Africa, AFRICACRYPT 2022, Proceedings. ed. / Lejla Batina; Joan Daemen. Springer, 2022. p. 397-421 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 13503 LNCS).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

TY - GEN

T1 - To Overfit, or Not to Overfit

T2 - 13th International Conference on Progress in Cryptology in Africa, AFRICACRYPT 2022

AU - Rezaeezade, Azade

AU - Perin, Guilherme

AU - Picek, Stjepan

N1 - Green Open Access added to TU Delft Institutional Repository ‘You share, we take care!’ – Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

PY - 2022

Y1 - 2022

N2 - Profiling side-channel analysis allows evaluators to estimate the worst-case security of a target. When security evaluations relax the assumptions about the adversary’s knowledge, profiling models may easily be sub-optimal due to the inability to extract the most informative points of interest from the side-channel measurements. When used for profiling attacks, deep neural networks can learn strong models without feature selection with the drawback of expensive hyperparameter tuning. Unfortunately, due to very large search spaces, one usually finds very different model behaviors, and a widespread situation is to face overfitting with typically poor generalization capacity. Usually, overfitting or poor generalization would be mitigated by adding more measurements to the profiling phase to reduce estimation errors. This paper provides a detailed analysis of different deep learning model behaviors and shows that adding more profiling traces as a single solution does not necessarily help improve generalization. We recognize the main problem to be the sub-optimal selection of hyperparameters, which is then difficult to resolve by simply adding more measurements. Instead, we propose to use small hyperparameter tweaks or regularization as techniques to resolve the problem.

AB - Profiling side-channel analysis allows evaluators to estimate the worst-case security of a target. When security evaluations relax the assumptions about the adversary’s knowledge, profiling models may easily be sub-optimal due to the inability to extract the most informative points of interest from the side-channel measurements. When used for profiling attacks, deep neural networks can learn strong models without feature selection with the drawback of expensive hyperparameter tuning. Unfortunately, due to very large search spaces, one usually finds very different model behaviors, and a widespread situation is to face overfitting with typically poor generalization capacity. Usually, overfitting or poor generalization would be mitigated by adding more measurements to the profiling phase to reduce estimation errors. This paper provides a detailed analysis of different deep learning model behaviors and shows that adding more profiling traces as a single solution does not necessarily help improve generalization. We recognize the main problem to be the sub-optimal selection of hyperparameters, which is then difficult to resolve by simply adding more measurements. Instead, we propose to use small hyperparameter tweaks or regularization as techniques to resolve the problem.

KW - Deep learning

KW - Generalization

KW - Overfitting

KW - Side-channel analysis

UR - http://www.scopus.com/inward/record.url?scp=85141657113&partnerID=8YFLogxK

U2 - 10.1007/978-3-031-17433-9_17

DO - 10.1007/978-3-031-17433-9_17

M3 - Conference contribution

AN - SCOPUS:85141657113

SN - 978-3-031-17432-2

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 397

EP - 421

BT - Progress in Cryptology - AFRICACRYPT 2022 - 13th International Conference on Cryptology in Africa, AFRICACRYPT 2022, Proceedings

A2 - Batina, Lejla

A2 - Daemen, Joan

PB - Springer

Y2 - 18 July 2022 through 20 July 2022

ER -

Rezaeezade A , Perin G , Picek S. To Overfit, or Not to Overfit: Improving the Performance of Deep Learning-Based SCA. In Batina L, Daemen J, editors, Progress in Cryptology - AFRICACRYPT 2022 - 13th International Conference on Cryptology in Africa, AFRICACRYPT 2022, Proceedings. Springer. 2022. p. 397-421. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-031-17433-9_17