Sparse Bayesian deep learning for dynamic system identification

Hongpeng Zhou; I. Chahine; Wei Xing Zheng; Wei Pan

doi:10.1016/j.automatica.2022.110489

Sparse Bayesian deep learning for dynamic system identification

Hongpeng Zhou, I. Chahine, Wei Xing Zheng, Wei Pan^*

^*Corresponding author for this work

Robot Dynamics

Research output: Contribution to journal › Article › Scientific › peer-review

3 Citations (Scopus)

48 Downloads (Pure)

Abstract

This paper proposes a sparse Bayesian treatment of deep neural networks (DNNs) for system identification. Although DNNs show impressive approximation ability in various fields, several challenges still exist for system identification problems. First, DNNs are known to be too complex that they can easily overfit the training data. Second, the selection of the input regressors for system identification is nontrivial. Third, uncertainty quantification of the model parameters and predictions are necessary. The proposed Bayesian approach offers a principled way to alleviate the above challenges by marginal likelihood/model evidence approximation and structured group sparsity-inducing priors construction. The identification algorithm is derived as an iterative regularised optimisation procedure that can be solved as efficiently as training typical DNNs. Remarkably, an efficient and recursive Hessian calculation method for each layer of DNNs is developed, turning the intractable training/optimisation process into a tractable one. Furthermore, a practical calculation approach based on the Monte-Carlo integration method is derived to quantify the uncertainty of the parameters and predictions. The effectiveness of the proposed Bayesian approach is demonstrated on several linear and nonlinear system identification benchmarks by achieving good and competitive simulation accuracy. The code to reproduce the experimental results is open-sourced and available online.

Original language	English
Article number	110489
Number of pages	11
Journal	Automatica
Volume	144
DOIs	https://doi.org/10.1016/j.automatica.2022.110489
Publication status	Published - 2022

Keywords

Deep neural networks
Group sparsity
Regularised system identification
Sparse Bayesian learning

Access to Document

10.1016/j.automatica.2022.110489

1-s2.0-S000510982200348X-mainFinal published version, 1.13 MBLicence: CC BY

Cite this

@article{72ef293a5e1b478a932cac7b900b669c,

title = "Sparse Bayesian deep learning for dynamic system identification",

abstract = "This paper proposes a sparse Bayesian treatment of deep neural networks (DNNs) for system identification. Although DNNs show impressive approximation ability in various fields, several challenges still exist for system identification problems. First, DNNs are known to be too complex that they can easily overfit the training data. Second, the selection of the input regressors for system identification is nontrivial. Third, uncertainty quantification of the model parameters and predictions are necessary. The proposed Bayesian approach offers a principled way to alleviate the above challenges by marginal likelihood/model evidence approximation and structured group sparsity-inducing priors construction. The identification algorithm is derived as an iterative regularised optimisation procedure that can be solved as efficiently as training typical DNNs. Remarkably, an efficient and recursive Hessian calculation method for each layer of DNNs is developed, turning the intractable training/optimisation process into a tractable one. Furthermore, a practical calculation approach based on the Monte-Carlo integration method is derived to quantify the uncertainty of the parameters and predictions. The effectiveness of the proposed Bayesian approach is demonstrated on several linear and nonlinear system identification benchmarks by achieving good and competitive simulation accuracy. The code to reproduce the experimental results is open-sourced and available online.",

keywords = "Deep neural networks, Group sparsity, Regularised system identification, Sparse Bayesian learning",

author = "Hongpeng Zhou and I. Chahine and Zheng, {Wei Xing} and Wei Pan",

year = "2022",

doi = "10.1016/j.automatica.2022.110489",

language = "English",

volume = "144",

journal = "Automatica",

issn = "0005-1098",

publisher = "Elsevier",

}

TY - JOUR

T1 - Sparse Bayesian deep learning for dynamic system identification

AU - Zhou, Hongpeng

AU - Chahine, I.

AU - Zheng, Wei Xing

AU - Pan, Wei

PY - 2022

Y1 - 2022

N2 - This paper proposes a sparse Bayesian treatment of deep neural networks (DNNs) for system identification. Although DNNs show impressive approximation ability in various fields, several challenges still exist for system identification problems. First, DNNs are known to be too complex that they can easily overfit the training data. Second, the selection of the input regressors for system identification is nontrivial. Third, uncertainty quantification of the model parameters and predictions are necessary. The proposed Bayesian approach offers a principled way to alleviate the above challenges by marginal likelihood/model evidence approximation and structured group sparsity-inducing priors construction. The identification algorithm is derived as an iterative regularised optimisation procedure that can be solved as efficiently as training typical DNNs. Remarkably, an efficient and recursive Hessian calculation method for each layer of DNNs is developed, turning the intractable training/optimisation process into a tractable one. Furthermore, a practical calculation approach based on the Monte-Carlo integration method is derived to quantify the uncertainty of the parameters and predictions. The effectiveness of the proposed Bayesian approach is demonstrated on several linear and nonlinear system identification benchmarks by achieving good and competitive simulation accuracy. The code to reproduce the experimental results is open-sourced and available online.

AB - This paper proposes a sparse Bayesian treatment of deep neural networks (DNNs) for system identification. Although DNNs show impressive approximation ability in various fields, several challenges still exist for system identification problems. First, DNNs are known to be too complex that they can easily overfit the training data. Second, the selection of the input regressors for system identification is nontrivial. Third, uncertainty quantification of the model parameters and predictions are necessary. The proposed Bayesian approach offers a principled way to alleviate the above challenges by marginal likelihood/model evidence approximation and structured group sparsity-inducing priors construction. The identification algorithm is derived as an iterative regularised optimisation procedure that can be solved as efficiently as training typical DNNs. Remarkably, an efficient and recursive Hessian calculation method for each layer of DNNs is developed, turning the intractable training/optimisation process into a tractable one. Furthermore, a practical calculation approach based on the Monte-Carlo integration method is derived to quantify the uncertainty of the parameters and predictions. The effectiveness of the proposed Bayesian approach is demonstrated on several linear and nonlinear system identification benchmarks by achieving good and competitive simulation accuracy. The code to reproduce the experimental results is open-sourced and available online.

KW - Deep neural networks

KW - Group sparsity

KW - Regularised system identification

KW - Sparse Bayesian learning

UR - http://www.scopus.com/inward/record.url?scp=85134638748&partnerID=8YFLogxK

U2 - 10.1016/j.automatica.2022.110489

DO - 10.1016/j.automatica.2022.110489

M3 - Article

AN - SCOPUS:85134638748

SN - 0005-1098

VL - 144

JO - Automatica

JF - Automatica

M1 - 110489

ER -

Sparse Bayesian deep learning for dynamic system identification

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this