Distributionally Robust Inverse Covariance Estimation: The Wasserstein Shrinkage Estimator

Viet Anh Nguyen; Daniel Kuhn; Peyman Mohajerin Esfahani

doi:10.1287/opre.2020.2076

Distributionally Robust Inverse Covariance Estimation: The Wasserstein Shrinkage Estimator

Viet Anh Nguyen, Daniel Kuhn, Peyman Mohajerin Esfahani

Research output: Contribution to journal › Article › Scientific › peer-review

9 Citations (Scopus)

37 Downloads (Pure)

Abstract

We introduce a distributionally robust maximum likelihood estimation model with a Wasserstein ambiguity set to infer the inverse covariance matrix of a p-dimensional Gaussian random vector from n independent samples. The proposed model minimizes the worst case (maximum) of Stein’s loss across all normal reference distributions within a prescribed Wasserstein distance from the normal distribution characterized by the sample mean and the sample covariance matrix. We prove that this estimation problem is equivalent to a semidefinite program that is tractable in theory but beyond the reach of general-purpose solvers for practically relevant problem dimensions p. In the absence of any prior structural information, the estimation problem has an analytical solution that is naturally interpreted as a nonlinear shrinkage estimator. Besides being invertible and well conditioned even for p > n, the new shrinkage estimator is rotation equivariant and preserves the order of the eigenvalues of the sample covariance matrix. These desirable properties are not imposed ad hoc but emerge naturally from the underlying distributionally robust optimization model. Finally, we develop a sequential quadratic approximation algorithm for efficiently solving the general estimation problem subject to conditional independence constraints typically encountered in Gaussian graphical models.

Original language	English
Pages (from-to)	490-515
Journal	Operations Research
Volume	70
Issue number	1
DOIs	https://doi.org/10.1287/opre.2020.2076
Publication status	Published - 2022

Bibliographical note

Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care
Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

Keywords

Data-driven optimization
Distributionally robust optimization
Maximum likelihood estimation
Shrinkage estimator
Wasserstein distance

Access to Document

10.1287/opre.2020.2076

opre.2020.2076Final published version, 2.13 MB

Cite this

@article{70ea84d7db7e4c14a60b038187c86433,

title = "Distributionally Robust Inverse Covariance Estimation: The Wasserstein Shrinkage Estimator",

abstract = "We introduce a distributionally robust maximum likelihood estimation model with a Wasserstein ambiguity set to infer the inverse covariance matrix of a p-dimensional Gaussian random vector from n independent samples. The proposed model minimizes the worst case (maximum) of Stein{\textquoteright}s loss across all normal reference distributions within a prescribed Wasserstein distance from the normal distribution characterized by the sample mean and the sample covariance matrix. We prove that this estimation problem is equivalent to a semidefinite program that is tractable in theory but beyond the reach of general-purpose solvers for practically relevant problem dimensions p. In the absence of any prior structural information, the estimation problem has an analytical solution that is naturally interpreted as a nonlinear shrinkage estimator. Besides being invertible and well conditioned even for p > n, the new shrinkage estimator is rotation equivariant and preserves the order of the eigenvalues of the sample covariance matrix. These desirable properties are not imposed ad hoc but emerge naturally from the underlying distributionally robust optimization model. Finally, we develop a sequential quadratic approximation algorithm for efficiently solving the general estimation problem subject to conditional independence constraints typically encountered in Gaussian graphical models.",

keywords = "Data-driven optimization, Distributionally robust optimization, Maximum likelihood estimation, Shrinkage estimator, Wasserstein distance",

author = "Nguyen, {Viet Anh} and Daniel Kuhn and Esfahani, {Peyman Mohajerin}",

note = "Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.",

year = "2022",

doi = "10.1287/opre.2020.2076",

language = "English",

volume = "70",

pages = "490--515",

journal = "Operations Research",

issn = "0030-364X",

publisher = "INFORMS Inst.for Operations Res.and the Management Sciences",

number = "1",

}

TY - JOUR

T1 - Distributionally Robust Inverse Covariance Estimation

T2 - The Wasserstein Shrinkage Estimator

AU - Nguyen, Viet Anh

AU - Kuhn, Daniel

AU - Esfahani, Peyman Mohajerin

N1 - Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

PY - 2022

Y1 - 2022

N2 - We introduce a distributionally robust maximum likelihood estimation model with a Wasserstein ambiguity set to infer the inverse covariance matrix of a p-dimensional Gaussian random vector from n independent samples. The proposed model minimizes the worst case (maximum) of Stein’s loss across all normal reference distributions within a prescribed Wasserstein distance from the normal distribution characterized by the sample mean and the sample covariance matrix. We prove that this estimation problem is equivalent to a semidefinite program that is tractable in theory but beyond the reach of general-purpose solvers for practically relevant problem dimensions p. In the absence of any prior structural information, the estimation problem has an analytical solution that is naturally interpreted as a nonlinear shrinkage estimator. Besides being invertible and well conditioned even for p > n, the new shrinkage estimator is rotation equivariant and preserves the order of the eigenvalues of the sample covariance matrix. These desirable properties are not imposed ad hoc but emerge naturally from the underlying distributionally robust optimization model. Finally, we develop a sequential quadratic approximation algorithm for efficiently solving the general estimation problem subject to conditional independence constraints typically encountered in Gaussian graphical models.

AB - We introduce a distributionally robust maximum likelihood estimation model with a Wasserstein ambiguity set to infer the inverse covariance matrix of a p-dimensional Gaussian random vector from n independent samples. The proposed model minimizes the worst case (maximum) of Stein’s loss across all normal reference distributions within a prescribed Wasserstein distance from the normal distribution characterized by the sample mean and the sample covariance matrix. We prove that this estimation problem is equivalent to a semidefinite program that is tractable in theory but beyond the reach of general-purpose solvers for practically relevant problem dimensions p. In the absence of any prior structural information, the estimation problem has an analytical solution that is naturally interpreted as a nonlinear shrinkage estimator. Besides being invertible and well conditioned even for p > n, the new shrinkage estimator is rotation equivariant and preserves the order of the eigenvalues of the sample covariance matrix. These desirable properties are not imposed ad hoc but emerge naturally from the underlying distributionally robust optimization model. Finally, we develop a sequential quadratic approximation algorithm for efficiently solving the general estimation problem subject to conditional independence constraints typically encountered in Gaussian graphical models.

KW - Data-driven optimization

KW - Distributionally robust optimization

KW - Maximum likelihood estimation

KW - Shrinkage estimator

KW - Wasserstein distance

UR - http://www.scopus.com/inward/record.url?scp=85124941201&partnerID=8YFLogxK

U2 - 10.1287/opre.2020.2076

DO - 10.1287/opre.2020.2076

M3 - Article

AN - SCOPUS:85124941201

SN - 0030-364X

VL - 70

SP - 490

EP - 515

JO - Operations Research

JF - Operations Research

IS - 1

ER -

Distributionally Robust Inverse Covariance Estimation: The Wasserstein Shrinkage Estimator

Abstract

Bibliographical note

Keywords

Access to Document

Other files and links

Fingerprint

Cite this