Secure genotype imputation using homomorphic encryption

Junwei Zhou; Botian Lei; Huile Lang; Emmanouil Panaousis; Kaitai Liang; Jianwen Xiang

doi:10.1016/j.jisa.2022.103386

Secure genotype imputation using homomorphic encryption

Junwei Zhou^*, Botian Lei, Huile Lang, Emmanouil Panaousis, Kaitai Liang, Jianwen Xiang

^*Corresponding author for this work

Cyber Security

Research output: Contribution to journal › Article › Scientific › peer-review

1 Citation (Scopus)

25 Downloads (Pure)

Abstract

Genotype imputation estimates missing genotypes from the haplotype or genotype reference panel in individual genetic sequences, which boosts the potential of genome-wide association and is essential in genetic data analysis. However, the genetic sequences involve people's privacy, confirming an individual's identification and even disease information. This work proposes a secure genotype imputation model, which uses a linear regression model and the homomorphic encryption scheme over ciphertext to impute missing genotypes. The inference model is trained with float plaintext parameters, which are round into integers to avoid high complexity homomorphic evaluation on float number operations without bootstrapping operations. Even though the rounding parameters in the inference model are not the same as those in the trained model, We find that it will no effect on the outcome of the homomorphic prediction. Thus, a high-efficiency genotype imputation inference model over the ciphertext is obtained while keeping the high-security level. The simulation results indicate that the accuracy of the secure inference model is almost the same as the original model trained on float parameters. The secure inference model's accuracy is 98.6% for a single genotype.

Original language	English
Article number	103386
Number of pages	7
Journal	Journal of Information Security and Applications
Volume	72
DOIs	https://doi.org/10.1016/j.jisa.2022.103386
Publication status	Published - 2023

Bibliographical note

Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care
Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

Keywords

Genetic security
Genotype imputation
Homomorphic encryption
Privacy computing
Privacy-preserving

Access to Document

10.1016/j.jisa.2022.103386

1-s2.0-S2214212622002307-mainFinal published version, 736 KB

Cite this

@article{90cd9fd8110b4ab08b601d6ae8d5cbe8,

title = "Secure genotype imputation using homomorphic encryption",

abstract = "Genotype imputation estimates missing genotypes from the haplotype or genotype reference panel in individual genetic sequences, which boosts the potential of genome-wide association and is essential in genetic data analysis. However, the genetic sequences involve people's privacy, confirming an individual's identification and even disease information. This work proposes a secure genotype imputation model, which uses a linear regression model and the homomorphic encryption scheme over ciphertext to impute missing genotypes. The inference model is trained with float plaintext parameters, which are round into integers to avoid high complexity homomorphic evaluation on float number operations without bootstrapping operations. Even though the rounding parameters in the inference model are not the same as those in the trained model, We find that it will no effect on the outcome of the homomorphic prediction. Thus, a high-efficiency genotype imputation inference model over the ciphertext is obtained while keeping the high-security level. The simulation results indicate that the accuracy of the secure inference model is almost the same as the original model trained on float parameters. The secure inference model's accuracy is 98.6% for a single genotype.",

keywords = "Genetic security, Genotype imputation, Homomorphic encryption, Privacy computing, Privacy-preserving",

author = "Junwei Zhou and Botian Lei and Huile Lang and Emmanouil Panaousis and Kaitai Liang and Jianwen Xiang",

note = "Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.",

year = "2023",

doi = "10.1016/j.jisa.2022.103386",

language = "English",

volume = "72",

journal = "Journal of Information Security and Applications",

issn = "2214-2134",

publisher = "Elsevier",

}

TY - JOUR

T1 - Secure genotype imputation using homomorphic encryption

AU - Zhou, Junwei

AU - Lei, Botian

AU - Lang, Huile

AU - Panaousis, Emmanouil

AU - Liang, Kaitai

AU - Xiang, Jianwen

N1 - Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

PY - 2023

Y1 - 2023

N2 - Genotype imputation estimates missing genotypes from the haplotype or genotype reference panel in individual genetic sequences, which boosts the potential of genome-wide association and is essential in genetic data analysis. However, the genetic sequences involve people's privacy, confirming an individual's identification and even disease information. This work proposes a secure genotype imputation model, which uses a linear regression model and the homomorphic encryption scheme over ciphertext to impute missing genotypes. The inference model is trained with float plaintext parameters, which are round into integers to avoid high complexity homomorphic evaluation on float number operations without bootstrapping operations. Even though the rounding parameters in the inference model are not the same as those in the trained model, We find that it will no effect on the outcome of the homomorphic prediction. Thus, a high-efficiency genotype imputation inference model over the ciphertext is obtained while keeping the high-security level. The simulation results indicate that the accuracy of the secure inference model is almost the same as the original model trained on float parameters. The secure inference model's accuracy is 98.6% for a single genotype.

AB - Genotype imputation estimates missing genotypes from the haplotype or genotype reference panel in individual genetic sequences, which boosts the potential of genome-wide association and is essential in genetic data analysis. However, the genetic sequences involve people's privacy, confirming an individual's identification and even disease information. This work proposes a secure genotype imputation model, which uses a linear regression model and the homomorphic encryption scheme over ciphertext to impute missing genotypes. The inference model is trained with float plaintext parameters, which are round into integers to avoid high complexity homomorphic evaluation on float number operations without bootstrapping operations. Even though the rounding parameters in the inference model are not the same as those in the trained model, We find that it will no effect on the outcome of the homomorphic prediction. Thus, a high-efficiency genotype imputation inference model over the ciphertext is obtained while keeping the high-security level. The simulation results indicate that the accuracy of the secure inference model is almost the same as the original model trained on float parameters. The secure inference model's accuracy is 98.6% for a single genotype.

KW - Genetic security

KW - Genotype imputation

KW - Homomorphic encryption

KW - Privacy computing

KW - Privacy-preserving

UR - http://www.scopus.com/inward/record.url?scp=85143506262&partnerID=8YFLogxK

U2 - 10.1016/j.jisa.2022.103386

DO - 10.1016/j.jisa.2022.103386

M3 - Article

AN - SCOPUS:85143506262

SN - 2214-2134

VL - 72

JO - Journal of Information Security and Applications

JF - Journal of Information Security and Applications

M1 - 103386

ER -

Secure genotype imputation using homomorphic encryption

Abstract

Bibliographical note

Keywords

Access to Document

Other files and links

Fingerprint

Cite this