| Literature DB >> 32693798 |
Miran Kim1, Yongsoo Song2, Baiyu Li3, Daniele Micciancio3.
Abstract
BACKGROUND: The sharing of biomedical data is crucial to enable scientific discoveries across institutions and improve health care. For example, genome-wide association studies (GWAS) based on a large number of samples can identify disease-causing genetic variants. The privacy concern, however, has become a major hurdle for data management and utilization. Homomorphic encryption is one of the most powerful cryptographic primitives which can address the privacy and security issues. It supports the computation on encrypted data, so that we can aggregate data and perform an arbitrary computation on an untrusted cloud environment without the leakage of sensitive information.Entities:
Keywords: Genome-wide association studies; Homomorphic encryption; Logistic regression
Mesh:
Year: 2020 PMID: 32693798 PMCID: PMC7372846 DOI: 10.1186/s12920-020-0724-z
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
HE parameter sets
| NumIter | log | log | log | log | log | ||
|---|---|---|---|---|---|---|---|
| Set-I | 1 | 15 | 15 | 43 | 51 | 60 | 713 |
| Set-II | 2 | 15 | 19 | 43 | 51 | 60 | 885 |
| Set-III | 3 | 16 | 23 | 45 | 54 | 62 | 1106 |
Experimental results for iDASH dataset with 245 samples, each has 10643 SNPs and 3 covariates (4 cores)
| Stage | Set-I | Set-II | Set-III | |||
|---|---|---|---|---|---|---|
| Key Generation | 4.460 s | 2.321 GB | 6.665 s | 3.584 GB | 9.699 s | 10.721 GB |
| Encryption | 7.059 s | 5.406 GB | 7.066 s | 6.669 GB | 23.023 s | 12.137 GB |
| Training with covariates | 2.622 s | 7.176 GB | 9.367 s | 7.186 GB | 62.922 s | 12.137 GB |
| Training with all SNPs | 40.442 s | 10.339 GB | 42.567 s | 11.176 GB | 108.24 s | 12.137 GB |
| − | − | − | ||||
| Decryption | 0.025 s | 10.339 GB | 0.025 s | 11.176 GB | 0.055 s | 12.137 GB |
| Reconstruction | 0.794 ms | 10.339 GB | 0.794 ms | 11.176 GB | 2.821 ms | 12.137 GB |
Fig. 1Comparison with the semi-parallel model (p-value cut-off: 10−5)
Fig. 2Comparison with the gold standard model (p-value cut-off: 10−5)
F1-Scores on different models
| Cut-off | v.s. Plain semi-parallel model | v.s. Plain gold standard model | ||||
|---|---|---|---|---|---|---|
| Set-I | Set-II | Set-III | Set-I | Set-II | Set-III | |
| 10−2 | 0.9807 | 0.9830 | 0.9964 | 0.9818 | 0.9808 | 0.9710 |
| 10−3 | 0.9749 | 0.9810 | 0.9975 | 0.9878 | 0.9887 | 0.9740 |
| 10−4 | 0.9745 | 0.9798 | 0.9969 | 0.9878 | 0.9888 | 0.9729 |
| 10−5 | 0.9828 | 0.9852 | 0.9971 | 0.9946 | 0.9970 | 0.9805 |
DeLong’s Test for AUCs of our solution with Set-II against the plain semi-parallel model
| Cut-off | Mean and stdev of the test results |
|---|---|
| 10−2 | 0.4038 ±0.3001 |
| 10−3 | 0.5357 ±0.2704 |
| 10−4 | 0.6404 ±0.2638 |
| 10−5 | 0.8959 ±0.2195 |