| Literature DB >> 32693805 |
Marcelo Blatt1, Alexander Gusev1,2, Yuriy Polyakov3, Kurt Rohloff1, Vinod Vaikuntanathan1.
Abstract
BACKGROUND: Genome-Wide Association Studies (GWAS) refer to observational studies of a genome-wide set of genetic variants across many individuals to see if any genetic variants are associated with a certain trait. A typical GWAS analysis of a disease phenotype involves iterative logistic regression of a case/control phenotype on a single-neuclotide polymorphism (SNP) with quantitative covariates. GWAS have been a highly successful approach for identifying genetic-variant associations with many poorly-understood diseases. However, a major limitation of GWAS is the dependence on individual-level genotype/phenotype data and the corresponding privacy concerns.Entities:
Keywords: Cryptography; Genome-wide association studies; Homomorphic encryption
Mesh:
Year: 2020 PMID: 32693805 PMCID: PMC7372898 DOI: 10.1186/s12920-020-0719-9
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Maximum storage requirements for N=245; M=10,643; K=3
| Ciphertexts [GB] | Evaluation Keys [GB] | |||||
|---|---|---|---|---|---|---|
| Rotation | Relinearization | |||||
| 0.0085 | 0.0085 | 0.0085 | 0.84 | 2.87 | 3.65 | 0.42 |
Runtimes and peak RAM utilization on a UTHealth ITS VM (4 cores, 16 GB RAM, 200 GB hard drive, AWS T2 Xlarge equivalent, official iDASH’18 evaluation environment) and a server node with 2 x 14 cores of Intel(R) Xeon(R) CPU E5-2680 v4 at 2.40GHz (500 GB RAM and 2 TB hard drive)
| System | KeyGen | Enc | Eval | Dec | Peak RAM | ||
|---|---|---|---|---|---|---|---|
| [min] | [min] | [min] | [s] | [GB] | |||
| UTHealth ITS VM (iDASH) | 245 | 14,841 | 0.35 | 0.34 | 3.46 | 0.06 | 9.99 |
| 28-core server node | 245 | 10,643 | 0.12 | 0.059 | 1.45 | 0.06 | 12.2 |
| 28-core server node | 300 | 20,000 | 0.12 | 0.088 | 1.88 | 0.11 | 16.2 |
| 28-core server node | 1,000 | 131,071 | 0.12 | 0.72 | 10.44 | 0.4 | 116 |
Fig. 1Accuracy of our encrypted computing prototype w.r.t the plaintext reference implementation [3]
F1 score as a function of the degree d of the Chebyshev polynomials used to approximate the logit-function at d=8 and t=1
| 1 | 3 | 5 | 7 | 9 | 11 | 13 | 15 | |
|---|---|---|---|---|---|---|---|---|
| 0.9914 | 0.9924 | 0.9927 | 0.9931 | 0.9932 | 0.9933 | 0.9933 | 0.9933 |
Runtime profiling on the 28-core node; time in seconds; numbers in header row denote step #’s in Algorithm 3; numbers in parentheses are for the single-threaded experiment; → denotes the conversion from packed-matrix to packed-integer encoding
| 1–5 | 6–7 + | 8 | 9 | 10 | 11 | ||||
|---|---|---|---|---|---|---|---|---|---|
| 245 | 10,643 | 13.3 | 23.4 | 4.6 | 27.4 | 10.4 | 1.8 | 5.5 | 0.62 |
| (27.4) | (40.2) | (6.5) | (419) | (59.3) | (12.0) | (84.1) | (1.64) | ||
| 300 | 20,000 | 13.1 | 23.5 | 4.6 | 33.2 | 25.7 | 3.8 | 7.3 | 1.5 |
| 1,000 | 131,071 | 12.7 | 22.9 | 4.2 | 132.8 | 360.6 | 47.2 | 25.0 | 21.0 |