| Literature DB >> 31141561 |
Hui Ma1, Xuyang Guo2, Yuan Ping1,3, Baocang Wang1,4, Yuehua Yang1, Zhili Zhang1, Jingxian Zhou3.
Abstract
With the prosperity of machine learning and cloud computing, meaningful information can be mined from mass electronic medical data which help physicians make proper disease diagnosis for patients. However, using medical data and disease information of patients frequently raise privacy concerns. In this paper, based on single-layer perceptron, we propose a scheme of privacy-preserving clinical decision with cloud support (PPCD), which securely conducts disease model training and prediction for the patient. Each party learns nothing about the other's private information. In PPCD, a lightweight secure multiplication is presented and introduced to improve the model training. Security analysis and experimental results on real data confirm the high accuracy of disease prediction achieved by the proposed PPCD without the risk of privacy disclosure.Entities:
Mesh:
Year: 2019 PMID: 31141561 PMCID: PMC6541381 DOI: 10.1371/journal.pone.0217349
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Architecture of the proposed PPCD.
Summary of notations.
| Notation | Definition |
|---|---|
| Hospital’s public key of the Paillier encryption scheme | |
| Hospital’s private key of the Paillier encryption scheme | |
| Undiagnosed Patient’s public key of the Paillier encryption | |
| Undiagnosed Patient’s private key of the Paillier encryption | |
| The Paillier’s encryption function | |
| The Paillier’s decryption function | |
| Sign(.) | Activation function of SLP |
| Symptom vector of patient | |
| Output value, | |
| The | |
| Encrypted symptom vector of patient | |
| Weight ciphertext vector of | |
| The | |
| Ciphertext of | |
| Ciphertext of | |
| | | The absolute value of |
| The random numbers, | |
| Time cost of one exponentiation operation | |
| Time cost of one multiplication operation | |
| Time cost of one modular inverse operation | |
| # | Not equal to |
Description of the attended four parties.
| Parts | Descriptions |
|---|---|
| Diagnosed Patient(DP) | DP encrypts the symptoms data with the hospital’s public key |
| Undiagnosed Patients(UP) | UP provides the encrypted disease symptoms data for hospital to make decisions |
| Hospital | As a medical service provider, the hospital is a trusted party who is in charge of generating, distributing and management of public key and private key. Meanwhile, the hospital performs model training together with the cloud server and disease predicting for UP based on patient’s symptoms |
| Cloud Server (CS) | CS with almost unlimited storage trains the disease model according to the outsourced medical data. The trained model is securely stored in the hospital |
Medical data for the k-th disease.
| Medical sample | Medical data | Desired output |
|---|---|---|
| { | ||
| { | ||
| ⋯ | ⋯ ⋯ | ⋯ ⋯ |
| { |
Summary of computational cost for x in PPCD.
| Phase | Step | Entity | Computational cost |
|---|---|---|---|
| Disease learning | Step 1 | Hospital | |
| Step 2 | Cloud | (2 | |
| Hospital | 2 | ||
| Step 3 | Hospital | ||
| Step 4 | Cloud | ||
| Disease prediction | Step 1 | Hospital | ( |
Summary of communication overhead in PPCD.
| Phase | Step | Communication overhead |
|---|---|---|
| Outsourcing DP’s data | ||
| Disease learning | Step 1 | |
| Step 2 | 2 | |
| Step 4 | ||
| Disease prediction |
Description of the benchmark data sets.
| Data sets | size | dims | #classes | attributes |
|---|---|---|---|---|
| WBCD | 683 | 9 | 2 | clump thickness; uniformity of cell size; uniformity of cell shape; marginal adhesion; single epithelial cell size; bare nuclei; bland chromatin; normal nucleoli; mitoses |
| HDD | 297 | 13 | 2 | age; sex; cp; trestbpl; chol; fbs; restecg; thalach; exang; oldpeak; slope; ca; thal |
| AID | 120 | 6 | 2 | temperature; occurrence of nausea; lumbar pain; urine pushing; micturition pains; burning of urethra, itch, swelling of urethra outlet |
Accuracy comparisons of SLP in PD and PPCD in ED on WBCD.
| Output/Target | Class 1 | Class 2 | Overall | |
|---|---|---|---|---|
| SLP(PD) | Class 1 | 426(62.3%) | 18(2.6%) | 96.0% |
| Class 2 | 8(1.2%) | 231(33.8%) | 96.7% | |
| Overall | 98.2% | 92.8% | 96.2% | |
| PPCD(ED) | Class 1 | 423(61.9%) | 21(3.1%) | 95.3% |
| Class 2 | 9(1.3%) | 230(33.7%) | 96.2% | |
| Overall | 97.9% | 91.6% | 95.6% | |
Accuracy comparisons of SLP in PD and PPCD in ED for NRPO of AID.
| Output/Target | Class 1 | Class 2 | Overall | |
|---|---|---|---|---|
| SLP(PD) | Class 1 | 48(52.2%) | 2(1.7%) | 96.0% |
| Class 2 | 6(3%) | 64(42.4%) | 91.4% | |
| Overall | 88.9% | 97% | 93.3% | |
| PPCD(ED) | Class 1 | 46(52.2%) | 4(1.7%) | 92% |
| Class 2 | 6(4.4%) | 64(41.8%) | 91.4% | |
| Overall | 88.5% | 94.1% | 91.7% | |
Runtime comparisons of PPCD in ED and SLP in PD.
| Dataset | Phase | PPCD(s) | SLP(s) |
|---|---|---|---|
| Breast cancer | Data encryption | 6.125 | --- |
| Model training | 2993.100 | 0.012 | |
| Disease predicting | 0.098 | 0.005 | |
| Heart disease | Data encryption | 3.259 | ---- |
| Model training | 1860.505 | 0.010 | |
| Disease predicting | 0.145 | 0.002 | |
| AID(UIB) | Data encryption | 1.564 | --- |
| Model training | 743.875 | 0.010 | |
| Disease predicting | 0.143 | 0.001 | |
| AID(NRPO) | Data encryption | 1.467 | --- |
| Model training | 683.387 | 0.080 | |
| Disease predicting | 0.148 | 0.001 |
Note: "---" means not available.
Accuracy comparisons of SLP in PD and PPCD in ED on HDD.
| Output/Target | Class 1 | Class 2 | Overall | |
|---|---|---|---|---|
| SLP(PD) | Class 1 | 155(52.2%) | 5(1.7%) | 96.9% |
| Class 2 | 11(3.7%) | 126(42.4%) | 92.0% | |
| Overall | 93.4% | 96.2% | 94.6% | |
| PPCD(ED) | Class 1 | 155(52.2%) | 5(1.7%) | 96.9% |
| Class 2 | 13(4.4%) | 124(41.8%) | 90.5% | |
| Overall | 92.3% | 96.1% | 93.9% | |
Accuracy comparisons of SLP in PD and PPCD in ED for IUB of AID.
| Output/Target | Class 1 | Class 2 | Overall | |
|---|---|---|---|---|
| SLP(PD) | Class 1 | 57(47.5%) | 2(1.7%) | 96.7% |
| Class 2 | 6(5%) | 55(45.8%) | 90.2% | |
| Overall | 90.5% | 96.5% | 93.3% | |
| PPCD(ED) | Class 1 | 55(45.8%) | 4(3.3%) | 93.2% |
| Class 2 | 5(4.2%) | 56(46.7%) | 91.8% | |
| Overall | 91.7% | 93.3% | 92.5% | |