| Literature DB >> 34928973 |
Yongha Son1, Kyoohyung Han1, Yong Seok Lee2, Jonghan Yu3, Young-Hyuck Im4, Soo-Yong Shin5,6.
Abstract
Protecting patients' privacy is one of the most important tasks when developing medical artificial intelligence models since medical data is the most sensitive personal data. To overcome this privacy protection issue, diverse privacy-preserving methods have been proposed. We proposed a novel method for privacy-preserving Gated Recurrent Unit (GRU) inference model using privacy enhancing technologies including homomorphic encryption and secure two party computation. The proposed privacy-preserving GRU inference model validated on breast cancer recurrence prediction with 13,117 patients' medical data. Our method gives reliable prediction result (0.893 accuracy) compared to the normal GRU model (0.895 accuracy). Unlike other previous works, the experiment on real breast cancer data yields almost identical results for privacy-preserving and conventional cases. We also implement our algorithm to shows the realistic end-to-end encrypted breast cancer recurrence prediction.Entities:
Mesh:
Year: 2021 PMID: 34928973 PMCID: PMC8687538 DOI: 10.1371/journal.pone.0260681
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1A RNN layer.
Sequential RNN units.
Fig 2Evaluations of sigmoid with degree 16 approximations.
A: Approx. on [−8, 8]. B: Approx. on [−32, 32]. C: Approx. on [−8, 8] with input substitution.
Fig 3Workflow of GRU layer evaluation.
Benchmarks for one privacy-preserving GRU cell evaluations.
| Linear | Non-linear | Input Adjustment | |||||
|---|---|---|---|---|---|---|---|
| Latency | Comm. | Latency | Comm. | Latency | Comm. | ||
| Input: 70 | Single query | 91 | - | 338 | 3.22 | 50 | 2.50 |
| 32-Batch query | 245 | - | 338 | 3.22 | 977 | 27.69 | |
| Input: 32 | Single query | 48 | - | 338 | 3.22 | 40 | 2.19 |
| 32-Batch query | 245 | - | 338 | 3.22 | 632 | 17.94 | |
Runtimes are in milliseconds and comm. in MB
Fig 4Our GRU model for predicting breast cancer recurrence.
Performance of privacy-preserving inference of our cancer prediction model.
| Encryption | Inference | Decryption | Total | C-index (Plain) | |||||
|---|---|---|---|---|---|---|---|---|---|
| Latency | Comm. | Latency | Comm. | Latency | Comm. | Latency | Comm. | ||
| Single query | 57 | 0.16 | 4889 | 57.10 | < 1 | 0.08 | 4947 | 57.34 | 0.893 (0.895) |
| 32-Batch query | 57 | 0.16 | 13800 | 267.12 | < 1 | 0.08 | 13858 | 267.36 | |
Runtimes are in milliseconds and comm. in MB. The inference costs heavily depends on the number of time records, and table represents costs for average of the number of records (about 5.13). The accuracy measured by C-index remains almost same with plain inference (Plain).