| Literature DB >> 28683736 |
Takako Takai-Igarashi1, Kengo Kinoshita2,3, Masao Nagasaki2, Soichi Ogishima2, Naoki Nakamura2,4,5, Sachiko Nagase2,4, Satoshi Nagaie2, Tomo Saito2, Fuji Nagami2,4,5, Naoko Minegishi2,4, Yoichi Suzuki2,4, Kichiya Suzuki2,4,5, Hiroaki Hashizume2, Shinichi Kuriyama2,4,6, Atsushi Hozawa2,4, Nobuo Yaegashi2,4,5, Shigeo Kure2,4,5, Gen Tamiya2, Yoshio Kawaguchi2, Hiroshi Tanaka2, Masayuki Yamamoto7,8.
Abstract
BACKGROUND: With the goal of realizing genome-based personalized healthcare, we have developed a biobank that integrates personal health, genome, and omics data along with biospecimens donated by volunteers of 150,000. Such a large-scale of data integration involves obvious risks of privacy violation. The research use of personal genome and health information is a topic of global discussion with regard to the protection of privacy while promoting scientific advancement. The present paper reports on our plans, current attempts, and accomplishments in addressing security problems involved in data sharing to ensure donor privacy while promoting scientific advancement.Entities:
Keywords: Biobank; Data sharing policy; Personal genome data; Personal health data; Personalized healthcare; Privacy violation risk; Remote access; Security policy
Mesh:
Year: 2017 PMID: 28683736 PMCID: PMC5501115 DOI: 10.1186/s12911-017-0494-5
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Security categories defined by the TMM data sharing policy and attributes of the categorized dataset
| Security categorya | Datasetb | Minimum necessaryc | Identifiabilityd | Research usee | Remote accessf |
|---|---|---|---|---|---|
| Very strong | Entire dataset | No | Significantly high | No | No |
| Strong | Extracted dataset | Yes | Significantly high | Yes | Yes, under security control, data transfer is not allowed |
| Standard | Extracted dataset | Yes | Negligible | Yes | Yes, under security control, data transfer is allowed |
| Open | Extracted dataset | No | No | Yes | Yes |
aSecurity category defined by the TMM data sharing policy, bWhole or part of sharable dataset in the TMM biobank, cApplicability of the minimum necessary principle, dStrength of identifiability, eAllowance of research use, fAvailability of remote access
Fig. 1Segmented data storage in the TMM supercomputer. Segments A, B, C, and D are assigned to Open, Standard, Strong, and Very Strong data, respectively, in accordance with the security classification defined in the TMM data sharing policy. Segment A is open to the public and is freely accessible with no restriction. Segment B allows remote access and data export under the required security controls. Segment C allows remote access from security rooms equipped with all required security countermeasures. Segment D is rigorously closed and is prohibited from any remote access
Fig. 2Data processing operations. Biospecimens and health data are collected with Cohort IDs. The De-identification process converts Cohort IDs into Biobank IDs in the case of biospecimens, while the process converts Cohort IDs into QA_test IDs in the case of health data. The TMM biobank stores biospecimens with Biobank IDs and health data with QA_test IDs. Genome and omics data are analyzed and stored with Biobank IDs. After the approval of a research application, datasets of minimum necessary data are extracted from the TMM biobank. The extracted datasets are combined and anonymized by ID conversion into Distribution IDs. The combined dataset is then used and shared with Distribution IDs for the approved research application
Fig. 3Network configuration. The network configuration consists of three types of closed sub-networks; A) a network for identifiable data, B) a network for de-identified data, and C) a network for shared data. Only de-identification or anonymization actions can traverse between network A and network B or network B and network C, respectively. The two actions require off-line data transfers using portable devices with biometric authentication. VPN connections enable remote access from distant places of recruitment (community support centers and clinics) and data sharing (remote security rooms). An IP-VPN or wireless IP-VPN with a one-time password system meets the technological requirements of the security policy. The TMM biobank does not allow any remote access