| Literature DB >> 31389335 |
Ho Heon Kim1, Bora Kim2, Segyeong Joo3,4, Soo-Yong Shin5,6, Hyo Soung Cha7, Yu Rang Park1.
Abstract
BACKGROUND: There has been significant effort in attempting to use health care data. However, laws that protect patients' privacy have restricted data use because health care data contain sensitive information. Thus, discussions on privacy laws now focus on the active use of health care data beyond protection. However, current literature does not clarify the obstacles that make data usage and deidentification processes difficult or elaborate on users' needs for data linking from practical perspectives.Entities:
Keywords: data anonymization; data linking; data protection; data sharing; health care data demand; privacy act
Mesh:
Year: 2019 PMID: 31389335 PMCID: PMC6701164 DOI: 10.2196/14126
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 5.428
Profile of online survey respondents (N=118).
| Characteristics | Respondents, n (%) | |
| 20-29 | 21 (17.8) | |
| 30-39 | 56 (47.5) | |
| 40-49 | 34 (28.8) | |
| 50-59 | 5 (4.2) | |
| Other | 2 (1.7) | |
| General hospital | 62 (52.5) | |
| University | 51 (43.2) | |
| Industry | 15 (12.7) | |
| Academic society | 6 (5.1) | |
| Other | 3 (2.6) | |
| Research | 84 (71.2) | |
| Data analysis | 18 (15.3) | |
| Planning | 11 (9.3) | |
| Device development | 5 (4.2) | |
| ≥10 years | 9 (7.6) | |
| 5 years to <10 years | 32 (27.1) | |
| 1 year to <5 years | 56 (47.5) | |
| <1 year | 21 (17.8) | |
Data needs, obstacles, and developmental proposals for data utilization.
| Characteristics | Respondents, n (%) | Total (N=118), n (%) | |||||
| General hospital (n=62) | University (n=51) | Industry (n=15) | Academic society (n=6) | Other (n=3) | |||
| Clinical data (collected during care process in hospital) | 56 (90.3) | 29 (56.9) | 10 (66.7) | 5 (83.3) | 0 (0.0) | 82 (69.5) | |
| Public data (managed by nation) | 40 (64.5) | 32 (62.7) | 10 (66.7) | 5 (83.3) | 3 (100.0) | 76 (64.4) | |
| Research data (clinical research or trial data) | 38 (61.3) | 33 (64.7) | 3 (2.0) | 5 (83.3) | 0 (0.0) | 61 (51.7) | |
| Life log data (patient generated health data) | 17 (27.4) | 16 (31.4) | 8 (53.3) | 4 (66.7) | 1 (33.3) | 36 (30.5) | |
| Genetic data | 13 (21.0) | 18 (35.3) | 1 (6.7) | 2 (33.3) | 1 (33.3) | 28 (23.7) | |
| Conflict of laws | 30 (48.4) | 19 (37.3) | 8 (53.3) | 2 (33.3) | 0 (0.0) | 53 (44.9) | |
| Data standardization | 24 (38.7) | 23 (45.1) | 5 (33.3) | 3 (50.5) | 3 (100.0) | 50 (42.4) | |
| Strict social recognition | 5 (8.1) | 4 (7.8) | 2 (13.3) | 1 (16.7) | 0 (0.0) | 9 (7.6) | |
| Other | 1 (1.6) | 1 (4.0) | 0 (0.0) | 0 (0.0) | 0 (0.0) | 2 (1.7) | |
| None | 2 (3.2) | 3 (5.9) | 0 (0.0) | 0 (0.0) | 0 (0.0) | 4 (3.4) | |
| Law revision | 29 (46.8) | 22 (43.1) | 7 (46.7) | 4 (66.7) | 2 (66.7) | 54 (45.8) | |
| Technical measures | 22 (35.5) | 23 (43.1) | 6 (40.0) | 3 (50.0) | 1 (33.3) | 47 (39.8) | |
| Utilization support | 12 (19.4) | 7 (13.7) | 0 (0.0) | 0 (0.0) | 0 (0.0) | 16 (13.6) | |
| Public consensus | 6 (9.7) | 11 (21.6) | 2 (13.3) | 1 (16.7) | 1 (33.3) | 22 (18.6) | |
Comparison of data demand, obstacles, and improvement suggestions between health care data use for commercial and public purposes (N=115). Sample size excludes the n=3 for other minor purposes.
| Measures | Respondents, n (%) | Chi-square (df) | |||
| Public purpose (n=81) | Commercial purpose (n=34) | ||||
| Not applicable | Not applicable | ||||
| Clinical data | 53 (65.4) | 28 (82.4) | |||
| Public data | 53 (65.4) | 23 (67.6) | |||
| Research data | 44 (54.3) | 16 (47.1) | |||
| Life log data | 18 (22.2) | 18 (52.9) | |||
| Genetic data | 17 (21.0) | 9 (26.5) | |||
| 2.9 (4) | .38 | ||||
| Conflict with laws | 38 (46.9) | 13 (38.2) | |||
| Data standardization | 35 (43.2) | 15 (44.1) | |||
| Strict social recognition | 6 (7.4) | 3 (8.8) | |||
| None | 1 (1.2) | 2 (5.9) | |||
| Other | 1 (1.2) | 1 (2.9) | |||
| Not applicable | Not applicable | ||||
| Law revision | 38 (46.9) | 15 (44.1) | |||
| Technical method | 33 (40.7) | 14 (41.2) | |||
| Data utilization support | 15 (18.5) | 1 (2.9) | |||
| Public consensus | 14 (17.3) | 6 (17.6) | |||
Responses about the current state of data deidentification (N=118).
| Measures | Respondents, n (%) | |
| Yes | 101 (85.6) | |
| No | 17 (14.4) | |
| 1 method | 37 (31.4) | |
| 2 methods | 33 (28.0) | |
| 3 methods | 18 (15.3) | |
| 4 methods | 4 (3.4) | |
| 5 methods | 9 (7.6) | |
| Pseudonymization | 72 (71.3) | |
| Masking | 57 (56.4) | |
| Data reduction | 37 (36.6) | |
| Data suppression | 30 (29.7) | |
| Aggregation | 22 (21.8) | |
| Strict social culture | 28 (27.7) | |
| Absence of clear deidentification guideline | 24 (23.8) | |
| Usefulness of deidentified data | 15 (14.9) | |
| Lack of understanding of deidentification policy and technology | 14 (13.9) | |
| Lack of relevant institution support | 11 (10.9) | |
| Lack of deidentification measure for unstructured data | 9 (8.9) | |
Demand for health care data linking.
| Measures | Participants, n (%) | Chi-square (df) | |||||
| Public | Commercial | Other | Total | ||||
| 0.2 (1) | .64 | ||||||
| Required | 67 (82.7) | 30 (88.2) | 1 (33.3) | 98 (83.1) | |||
| Not required | 14 (17.3) | 4 (11.8) | 2 (66.7) | 20 (16.9) | |||
| 3.6 (2) | .16 | ||||||
| Obtain longitudinal data | 39 (58.2) | 23 (76.7) | 0 (0.0) | 62 (63.3) | |||
| Obtain larger number of subjects | 15 (22.4) | 5 (16.7) | 0 (0.0) | 20 (20.4) | |||
| Develop policy predicated on data | 13 (19.4) | 2 (6.7) | 1 (100.0) | 16 (16.3) | |||
| 2.5 (3) | .47 | ||||||
| Deregulation | 22 (32.8) | 11 (36.7) | 0 (0.0) | 33 (33.7) | |||
| Data standardization | 28 (41.8) | 10 (33.3) | 0 (0.0) | 38 (38.8) | |||
| Effective guidelines including procedure, responsibility, and technology | 11 (16.4) | 8 (26.7) | 1 (100.0) | 20 (20.4) | |||
| Improvement of social recognition | 6 (9.0) | 1 (3.3) | 0 (0.0) | 7 (7.1) | |||
| 18.8 (6) | .003 | ||||||
| Development of health care policy | 28 (41.8) | 2 (6.7) | 1 (100.0) | 31 (31.6) | |||
| Development of diagnostic technology | 15 (22.4) | 12 (40.0) | 0 (0.0) | 27 (27.6) | |||
| Development of treatment modality | 12 (17.9) | 4 (13.3) | 0 (0.0) | 16 (16.3) | |||
| General research | 8 (11.9) | 4 (13.3) | 0 (0.0) | 12 (12.2) | |||
| Development of medical device | 2 (3.0) | 6 (20.0) | 0 (0.0) | 8 (8.2) | |||
| Development of new drug | 1 (1.5) | 1 (3.3) | 0 (0.0) | 2 (2.0) | |||
| Other | 1 (1.5) | 1 (3.3) | 0 (0.0) | 2 (2.0) | |||
| Subtotal | 67 (100.0) | 30 (100.0) | 1 (100.0) | 98 (100.0) | |||
aPublic versus commercial.