| Literature DB >> 32867839 |
James Gow1,2, Colin Moffatt3,4, Jamie Blackport4.
Abstract
BACKGROUND: Rare disease patients often struggle to find both medical advice and emotional support for their diagnosis. Consequently, many rare disease patient support forums have appeared on hospital webpages, social media sites, and on rare disease foundation sites. However, we argue that engagement in these groups may pose a healthcare data privacy threat to many participants, since it makes a series of patient indirect identifiers 'readily available' in combination with rare disease conditions. This information produces a risk of re-identification because it may allow a motivated attacker to use the unique combination of a patient's identifiers and disease condition to re-identify them in anonymized data.Entities:
Keywords: HIPAA; Healthcare; Privacy; Rare disease; Support forums
Mesh:
Year: 2020 PMID: 32867839 PMCID: PMC7457524 DOI: 10.1186/s13023-020-01497-3
Source DB: PubMed Journal: Orphanet J Rare Dis ISSN: 1750-1172 Impact factor: 4.123
Fig. 4Tabulated Example of the Collected Patient Direct and Indirect Identifiers. The left column represents the direct or indirect identifier collected and the right column provides an example of how a structured record may appear for a sample patient
Fig. 1Group Size for Analyzed Patients. The x axis represents the number of people in the US population that share the patient’s combination of indirect identifiers. The y axis is the number of patients in this study that were a part of a group of that size. Highlighted in red are the two group sizes which constitute a high risk for re-identification
Fig. 2High-risk Patients by Disease Condition. The x axis represents the patient’s rare disease condition and the y axis represents what percentage of patients with the condition were in high-risk groups. For conditions with 100% high-risk patients the bars are coloured red, for 90% they are orange, for 80% they are yellow, and for <50% they are coloured blue
Fig. 3Scatter Plot of Patient Group Size and Age/Sex/ZIP Group Size. The x axis is the Log base 10 value of the number of people in the US that share each patient’s age, sex, and 3-digit ZIP as was collected. The y axis is the Log base 10 of their group size after incorporating the role of disease condition. Rare disease patients are shown in red and the control group of non-rare disease (multiple sclerosis) patients are in black. The horizontal line is at the value 0.301 which equals the Log base 10 of 5. The linear regression line with R-squared equal to 0.7695 for the rare disease patients is also shown