| Literature DB >> 25794882 |
Khaled El Emam1, Sam Rodgers2, Bradley Malin3.
Abstract
Entities:
Mesh:
Year: 2015 PMID: 25794882 PMCID: PMC4707567 DOI: 10.1136/bmj.h1139
Source DB: PubMed Journal: BMJ ISSN: 0959-8138

Extent of anonymisation that needs to be applied for different types of data releases balanced against other controls
Probability of re-identification of anonymised data in BORN (Ontario birth registry dataset) for various combinations of quasi-identifiers
| Mother’s date of birth | Baby’s date of birth | Mother’s postal code | Baby’s sex | Probability of re‑identification* |
|---|---|---|---|---|
| X | 0.014 | |||
| X | 0.005 | |||
| X | X | 0.88 | ||
| X | X | X | 1.00 | |
| X | X | X | X | 1.00 |
| X | X | X | 0.91 | |
| X | X | 0.98 | ||
| X | X | 0.85 | ||
| X | 0.19 |
X indicates that a variable is included in the calculation of probability.
*Probability was measured using the average re-identification risk metric defined elsewhere.7
Changes in probability of re-identification of anonymised data in BORN (Ontario birth registry dataset) for different levels of generalisation of quasi-identifiers
| Scenario | Mother’s date of birth or age | Baby’s date of birth | Mother’s postal code | Baby’s sex | Probability of re‑identification* |
|---|---|---|---|---|---|
| S1 | Year | day, month, year | 3 character | Unchanged | 0.973 |
| S2 | Year | month, year | 3 character | Unchanged | 0.677 |
| S3 | Age in 5-year groups | month, year | 3 character | Unchanged | 0.327 |
| S4 | Age ≤19, 20-30, 30-40, >40 | month, year | 3 character | Unchanged | 0.23 |
| S5 | Age ≤19, 20-30, 30-40, >40 | month, year | 1 character | Unchanged | 0.007 |
| S6 | Year | month, year | 1 character | Unchanged | 0.034 |
| S7 | Age in 5-year groups | quarter, year | 3 character | Unchanged | 0.152 |
| S8 | Age ≤19, 20-30, 30-40, >40 | quarter, year | 3 character | Unchanged | 0.1 |
*Probability was measured using the average re-identification risk metric defined elsewhere.7