Literature DB >> 19074299

Evaluating predictors of geographic area population size cut-offs to manage re-identification risk.

Khaled El Emam1, Ann Brown, Philip AbdelMalik.   

Abstract

OBJECTIVE: In public health and health services research, the inclusion of geographic information in data sets is critical. Because of concerns over the re-identification of patients, data from small geographic areas are either suppressed or the geographic areas are aggregated into larger ones. Our objective is to estimate the population size cut-off at which a geographic area is sufficiently large so that no data suppression or further aggregation is necessary.
DESIGN: The 2001 Canadian census data were used to conduct a simulation to model the relationship between geographic area population size and uniqueness for some common demographic variables. Cut-offs were computed for geographic area population size, and prediction models were developed to estimate the appropriate cut-offs. MEASUREMENTS: Re-identification risk was measured using uniqueness. Geographic area population size cut-offs were estimated using the maximum number of possible values in the data set and a traditional entropy measure.
RESULTS: The model that predicted population cut-offs using the maximum number of possible values in the data set had R2 values around 0.9, and relative error of prediction less than 0.02 across all regions of Canada. The models were then applied to assess the appropriate geographic area size for the prescription records provided by retail and hospital pharmacies to commercial research and analysis firms.
CONCLUSIONS: To manage re-identification risk, the prediction models can be used by public health professionals, health researchers, and research ethics boards to decide when the geographic area population size is sufficiently large.

Entities:  

Mesh:

Year:  2008        PMID: 19074299      PMCID: PMC2649314          DOI: 10.1197/jamia.M2902

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  24 in total

Review 1.  Geographic information systems and public health.

Authors:  Thomas C Ricketts
Journal:  Annu Rev Public Health       Date:  2001-11-06       Impact factor: 21.981

2.  Recruiting patients to medical research: double blind randomised trial of "opt-in" versus "opt-out" strategies.

Authors:  Cornelia Junghans; Gene Feder; Harry Hemingway; Adam Timmis; Melvyn Jones
Journal:  BMJ       Date:  2005-09-12

3.  No place to hide--reverse identification of patients from published maps.

Authors:  John S Brownstein; Christopher A Cassa; Kenneth D Mandl
Journal:  N Engl J Med       Date:  2006-10-19       Impact factor: 91.245

4.  Small numbers, disclosure risk, security, and reliability issues in Web-based data query systems.

Authors:  Barbara A Rudolph; Gulzar H Shah; Denise Love
Journal:  J Public Health Manag Pract       Date:  2006 Mar-Apr

5.  Potential meets reality: GIS and public health research in Australia.

Authors:  L A O'Dwyer; D L Burton
Journal:  Aust N Z J Public Health       Date:  1998-12       Impact factor: 2.939

6.  Privacy protection versus cluster detection in spatial epidemiology.

Authors:  Karen L Olson; Shaun J Grannis; Kenneth D Mandl
Journal:  Am J Public Health       Date:  2006-10-03       Impact factor: 9.308

7.  Do patient consent procedures affect participation rates in health services research?

Authors:  Karin Nelson; Rosa Elena Garcia; Julie Brown; Carol M Mangione; Thomas A Louis; Emmett Keeler; Shan Cretin
Journal:  Med Care       Date:  2002-04       Impact factor: 2.983

8.  Bias from requiring explicit consent from all participants in observational research: prospective, population based study.

Authors:  Rustam Al-Shahi; Céline Vousden; Charles Warlow
Journal:  BMJ       Date:  2005-10-13

9.  Potential impact of the HIPAA privacy rule on data collection in a registry of patients with acute coronary syndrome.

Authors:  David Armstrong; Eva Kline-Rogers; Sandeep M Jani; Edward B Goldman; Jianming Fang; Debabrata Mukherjee; Brahmajee K Nallamothu; Kim A Eagle
Journal:  Arch Intern Med       Date:  2005-05-23

10.  An unsupervised classification method for inferring original case locations from low-resolution disease maps.

Authors:  John S Brownstein; Christopher A Cassa; Isaac S Kohane; Kenneth D Mandl
Journal:  Int J Health Geogr       Date:  2006-12-08       Impact factor: 3.918

View more
  15 in total

1.  Trends in biomedical informatics: most cited topics from recent years.

Authors:  Hyeon-Eui Kim; Xiaoqian Jiang; Jihoon Kim; Lucila Ohno-Machado
Journal:  J Am Med Inform Assoc       Date:  2011-12       Impact factor: 4.497

2.  A globally optimal k-anonymity method for the de-identification of health data.

Authors:  Khaled El Emam; Fida Kamal Dankar; Romeo Issa; Elizabeth Jonker; Daniel Amyot; Elise Cogo; Jean-Pierre Corriveau; Mark Walker; Sadrul Chowdhury; Regis Vaillancourt; Tyson Roffey; Jim Bottomley
Journal:  J Am Med Inform Assoc       Date:  2009-06-30       Impact factor: 4.497

3.  Evaluating re-identification risks with respect to the HIPAA privacy rule.

Authors:  Kathleen Benitez; Bradley Malin
Journal:  J Am Med Inform Assoc       Date:  2010 Mar-Apr       Impact factor: 4.497

4.  Using mobile location data in biomedical research while preserving privacy.

Authors:  Daniel M Goldenholz; Shira R Goldenholz; Kaarkuzhali B Krishnamurthy; John Halamka; Barbara Karp; Matthew Tyburski; David Wendler; Robert Moss; Kenzie L Preston; William Theodore
Journal:  J Am Med Inform Assoc       Date:  2018-10-01       Impact factor: 4.497

5.  SHARE: system design and case studies for statistical health information release.

Authors:  James Gardner; Li Xiong; Yonghui Xiao; Jingjing Gao; Andrew R Post; Xiaoqian Jiang; Lucila Ohno-Machado
Journal:  J Am Med Inform Assoc       Date:  2012-10-11       Impact factor: 4.497

6.  Spatial dimensions of research on alcohol and sexual risk: a case example from a Mumbai study.

Authors:  Ellen K Cromley; Jean J Schensul; S K Singh; Marlene J Berg; Emil Coman
Journal:  AIDS Behav       Date:  2010-08

7.  Protecting count queries in study design.

Authors:  Staal A Vinterbo; Anand D Sarwate; Aziz A Boxwala
Journal:  J Am Med Inform Assoc       Date:  2012-04-17       Impact factor: 4.497

8.  A method for managing re-identification risk from small geographic areas in Canada.

Authors:  Khaled El Emam; Ann Brown; Philip AbdelMalik; Angelica Neisa; Mark Walker; Jim Bottomley; Tyson Roffey
Journal:  BMC Med Inform Decis Mak       Date:  2010-04-02       Impact factor: 2.796

9.  Estimating the re-identification risk of clinical data sets.

Authors:  Fida Kamal Dankar; Khaled El Emam; Angelica Neisa; Tyson Roffey
Journal:  BMC Med Inform Decis Mak       Date:  2012-07-09       Impact factor: 2.796

Review 10.  Musings on privacy issues in health research involving disaggregate geographic data about individuals.

Authors:  Maged N Kamel Boulos; Andrew J Curtis; Philip Abdelmalik
Journal:  Int J Health Geogr       Date:  2009-07-20       Impact factor: 3.918

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.