Literature DB >> 23990852

MULTIPLE IMPUTATION FOR SHARING PRECISE GEOGRAPHIES IN PUBLIC USE DATA.

Hao Wang1, Jerome P Reiter.   

Abstract

When releasing data to the public, data stewards are ethically and often legally obligated to protect the confidentiality of data subjects' identities and sensitive attributes. They also strive to release data that are informative for a wide range of secondary analyses. Achieving both objectives is particularly challenging when data stewards seek to release highly resolved geographical information. We present an approach for protecting the confidentiality of data with geographic identifiers based on multiple imputation. The basic idea is to convert geography to latitude and longitude, estimate a bivariate response model conditional on attributes, and simulate new latitude and longitude values from these models. We illustrate the proposed methods using data describing causes of death in Durham, North Carolina. In the context of the application, we present a straightforward tool for generating simulated geographies and attributes based on regression trees, and we present methods for assessing disclosure risks with such simulated data.

Entities:  

Keywords:  Confidentiality; disclosure; dissemination; spatial; synthetic; tree

Year:  2012        PMID: 23990852      PMCID: PMC3753824          DOI: 10.1214/11-AOAS506

Source DB:  PubMed          Journal:  Ann Appl Stat        ISSN: 1932-6157            Impact factor:   2.083


  4 in total

Review 1.  Geographically masking health data to preserve confidentiality.

Authors:  M P Armstrong; G Rushton; D L Zimmerman
Journal:  Stat Med       Date:  1999-03-15       Impact factor: 2.373

2.  MULTIPLE IMPUTATION FOR SHARING PRECISE GEOGRAPHIES IN PUBLIC USE DATA.

Authors:  Hao Wang; Jerome P Reiter
Journal:  Ann Appl Stat       Date:  2012-03-01       Impact factor: 2.083

3.  Confidentiality and spatially explicit data: concerns and challenges.

Authors:  Leah K VanWey; Ronald R Rindfuss; Myron P Gutmann; Barbara Entwisle; Deborah L Balk
Journal:  Proc Natl Acad Sci U S A       Date:  2005-10-17       Impact factor: 11.205

4.  Gaussian predictive process models for large spatial data sets.

Authors:  Sudipto Banerjee; Alan E Gelfand; Andrew O Finley; Huiyan Sang
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2008-09-01       Impact factor: 4.488

  4 in total
  9 in total

1.  MULTIPLE IMPUTATION FOR SHARING PRECISE GEOGRAPHIES IN PUBLIC USE DATA.

Authors:  Hao Wang; Jerome P Reiter
Journal:  Ann Appl Stat       Date:  2012-03-01       Impact factor: 2.083

2.  Using mobile location data in biomedical research while preserving privacy.

Authors:  Daniel M Goldenholz; Shira R Goldenholz; Kaarkuzhali B Krishnamurthy; John Halamka; Barbara Karp; Matthew Tyburski; David Wendler; Robert Moss; Kenzie L Preston; William Theodore
Journal:  J Am Med Inform Assoc       Date:  2018-10-01       Impact factor: 4.497

3.  Multiple-Shrinkage Multinomial Probit Models with Applications to Simulating Geographies in Public Use Data.

Authors:  Lane F Burgette; Jerome P Reiter
Journal:  Bayesian Anal       Date:  2013-06-01       Impact factor: 3.728

4.  Imputation of confidential data sets with spatial locations using disease mapping models.

Authors:  Thais Paiva; Avishek Chakraborty; Jerry Reiter; Alan Gelfand
Journal:  Stat Med       Date:  2014-01-07       Impact factor: 2.373

5.  Protecting Confidentiality in Cancer Registry Data With Geographic Identifiers.

Authors:  Mandi Yu; Jerome Phillip Reiter; Li Zhu; Benmei Liu; Kathleen A Cronin; Eric J Rocky Feuer
Journal:  Am J Epidemiol       Date:  2017-07-01       Impact factor: 4.897

6.  Confidentiality considerations for use of social-spatial data on the social determinants of health: Sexual and reproductive health case study.

Authors:  Danielle F Haley; Stephen A Matthews; Hannah L F Cooper; Regine Haardörfer; Adaora A Adimora; Gina M Wingood; Michael R Kramer
Journal:  Soc Sci Med       Date:  2016-08-08       Impact factor: 4.634

7.  Multiple Imputation: A Flexible Tool for Handling Missing Data.

Authors:  Peng Li; Elizabeth A Stuart; David B Allison
Journal:  JAMA       Date:  2015-11-10       Impact factor: 56.272

8.  A Geoprivacy by Design Guideline for Research Campaigns That Use Participatory Sensing Data.

Authors:  Ourania Kounadi; Bernd Resch
Journal:  J Empir Res Hum Res Ethics       Date:  2018-04-23       Impact factor: 1.742

Review 9.  Ensuring Confidentiality of Geocoded Health Data: Assessing Geographic Masking Strategies for Individual-Level Data.

Authors:  Paul A Zandbergen
Journal:  Adv Med       Date:  2014-04-29
  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.