Literature DB >> 24395116

Imputation of confidential data sets with spatial locations using disease mapping models.

Thais Paiva1, Avishek Chakraborty, Jerry Reiter, Alan Gelfand.   

Abstract

Data that include fine geographic information, such as census tract or street block identifiers, can be difficult to release as public use files. Fine geography provides information that ill-intentioned data users can use to identify individuals. We propose to release data with simulated geographies, so as to enable spatial analyses while reducing disclosure risks. We fit disease mapping models that predict areal-level counts from attributes in the file and sample new locations based on the estimated models. We illustrate this approach using data on causes of death in North Carolina, including evaluations of the disclosure risks and analytic validity that can result from releasing synthetic geographies.
Copyright © 2014 John Wiley & Sons, Ltd.

Entities:  

Keywords:  confidentiality; disclosure; geography; imputation; synthetic

Mesh:

Year:  2014        PMID: 24395116      PMCID: PMC4008679          DOI: 10.1002/sim.6078

Source DB:  PubMed          Journal:  Stat Med        ISSN: 0277-6715            Impact factor:   2.373


  9 in total

Review 1.  Geographically masking health data to preserve confidentiality.

Authors:  M P Armstrong; G Rushton; D L Zimmerman
Journal:  Stat Med       Date:  1999-03-15       Impact factor: 2.373

2.  MULTIPLE IMPUTATION FOR SHARING PRECISE GEOGRAPHIES IN PUBLIC USE DATA.

Authors:  Hao Wang; Jerome P Reiter
Journal:  Ann Appl Stat       Date:  2012-03-01       Impact factor: 2.083

3.  Disease mapping and spatial regression with count data.

Authors:  Jon Wakefield
Journal:  Biostatistics       Date:  2006-06-29       Impact factor: 5.899

4.  Confidentiality and spatially explicit data: concerns and challenges.

Authors:  Leah K VanWey; Ronald R Rindfuss; Myron P Gutmann; Barbara Entwisle; Deborah L Balk
Journal:  Proc Natl Acad Sci U S A       Date:  2005-10-17       Impact factor: 11.205

5.  Empirical Bayes estimates of age-standardized relative risks for use in disease mapping.

Authors:  D Clayton; J Kaldor
Journal:  Biometrics       Date:  1987-09       Impact factor: 2.571

6.  A multiple imputation approach to disclosure limitation for high-age individuals in longitudinal studies.

Authors:  Di An; Roderick J A Little; James W McNally
Journal:  Stat Med       Date:  2010-07-30       Impact factor: 2.373

7.  Improving the performance of predictive process modeling for large datasets.

Authors:  Andrew O Finley; Huiyan Sang; Sudipto Banerjee; Alan E Gelfand
Journal:  Comput Stat Data Anal       Date:  2009-06-15       Impact factor: 1.681

8.  Gaussian predictive process models for large spatial data sets.

Authors:  Sudipto Banerjee; Alan E Gelfand; Andrew O Finley; Huiyan Sang
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2008-09-01       Impact factor: 4.488

9.  Estimating Identification Disclosure Risk Using Mixed Membership Models.

Authors:  Daniel Manrique-Vallier; Jerome P Reiter
Journal:  J Am Stat Assoc       Date:  2012-12-01       Impact factor: 5.033

  9 in total
  4 in total

1.  Protecting Confidentiality in Cancer Registry Data With Geographic Identifiers.

Authors:  Mandi Yu; Jerome Phillip Reiter; Li Zhu; Benmei Liu; Kathleen A Cronin; Eric J Rocky Feuer
Journal:  Am J Epidemiol       Date:  2017-07-01       Impact factor: 4.897

2.  Selecting Optimal Subset to release under Differentially Private M-estimators from Hybrid Datasets.

Authors:  Meng Wang; Zhanglong Ji; Hyeon-Eui Kim; Shuang Wang; Li Xiong; Xiaoqian Jiang
Journal:  IEEE Trans Knowl Data Eng       Date:  2017-11-14       Impact factor: 6.977

3.  Confidentiality considerations for use of social-spatial data on the social determinants of health: Sexual and reproductive health case study.

Authors:  Danielle F Haley; Stephen A Matthews; Hannah L F Cooper; Regine Haardörfer; Adaora A Adimora; Gina M Wingood; Michael R Kramer
Journal:  Soc Sci Med       Date:  2016-08-08       Impact factor: 4.634

4.  A Geoprivacy by Design Guideline for Research Campaigns That Use Participatory Sensing Data.

Authors:  Ourania Kounadi; Bernd Resch
Journal:  J Empir Res Hum Res Ethics       Date:  2018-04-23       Impact factor: 1.742

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.