Literature DB >> 28453646

Protecting Confidentiality in Cancer Registry Data With Geographic Identifiers.

Mandi Yu, Jerome Phillip Reiter, Li Zhu, Benmei Liu, Kathleen A Cronin, Eric J Rocky Feuer.   

Abstract

The National Cancer Institute's Surveillance, Epidemiology, and End Results Program releases research files of cancer registry data. These files include geographic information at the county level, but no finer. Access to finer geography, such as census tract identifiers, would enable richer analyses-for example, examination of health disparities across neighborhoods. To date, tract identifiers have been left off the research files because they could compromise the confidentiality of patients' identities. We present an approach to inclusion of tract identifiers based on multiply imputed, synthetic data. The idea is to build a predictive model of tract locations, given patient and tumor characteristics, and randomly simulate the tract of each patient by sampling from this model. For the predictive model, we use multivariate regression trees fitted to the latitude and longitude of the population centroid of each tract. We implement the approach in the registry data from California. The method results in synthetic data that reproduce a wide range (but not all) of analyses of census tract socioeconomic cancer disparities and have relatively low disclosure risks, which we assess by comparing individual patients' actual and synthetic tract locations. We conclude with a discussion of how synthetic data sets can be used by researchers with cancer registry data. Published by Oxford University Press on behalf of the Johns Hopkins Bloomberg School of Public Health 2017. This work is written by (a) US Government employee(s) and is in the public domain in the US.

Entities:  

Keywords:  Surveillance, Epidemiology, and End Results Program; breast cancer; classification and regression trees; health disparities; multiple imputation; partial synthetic data

Mesh:

Year:  2017        PMID: 28453646      PMCID: PMC5860429          DOI: 10.1093/aje/kwx050

Source DB:  PubMed          Journal:  Am J Epidemiol        ISSN: 0002-9262            Impact factor:   4.897


  15 in total

Review 1.  Geographically masking health data to preserve confidentiality.

Authors:  M P Armstrong; G Rushton; D L Zimmerman
Journal:  Stat Med       Date:  1999-03-15       Impact factor: 2.373

Review 2.  Geocoding and monitoring of US socioeconomic inequalities in mortality and cancer incidence: does the choice of area-based measure and geographic level matter?: the Public Health Disparities Geocoding Project.

Authors:  Nancy Krieger; Jarvis T Chen; Pamela D Waterman; Mah-Jabeen Soobader; S V Subramanian; Rosa Carson
Journal:  Am J Epidemiol       Date:  2002-09-01       Impact factor: 4.897

3.  Mapping health data: improved privacy protection with donut method geomasking.

Authors:  Kristen H Hampton; Molly K Fitch; William B Allshouse; Irene A Doherty; Dionne C Gesink; Peter A Leone; Marc L Serre; William C Miller
Journal:  Am J Epidemiol       Date:  2010-09-03       Impact factor: 4.897

4.  MULTIPLE IMPUTATION FOR SHARING PRECISE GEOGRAPHIES IN PUBLIC USE DATA.

Authors:  Hao Wang; Jerome P Reiter
Journal:  Ann Appl Stat       Date:  2012-03-01       Impact factor: 2.083

5.  Multiple-Shrinkage Multinomial Probit Models with Applications to Simulating Geographies in Public Use Data.

Authors:  Lane F Burgette; Jerome P Reiter
Journal:  Bayesian Anal       Date:  2013-06-01       Impact factor: 3.728

6.  Imputation of confidential data sets with spatial locations using disease mapping models.

Authors:  Thais Paiva; Avishek Chakraborty; Jerry Reiter; Alan Gelfand
Journal:  Stat Med       Date:  2014-01-07       Impact factor: 2.373

7.  Geographic disparities in late-stage breast cancer diagnosis in California.

Authors:  Tzy-Mey Kuo; Lee R Mobley; Luc Anselin
Journal:  Health Place       Date:  2010-11-26       Impact factor: 4.078

8.  Rural - urban inequalities in late-stage breast cancer: spatial and social dimensions of risk and access.

Authors:  Sara McLafferty; Fahui Wang; Lan Luo; Jared Butler
Journal:  Environ Plann B Plann Des       Date:  2011-08

Review 9.  Cancer disparities by race/ethnicity and socioeconomic status.

Authors:  Elizabeth Ward; Ahmedin Jemal; Vilma Cokkinides; Gopal K Singh; Cheryll Cardinez; Asma Ghafoor; Michael Thun
Journal:  CA Cancer J Clin       Date:  2004 Mar-Apr       Impact factor: 508.702

10.  Detecting an association between socioeconomic status and late stage breast cancer using spatial analysis and area-based measures.

Authors:  Jill Amlong MacKinnon; Robert C Duncan; Youjie Huang; David J Lee; Lora E Fleming; Lydia Voti; Mark Rudolph; James D Wilkinson
Journal:  Cancer Epidemiol Biomarkers Prev       Date:  2007-04       Impact factor: 4.254

View more
  4 in total

1.  Using mobile location data in biomedical research while preserving privacy.

Authors:  Daniel M Goldenholz; Shira R Goldenholz; Kaarkuzhali B Krishnamurthy; John Halamka; Barbara Karp; Matthew Tyburski; David Wendler; Robert Moss; Kenzie L Preston; William Theodore
Journal:  J Am Med Inform Assoc       Date:  2018-10-01       Impact factor: 4.497

Review 2.  Causes of Socioeconomic Disparities in Colorectal Cancer and Intervention Framework and Strategies.

Authors:  John M Carethers; Chyke A Doubeni
Journal:  Gastroenterology       Date:  2019-11-01       Impact factor: 22.682

3.  Multilevel mediation analysis on time-to-event outcomes: Exploring racial/ethnic disparities in breast cancer survival in California.

Authors:  Qingzhao Yu; Mandi Yu; Joe Zou; Xiaocheng Wu; Scarlett L Gomez; Bin Li
Journal:  Res Methods Med Health Sci       Date:  2021-11-20

4.  A Multilevel Approach to Investigate Relationships Between Healthcare Resources and Lung Cancer.

Authors:  Darryl Somayaji; Young S Seo; Gregory E Wilding; Ekaterina Noyes
Journal:  Nurs Res       Date:  2022-05-05       Impact factor: 2.364

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.