Literature DB >> 19032791

An effective and efficient approach for manually improving geocoded data.

Daniel W Goldberg1, John P Wilson, Craig A Knoblock, Beate Ritz, Myles G Cockburn.   

Abstract

BACKGROUND: The process of geocoding produces output coordinates of varying degrees of quality. Previous studies have revealed that simply excluding records with low-quality geocodes from analysis can introduce significant bias, but depending on the number and severity of the inaccuracies, their inclusion may also lead to bias. Little quantitative research has been presented on the cost and/or effectiveness of correcting geocodes through manual interactive processes, so the most cost effective methods for improving geocoded data are unclear. The present work investigates the time and effort required to correct geocodes contained in five health-related datasets that represent examples of data commonly used in Health GIS.
RESULTS: Geocode correction was attempted on five health-related datasets containing a total of 22,317 records. The complete processing of these data took 11.4 weeks (427 hours), averaging 69 seconds of processing time per record. Overall, the geocodes associated with 12,280 (55%) of records were successfully improved, taking 95 seconds of processing time per corrected record on average across all five datasets. Geocode correction improved the overall match rate (the number of successful matches out of the total attempted) from 79.3 to 95%. The spatial shift between the location of original successfully matched geocodes and their corrected improved counterparts averaged 9.9 km per corrected record. After geocode correction the number of city and USPS ZIP code accuracy geocodes were reduced from 10,959 and 1,031 to 6,284 and 200, respectively, while the number of building centroid accuracy geocodes increased from 0 to 2,261.
CONCLUSION: The results indicate that manual geocode correction using a web-based interactive approach is a feasible and cost effective method for improving the quality of geocoded data. The level of effort required varies depending on the type of data geocoded. These results can be used to choose between data improvement options (e.g., manual intervention, pseudocoding/geo-imputation, field GPS readings).

Entities:  

Mesh:

Year:  2008        PMID: 19032791      PMCID: PMC2612650          DOI: 10.1186/1476-072X-7-60

Source DB:  PubMed          Journal:  Int J Health Geogr        ISSN: 1476-072X            Impact factor:   3.918


  35 in total

1.  Utilization of health facilities and trained birth attendants for childbirth in rural Bangladesh: an empirical study.

Authors:  Bimal Kanti Paul; Deborah J Rumsey
Journal:  Soc Sci Med       Date:  2002-06       Impact factor: 4.634

2.  Post office box addresses: a challenge for geographic information system-based studies.

Authors:  Susan E Hurley; Theresa M Saunders; Rachna Nivas; Andrew Hertz; Peggy Reynolds
Journal:  Epidemiology       Date:  2003-07       Impact factor: 4.822

3.  Geospatial field applications within United States Department of Agriculture, Veterinary Services.

Authors:  Priscilla L FitzMaurice; Jerome E Freier; Kenneth D Geter
Journal:  Vet Ital       Date:  2007 Jul-Sep       Impact factor: 1.101

Review 4.  Geocoding in cancer research: a review.

Authors:  Gerard Rushton; Marc P Armstrong; Josephine Gittler; Barry R Greene; Claire E Pavlik; Michele M West; Dale L Zimmerman
Journal:  Am J Prev Med       Date:  2006-02       Impact factor: 5.043

5.  Defining localities of inadequate treatment for childhood asthma: a GIS approach.

Authors:  Ronit Peled; Haim Reuveni; Joseph S Pliskin; Itzhak Benenson; Erez Hatna; Asher Tal
Journal:  Int J Health Geogr       Date:  2006-01-17       Impact factor: 3.918

6.  Quantifying geocode location error using GIS methods.

Authors:  Matthew J Strickland; Csaba Siffel; Bennett R Gardner; Alissa K Berzen; Adolfo Correa
Journal:  Environ Health       Date:  2007-04-04       Impact factor: 5.984

7.  Error and bias in determining exposure potential of children at school locations using proximity-based GIS techniques.

Authors:  Paul A Zandbergen; Joseph W Green
Journal:  Environ Health Perspect       Date:  2007-09       Impact factor: 9.031

8.  Breast cancer risk and historical exposure to pesticides from wide-area applications assessed with GIS.

Authors:  Julia Green Brody; Ann Aschengrau; Wendy McKelvey; Ruthann A Rudel; Christopher H Swartz; Theresa Kennedy
Journal:  Environ Health Perspect       Date:  2004-06       Impact factor: 9.031

9.  Positional error in automated geocoding of residential addresses.

Authors:  Michael R Cayo; Thomas O Talbot
Journal:  Int J Health Geogr       Date:  2003-12-19       Impact factor: 3.918

10.  Spatial analysis of human granulocytic ehrlichiosis near Lyme, Connecticut.

Authors:  Emma K Chaput; James I Meek; Robert Heimer
Journal:  Emerg Infect Dis       Date:  2002-09       Impact factor: 6.883

View more
  59 in total

1.  Maternal serum metabolome and traffic-related air pollution exposure in pregnancy.

Authors:  Qi Yan; Zeyan Liew; Karan Uppal; Xin Cui; Chenxiao Ling; Julia E Heck; Ondine S von Ehrenstein; Jun Wu; Douglas I Walker; Dean P Jones; Beate Ritz
Journal:  Environ Int       Date:  2019-06-20       Impact factor: 9.621

2.  Prenatal exposure to air toxics and risk of Wilms' tumor in 0- to 5-year-old children.

Authors:  Anshu Shrestha; Beate Ritz; Michelle Wilhelm; Jiaheng Qiu; Myles Cockburn; Julia E Heck
Journal:  J Occup Environ Med       Date:  2014-06       Impact factor: 2.162

3.  Error propagation in spatial modeling of public health data: a simulation approach using pediatric blood lead level data for Syracuse, New York.

Authors:  Monghyeon Lee; Yongwan Chun; Daniel A Griffith
Journal:  Environ Geochem Health       Date:  2017-08-08       Impact factor: 4.609

4.  Retinoblastoma and ambient exposure to air toxics in the perinatal period.

Authors:  Julia E Heck; Andrew S Park; Jiaheng Qiu; Myles Cockburn; Beate Ritz
Journal:  J Expo Sci Environ Epidemiol       Date:  2013-11-27       Impact factor: 5.563

5.  Paraoxonase 1, agricultural organophosphate exposure, and Parkinson disease.

Authors:  Angelika D Manthripragada; Sadie Costello; Myles G Cockburn; Jeff M Bronstein; Beate Ritz
Journal:  Epidemiology       Date:  2010-01       Impact factor: 4.822

6.  Dopamine transporter genetic variants and pesticides in Parkinson's disease.

Authors:  Beate R Ritz; Angelika D Manthripragada; Sadie Costello; Sarah J Lincoln; Matthew J Farrer; Myles Cockburn; Jeff Bronstein
Journal:  Environ Health Perspect       Date:  2009-02-22       Impact factor: 9.031

7.  Using imputation to provide location information for nongeocoded addresses.

Authors:  Frank C Curriero; Martin Kulldorff; Francis P Boscoe; Ann C Klassen
Journal:  PLoS One       Date:  2010-02-10       Impact factor: 3.240

8.  Geocoding rural addresses in a community contaminated by PFOA: a comparison of methods.

Authors:  Verónica M Vieira; Gregory J Howard; Lisa G Gallagher; Tony Fletcher
Journal:  Environ Health       Date:  2010-04-21       Impact factor: 5.984

9.  Local indicators of geocoding accuracy (LIGA): theory and application.

Authors:  Geoffrey M Jacquez; Robert Rommel
Journal:  Int J Health Geogr       Date:  2009-10-28       Impact factor: 3.918

10.  Evaluating geographic imputation approaches for zip code level data: an application to a study of pediatric diabetes.

Authors:  James D Hibbert; Angela D Liese; Andrew Lawson; Dwayne E Porter; Robin C Puett; Debra Standiford; Lenna Liu; Dana Dabelea
Journal:  Int J Health Geogr       Date:  2009-10-08       Impact factor: 3.918

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.