OBJECTIVE: Geocoding and characterizing geographic, community, and environmental characteristics of study participants is frequently done in epidemiological studies. However, participant addresses are identifiable protected health information (PHI) and geocoding must be conducted in a Health Insurance Portability and Accountability Act-compliant manner. Our objective was to create a software application for this process that addresses limitations in current approaches. MATERIALS AND METHODS: We used a containerization platform to create DeGAUSS (Decentralized Geomarker Assessment for Multi-Site Studies), a software application that facilitates reproducible geocoding and geomarker assessment while maintaining the confidentiality of PHI. To validate the software, 215 350 addresses in Hamilton County, Ohio, were geocoded using DeGAUSS, ArcGIS, Google, and SAS and compared to a gold-standard approach. We distributed the DeGAUSS software to sites in an ongoing multisite study (Electronic Medical Records and Genomics, or eMERGE), and individual sites independently geocoded and assigned median census tract-level income and distance to nearest major roadway to their participants' addresses, removed associated PHI, and returned deidentified data. RESULTS: Within a multisite study, 52 244 study participants' addresses across 5 sites were geocoded with a median distance to roadway of 10 022m and a median census tract income of $57 266, demonstrating the feasibility of DeGAUSS within a multisite study. Compared to other commonly used geocoding platforms, DeGAUSS had similar geocoding and geomarker assessment accuracies. CONCLUSION: The open source DeGAUSS software overcomes multiple challenges in the use of address data in multisite studies and also serves as a more general reproducible research tool for geocoding and geomarker assessment.
OBJECTIVE: Geocoding and characterizing geographic, community, and environmental characteristics of study participants is frequently done in epidemiological studies. However, participant addresses are identifiable protected health information (PHI) and geocoding must be conducted in a Health Insurance Portability and Accountability Act-compliant manner. Our objective was to create a software application for this process that addresses limitations in current approaches. MATERIALS AND METHODS: We used a containerization platform to create DeGAUSS (Decentralized Geomarker Assessment for Multi-Site Studies), a software application that facilitates reproducible geocoding and geomarker assessment while maintaining the confidentiality of PHI. To validate the software, 215 350 addresses in Hamilton County, Ohio, were geocoded using DeGAUSS, ArcGIS, Google, and SAS and compared to a gold-standard approach. We distributed the DeGAUSS software to sites in an ongoing multisite study (Electronic Medical Records and Genomics, or eMERGE), and individual sites independently geocoded and assigned median census tract-level income and distance to nearest major roadway to their participants' addresses, removed associated PHI, and returned deidentified data. RESULTS: Within a multisite study, 52 244 study participants' addresses across 5 sites were geocoded with a median distance to roadway of 10 022m and a median census tract income of $57 266, demonstrating the feasibility of DeGAUSS within a multisite study. Compared to other commonly used geocoding platforms, DeGAUSS had similar geocoding and geomarker assessment accuracies. CONCLUSION: The open source DeGAUSS software overcomes multiple challenges in the use of address data in multisite studies and also serves as a more general reproducible research tool for geocoding and geomarker assessment.
Authors: Toshifumi Yodoshi; Sarah Orkin; Ana-Catalina Arce Clachar; Kristin Bramlage; Qin Sun; Lin Fei; Andrew F Beck; Stavra A Xanthakos; Andrew T Trout; Marialena Mouzaki Journal: J Pediatr Date: 2020-08 Impact factor: 4.406
Authors: Cole Brokamp; Andrew F Beck; Neera K Goyal; Patrick Ryan; James M Greenberg; Eric S Hall Journal: Ann Epidemiol Date: 2018-11-29 Impact factor: 3.797
Authors: Sarah Orkin; Cole Brokamp; Toshifumi Yodoshi; Andrew T Trout; Chunyan Liu; Syeda Meryum; Stuart Taylor; Christopher Wolfe; Rachel Sheridan; Aradhna Seth; Mohammad Alfrad Nobel Bhuiyan; Sanita Ley; Ana Catalina Arce-Clachar; Kristin Bramlage; Robert Kahn; Stavra Xanthakos; Andrew F Beck; Marialena Mouzaki Journal: J Pediatr Gastroenterol Nutr Date: 2020-03 Impact factor: 2.839
Authors: Patrick H Ryan; Cole Brokamp; Jeff Blossom; Nathan Lothrop; Rachel L Miller; Paloma I Beamer; Cynthia M Visness; Antonella Zanobetti; Howard Andrews; Leonard B Bacharier; Tina Hartert; Christine C Johnson; Dennis Ownby; Robert F Lemanske; Heike Gibson; Weeberb Requia; Brent Coull; Edward M Zoratti; Anne L Wright; Fernando D Martinez; Christine M Seroogy; James E Gern; Diane R Gold Journal: J Clin Transl Sci Date: 2021-02-05
Authors: Sharad I Wadhwani; Cole Brokamp; Erika Rasnick; John C Bucuvalas; Jennifer C Lai; Andrew F Beck Journal: Am J Transplant Date: 2020-08-04 Impact factor: 8.086