| Literature DB >> 32708093 |
Alejandro Valencia1, Lisa Stillwell2, Stephen Appold3, Saravanan Arunachalam4, Steven Cox2, Hao Xu2, Charles P Schmitt5, Shepherd H Schurman5, Stavros Garantziotis5, William Xue5, Stanley C Ahalt2, Karamarie Fecho2,6.
Abstract
Environmental exposures have profound effects on health and disease. While public repositories exist for a variety of exposures data, these are generally difficult to access, navigate, and interpret. We describe the research, development, and application of three open application programming interfaces (APIs) that support access to usable, nationwide, exposures data from three public repositories: airborne pollutant estimates from the US Environmental Protection Agency; roadway data from the US Department of Transportation; and socio-environmental exposures from the US Census Bureau's American Community Survey. Three open APIs were successfully developed, deployed, and tested using random latitude/longitude values and time periods as input parameters. After confirming the accuracy of the data, we used the APIs to extract exposures data on 2550 participants from a cohort within the Environmental Polymorphisms Registry (EPR) at the National Institute of Environmental Health Sciences, and we successfully linked the exposure estimates with participant-level data derived from the EPR. We then conducted an exploratory, proof-of-concept analysis of the integrated data for a subset of participants with self-reported asthma and largely replicated our prior findings on the impact of select exposures and demographic factors on asthma exacerbations. Together, the three open exposures APIs provide a valuable resource, with application across environmental and public health fields.Entities:
Keywords: airborne pollutants; application programming interfaces; asthma; asthma exacerbations; demographic factors; environmental health; open data; public health; roadway exposure; socio-economic factors
Mesh:
Substances:
Year: 2020 PMID: 32708093 PMCID: PMC7400024 DOI: 10.3390/ijerph17145243
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 3.390
Figure 1Screenshot of Swagger user interface to Translator Airborne Pollutant Exposures application programming interfaces (API).
Figure 2Example JSON output for Raleigh, North Carolina (latitude, longitude: 35.7796, −78.6382) for the Airborne Pollutant Exposures API (A), Roadway Exposures API (B), and Socio-environmental Exposures API (C). A start and an end date of 01–01–2010 and 01–01–2010 was used for the Airborne Pollutant Exposures API; a default setting of 500 m was used for the Roadway Exposures API; the 2012–2016 survey period was selected for the Socio-environmental Exposures API; and the UTC time zone offset was chosen.
EPR data extraction and integration results for the three open Translator Exposures APIs.
| General Statistics | Comments |
|---|---|
| 4130 total participants | all participants had a self-reported diagnosis of asthma |
| 2550 participants with geocodes | 5 participants had a foreign address; one participant resided in Alaska/Hawaii; 1576 participants were not geocoded due to home addresses that were listed as P.O. boxes or routes, or addresses that were outdated, incomplete, or missing |
| 2550 participants with complete airborne pollutant exposures data | extraction included the number of observations per airborne pollutant per participant |
| 2550 participants with complete roadway data | 954 participants (37.4%) had a primary residence located 500 m or greater from a major roadway (i.e., the maximum distance reported by the API) |
| 2550 participants with complete ACS data | extraction included standard error estimates for each socio-environmental exposure, as those are available via the API as standard ACS metrics |
| 2550 participants with integrated exposures and survey data | integration was at the participant level |
Note: ACS = American Community Survey; EPR = Environmental Polymorphisms Registry.
EPR data on a subset of participants with a self-report of asthma, integrated at the participant-level with data derived from the Translator Exposures APIs.
| Feature Variable | 0 ED or Urgent Care Visits for Asthma Prior 12 Months | 1+ ED or Urgent Care Visits for Asthma Prior 12 Months | TOTAL | |
|---|---|---|---|---|
|
| ||||
| Female | 593 (86.2%) | 95 (13.8%) | 688 (100.0%) | |
| Male | 213 (90.6%) | 22 (9.36%) | 235 (100.0%) | |
|
| ||||
| African American | 179 (77.5%) | 52 (22.5%) | 231 (100.0%) | |
| Caucasian | 562 (90.6%) | 58 (9.35%) | 620 (100.0%) | |
| Other | 65 (90.3%) | 7 (9.72%) | 72 (100.0%) | |
|
| ||||
| Yes | 327 (84.7%) | 59 (15.3%) | 386 (100.0%) | |
| No | 472 (89.1%) | 58 (10.9%) | 530 (100.0%) | |
|
| ||||
| Yes | 345 (85.0%) | 61 (15.0%) | 406 (100.0%) | |
| No | 445 (89.4%) | 53 (10.6%) | 498 (100.0%) | |
|
| ||||
| Bin 1 | 20 (100.0%) | 0 (0.00%) | 20 (100.0%) | |
| Bin 2 | 149 (90.3%) | 16 (9.70%) | 165 (100.0%) | |
|
| 604 (87.0%) | 90 (13.0%) | 694 (100.0%) | |
|
| ||||
| ≤250 m | 332 (87.6%) | 47 (12.4%) | 379 (100.0%) | |
| >250 m | 441 (88.2%) | 59 (11.8%) | 500 (100.0%) | |
|
| ||||
| Bin 1 | 311 (83.2%) | 63 (16.8%) | 374 (100.0%) | |
| Bin 2 | 355 (90.6%) | 37 (9.44%) | 392 (100.0%) | |
| Bin 3 | 107 (94.7%) | 6 (5.31%) | 113 (100.0%) |
Abbreviations: ED, emergency department; PM2.5, particulate matter ≤ 2.5-µm in diameter. * Sample sizes reflect the starting sample size of n = 932 EPR participants with a self-reported diagnosis of asthma, of which n = 923 participants had data on the primary endpoint of self-reported ED or urgent care visits for asthma (one or more in prior 12 months), and n = 879 of those had geocodes and therefore had exposures data available from the Translator Exposures APIs. Sample sizes vary by feature variable due to missing data arising from incomplete surveys. † Significance level was set at p < 0.10. ‡ Binned using pandas cut function to facilitate comparison with [24] and account for the granularity of the data, with airborne pollutant exposure estimates obtained at a resolution of 12 km-squared and provisioned at a resolution of the US Census Track centroid, and median household income estimates provisioned at the US Census Block Group level.