Literature DB >> 29152601

A novel approach to analyzing lung cancer mortality disparities: Using the exposome and a graph-theoretical toolchain.

Paul D Juarez1, Darryl B Hood2, Gary L Rogers3, Suzanne H Baktash4, Arnold M Saxton5, Patricia Matthews-Juarez1, Wansoo Im1, Myriam Patricia Cifuentes2, Charles A Phillips6, Maureen Y Lichtveld7, Michael A Langston4.   

Abstract

OBJECTIVES: The aim is to identify exposures associated with lung cancer mortality and mortality disparities by race and gender using an exposome database coupled to a graph theoretical toolchain.
METHODS: Graph theoretical algorithms were employed to extract paracliques from correlation graphs using associations between 2162 environmental exposures and lung cancer mortality rates in 2067 counties, with clique doubling applied to compute an absolute threshold of significance. Factor analysis and multiple linear regressions then were used to analyze differences in exposures associated with lung cancer mortality and mortality disparities by race and gender.
RESULTS: While cigarette consumption was highly correlated with rates of lung cancer mortality for both white men and women, previously unidentified novel exposures were more closely associated with lung cancer mortality and mortality disparities for blacks, particularly black women.
CONCLUSIONS: Exposures beyond smoking moderate lung cancer mortality and mortality disparities by race and gender. POLICY IMPLICATIONS: An exposome approach and database coupled with scalable combinatorial analytics provides a powerful new approach for analyzing relationships between multiple environmental exposures, pathways and health outcomes. An assessment of multiple exposures is needed to appropriately translate research findings into environmental public health practice and policy.

Entities:  

Keywords:  Disparities; exposome; lung cancer; mortality; social determinants

Year:  2017        PMID: 29152601      PMCID: PMC5687093     

Source DB:  PubMed          Journal:  Environ Dis        ISSN: 2468-5690


INTRODUCTION

Lung cancer remains the leading cause of cancer mortality in both males and females in the United States.[ Based on 2009–2013 SEER data, the National Cancer Institute projected that lung and bronchus cancer is associated with an estimated 158,080 deaths in the US., 415,787 individuals would be living with the disease, and 224,390 new cases would be diagnosed in 2016.[ These figures translate to an overall, age adjusted incidence rate of 57.3/100,000, and a mortality rate of 46.0.[ Despite a more than 50% decrease in smoking rates from 1970 to 2014 (37.4%–16.8%),[ the number of deaths caused by lung cancer has more than doubled from 61,700 in 1970[ to an estimated 159,260 in 2014.[

Mortality

While smoking has been identified as contributing to 87% of lung cancer deaths overall,[ numerous other etiological factors have been identified. Radon has been attributed to approximately 10% of lung cancer mortality, accounting for an estimated 21,000 lung cancer deaths each year.[ Exposure to secondhand smoke has been estimated to account for 4% of lung cancer deaths.[ A 2002 American Cancer Society study found that long term exposure to combustion related particulate matter (PM2.5) led to an 8% increase in lung cancer mortality.[ A recent systematic review of the effects of air pollution found the meta relative risk for lung cancer associated with PM2.5 was 1.09 (95% confidence interval [CI]: 1.04, 1.14) and the meta relative risk of lung cancer associated with PM10 was 1.08 (95% CI: 1.00, 1.17). In addition, meta relative risk estimates for adenocarcinoma associated with PM2.5 and PM10 were 1.40 (95% CI: 1.07, 1.83) and 1.29 (95% CI: 1.02, 1.63), respectively.[ Similarly, occupational exposures (smelters, blast furnaces and foundries, rubber manufacturing, paving, roofing, painting, and chimney sweeping) and associated chemical exposures, including certain metals (chromium, cadmium and arsenic), volatile organic compounds, radiation and diesel exhaust together, have been associated with an additional 9% to 15% of lung cancer deaths. Individual etiological risk factors linked to lung cancer mortality when combined, exceed 100%.[

Disparities

Smoking rates do not adequately account for race×gender, lung cancer mortality disparities. Age adjusted, adult smoking rates (2015)[ and age adjusted, lung cancer mortality rates (2009–2013)[ were 17.2% and 57.7 for white males (WM); 16.0% and 38.39 for white females (WF); 20.9% and 70.6 for black males (BM); and 13.3% and 35.3 for black females (BF). Similarly, males and females who smoke were 23 and 13 times more likely to develop lung cancer, respectively, compared to those who never smoked.[ Poor and medically underserved populations are more likely to be diagnosed with late-stage cancers than compared to those treated more effectively or cured if diagnosed earlier.[ Social determinants of lung cancer mortality disparities also have been associated with increased risk for lung cancer mortality, including a broad range of indicators such as behavioral factors (e.g., smoking, higher rates of alcohol use, and obesity), socioeconomic status, education, occupation, living conditions, lack of health care coverage, mistrust of the health care system, and fatalistic attitudes about cancer. Financial barriers, cultural beliefs, and lack of access to culturally competent health care by low income and/or racial/ethnic minority groups also have been associated with lung cancer mortality disparities. Aizer et al.[ found that differences in lung cancer mortality rates between Blacks and Whites persist even after adjusting for sociodemographic factors, year and stage of diagnosis, and receipt of definitive treatment. It is unclear, however, whether the mechanisms and pathways through which social determinants affect lung cancer mortality and mortality disparities are etiological, mediating, or simply co occurring.

Multiple exposures

While cigarette consumption clearly accounts for the greatest attributable risk, it remains unclear the extent to which other environmental exposures contribute independently, interactively, or synergistically. Persons who are exposed to radon, PM2.5, workplace chemicals, pesticides, or chemicals in the home and who smoke are at greater risk for dying from lung cancer than those who smoke but who do not experience similar exposures. Living with a smoker likewise increases a nonsmoker’s chances of developing lung cancer by 20%–30%,[ accounting for approximately 3,000 excess lung cancer deaths each year.[ Similarly, lung cancer risk associated with PM2.5 is greatest for former smokers (1.44 [95% CI: 1.04, 2.01]) as compared to never smokers (1.18 [95% CI: 1.00, 1.39]). Deaths attributed to radon exposure also are more likely to occur among smokers than nonsmokers.[ While persons exposed to asbestos are five times more likely to develop lung cancer than those not exposed to asbestos, the risk for lung cancer mortality increases 50 fold for those who are exposed to asbestos and who smoke.[ Till date, a few studies have attempted to examine the effects of multiple chemical and nonchemical stressors on lung cancer mortality or mortality disparities, by race and gender. The evidence clearly supports the need for applying a risk model that is capable of examining how multiple exposures across various domains act as etiologic, mediating, or co occurring factors to affect lung cancer mortality and mortality disparities.[

Exposome

The exposome has been previously defined by Wild[ as cumulative exposures across the lifespan, from conception to death. Juarez et al.[ demonstrated the general utility of the exposome approach using a graph theoretical toolchain to assess the effects of over 600 measures of environmental exposures on preterm births. That study examined relationships between annual, county level variables across three domains, and preterm births using graph theoretical algorithms and scalable combinatorial analyses. By contrast, this study more than triples the number of environmental stressors included in the analysis, particularly measures previously linked to lung cancer mortality. The goal of this research was to use an exposome database comprised 2162 chemical and nonchemical environmental stressors coupled with a graph theoretical toolchain and a data driven approach to identify putative relationships between exposures from natural, built, and social environment domains and lung cancer mortality and mortality disparities across four race and gender groups: WM, WF, BM, and BF.

METHODS

We integrated a portfolio of advanced computational tools and more conventional biostatistics, to elucidate latent relationships between annual county level measures of environmental stressors across the natural, built, and social environment domains with lung cancer mortality and mortality disparities rates, by race and gender. The overall approach we employed is depicted in Figure 1.
Figure 1

Graph theoretical toolchain. These steps were undertaken to assess exposure impact of multiple chemical and non-chemical environmental exposures on lung cancer mortality and mortality disparities using a public health exposome approach. Date from diverse sources were collected, curated and prepared for further interrogation. Modern combinatorial tools were used to distill highly correlated subgraphs for more traditional statistical analysis. These results can be used by domain scientists within community settings to generate and test hypotheses and to translate findings into public and environmental health policy and practice. The first four operations performed in this paper were used to demonstrate the proof of concept of the public health exposome approach while the latter two were designed to motive action

All exposure and health data were obtained from publically available sources and standardized as annual, county level, age adjusted rates per 100,000/population. Data were geo coded using ArcGIS 10.5 and analyzed by county. Due to small numbers of annual lung cancer deaths by race and gender, particularly in rural, homogeneous, and sparsely populated counties, data were pooled across multiple years (1999–2013) to derive an average, age adjusted, annualized, county rate per 100,000, by race, gender, and age (combined 45–84 years of age). Only counties with a minimum, combined total of ten mortality cases of the lung and bronchus (ICD 10 Codes: C33 (Malignant neoplasm of trachea), C34.0 (Main bronchus Malignant neoplasms), C34.1 (Upper lobe, bronchus or lung Malignant neoplasms), C34.2 (Middle lobe, bronchus or lung Malignant neoplasms), C34.3 (Lower lobe, bronchus or lung Malignant neoplasms), C34.8 (Overlapping lesion of bronchus and lung Malignant neoplasms), and C34.9 (Bronchus or lung, unspecified Malignant neoplasms) for each of the four, race×gender groups were included in the study. Racial differences were limited to blacks and whites based on the small number of counties that had a minimum of ten lung cancer deaths for other racial groups and exceeded the CDC Wonder suppression policy. A total of 2,101 measures of diverse stressors from the three described environment domains for 2,067 (of 3,144) counties and county equivalents were used in this study. Examples of measures of the natural environment included meteorological conditions, chemical emissions, and land cover/use; measures of the built environment included health care access, neighborhood resources, and occupational codes; social environmental stressors included population level measures of social, demographic, economic, and political variables. See Table 1 in supplemental material for a complete list, source, and year of exposure variables. Mortality rates due to cancer of the lung and bronchus by county for WM, WF, BM, and BF were obtained from the CDC Wonder website https://wonder.cdc.gov/. Pooling, selection of counties, and smoothing were used in response to the CDC policy of suppressing data for counties in which there were fewer than ten reported cases. For counties in which persons of all four race×gender groups were counted, but no lung cancer deaths were reported, rates were smoothed with techniques designed for this purpose.[ Suppressed mortality values were otherwise set to missing. All exposure and health data were obtained from publically available sources and standardized as annual, county level, age adjusted rates per 100,000/population. As there is a known lag of 20–30 years between environmental exposures and lung cancer mortality, we limited exposure data to the years 1980–2010. No Institutional Review Board approval was required as mortality rates and environmental stressors measurements were publically available secondary data.

Scalable computational analysis

We applied graph theoretical algorithms to the data. Pearson correlation coefficients were first calculated between each pair of variables (environmental exposure and lung cancer mortality rate). The clique doubling technique[ was employed to compute an (absolute) threshold of significance, which was |r|>0.14. By applying this threshold and by anchoring on each of the four race×gender lung cancer mortality responses, we created four graphs (WM, WF, BM, and BF) for further analysis as described by Langston et al.[ Vertex and edge counts were as follows. WM: 530, 80249; WF: 477, 65149; BM: 483, 66915; and BF: 486, 61167. Paracliques[ were extracted from these graphs using a glom term[ set to 1 and an anchor variable that was guaranteed to reside in the first and largest paraclique. Other paracliques also were considered, because those represented latent, putative relationships with the potential to be equally revealing. To reduce redundancy and extract underlying traits that bear the highest amount of data variability, we conducted a factor analysis procedure with varimax rotation using SAS 9.4 (SAS Institute, Cary, NC, USA) on the pool of variables from the first paraclique. Factor scores were calculated using the original variables so that we could make direct comparisons of factors within and between regression models; this resulted in 172 factors. A subset of 120 factors was selected by stepwise regression (due to computational limitations) and used in all possible regression analyses for each of the four, race×gender, lung cancer mortality variables, and differences between variables. A P = 0.0001 was the threshold used to determine statistical significance. Using parsimony, R square, and Akaike information criterion (AIC), we identified the highest contributing factors for each of the four race×gender groups. The 20 most commonly occurring factors for each regression model were then analyzed in final multiple regression models, allowing factors to be compared for differential effects on race×gender, lung cancer mortality, and lung cancer mortality disparity rates. These effects then were computed by differences among the single rates. Standardized regression coefficients (β) were used to compare the relative importance of factors explaining variability of the eight, dependent variables of the models of lung cancer mortality rates and disparities.[ Final regression models incorporated spatial autocorrelation based on location of county centers (Moran’s I = 0.0838, P < 0.001). We set absolute values of coefficient values above 0.5 to characterize strong factor contributions, between 0.3 and 0.5 for moderate contributions and below 0.3 for weak ones. Geographical information systems (GIS) were used to generate maps to visualize spatial distributions of each of the factors and assist with data interpretation (see Appendix 1: Maps, supplemental materials).

RESULTS

Lung cancer mortality

Mean rates and standard deviations of age adjusted, lung cancer mortality rates per 100,000 in the 2067 counties were 193.59 ± 61.11 for WM, 110.15 ± 33.41 for WF, 120.7 ± 122.27 for BM, and 42.18 ± 49.92 for BF. Standardized regression models were used to render the cumulative effect of combined factors for the highest zero order correlations and to confirm the main role of the most important variables in each model (nonstandardized regression models are presented in Tables 2 and 3 of the supplemental materials). Cigarette consumption contributed the greatest explanation of lung cancer mortality rates for both WM and WF (β = 0.47 and β = 0.60, respectively) while % vulnerable African Americans (comprised variables: % African American, low birth weight, very low birthweight, unmarried, chlamydia, and gonorrhea) contributed the greatest explanation of lung cancer mortality for BM and BF (β = 0.44 and β = 0.38, respectively). % disabled and rent were found to have significant, yet weak, positive coefficients across all four, race×gender models [Table 1].
Table 1

List of Variables by Domain, Clique, Factor and Year

Exposome domainFactor namesVariable numberVariable nameClique factorYear
NaturalEthyl dichloride and ethylene oxidV110Cancer risk in a million due to ethyl dichlorideClique 1 Factor 552005
V112Cancer risk in a million due to ethylene oxide2005
Avg daily min/maxTempV131Max Temp JulyClique 1 Factor 12000
V132Max Temp July2005
V137Min Temp July2000
V13Percentage housing units heated by electricity2000
V188Low literacy_percentage
V20LT_hi_school_percentage2005
V2115Cancer risk in a million due to acetaldehyde2005
V2131Cancer risk in a million due to formaldehyde
V2151AvgDaily_Min_Air_Temp
V220M_LT65_NO_HLTH_INS_percentage2006
V468MILK_PRICE
V499AvgDailyMax Heat Index_F
V591F_divorce2009
V604DM_Temp_99 TO
V607DAYS_HI_90
V608DAYS_HI_100
V609DAYS_MX_T_90
V620Premature
V621Under_18
V662Land_surf_temp_day
V663Land_surf_night
V664Temp_min
V665Sunlight
V700AvgDailySunlight1979
V701AvgDayLandSurfaceTemp_F
V702AvgNightLandSurfaceTemp_F
V833Avgdaily_max_heat_index
PrecipitationV665PrecipClique 1 Factor 31
V951Ave Daily Precip1980
V952Ave Daily Precip1985
V953Ave Daily Precip1990
PM2.5V111Cancer risk in a million due to Ethylene dibromideClique 1 Factor 8
Cancer risk in a million due to2005
V2118acrylonitrile2005
V588Ave Fine Part
BuiltAccess to neighborhood facilitiesV423House No Car GT 10 Miles to Store2006
V424Low income GT 102010
Miles to store2010
V65Percentage population,2010
low access to store 20102010
V66Percentage population, low-income access2010
to store 2010
V67Percentage population, children low access2010
to store 20102010
V68Percentage population, seniors, low access to store

Exposome domainVariable categoryVariable numberVariable nameClique factorYear

Farming dependentV234Farming-dependent typology code 2004Clique1 Factor 3512004
SocialPercentage vulnerable African AmericanV184Black Isolation Index 2000Clique 1 Factor 92000
V211Black Pop percentage
V29AA Pop percentage
V487Non-Hispanicblack percentage2008
V618Low birth weight
V619Very low birth weight
V623Unmarried
V638#/1000 black protestant
V761Chlamydia2006
V762Gonorrhea2006
V936Probability that blacks will meet other blacks1990
Blue collar workersV174Percentage Black BlueCollar Workers 2000Clique 1 Factor 652000
V176Renting blacks percentage2000
DiversityV501DiversityClique 1 Factor 302000
V932Thiel Index (diversity)1990
V9391990 White’s RCL measure1990
V9401990 Spatial Proximity Index1990
DisabledV633Dis_Am per 1000 AllClique 1 Factor 1242003–2005
V634Dis_Am_White2003–2005
RentV2157Rent estimates at the 50th percentile 0Clique 1 Factor 122010
V2158Rent estimates at the 50th percentile 12010
V2159Rent estimates at the 50th percentile _22010
V2160Rent estimates at the 50th percentile _32010
V2161Rent estimates at the 50th percentile 42010
SES/education/incomeV227W Collar Wrkr percentageClique 1 Factor 3022010
V22Bachlr Degree + percentage2010
V23Grad or Prof Degree percentage2010
V26Educ Index
V46Manage Prof _occs percentage
V538Median House Inc W2010
V541Per Cap Inc W2010
V587Ed hi school W percentage2010
V615Ave life expectancy2000
V942Household income total pop2000
V943Household income (for population age 65 or older)2000
Percentage democrats HH IncomeV192Democrats percentageClique 1 Factor 3512004
V193Democrats percentage2008
V25Median Personal EarningClique 1 Factor 3062010
V27Income Index
V36Labor Force Part GE16 percentage
V492Med Household Income2008
V584Med household Income W2000
Percentage republican PovertyV189Republican percentageClique 1 Factor 252004
V190Republican percentage2008
V186GINIClique 1 Factor 72000
V241Low Educ 042004
V242Low Employ 042004
V243Persist Poverty 042004
V35Gini Coefficient
V37Poverty below fed percentage
V389Medicaid eligible total
V38Child poverty percentage
V390Medicaid Eligible M
V391Medicaid Eligible F
V393Medi/Medi Dual eligible
V403Food stamp recipients percentage2005
V41Children less than 5 poverty percentage
V422Low income GT 1 mile to store2006
V441Adults 65+ poverty percentage
V458Snap St2008
V493Free lunch percentage2008
V495Poverty rate 082008
V496Child poverty percentage2000
V536Income less than poverty W2010
V585Poverty white percentage2000
V945RS 002000
V946Atkin2000
V947RS901990
Cigarette consumptionV222Unemployment rateClique 1 Factor 2
V383Medicare enrollment Disab Tot
V384MEDCR_ENROL_DISABL_HI_ percentage2000
V385MEDCR_ENROL_DISABL_SMI percentage2000
V392MEDCD_ELIG_BLIND2000
V456LOW Income_SP percentage2007
V475DIABETES_ADULTS percentage2000
V476OBESE_ADULTS percentage2000
V51Prod_trans_moving_occs percentage
V520Educ_Less_HS_M_W2010
V521Educ_HS_M_W2010
V524Educ Less HS F W2010
V525Educ HS F W2010
V543SNAP W2010
V557Blue Col W2000
V558Blue Col WM2000
V565adj_ictive percentage2009
V586Ed low W percentage2000
V58LT HS percentage2000
V59HS degree percentage2000
V602Single Family W2010
V661Age Adj Obesity2009
V703Ave M2000
V704Ave M2005
V705Ave F2000
V706Ave F2005
V778Cig M 961996
V779Cig M 971997
V780Cig M 981998
V781Cig M 991999
V782Cig M 002000
V783Cig M 012001
V784Cig M 022002
V785Cig M 032003
V786Cig M 042004
V787Cig M 052005
V788Cig M 062006
V789Cig M 072007
V790Cig M 082008
V791Cig M 092009
V792Cig M 102010
V793Cig F 961996
V794Cig F 971997
V795Cig F 981998
V796Cig F 991999
V797Cig F 002000
V798Cig F 012001
V799Cig F 022002
V800Cig F 032003
V801Cig F 042004
V802Cig F 052005
V803Cig F 062006
V804Cig F 072007
V805Cig F 082008
V806Cig F 092009
V807Cig F 102010
V808Cig B 961996
V809Cig B 971997
V810Cig B 981998
V811Cig B 991999
V812Cig B 002000
V813Cig B 012001
V814Cig B 022002
V815Cig B 032003
V816Cig B 042004
V817Cig B 052005
V818Cig B 062006
V819Cig B 072007
V820Cig B 082008
V821Cig B 092009
V822Cig B 102010
Marital statusV505Mar Stat Mar WClique 1 Factor 142010
V506Mar Status Mar WM2010
V507Mar Status Mar WF2010
Percentage catholicV640Percentage catholicClique 1 Factor 100

SES: Socioeconomic status, PM2.5: Particulate matter

For WM, other significant factors with weak positive coefficients included average daily min/max average temperature, % disabled, household income, poverty, PM2.5, precipitation, rent, and % of population age 19–64. % Catholic, % vulnerable African American, and access to neighborhood facilities had statistically significant but weak negative coefficients [Table 1]. For WF, factors with significant, but weak, positive correlations in explaining lung cancer mortality, in descending order were: rent, daily min/max average temperature, % of population age 19–64, marital status, and % disabled. Access to neighborhood facilities, PM2.5, % Catholic, farm dependent, and % vulnerable African American had weak negative coefficients. For BM, % vulnerable African American had the highest but moderate contribution (β = 0.44), followed by weak positive contributions for rent, % disabled, education, average min/max daily temperature, precipitation, and PM2.5, whereas cigarette consumption was nonsignificant at P < 0.0001 threshold, with a weak β = 0.06 (P < 0.05). In the case of BF, 20 factors accounted for a R2 = 0.48. Nine factors had significant positive P values, whereas two factors had negative, significant coefficients. Among these, % vulnerable African American was the highest contributing factor, with a moderate β = 0.38, followed by weak contributions of education, % disability, diversity, cigarette consumption, rent, and PM2.5, with β between 0.10 and 0.20. A factor comprised of ethyl dichloride and ethylene oxide, and PM2.5 had weak, negative β coefficients.

Lung cancer mortality disparities

Additional regression models were used to calculate the relative contribution of environmental exposures on lung mortality disparities rates between WM and BM; WF and BF; WM and WF, and BM and BF (race); and WM and BM, and WF and BF (gender) at the P < 0.0001 threshold. Seven factors contributed positively and three negatively to black: white, racial, lung cancer morality disparities [Figures 2 and 3]. Positive β included % vulnerable African American, education, rent, % disability, % catholic, and PM2.5. Factors with negative β were cigarette consumption, poverty, and % population age 19–64. % Vulnerable African American had a strong effect and the others contributed weakly. Six coefficients contributed negatively and none positively to M/F gender disparities including % vulnerable African American, min/max average temperature, rent, average precipitation, % disability, and PM2.5. Disparities between WM and BM were accounted for largely by % vulnerable African American (β = 0.51). Other positive, but weak coefficients included rent, % disability, and education. Negative β included cigarette consumption, poverty, and % population 19–64. Significant β that contributed weakly to disparities between WF and BF included education, diversity, rent, and % disability. Cigarette consumption contributed negatively and weakly to gender disparities.
Figure 2

Comparison of standardized regression coefficients of factors included in four models to explain lung cancer mortality rates for WM, WF, BM, and BF population. Factors are a combination of multiple years of data

Figure 3

Comparison of standardized regression coefficients of factors included in four models to explain lung cancer mortality disparities rates for BF-WF, BM-WM, B-W, M-F population. Factors are a combination of multiple years of data

DISCUSSION

Results of this study suggest that county level, race, and gender differences in cigarette consumption, % vulnerable African American, level of education, % blue collar workers, access to neighborhood resources, housing as a % of income, and diversity, as well as differences in direct exposures to ethyl dichloride and ethylene oxide, min/max average temperature, PM2.5 and precipitation are associated with lung cancer mortality and/or race×gender mortality disparities. Of particular interest is the impact of cigarette consumption on lung cancer mortality disparities. While cigarette consumption is clearly the leading cause of lung cancer overall, it contributes less to our understanding of lung cancer mortality between BM and BF as compared to WM and WF and contributes little to our understanding of race×gender mortality disparities. Interpretation of our findings based on the previous research suggests that cigarette consumption, ethyl dichloride and ethylene oxide, and PM2.5 are etiologic chemical agents associated with lung cancer mortality and mortality disparities. In parallel, % vulnerable African American, level of education, % blue collar workers, % disability, access to neighborhood resources, housing as a % of income, and diversity would appear to be moderating social determinants that impact lung cancer mortality and mortality disparities. Our mapping of exposures using GIS suggests that other variables, such as temperature, precipitation, % Catholic, % democrat, and % republican, may be co occurring or spurious and simply reflect regional differences found in Southern states [Supplemental Figures 1–24: Maps in Supplemental materials].

Public health implications

From primary prevention to survivorship, the pathway to lung cancer mortality and race×gender disparities is profoundly affected by environmental exposures. To date, limited research has examined the combined effects of multiple factors that affect lung cancer mortality and mortality disparities. By curating large amounts of disparate, heterogeneous data, an exposome approach provides public health researchers with an opportunity to harness existing secondary data, generate and test hypotheses, and consider the complex role of chemical and nonchemical environmental stressors. The exposome database and graph theoretical toolchain can also be used to assess the effectiveness of specific risk reduction interventions that test the intervention itself without the traditional limitations inherent to the technical validity of the public health action to be tested. This is particularly relevant where social determinants often act as powerful confounders to underlying etiologic factors that cause poor health outcomes hampering conclusive findings. While lung cancer mortality was used as a “demonstration case,” this approach has applicability to other priority adverse health conditions.

Enabling evidence based science

A major contribution of the public health exposome is that it provides a novel approach for considering the effects of multiple environmental stressors on health outcomes and racial disparities. A second contribution is enabling a dual derivation of testable hypotheses. The graph theoretical toolchain is capable of transforming high volume, disparate heterogeneous data comprised chemical and nonchemical environmental stressors to support both hypothesis generating and hypothesis testing inquiries. This data driven approach is epidemiologically significant in that it provides new opportunities for identifying populations at risk, risk and protective factors, and spatial and temporal measures of exposure. Together, these approaches increase the likelihood that environmental health research will address the public health concerns of affected communities, provide opportunities for meaningful, bi directional, community engaged research, and lay the fertile foundation for community academic partnerships working to collaboratively translate research findings into effective public health policy and practice.

CONCLUSIONS

The exposome paradigm offers a new risk assessment approach to assess the effects of multiple chemical and nonchemical environmental stressors on health outcomes and disparities. It provides public health providers and officials the tools to use “big data” and computational tools in conjunction with traditional biostatistics to analyze complex exposome relationships and to develop and evaluate targeted community health promotion, risk reduction, and health disparities interventions. Graph theoretical algorithms and computational analyses are capable of transforming high volume, heterogeneous, secondary exposure data, spanning the natural, built, and social environments, beyond that which is typically used in traditional, narrowly focused, observational studies. A public health exposome approach provides epidemiologically significant opportunities to identify environmental exposures associated with complex health outcomes and disparities and supports further biostatistical analysis, including factor analysis and multiple regression, multi level, and spatial temporal analyses, GIS and data visualization, and predictive modeling. The use of these analytics is particularly relevant in health disparities research, where mediating and moderating factors influencing disparities often are powerful confounders.

Limitations

Limitations in this study include the validity and reliability of existing public available data sets; environmental stressor data reflect different years; data are population level measures; and not all individuals in a given county are equally affected by a specific stressor.

Directions for future work

An exposome approach, database, and graph theoretical toolchain provides public health professionals with a novel set of tools for analyzing large, multiple, heterogeneous, secondary data sets that can be used both for generating and testing hypotheses and for targeting and evaluating public health interventions. This novel study demonstrates how the public health exposome approach and database comprised chemical and nonchemical stressors from the natural, built, and social environments coupled with a graph theoretical toolchain affords us an opportunity to examine the effects of multiple exposures across various domains on lung cancer mortality and mortality disparities [Figures 2 and 3]. While lung cancer mortality was used here as a “demonstration case,” the benefits of a public health exposome approach coupled with scalable combinatorial analytics are universal and can be applied to many complex health issues. The complex causes and correlates of poor health outcomes and health disparities support the need to move beyond individual risk assessment models to cumulative risk assessment models which not only incorporate multiple exposures across various domains but also can identify exposures across the life course and the life stage at which the exposures occurs. We currently are updating the public health exposome database to include smaller spatial and temporal units (from county to sub county areas and annual to daily measures—where available) while expanding the database to span the full 30 years of environmental stressors. This will allow us to model both the spatial and temporal dimensions of environmental exposures, more accurately distinguish between etiologic, mediating, and co occurring factors, and move toward a more robust cumulative assessment of environmental exposures across the lifespan. These measures should help us achieve the full potential of the exposome.
Table 2

Regression models (non-standardized coefficients) for lung cancer mortality rates by race and gender for 20 environmental exposure factors

Variable nameBlack female (R2=0.60)Black male (R2=0.57)White female (R2=0.53)White male (R2=0.62)




St.BPSt.BPSt.BPSt.BP
Percentage vulnerable African-American0.376<0.00010.441<0.0001−0.1440.2441−0.0820.5462
SES/education/income0.269<0.00010.141<0.00010.0910.293−0.030<0.0001
Ambulatory care discharges0.152<0.00010.152<0.00010.108<0.00010.106<0.0001
Blue collar workers0.143<0.00010.199<0.00010.064<0.00010.063<0.0001
Diversity0.143<0.00010.0320.00430.0230.1249−0.0300.2782
Poverty−0.1380.0005−0.0770.0080.0140.02590.0960.03
Cigarette consumption0.1360.00090.0640.00040.596<0.00010.474<0.0001
Adulthood−0.0660.0012−0.0300.09990.118<0.00010.0580.0056
Ethyl dichloride and ethylene oxide0.0930.0020.0300.14470.0100.1517−0.0210.075
PM2.50.0880.00320.0670.0417−0.1010.06950.0900.4733
Rent0.1100.0040.0350.38670.163<0.0001−0.0400.22
Marital status0.0530.00850.0350.14340.115<0.00010.053<0.0001
Percentage catholic0.0400.00910.0140.1163−0.1140.0023−0.1090.0003
Percentage income housing−0.0910.015−0.0350.26960.0570.76220.0980.0026
Access to neighborhood facilities0.0210.0786−0.0070.9579−0.088<0.0001−0.066<0.0001
Ave daily minimum/maximum temperature0.0240.08870.1110.00010.148<0.00010.216<0.0001
Precipitation0.0170.27150.0930.027−0.0360.94980.0870.1711
Percentage democrats−0.0710.3504−0.0700.5195−0.0460.3428−0.0150.5248
Percentage republicans0.0010.69170.0190.5166−0.1230.3188−0.0450.7091
Farming dependent−0.0070.91640.0060.391−0.119<0.0001−0.0270.003

SES: Socioeconomic status, PM2.5: Particulate matter

Table 3

Regression Summary (non-standardized coefficients): Lung Cancer Mortality Disparities by Race and Gender Differences for 20 Environmental Exposure Factors

EffectDifference black
Difference female – male (R2=0.40)
White (R2=0.64)
White female (R2=0.55)
White male (R2=0.57)

BPBPBPBP
Intercept39.742329.441822.6925−155.5
Percentage vulnerable African-American68.4521<0.000120.4956<0.000147.9361<0.0001−24.8206<0.0001
Education42.9121<0.000111.2358<0.000130.3656<0.0001−1.35330.7505
Am care discharges29.4984<0.00016.87070.000122.3544<0.0001−22.2268<0.0001
Blue-collar workers22.4462<0.00014.7944<0.000118.3492<0.0001−19.3076<0.0001
Diversity13.8324<0.00017.0338<0.00016.87540.00113.92080.0563
Percentage catholic0.09709<0.00010.03935<0.00010.059270.00080.0086170.6136
Adulthood−3.4361<0.0001−1.8126<0.0001−1.82530.00740.20710.7572
Access to neighborhood facilities9.82960.0015.1129<0.00014.51750.05623.2430.1653
Poverty−16.5910.0016−4.61280.0209−13.51090.00110.20.9599
Cigarette consumption−17.7160.0022−11.29<0.0001−7.41220.1023−14.85880.0006
Farming dependent20.61370.008710.250.000910.87430.0812−9.54160.12
Household income−12.36980.0111−4.9650.0089−7.83620.0422−3.0920.4134
PM2.513.78560.07387.18250.00216.25240.2682−12.30420.009
Precipitation7.12630.14892.38930.15564.92390.1889−10.60060.0017
Rent−5.46790.201−0.70010.6647−4.18810.21219.06120.0049
Percentage republicans13.07220.22225.33620.20927.38870.3863−4.45550.5982
Ethyl dichloride and ethylene oxide3.5190.47122.53480.14442.42630.5198−2.41570.4868
Percentage democrats−6.21860.5643−1.80810.673−4.13030.63091.07890.8993
Temperature1.53810.8149−1.81340.39014.22150.3885−19.6804<0.0001
Marital status−3.09140.9599−5.50850.8198−1.64010.9732−23.9430.6188

PM2.5: Particulate matter

  14 in total

1.  A matter of race: early-versus late-stage cancer diagnosis.

Authors:  Beth A Virnig; Nancy N Baxter; Elizabeth B Habermann; Roger D Feldman; Cathy J Bradley
Journal:  Health Aff (Millwood)       Date:  2009 Jan-Feb       Impact factor: 6.301

2.  The impact of data suppression on local mortality rates: the case of CDC WONDER.

Authors:  Chetan Tiwari; Kirsten Beyer; Gerard Rushton
Journal:  Am J Public Health       Date:  2014-06-12       Impact factor: 9.308

3.  Lower Bounds on Paraclique Density.

Authors:  Ronald D Hagan; Michael A Langston; Kai Wang
Journal:  Discrete Appl Math       Date:  2016-05-11       Impact factor: 1.139

4.  Cancer statistics, 1970.

Authors:  E Silverberg; R N Grant
Journal:  CA Cancer J Clin       Date:  1970 Jan-Feb       Impact factor: 508.702

5.  Current Cigarette Smoking Among Adults - United States, 2005-2015.

Authors:  Ahmed Jamal; Brian A King; Linda J Neff; Jennifer Whitmill; Stephen D Babb; Corinne M Graffunder
Journal:  MMWR Morb Mortal Wkly Rep       Date:  2016-11-11       Impact factor: 17.586

6.  Lung cancer, cardiopulmonary mortality, and long-term exposure to fine particulate air pollution.

Authors:  C Arden Pope; Richard T Burnett; Michael J Thun; Eugenia E Calle; Daniel Krewski; Kazuhiko Ito; George D Thurston
Journal:  JAMA       Date:  2002-03-06       Impact factor: 56.272

7.  Comparison of threshold selection methods for microarray gene co-expression matrices.

Authors:  Bhavesh R Borate; Elissa J Chesler; Michael A Langston; Arnold M Saxton; Brynn H Voy
Journal:  BMC Res Notes       Date:  2009-12-02

Review 8.  Occupational and environmental causes of lung cancer.

Authors:  R William Field; Brian L Withers
Journal:  Clin Chest Med       Date:  2012-12       Impact factor: 2.878

9.  Exploration of preterm birth rates using the public health exposome database and computational analysis methods.

Authors:  Anne D Kershenbaum; Michael A Langston; Robert S Levine; Arnold M Saxton; Tonny J Oyana; Barbara J Kilbourne; Gary L Rogers; Lisaann S Gittner; Suzanne H Baktash; Patricia Matthews-Juarez; Paul D Juarez
Journal:  Int J Environ Res Public Health       Date:  2014-11-28       Impact factor: 3.390

Review 10.  Outdoor particulate matter exposure and lung cancer: a systematic review and meta-analysis.

Authors:  Ghassan B Hamra; Neela Guha; Aaron Cohen; Francine Laden; Ole Raaschou-Nielsen; Jonathan M Samet; Paolo Vineis; Francesco Forastiere; Paulo Saldiva; Takashi Yorifuji; Dana Loomis
Journal:  Environ Health Perspect       Date:  2014-06-06       Impact factor: 9.031

View more
  7 in total

Review 1.  Lung cancer health disparities.

Authors:  Bríd M Ryan
Journal:  Carcinogenesis       Date:  2018-05-28       Impact factor: 4.944

2.  Unsupervised dimensionality reduction for exposome research.

Authors:  Vrinda Kalia; Douglas I Walker; Katherine M Krasnodemski; Dean P Jones; Gary W Miller; Marianthi-Anna Kioumourtzoglou
Journal:  Curr Opin Environ Sci Health       Date:  2020-05-19

3.  Application of the Public Health Exposome Framework to Estimate Phenotypes of Resilience in a Model Ohio African-American Women's Cohort.

Authors:  Patricia Cifuentes; John Reichard; Wansoo Im; Sakima Smith; Cynthia Colen; Carmen Giurgescu; Karen Patricia Williams; Shannon Gillespie; Paul D Juarez; Darryl B Hood
Journal:  J Urban Health       Date:  2019-03       Impact factor: 3.671

4.  Applying an Exposome-Wide (ExWAS) Approach to Cancer Research.

Authors:  Paul D Juarez; Patricia Matthews-Juarez
Journal:  Front Oncol       Date:  2018-08-27       Impact factor: 6.244

Review 5.  Towards Mass Spectrometry-Based Chemical Exposome: Current Approaches, Challenges, and Future Directions.

Authors:  Jingchuan Xue; Yunjia Lai; Chih-Wei Liu; Hongyu Ru
Journal:  Toxics       Date:  2019-08-18

Review 6.  The Chemical Exposome of Human Aging.

Authors:  Biswapriya B Misra
Journal:  Front Genet       Date:  2020-11-23       Impact factor: 4.599

7.  Rethinking of Environmental Health Risks: A Systematic Approach of Physical-Social Health Vulnerability Assessment on Heavy-Metal Exposure through Soil and Vegetables.

Authors:  Jun Yang; Silu Ma; Yongwei Song; Fei Li; Jingcheng Zhou
Journal:  Int J Environ Res Public Health       Date:  2021-12-19       Impact factor: 3.390

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.