| Literature DB >> 36083583 |
Weichuan Dong1, Wyatt P Bensken1, Uriel Kim2, Johnie Rose1,3,4, Qinjin Fan5, Nicholas K Schiltz1,4,6, Nathan A Berger3,7, Siran M Koroukian1,3,4.
Abstract
Importance: The association between cancer mortality and risk factors may vary by geography. However, conventional methodological approaches rarely account for this variation. Objective: To identify geographic variations in the association between risk factors and cancer mortality. Design, Setting, and Participants: This geospatial cross-sectional study used county-level data from the National Center for Health Statistics for individuals who died of cancer from 2008 to 2019. Risk factor data were obtained from County Health Rankings & Roadmaps, Health Resources and Services Administration, and Centers for Disease Control and Prevention. Analyses were conducted from October 2021 to July 2022. Main Outcomes and Measures: Conventional random forest models were applied nationwide and by US region, and the geographical random forest model (accounting for local variation of association) was applied to assess associations between a wide range of risk factors and cancer mortality.Entities:
Mesh:
Year: 2022 PMID: 36083583 PMCID: PMC9463612 DOI: 10.1001/jamanetworkopen.2022.30925
Source DB: PubMed Journal: JAMA Netw Open ISSN: 2574-3805
County-Level Demographic and Risk Factor Variables
| Source (year published) and variable | Description | Year of data | Original source |
|---|---|---|---|
| County Health Rankings & Roadmaps (2016) | |||
| Non-Hispanic Black | Percentage of Non-Hispanic African American people | 2014 | Census-PE |
| Hispanic population | Percentage of Hispanic people | 2014 | Census-PE |
| Population aged ≥65 y | Percentage of people aged ≥65 y | 2014 | Census-PE |
| Median household income | The income where half of households in a county earn more and half of households earn less | 2014 | Census-SAIPE |
| Unemployment | Percentage of population (aged ≥16 y) unemployed but seeking work | 2014 | BLS |
| Smoking | Percentage of adults who are current smokers (age-adjusted) | 2014 | CDC-BRFSS |
| Excessive drinking | Percentage of adults reporting binge or heavy drinking (age-adjusted) | 2014 | CDC-BRFSS |
| Poor or fair health | Percentage of adults reporting fair or poor health (age-adjusted) | 2014 | CDC-BRFSS |
| Frequent physical distress | Percentage of adults reporting ≥14 d of poor physical health per month (age-adjusted) | 2014 | CDC-BRFSS |
| Frequent mental distress | Percentage of adults reporting ≥14 d of poor mental health per month (age-adjusted) | 2014 | CDC-BRFSS |
| Not proficient in English | Percentage of people (aged ≥5 y) who reported speaking English less than very well | 2010-2014 | Census-ACS |
| County Health Rankings & Roadmaps (2017) | |||
| Rural population | Percentage of people living in rural areas | 2010 | Census-PE |
| Physical inactivity | Percentage of adults (aged ≥18 y) reporting no leisure-time physical activity (age-adjusted) | 2013 | CDC-DIA |
| Adult obesity | Percentage of the adult population (aged ≥18 y) that reports a body mass index ≥30 (age-adjusted) | 2013 | CDC-DIA |
| Diabetes | Percentage of adults (aged ≥20 y) with diagnosed diabetes (age-adjusted) | 2013 | CDC-DIA |
| Uninsured adults | Percentage of adults aged <65 y without health insurance | 2014 | Census-SAHIE |
| Social associations | Number of membership associations per 10 000 population | 2014 | Census-CBP |
| Insufficient sleep | Percentage of adults who report <7 h of sleep on average (age-adjusted) | 2014 | CDC-BRFSS |
| Sexually transmitted infections | Number of newly diagnosed chlamydia cases per 100 000 population | 2014 | CDC-NCHHSTP |
| Preventable hospital stays | Rate of hospital stays for ambulatory-care sensitive conditions per 100 000 Medicare enrollees | 2014 | DAHC |
| Health care costs | Per capita spending of Medicare enrollees | 2014 | DAHC |
| Food environment index | Index of factors that contribute to a healthy food environment, from 0 (worst) to 10 (best) | 2014 | USDA-FEA |
| Mammography use (aged 65-69 y) | Percentage of female Medicare enrollees (aged 65-69 y) who received an annual mammography screening | 2014 | DAHC |
| Severe housing problems | Percentage of households with at least 1 of 4 housing problems: overcrowding, high housing costs, lack of kitchen facilities, or lack of plumbing facilities | 2009-2013 | HUD-CHAS |
| Access to exercise opportunities | Percentage of population with adequate access to locations for physical activity | 2010 & 2014 | ESRI and Census (TF) |
| Violent crime | Number of reported violent crime offenses per 100 000 population | 2012-2014 | FBI-UCR |
| Income inequality | Ratio of household income at the 80th percentile to income at the 20th percentile | 2011-2015 | Census-ACS |
| Children in single-parent households | Percentage of children who live in a household headed by a single parent | 2011-2015 | Census-ACS |
| Driving alone to work | Percentage of the workforce that drives alone to work | 2011-2015 | Census-ACS |
| Long commute, driving alone | Among workers who commute in their car alone, the percentage who commute >30 min | 2011-2015 | Census-ACS |
| HRSA–Area Health Resources Files (2019) | |||
| Female-headed households | Percentage of female-headed households | 2010 | Census (decennial) |
| Rural-urban continuum code | An ordinal variable classifying metropolitan and nonmetropolitan counties by the population size of their metropolitan area and by degree of urbanization and adjacency to a metropolitan area or areas | 2013 | USDA-ERS |
| Urban influence code | An ordinal variable classifying metropolitan and nonmetropolitan counties by the population size of their metropolitan area and by size of the largest city or town and proximity to metropolitan and micropolitan areas | 2013 | USDA-ERS |
| Poverty | Percentage of people in poverty | 2014 | Census-SAIPE |
| Income <200% of federal poverty level | Percentage of people (aged 18-64 y) with income <200% of federal poverty level | 2014 | Census-SAHIE |
| Receipt of SNAP benefits | Percentage of households with ≥1 individual who received SNAP benefits | 2014 | Census-SNAPF |
| Medicare eligibility | Percentage of people eligible for Medicare | 2014 | CMS |
| Primary care physicians | Primary care physicians in patient care per 100 000 people | 2014 | AMA |
| Obstetrician-gynecologists | Obstetrician-gynecologists in patient care per 100 000 people | 2015 | AMA |
| Radiation oncologists | Radiation oncologists per 100 000 people | 2015 | AMA |
| Radiologists | Diagnostic radiologists in patient care per 100 000 people | 2015 | AMA |
| Hospitals | Hospitals per 100 000 people | 2015 | AMA |
| Community health centers | Community health centers per 100 000 people | 2014 | HRSA |
| Health professional shortage area | An ordinal variable identifying counties experiencing a shortage of health professionals (primary care physicians) | 2015 | HRSA |
| Without high school degree | Percentage of people (aged ≥25 y) with no high school diploma | 2011-2015 | Census-ACS |
| National Cancer Institute (2016) | |||
| Mammography use (aged ≥40 y) | Percentage of female individuals (aged ≥40 y) who received a mammography screening within 2 y | 2008-2010 | CDC-BRFSS and NHIS |
| Colorectal screening (aged ≥50 y) | Percentage of people (aged ≥50 y) who ever had colorectal cancer test (home-based fecal occult blood test in the past 2 y or ever had a colorectal endoscopy) | 2008-2010 | CDC-BRFSS and NHIS |
| CDC Division for Heart Disease and Stroke Prevention (2020) | |||
| Cardiovascular disease | Total cardiovascular disease hospitalization rate per 1000 Medicare beneficiaries (aged ≥65 y) | 2013-2015 | CMS |
| Stroke | Stroke hospitalization rate per 1000 Medicare beneficiaries (aged ≥65 y) | 2013-2015 | CMS |
Abbreviations: ACS, American Community Survey; AMA, American Medical Association; BLS, Bureau of Labor Statistics; BRFSS, Behavioral Risk Factor Surveillance System; CBP, County Business Patterns; CDC, Centers for Disease Control and Prevention; CHAS, Comprehensive Housing Affordability Strategy data; CMS, Centers for Medicare & Medicaid Services; DAHC, Dartmouth Atlas of Health Care; DIA, Diabetes Interactive Atlas; ERS, Economic Research Service; FEA, Food Environment Atlas; HRSA, Health Resources and Services Administration; HUD, US Department of Housing and Urban Development; NCHHSTP, National Center for HIV/AIDS, Viral Hepatitis, STD, and TB prevention; NHIS, National Health Interview Survey; PE, Population Estimates; SAHIE, Small Area Health Insurance Estimates; SNAP, Supplemental Nutrition Assistance Program; SNAPF, Supplemental Nutrition Assistance Program File; TF, Tigerline File; UCR, Uniform Crime Reporting; USDA, US Department of Agriculture.
Body mass index is calculated as weight in kilograms divided by height in meters squared.
Figure 1. US County-Level Cancer Mortality Rate by Quartile During 2008 to 2019
Black lines delineate regions of the US (Northeast, Midwest, South, and West).
Figure 2. Relative Importance Plot of Cancer Mortality Risk Factors From the Conventional Random Forest Models
The most important variable is at the top and set to 100%. The importance of the rest of the variables is shown relative to the top one. In panel B, states included in Midwest region were IN, IA, IL, KS, MI, MO, MN, NE, ND, OH, SD, and WI. In panel C, states included in the Northeast region were CT, ME, MA, NH, NJ, NY, PA, RI, and VT. In panel D, states included in West region were AZ, CA, CO, ID, MT, NV, NM, OR, UT, WA, and WY. In panel E, states included in South region were AL, AR, DE, DC, FL, GA, KY, LA, MD, MS, NC, OK, SC, TN, TX, VA, and WV. SNAP indicates Supplemental Nutrition Assistance Program.
Figure 3. Relative Importance of Selected Cancer Risk Factors From the Geographical Random Forest Analysis
The highest value of variable importance among all risk factors is set to 100%. All other values of variable importance are scaled relative to the highest value. Black lines delineate regions of the US (Northeast, Midwest, South, and West). SNAP indicates Supplemental Nutrition Assistance Program.
Figure 4. Risk Factors Prevalence and Areas With High Variable Importance From the Geographical Random Forest Analysis
All variables use their own respective scales and are classified by quartile. Black lines delineate regions of the US (Northeast, Midwest, South, and West). SNAP indicates Supplemental Nutrition Assistance Program.