| Literature DB >> 33981823 |
Rachel C Nethery1, Tamara Rushovich2, Emily Peterson3, Jarvis T Chen2, Pamela D Waterman2, Nancy Krieger2, Lance Waller3, Brent A Coull1.
Abstract
Across the United States public health community in 2020, in the midst of a pandemic and increased concern regarding racial/ethnic health disparities, there is widespread concern about our ability to accurately estimate small-area disease incidence rates due to the absence of a recent census to obtain reliable population denominators. 2010 decennial census data are likely outdated, and intercensal population estimates from the Census Bureau, which are less temporally misaligned with real-time disease incidence data, are not recommended for use with small areas. Machine learning-based population estimates are an attractive option but have not been validated for use in epidemiologic studies. Treating 2010 decennial census counts as a "ground truth", we conduct a case study to compare the performance of alternative small-area population denominator estimates from surrounding years for modeling real-time disease incidence rates. Our case study focuses on modeling health disparities in census tract incidence rates in Massachusetts, using population size estimates from the American Community Survey (ACS), the most commonly-used intercensal small-area population data in epidemiology, and WorldPop, a machine learning model for high-resolution population size estimation. Through simulation studies and an analysis of real premature mortality data, we evaluate whether WorldPop denominators can provide improved performance relative to ACS for quantifying disparities using both census tract-aggregate and race-stratified modeling approaches. We find that biases induced in parameter estimates due to temporally incompatible incidence and denominator data tend to be larger for race-stratified models than for area-aggregate models. In most scenarios considered here, WorldPop denominators lead to greater bias in estimates of health disparities than ACS denominators. These insights will assist researchers in intercensal years to select appropriate population size estimates for modeling disparities in real-time disease incidence. We highlight implications for health disparity studies in the coming decade, as 2020 census counts may introduce new sources of error.Entities:
Keywords: Health disparities; Population denominators; Real-time incidence modeling
Year: 2021 PMID: 33981823 PMCID: PMC8081984 DOI: 10.1016/j.ssmph.2021.100786
Source DB: PubMed Journal: SSM Popul Health ISSN: 2352-8273
Fig. 1Spatial distribution of census tract-level 2010 decennial census population counts compared to 5-year ACS and WorldPop population estimates for years 2008–2010, Massachusetts, USA.
Fig. 2Scatterplots of difference in WorldPop 2010 and decennial census age-stratified CT population estimates vs. percent of the CT in poverty.
Fig. 3MA census tract ACS total population size estimates (A) and Black population size estimates (B) over time as a proportion of 2010 decennial census population size. Colors represent 2010 decennial census population size bins. (For interpretation of the references to colour in this figure legend, the reader is referred to the Web version of this article.)
Incidence rate ratio estimates (95% credible intervals) from premature mortality models. In the CT-aggregate models, the Race variable is an ecologic variable (CT proportion Black), while in the race-stratified models, the Race variable is a group-level binary indicator of Black (versus non-Hispanic White). In both models, the Poverty variable is CT-aggregate proportion in poverty. Continuous covariates are centered and scaled.
| CT-aggregate | Race-stratified | ||||||
|---|---|---|---|---|---|---|---|
| ACS | WP | Census | ACS | WP | Census | ||
| 2008 | Intercept | 0.97 (0.95,0.99) | 0.95 (0.93,0.97) | – | 1.05 (1.03,1.08) | 1.02 (1.00,1.05) | – |
| Race | 1.09 (1.05,1.14) | 1.10 (1.06,1.15) | – | 0.95 (0.88,1.03) | 0.97 (0.89,1.05) | – | |
| Poverty | 1.22 (1.17,1.27) | 1.10 (1.06,1.14) | – | 1.34 (1.29,1.39) | 1.19 (1.15,1.24) | – | |
| 2009 | Intercept | 0.97 (0.95,0.99) | 0.95 (0.93,0.97) | – | 1.06 (1.03,1.08) | 1.03 (1.01,1.06) | – |
| Race | 1.09 (1.05,1.13) | 1.10 (1.05,1.14) | – | 0.92 (0.84,1.00) | 0.92 (0.84,1.01) | – | |
| Poverty | 1.22 (1.18,1.26) | 1.10 (1.06,1.14) | – | 1.34 (1.29,1.39) | 1.20 (1.16,1.24) | – | |
| 2010 | Intercept | 0.97 (0.95,0.99) | 0.95 (0.93,0.97) | 0.97 (0.95,0.99) | 1.07 (1.04,1.10) | 1.04 (1.02,1.07) | 1.07 (1.05,1.09) |
| Race | 1.09 (1.05,1.14) | 1.10 (1.05,1.14) | 1.09 (1.05,1.14) | 0.88 (0.82,0.95) | 0.88 (0.81,0.97) | 0.89 (0.81,0.96) | |
| Poverty | 1.22 (1.18,1.26) | 1.11 (1.07,1.15) | 1.19 (1.15,1.23) | 1.35 (1.31,1.40) | 1.21 (1.17,1.26) | 1.32 (1.28,1.36) | |
Fig. 4Simulation results using standardized denominators. Parameter estimates using ACS and WorldPop population size estimates in CT-aggregate (A) and race-stratified (B) models. Data are generated using 2010 decennial census population sizes. True values of each parameter are denoted by the black horizontal lines.