| Literature DB >> 32450424 |
Abstract
Recent advances in understanding of biological mechanisms and adverse outcome pathways for many exposure-related diseases show that certain common mechanisms involve thresholds and nonlinearities in biological exposure concentration-response (C-R) functions. These range from ultrasensitive molecular switches in signaling pathways, to assembly and activation of inflammasomes, to rupture of lysosomes and pyroptosis of cells. Realistic dose-response modeling and risk analysis must confront the reality of nonlinear C-R functions. This paper reviews several challenges for traditional statistical regression modeling of C-R functions with thresholds and nonlinearities, together with methods for overcoming them. Statistically significantly positive exposure-response regression coefficients can arise from many non-causal sources such as model specification errors, incompletely controlled confounding, exposure estimation errors, attribution of interactions to factors, associations among explanatory variables, or coincident historical trends. If so, the unadjusted regression coefficients do not necessarily predict how or whether reducing exposure would reduce risk. We discuss statistical options for controlling for such threats, and advocate causal Bayesian networks and dynamic simulation models as potentially valuable complements to nonparametric regression modeling for assessing causally interpretable nonlinear C-R functions and understanding how time patterns of exposures affect risk. We conclude that these approaches are promising for extending the great advances made in statistical C-R modeling methods in recent decades to clarify how to design regulations that are more causally effective in protecting human health.Entities:
Keywords: Bayesian network; Causality; Dose-response threshold; Dynamic simulation model; Lead; Measurement error; Model specification error; Molybdenum; Nonlinear dose-response modeling; Nonparametric regression; Regulatory risk assessment; Residual confounding; be
Mesh:
Year: 2020 PMID: 32450424 PMCID: PMC7235595 DOI: 10.1016/j.envres.2020.109638
Source DB: PubMed Journal: Environ Res ISSN: 0013-9351 Impact factor: 6.498
Statistical techniques for commonly encountered data imperfections.
| Data/Study Imperfection | Examples of appropriate techniques and software |
|---|---|
| Model misspecification errors; unknown shapes of exposure-response dependencies | Flexible nonparametric models (e.g., MARS, |
| Exposure estimation errors and errors in estimated or measured covariates (explanatory variables) | Errors-in-variables methods (e.g., the MMC package in R, |
| Omitted variables; unobserved or unmeasured risk factors, confounders, and modifiers | latent variable techniques and finite mixture distribution modeling methods (e.g., |
| Missing data values | Multiple imputation algorithms (e.g., MICE, |
| Inter-individual heterogeneity and variability in causal exposure-response curves | Finite mixture distribution modeling, clustering, individual conditional expectation methods (e.g., |
| Correlated or interdependent explanatory variables | Probabilistic graphical methods, e.g., Bayesian networks ( |
| Interactions among risk factors or other explanatory variables | Nonparametric detection, estimation, and visualization of interactions ( |
| Uncertain internal validity (soundness of causal inferences) | Use quasi-experiment designs (or randomization and design of experiments where possible) to control for standard threats to internal validity, e.g., using PlanOut and PlanAlyzer software ( |
| Uncertain external validity (generalizability of findings) | Multisite causal mediation analysis ( |
Fig. 1Linear and nonlinear (smoothing) regressions for log(T) and log(Mo) as functions of BMI. The nonlinear curves are fit to the data using locally weighted scatterplot smoothing (LOWESS).
Fig. 2Bayesian network (BN) model structure (left) and predictions (right) (Source:Hack et al., 2010). Abbreviations: 8-OHdG = 8-hydroxyguanosine (a biomarker of oxidative stress); CFU-GEMM = colony-forming unit-granulocyte, erythrocyte, monocyte, megakaryocyte (a precursor to RBCs and WBCs); BFU-E = burst-forming unit-erythroid (a RBC precursor cell type); CFU-GM = colony forming unit – granulocyte-macrophage (a WBC precursor); RBC = red blood cell count; ttMA = trans, trans muconic acid; WBC = white blood cell count. Diamonds on right indicate observed data. Blue curves are random samples from the uncertain dose-response relationship. . (For interpretation of the references to colour in this figure legend, the reader is referred to the Web version of this article.)
Fig. 3Administering the same total amount of exposure (e.g., 100 ppm-hours per week) in different time patterns changes the maximum internal dose received.
Fig. 4Higher administered concentrations that reduce clearance rates are disproportionately efficient in producing internal concentrations.
Fig. 5Mortality during follow-up vs. blood lead level (μg/dL) for non-smokers in NHANES data.