| Literature DB >> 33194957 |
Marion Belloni1, Olivier Laurent1, Chantal Guihenneuc2, Sophie Ancelet1.
Abstract
As multifactorial and chronic diseases, cancers are among these pathologies for which the exposome concept is essential to gain more insight into the associated etiology and, ultimately, lead to better primary prevention strategies for public health. Indeed, cancers result from the combined influence of many genetic, environmental and behavioral stressors that may occur simultaneously and interact. It is thus important to properly account for multifactorial exposure patterns when estimating specific cancer risks at individual or population level. Nevertheless, the risk factors, especially environmental, are still too often considered in isolation in epidemiological studies. Moreover, major statistical difficulties occur when exposures to several factors are highly correlated due, for instance, to common sources shared by several pollutants. Suitable statistical methods must then be used to deal with these multicollinearity issues. In this work, we focused on the specific problem of estimating a disease risk from highly correlated environmental exposure covariates and a censored survival outcome. We extended Bayesian profile regression mixture (PRM) models to this context by assuming an instantaneous excess hazard ratio disease sub-model. The proposed hierarchical model incorporates an underlying truncated Dirichlet process mixture as an attribution sub-model. A specific adaptive Metropolis-Within-Gibbs algorithm-including label switching moves-was implemented to infer the model. This allows simultaneously clustering individuals with similar risks and similar exposure characteristics and estimating the associated risk for each group. Our Bayesian PRM model was applied to the estimation of the risk of death by lung cancer in a cohort of French uranium miners who were chronically and occupationally exposed to multiple and correlated sources of ionizing radiation. Several groups of uranium miners with high risk and low risk of death by lung cancer were identified and characterized by specific exposure profiles. Interestingly, our case study illustrates a limit of MCMC algorithms to fit full Bayesian PRM models even if the updating schemes for the cluster labels incorporate label-switching moves. Then, although this paper shows that Bayesian PRM models are promising tools for exposome research, it also opens new avenues for methodological research in this class of probabilistic models.Entities:
Keywords: Bayesian inference; ionizing radiation; lung cancer; multicollinearity; profile regression; survival data; truncated Dirichlet process mixture
Mesh:
Year: 2020 PMID: 33194957 PMCID: PMC7652768 DOI: 10.3389/fpubh.2020.557006
Source DB: PubMed Journal: Front Public Health ISSN: 2296-2565
Main characteristics of the post-55 French cohort of uranium miners.
| No. of miners | 3,377 |
| Age at entry into study, mean [min, max] | 28.3 [16.9, 57.7] |
| Duration of work in years, mean [min, max] | 16.7 [1.0, 40.9] |
| Duration of follow-up in years, mean [min, max] | 32.8 [0.1, 51.0] |
| Alive <85 years old | 2,412 (71.4) |
| Alive ≥85 years old | 74 (2.2) |
| Death from lung cancer | 94 (2.8) |
| Death from another cause | 777 (23.0) |
| Lost to follow-up | 20 (0.6) |
| Exposed miners, | 2,910 (86.2) |
| Duration of exposure (in years), mean [min, max] | 12.9 [1.0, 35.0] |
| Cumulative exposure (in WLM), mean [min, max] | 17.8 [0.003, 128.4] |
| Exposed miners, | 3,240 (95.9) |
| Duration of exposure (in years), mean [min, max] | 13.2 [1.0, 36.0] |
| Cumulative exposure (in mSv), mean [min, max] | 54.9 [0.2, 470.1] |
| Exposed miners, | 2,746 (81.3) |
| Duration of exposure (in years), mean [min, max] | 12.9 [1.0-35.0] |
| Cumulative exposure (in kBq·m−3·h), mean [min, max] | 1.64 [0.01, 10.4] |
Results only on measured exposures.
Figure 1Scatter plots of the observed cumulative exposures to γ-rays and radon (left-hand panel), γ-rays and uranium dust (at the center), radon and uranium dust (right-hand panel).
Prior probability distributions assigned to the unknown parameters of a Bayesian PRM model including the disease sub-model, the exposure sub-model and the attribution sub-model.
| Disease sub-model | β | Normal | N (0, 106) |
| λ1 | Gamma | G (23.7, 4.9·108) | |
| λ2 | Gamma | G (35.5, 2.6·107) | |
| λ3 | Gamma | G (88.1, 1.6·107) | |
| λ4 | Gamma | G (29.7, 3.2·106) | |
| Exposure sub-model | Normal | N (0.10, 2.25) | |
| Normal | N (−2.3, 8.08) | ||
| Normal | N (1.01, 11.79) | ||
| Normal | N (0, 106) | ||
| Uniform | U [0,100] | ||
| Dirichlet | D [0.5, …, 0.5] | ||
| Attribution sub-model | α | Uniform | U [0.3, 10] |
Figure 2Directed Acyclic Graph associated to the full Bayesian PRM model. Circles indicate unknown quantities and rectangles indicate observed variables. Single arrows indicate oriented probabilistic links between two quantities and double arrows indicate oriented deterministic links between two quantities. denotes the observed value of any categorical covariate q for uranium miner i and denotes the observed value of any continuous covariate q of uranium miner i.
Label switching moves.
The switching between labels j and k is accepted with probability min(1, r.
, ,
and
Figure 3Estimated number of non-empty clusters according to the initial value of α.
DIC and WAIC of Bayesian PRM model according to the fixed number K of non empty clusters.
| 5 | 146,345 | 110,872 |
| 6 | 136,714 | 108,773 |
| 7 | 118,602 | 107,004 |
| 8 | 104,566 | 105,704 |
Figure 4Number of French uranium miners (top left), number of deaths by lung cancer (bottom left) and instantaneous excess hazard ratio (per 100 WLM) of death by lung cancer (β) in each cluster (right), when fitting a Bayesian RPRM model assuming 8 non-empty clusters from the French cohort of uranium miners. The cluster including non-exposed miners is not displayed. The boxes represent the three quartiles (1st quartile, median, and 3rd quartile) of the posterior distribution of β and the whiskers of the boxplots show the 95% credible interval of the posterior distribution for each group.
Figure 5Characterization of the exposure profiles associated to each cluster, when fitting a Bayesian RPRM model assuming 8 non-empty clusters. The cluster including non-exposed miners is not displayed.