| Literature DB >> 32321578 |
Ryan J Langdon1,2, Rhona A Beynon1,2, Kate Ingarfield3,4,5, Riccardo E Marioni6,7, Daniel L McCartney6,7, Richard M Martin1,2,3, Andy R Ness3, Michael Pawlita8, Tim Waterboer8, Caroline Relton1,2,3, Steven J Thomas3, Rebecca C Richmond9,10.
Abstract
BACKGROUND: DNA methylation (DNAm) variation is an established predictor for several traits. In the context of oropharyngeal cancer (OPC), where 5-year survival is ~ 65%, DNA methylation may act as a prognostic biomarker. We examined the accuracy of DNA methylation biomarkers of 4 complex exposure traits (alcohol consumption, body mass index [BMI], educational attainment and smoking status) in predicting all-cause mortality in people with OPC.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32321578 PMCID: PMC7178612 DOI: 10.1186/s13148-020-00850-4
Source DB: PubMed Journal: Clin Epigenetics ISSN: 1868-7075 Impact factor: 6.551
Baseline descriptive statistics of included participants (N = 364)
| Alive ( | Dead ( | ||||
|---|---|---|---|---|---|
| Characteristic | Frequency | Frequency | |||
| Male | 209 | 76.6% | 75 | 82.4% | 0.242 |
| Female | 64 | 23.4% | 16 | 17.6% | |
| < 44 | 20 | 7.3% | 3 | 3.3% | 0.016 |
| 45 to 54 | 83 | 30.4% | 22 | 24.2% | |
| 55 to 64 | 113 | 41.4% | 34 | 37.4% | |
| 65 to 74 | 48 | 17.6% | 22 | 24.2% | |
| 75 + | 9 | 3.3% | 10 | 11.0% | |
| Low (I–II) | 39 | 14.3% | 8 | 8.8% | 0.176 |
| High (III–IV) | 234 | 85.7% | 83 | 91.2% | |
| Negative | 61 | 22.3% | 48 | 52.7% | < 0.001 |
| Positive | 212 | 77.7% | 43 | 47.3% | |
| Not overweight | 73 | 38.0% | 31 | 55.4% | 0.021 |
| Overweight or obese | 119 | 62.0% | 25 | 44.6% | |
| None | 164 | 60.1% | 34 | 37.4% | < 0.001 |
| Mild | 73 | 26.7% | 29 | 31.9% | |
| Moderate/severe | 36 | 13.2% | 28 | 30.8% | |
| School education | 116 | 42.5% | 45 | 49.5% | 0.470 |
| College | 111 | 40.7% | 34 | 37.4% | |
| Degree | 46 | 16.8% | 12 | 13.2% | |
| Never | 96 | 35.2% | 11 | 12.1% | < 0.001 |
| Former | 140 | 51.3% | 49 | 53.8% | |
| Current | 37 | 13.6% | 31 | 34.1% | |
| Non-drinker | 75 | 27.5% | 22 | 24.2% | 0.119 |
| Moderate | 68 | 24.9% | 15 | 16.5% | |
| Hazardous-harmful | 130 | 47.6% | 54 | 59.3% | |
BMI body mass index, HPV human papillomavirus, N number.aComorbidity was defined using the Adult Comorbidity Evaluation-27 (ACE-27) index [37]. For the purposes of analysis, moderate and severe comorbidity groups were combined
Origins of alcohol consumption DNAm scores employed in the current analysis
| Phenotype | Origin publication | EWAS model | # CpG sites |
|---|---|---|---|
| Alcohol consumption | ‘A DNA methylation biomarker of alcohol consumption’ Liu et al. [ | EWAS (450 K) were conducted initially using linear models per cohort. Next, an inverse variance-weighted random-effects model was used to meta-analyse 8 European-ancestry cohorts. CpGs from the meta-analysis were taken forward and included in a least absolute shrinkage and selection operator (LASSO) regression in an independent cohort, with four selection criteria used to select CpGs with predictive value of alcohol consumption | Model 1: 5, model 2: 23, model 3: 78, model 4: 144 |
| ‘Epigenetic prediction of complex traits and death’ McCartney et al. [ | EWAS (MethylationEPIC) were conducted using a LASSO regression model with k-fold ( | 450 |
Origins of BMI DNAm scores employed in the current analysis
| Phenotype | Origin publication | EWAS model | # CpG sites |
|---|---|---|---|
| BMI | ‘Epigenetic prediction of complex traits and death’ McCartney et al. [ | EWAS (MethylationEPIC) were conducted using a LASSO regression model with k-fold ( | 1109 |
| ‘Bayesian reassessment of the epigenetic architecture of complex traits’ Trejo Banos et al. [ | EWAS (MethylationEPIC) were conducted using a Bayesian framework. | 144 |
Origins of educational attainment DNAm scores employed in the current analysis
| Phenotype | Origin publication | EWAS model | # CpG sites |
|---|---|---|---|
| Educational attainment | ‘Epigenetic prediction of complex traits and death’ McCartney et al. [ | EWAS (MethylationEPIC) were conducted using a LASSO regression model with k-fold ( | 373 |
Origins of smoking DNAm scores employed in the current analysis
| Phenotype | Origin publication | EWAS model | # CpG sites |
|---|---|---|---|
| Smoking | ‘Epigenetic Signatures of Cigarette Smoking’ Joehanes et al. [ | Linear mixed models were conducted, then combined in a random-effects model meta-analysis (450 K). After meta-analysis, one set of CpGs was selected based on a Bonferroni | Bonferroni model: 2623, FDR model: 18760 |
| ‘Self-reported smoking, serum cotinine, and blood DNA methylation’ Zhang et al. [ | An EWAS (450 K) of cotinine concentration was conducted using median quantile regression, then CpG sites were individually validated against estimated average cigarettes per day using restricted cubic spline regression. Results were filtered by optimising AUCs derived from logistic regression for smoking status (current vs never; former vs never). | 4 | |
| ‘Bayesian reassessment of the epigenetic architecture of complex traits’ Trejo Banos et al. [ | EWAS (MethylationEPIC) were conducted using a Bayesian framework. | 59 | |
| ‘Epigenetic prediction of complex traits and death’ McCartney et al. [ | EWAS (MethylationEPIC) were conducted using a LASSO regression model with k-fold ( | 233 |
Proportions of phenotypic variance explained by the DNAm risk scores employed
| Methylation score | Variance explained in phenotype |
|---|---|
| Smoking | |
| Trejo Bayesian (59 CpG sites) | 48.7% |
| 47.0% | |
| McCartney LASSO (233 CpG sites) | 43.5% |
| Joehanes (Bonferroni) (2623 CpG sites) | 40.5% |
| Joehanes (FDR) (18,670 CpG sites) | 33.5% |
| Zhang (4 CpG sites) | 5.2% |
| Alcohol | |
| Liu model 4 (144 CpG sites) | 16.5% |
| Liu model 3 (78 CpG sites) | 15.8% |
| Liu model 1 (5 CpG sites) | 13.9% |
| Liu model 2 (23 CpG sites) | 10.3% |
| McCartney LASSO (450 CpG sites) | 10.0% |
| BMI | |
| Trejo Bayesian (144 CpG sites) | 24.5% |
| McCartney LASSO (1109 CpG sites) | 22.2% |
| Educational attainment | |
| McCartney LASSO (373 CpG sites) | 0.4% |
Association of phenotypic and DNAm-based predictors of smoking, alcohol drinking, BMI and education with mortality
| Minimally adjusteda | Fully adjustedb | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Exposure | HR | ll | ul | HR | ll | ul | |||||
| Self-reported phenotype | |||||||||||
| Ever vs never smoker | 364 | 3.29 | 1.75 | 6.18 | 2.22 × 10−4 | 364 | 2.21 | 1.14 | 4.30 | 0.019 | |
| Hazardous to harmful drinker vs not | 364 | 1.62 | 1.06 | 2.49 | 0.027 | 364 | 1.34 | 0.86 | 2.09 | 0.202 | |
| Higher education vs school education | 364 | 0.81 | 0.54 | 1.22 | 0.320 | 364 | 0.87 | 0.57 | 1.31 | 0.503 | |
| BMI | 248 | 0.93 | 0.87 | 0.99 | 0.028 | 248 | 0.98 | 0.92 | 1.06 | 0.664 | |
| DNAm score | |||||||||||
| McCartney LASSO (233 CpG sites) | 364 | 1.53 | 1.24 | 1.88 | 7.89 × 10-5 | 364 | 1.20 | 0.94 | 1.52 | 0.144 | |
| Trejo Bayesian (59 CpG sites) | 364 | 1.70 | 1.37 | 2.11 | 1.49 × 10-6 | 364 | 1.26 | 0.93 | 1.72 | 0.140 | |
| 364 | 0.59 | 0.48 | 0.74 | 1.72 × 10-6 | 364 | 0.79 | 0.58 | 1.07 | 0.125 | ||
| Joehanes (FDR) (18,760 CpG sites) | 364 | 1.70 | 1.34 | 2.15 | 1.27 × 10−5 | 364 | 1.35 | 0.99 | 1.84 | 0.056 | |
| Joehanes (Bonferroni) (2623 CpG sites) | 364 | 1.67 | 1.36 | 2.05 | 7.57 × 10−7 | 364 | 1.38 | 1.04 | 1.83 | 0.025 | |
| Zhang (4 CpG sites) | 364 | 1.48 | 1.16 | 1.88 | 1.48 × 10−3 | 364 | 1.28 | 1.02 | 1.60 | 0.036 | |
| Liu (5 CpG sites) | 364 | 1.32 | 1.10 | 1.57 | 2.50 × 10−3 | 364 | 1.19 | 0.97 | 1.47 | 0.094 | |
| Liu (23 CpG sites) | 364 | 1.26 | 1.04 | 1.52 | 0.019 | 364 | 1.10 | 0.89 | 1.36 | 0.357 | |
| Liu (78 CpG sites) | 364 | 1.25 | 1.07 | 1.45 | 5.02 × 10−3 | 364 | 1.20 | 0.99 | 1.45 | 0.067 | |
| Liu (144 CpG sites) | 364 | 1.24 | 1.07 | 1.44 | 5.31 × 10−3 | 364 | 1.21 | 1.00 | 1.46 | 0.052 | |
| McCartney LASSO (450 CpG sites) | 364 | 1.28 | 1.03 | 1.60 | 0.024 | 364 | 1.05 | 0.79 | 1.41 | 0.723 | |
| Trejo Bayesian (144 CpG sites) | 364 | 0.78 | 0.63 | 0.97 | 0.024 | 248 | 0.77 | 0.56 | 1.08 | 0.132 | |
| McCartney LASSO (1109 CpG Sites) | 364 | 0.85 | 0.68 | 1.06 | 0.146 | 248 | 0.77 | 0.57 | 1.04 | 0.093 | |
| McCartney LASSO (373 CpG sites) | 364 | 0.76 | 0.61 | 0.96 | 0.021 | 364 | 0.87 | 0.68 | 1.12 | 0.270 | |
N number, HR hazard ratio, ll lower confidence interval, ul upper confidence interval. aSelf-reported phenotypes adjusted for age and gender; epigenetic scores adjusted for age, gender, cell counts and batch effects. bPhenotypes additionally adjusted for clinical variables (TNM stage, HPV status and comorbidity), and a combination of smoking, alcohol intake, education and BMI, as appropriate to the model; risk scores additionally adjusted for clinical variables, the corresponding phenotype predicted by the score of interest and the remaining self-reported phenotypes (excluding BMI). cSample numbers vary due to missing BMI data
Fig. 1Flow diagram of HN5000 participants included in the analysis. *Data available for age, gender, TNM stage, HPV status, comorbidity, education, self-reported smoking status and alcohol consumption