| Literature DB >> 29419728 |
Anne Marie Svane1, Mette Soerensen2, Jesper Lund3, Qihua Tan4,5, Juulia Jylhävä6, Yunzhang Wang7, Nancy L Pedersen8, Sara Hägg9, Birgit Debrabant10, Ian J Deary11,12, Kaare Christensen13,14, Lene Christiansen15, Jacob B Hjelmborg16.
Abstract
Several studies have linked DNA methylation at individual CpG sites to aging and various diseases. Recent studies have also identified single CpGs whose methylation levels are associated with all-cause mortality. In this study, we perform an epigenome-wide study of the association between CpG methylation and mortality in a population of 435 monozygotic twin pairs from three Danish twin studies. The participants were aged 55-90 at the time of blood sampling and were followed for up to 20 years. We validated our results by comparison with results from a British and a Swedish cohort, as well as results from the literature. We identified 2806 CpG sites associated with mortality (false discovery rate ( FDR ) < 0.05 ), of which 24 had an association p-value below 10 - 7 . This was confirmed by intra-pair comparison controlling for confounding effects. Eight of the 24 top sites could be validated in independent datasets or confirmed by previous studies. For all these eight sites, hypomethylation was associated with poor survival prognosis, and seven showed monozygotic correlations above 35%, indicating a potential moderate to strong heritability, but leaving room for substantial shared or unique environmental effects. We also set up a predictor for mortality using least absolute shrinkage and selection operator (LASSO) regression. The predictor showed good performance on the Danish data under cross-validation, but did not perform very well in independent samples.Entities:
Keywords: DNA methylation; EWAS; heritability; mortality; survival analysis; survival prediction; twin study
Year: 2018 PMID: 29419728 PMCID: PMC5852574 DOI: 10.3390/genes9020078
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Characteristics of the sample population. Cardiovascular diseases include apoplexy and age-related cardiovascular weakness.
| Dataset | BWD | LSADT | MADT | Total |
|---|---|---|---|---|
| 150 | 238 | 482 | 870 | |
| Women, | 74 (49%) | 156 (76%) | 218 (46%) | 448 |
| Deaths, | 11 (7.3%) | 213 (89%) | 34 (7.0%) | 258 |
| Age at blood sample | 57–74 | 73–90 | 55–79 | 55–90 |
| Year of blood sample | 2009–2010 | 1996–1997 | 2008–2011 | |
| Follow-up year | 2016 | 2016 | 2016 | |
| Cause of death: | ||||
| Cancer | 1 | 35 | 4 | 40 |
| Cardiovascular disease | 0 | 72 | 2 | 74 |
| Respiratory causes | 1 | 18 | 1 | 20 |
BWT: a study of birth weight-discordant twins; LSADT: Longitudinal Study of Aging Danish Twins; MADT: Middle Aged Danish Twins Study.
The 24 CpG sites with in the univariate analysis.
| CpG Site | HR | FDR | Chromosome | Gene Name | MZ Correlation | |
|---|---|---|---|---|---|---|
| cg07626482 | 0.64 | 3.21l | 1.42 | 19 | 0.43 *** | |
| cg12627844 | 0.69 | 5.20 | 1.15 | 2 | 0.44 *** | |
| cg05339037 | 0.77 | 1.18 | 1.31 | 19 | 0.44 *** | |
| cg03487027 | 0.73 | 1.40 | 1.31 | 10 | 0.51 *** | |
| cg20162822 | 0.78 | 1.81 | 1.31 | 17 | −0.05 | |
| cg25121437 | 0.73 | 2.04 | 1.31 | 11 | 0.29 *** | |
| cg10589813 | 0.71 | 2.76 | 1.31 | 20 | 0.41 *** | |
| cg08627624 | 0.69 | 2.85 | 1.31 | 10 | −0.04 | |
| cg06598544 | 0.78 | 3.13 | 1.31 | 20 | 0.31 *** | |
| cg23591302 ** | 0.74 | 3.14 | 1.31 | 12 | 0.15 *** | |
| cg13580749 | 0.72 | 3.25 | 1.31 | 9 | 0.18 *** | |
| cg15013177 * | 0.77 | 3.98 | 1.31 | 3 | 0.12 *** | |
| cg17087741 ** | 0.71 | 4.16 | 1.31 | 2 | 0.45 *** | |
| cg04716447 ** | 1.42 | 4.16 | 1.31 | 12 | 0.18 *** | |
| cg05232694 | 0.72 | 4.64 | 1.33 | 20 | 0.31 *** | |
| cg06172950 | 0.74 | 4.81 | 1.33 | 13 | 0.27 *** | |
| cg26186132 | 0.71 | 5.66 | 1.41 | 6 | −0.07 | |
| cg11339912 | 0.72 | 5.74 | 1.41 | 5 | 0.39 *** | |
| cg24750887 | 1.42 | 6.22 | 1.44 | 4 | 0.05 | |
| cg15763258 | 0.73 | 6.52 | 1.44 | 11 | 0.36 *** | |
| cg15871086 | 0.74 | 7.65 | 1.61 | 18 | 0.36 *** | |
| cg02657160 | 0.70 | 8.81 | 1.72 | 3 | 0.18 *** | |
| cg02711608 ** | 0.69 | 8.99 | 1.72 | 19 | SLC1A5 | 0.48 *** |
| cg17459949 | 0.73 | 9.79 | 1.80 | 10 | −0.06 |
FDR: false discovery rate. * Polymorphic site according to [16]. ** Single nucleotide polymorphism (SNP) within-probe binding region (according to annotation file). *** Significant at the 0.05 level. † A significant hazard ratio (HR) at 0.05 level in intra-pair comparison.
Hazard ratio and association p-value for the top 24 sites in a Cox frailty model adjusted for sex, age, and twin structure in the discovery and validation datasets. Five CpGs were not available in the Swedish data.
| CpG Site | Danish Twins | Lothian Birth Cohort | Swedish Twins | |||
|---|---|---|---|---|---|---|
| HR | HR | HR | ||||
| cg07626482 | 0.64 | 3.21 | 0.55 | 3.18 | 0.88 | 0.09 |
| cg12627844 | 0.69 | 5.20 | 0.94 | 0.53 | - | - |
| cg05339037 | 0.77 | 1.18 | 0.42 | 1.70 | 0.85 | 0.02 |
| cg03487027 | 0.73 | 1.40 | 1.09 | 4.81 | 0.99 | 0.87 |
| cg20162822 | 0.78 | 1.81 | 0.96 | 0.64 | 1.08 | 0.31 |
| cg25121437 | 0.73 | 2.04 | 0.91 | 0.48 | 1.07 | 0.30 |
| cg10589813 | 0.71 | 2.76 | 0.56 | 2.71 | 0.85 | 0.03 |
| cg08627624 | 0.69 | 2.85 | 1.08 | 0.48 | - | - |
| cg06598544 | 0.78 | 3.13 | 1.06 | 0.13 | 1.04 | 0.57 |
| cg23591302 | 0.74 | 3.14 | 1.05 | 0.11 | - | - |
| cg13580749 | 0.72 | 3.25 | 1.00 | 1.00 | - | - |
| cg15013177 | 0.77 | 3.98 | 1.09 | 0.42 | 0.97 | 0.66 |
| cg17087741 | 0.71 | 4.16 | 0.57 | 3.22 | - | - |
| cg04716447 | 1.42 | 4.16 | 1.37 | 0.15 | 1.06 | 0.47 |
| cg05232694 | 0.72 | 4.64 | 0.86 | 0.07 | 0.90 | 0.17 |
| cg06172950 | 0.74 | 4.81 | 0.77 | 0.034 | 0.98 | 0.75 |
| cg26186132 | 0.71 | 5.66 | 1.18 | 0.33 | 1.10 | 0.17 |
| cg11339912 | 0.72 | 5.74 | 0.65 | 2.27 | 0.91 | 0.18 |
| cg24750887 | 1.42 | 6.22 | 1.05 | 0.52 | 1.02 | 0.82 |
| cg15763258 | 0.73 | 6.52 | 0.70 | 0.03 | 0.97 | 0.69 |
| cg15871086 | 0.74 | 7.65 | 0.66 | 6.82 | 0.97 | 0.71 |
| cg02657160 | 0.70 | 8.81 | 1.01 | 0.93 | 0.90 | 0.13 |
| cg02711608 | 0.69 | 8.99 | 0.53 | 5.30 | 0.99 | 0.94 |
| cg17459949 | 0.73 | 9.79 | 1.07 | 0.11 | 0.97 | 0.68 |
Figure 1The 10-fold cross-validated Harrell’s C for the 24 most significant sites in the univariate analysis. Sex, age, and cohort are included in the model. The red line corresponds to the model based on sex, age, and cohort only.
The variables chosen by the least absolute shrinkage and selection operator (LASSO) for the mortality predictor. The second column gives the coefficients for the linear predictor. All methylation values are normalized to have a standard deviation of 1.
| Covariate | Coefficient ** | SE ** | Chromosome | Gene Name | MZ Correlation | |
|---|---|---|---|---|---|---|
| cg02537149 | 0.158 | 0.0664 | 0.0173 | 1 | 0.00 | |
| cg02691019 | 0.0906 | 0.0564 | 0.108 | 16 | −0.07 | |
| cg04716447 * | 0.308 | 0.0623 | 7.36 | 12 | 0.18 *** | |
| cg05232694 | −0.155 | 0.0686 | 0.0234 | 20 | 0.31 *** | |
| cg07626482 | −0.160 | 0.0820 | 0.0508 | 19 | 0.43 *** | |
| cg12880095 | 0.115 | 0.0621 | 0.0650 | 17 | 0.53 *** | |
| cg14304264 | 0.140 | 0.0705 | 0.0467 | 15 | 0.00 | |
| cg17459949 | −0.176 | 0.0640 | 5.95 | 10 | −0.06 | |
| cg20164226 * | 0.0579 | 0.0579 | 0.317 | 7 | 0.27 *** | |
| cg21381949 | −0.112 | 0.0592 | 0.0585 | 3 | 0.12 *** | |
| cg22304262 | −0.273 | 0.0822 | 8.97 | 19 | 0.38 *** | |
| cg24750887 | 0.205 | 0.0663 | 1.96 | 4 | 0.05 | |
| cg24967142 | 0.219 | 0.0669 | 1.04 | 12 | −0.08 | |
| cg26186132 | −0.228 | 0.0688 | 9.11 | 6 | −0.07 | |
| Sex (female) | −0.702 | 0.140 | 5.71 | |||
| Age (in years) | 0.139 | 0.0165 | <2 |
* SNP within the probe-binding region. ** Coefficients, standard error, and p-value in a Cox model fitted with all 14 sites, sex, and, age. *** Significant at the 0.05 level.
Figure 2The 10-fold cross-validated time-varying AUC for the LASSO predictor based on all 14 CpG sites, sex, and age. This is compared to the model based on sex and age only.