| Literature DB >> 30348905 |
Michael J Thompson1, Steve Horvath2, Karolina Chwiałkowska3, Liudmilla Rubbi1, Aldons J Lusis4, Richard C Davis4, Anuj Srivastava5, Ron Korstanje6, Gary A Churchill6, Matteo Pellegrini1.
Abstract
Human DNA-methylation data have been used to develop highly accurate biomarkers of aging ("epigenetic clocks"). Recent studies demonstrate that similar epigenetic clocks for mice (Mus Musculus) can be slowed by gold standard anti-aging interventions such as calorie restriction and growth hormone receptor knock-outs. Using DNA methylation data from previous publications with data collected in house for a total 1189 samples spanning 193,651 CpG sites, we developed 4 novel epigenetic clocks by choosing different regression models (elastic net- versus ridge regression) and by considering different sets of CpGs (all CpGs vs highly conserved CpGs). We demonstrate that accurate age estimators can be built on the basis of highly conserved CpGs. However, the most accurate clock results from applying elastic net regression to all CpGs. While the anti-aging effect of calorie restriction could be detected with all types of epigenetic clocks, only ridge regression based clocks replicated the finding of slow epigenetic aging effects in dwarf mice. Overall, this study demonstrates that there are trade-offs when it comes to epigenetic clocks in mice. Highly accurate clocks might not be optimal for detecting the beneficial effects of anti-aging interventions.Entities:
Keywords: DNA methylation; biological age; epigenetic clock; mouse
Mesh:
Substances:
Year: 2018 PMID: 30348905 PMCID: PMC6224226 DOI: 10.18632/aging.101590
Source DB: PubMed Journal: Aging (Albany NY) ISSN: 1945-4589 Impact factor: 5.682
Summary performance statistics of epigenetic aging models (“clocks”).
| All CpGs Clock | Training set | Ridge | 1.00 | 0.1 | 193651 | 0 |
| Elastic net | 0.99 | 0.7 | 582 | 0 | ||
| LO-Batch-Out | Ridge | 0.79 | 3.1 | 193641 | 0 | |
| Elastic | 0.82 | 2.5 | 529 | 81 | ||
| LO-Sample-Out | Ridge | 0.85 | 2.1 | 193651 | 0 | |
| Elastic | 0.89 | 1.8 | 444 | 81 | ||
| 10-fold CV | Ridge | 0.88 | 0.3 | 193641 | 0 | |
| Elastic | 0.89 | 1.2 | 463 | 134 | ||
| Conserved CpGs Clock | Training | Ridge | 0.85 | 2.7 | 952 | 0 |
| Elastic | 0.91 | 1.9 | 274 | 0 | ||
| LO-Batch-Out | Ridge | 0.64 | 4.0 | 952 | 0 | |
| Elastic | 0.68 | 3.8 | 214 | 39 | ||
| LO-Sample-Out | Ridge | 0.75 | 3.3 | 952 | 0 | |
| Elastic | 0.78 | 2.4 | 236 | 6 | ||
| 10-fold CV | Ridge | 0.77 | 3.5 | 952 | 0 | |
| Elastic | 0.80 | 2.5 | 247 | 23 |
Accuracy of estimating chronological age for 4 different epigenetic clocks. The 4 clocks differ in terms of the CpGs that were used in their construction (first column) and in terms of the underlying regression model (third column). The second column describes the method for estimating the predictive accuracy. The training set estimates are overly optimistic and should be ignored. Leave-one-batch out estimates and leave-one-sample-out estimates provide accuracy estimates that are far less biased than those obtained in the training set. The mean model size refers to the number of CpGs selected by the penalized regression model. Since the ridge regression is based on all CpGs, the standard deviation is zero.
Figure 1Accuracy of ridge regression epigenetic age predictions. DNA methylation age (y-axis) versus chronological age (x-axis) for all mouse samples. (a) Performance of ridge regression clock based on all 192K CpGs in all training samples. The training set estimates of the accuracy are overly optimistic and should be ignored. (b) Results by tissue type of cross-validated predictions obtained by iteratively withholding one “batch” (tissue x publication). For the batch cross-validation of this clock, the global Pearson correlation between predicted and chronological age was 0.79 (p < 2E-195) with a mae of 3.1 months. All models in these iterative cross-validations had the same size of 193,651 CpGs. (c) Scatter plots by tissue type based on DNAm age estimates made with an iterative leave-one-sample-out cross-validation. The correlation between predicted and chronologic age was 0.85 (p < 6E-258) with a mae of 2.1 months.
Figure 2Age acceleration due to diet treatments. Results obtained from ridge regression clock. A meta-analysis p-value for the 3 calorie-restriction (CR) experiments is included. (a) Calorie restriction versus standard diet in the C57BL/J strain. (b) Calorie restriction versus standard chow diet in the B6D2F1 strain. (c) Calorie restriction versus standard diet for the HET3 strain. d) Rapamycin enriched diet versus standard diet for the HET3 strain.
Figure 3Age acceleration and Dwarfism in mice. Results obtained from ridge regression clock. A meta-analysis p-value for the 3 experiments is included. (a) Genetic knockout dwarf mice versus wild type. (b) Snell dwarf mice versus wild type. c) Ames Dwarf mice versus wild type.
Figure 4Age acceleration and maternal diet. Results obtained from ridge regression clock. (a) Offspring of mothers fed a high fat diet (HFD) who were fed either a high fat or low fat diet (LFD). (b) Offspring of mothers fed a low fat diet who were fed either a high fat or low fat diet.
Figure 5Genome-wide association results for DNAm Age. (a) Manhattan plot presenting genome-wide association results for DNAm Age. Epigenetic age predictions were calculated using all CpGs clock with ridge regression and leave-one-sample-out estimates. GWAS analysis was based on linear mixed model and a set of 196,148 SNPs (MAF > 0.05) from HMDP mice strains. (b) This SNP as identified using GWAS analysis of epigenetic age predictions. It is located in an LD block on chromosome 6 and contains the genes Npy, Mpp6, Gsdme and Osbp13. A one-sided t-test of DNAm ages between the two allelic groups shown is statistically significant. (c) It is located in an LD block on chromosome 6 and contains the genes Npy, Mpp6, Gsdme and Osbp13. A one-sided t-test of DNAm ages between the two allelic groups shown is statistically significant.
Top ten SNPs from GWAS analysis of DNAm age predictions corresponding to peaks connected to LD blocks in HDMP (~2 Mbp). P-values were computed with a linear mixed-model (LMM).
| JAX00189882 | 6 | 77104479 | 1,08E-04 | Ctnna2, Lrrtm1 |
| JAX00141186 | 6 | 55124351 | 1,59E-04 | Plekha8, Mturn, Znrf2, Nod1, Ggct, Gars, Crhr2, Inmt, Mindy4, Aqp1, Ghrhr, Adcyap1r1 |
| JAX00613802 | 6 | 73641278 | 1,67E-04 | Dnah6, Suclg1, 4931417E11Rik |
| JAX00651898 | 7 | 115070069 | 3,06E-04 | Calca, Calcb, Insc, Sox6 |
| JAX00373522 | 14 | 14657081 | 4,48E-04 | Olfr720, Olfr31, Il3ra, Slc4a7, Nek10 |
| JAX00049927 | 14 | 9498134 | 4,83E-04 | Fhit |
| JAX00140799 | 6 | 49976398 | 5,06E-04 | Npy, Mpp6, Gsdme, Osbp13 |
| JAX00189488 | 4 | 95990791 | 5,48E-04 | Fggy, Hook1, Cyp2j13, Cyp2j12, Cyp2j11, Cyp2j8 |
| JAX00087970 | 19 | 25717022 | 5,66E-04 | Kank1, Dmrt1, Dmrt3, Dmrt2 |
| JAX00374020 | 14 | 17587871 | 7,02E-04 | Thrb |
Datasets.
| Novel. Current study | Adipose (56) | C57BL/6J (200) | N=364 |
| Stubbs (2017) | Cortex (16) | C57BL/6 | N=61 |
| Cole (2017) [45] | Liver (32) | Ames Prop1 Dwarf (16) | N = 32 |
| Petkovich (2017) [44] | Blood (231) | C57BL/6 (161) | N = 231 |
| Novel. Current study. JAX lab | Kidney (190) | Diversity Outbred (190) | N = 190 |
| Reizel (2015) [76] | Cerebellum (8) | C57BL/6 | N = 92 |
| Cannon (2016) [67] | Heart (5) | C57BL/6 | N=32 |
| Cannon (2014) [52] | Liver (40) | C57BL/6 | N = 40 |
| Orozco (2014) [66] | Liver (105) | 91 different strains | N = 105 |