| Literature DB >> 34341706 |
Fedor Galkin1,2, Polina Mamoshina1, Kirill Kochetov1, Denis Sidorenko3, Alex Zhavoronkov1,3,4.
Abstract
DNA methylation aging clocks have become an invaluable tool in biogerontology research since their inception in 2013. Today, a variety of machine learning approaches have been tested for the purpose of predicting human age based on molecular-level features. Among these, deep learning, or neural networks, is an especially promising approach that has been used to construct accurate clocks using blood biochemistry, transcriptomics, and microbiomics data-feats unachieved by other algorithms. In this article, we explore how deep learning performs in a DNA methylation setting and compare it to the current industry standard-the 353 CpG clock published in 2013. The aging clock we are presenting (DeepMAge) is a neural network regressor trained on 4,930 blood DNA methylation profiles from 17 studies. Its absolute median error was 2.77 years in an independent verification set of 1,293 samples from 15 studies. DeepMAge shows biological relevance by assigning a higher predicted age to people with various health-related conditions, such as ovarian cancer, irritable bowel diseases, and multiple sclerosis. copyright:Entities:
Keywords: DNA methylation; aging; artificial intelligence; epigenetics
Year: 2021 PMID: 34341706 PMCID: PMC8279523 DOI: 10.14336/AD.2020.1202
Source DB: PubMed Journal: Aging Dis ISSN: 2152-5250 Impact factor: 6.745
Accuracy metrics for DeepMAge. The accuracy achieved in cross-validation (CV column, MedAE = 2.24 years) was only slightly reduced during verification (healthy verification column, MedAE = 2.77 years). Accuracy declined in the samples with various health-related conditions (case verification column, MedAE = 4.35 years).
| CV | Healthy verification | Case training | Case verification | |
|---|---|---|---|---|
| MedAE, years | 2.24 | 2.77 | 3.29 | 4.18 |
| MAE, years | 3.21 | 3.80 | 4.74 | 5.08 |
| R2 | 0.96 | 0.93 | 0.88 | 0.82 |
| Pearson’s r | 0.98 | 0.97 | 0.94 | 0.94 |
| RMSE, years | 4.55 | 5.44 | 7.51 | 6.24 |
| N | 4,930 | 1,293 | 1,093 | 439 |
CV = Cross-validation; MAE = Mean absolute error; MedAE = Median absolute error; R2 = Coefficient of determination; RMSE = Root mean square error; N = Number of samples in the subsample
Figure 1.Scatter plot of DeepMAge predictions in 4 data cohorts”. DeepMAge accurately predicted the chronological age of healthy people from the training set (A), healthy people from the verification set (B), and remained accurate in the aggregations of case cohorts from the studies included in the training set (C) and the verification set (D). Scatter plot in panel A shows the per-fold predictions obtained during CV, and the other panels show the predictions by the final model. MedAE = Median absolute error measured in years, N = Number of samples in a corresponding cohort (see Supplementary Figures 1-3 for a more detailed visualization).
Figure 2.The DeepMAge prediction age distribution in the verification set closely resembled the real age distribution. Distributions were obtained using Gaussian kernel with 0.3σ bandwidth, where σ is the standard deviation of the age values.
DeepMAge prediction errors are not significantly different for younger males and females. Sex-related differences in age prediction for older adults are inconsistent between the CV and the verification sets.
| Set | Error, years | Absolute Error, years | N | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Years | (20-45) | (45-55) | (55-65) | (65-75) | (20-75) | (20-45) | (45-55) | (55-65) | (65-75) | (20-75) | ||
| Verification | Male | +0.48 | -2.50 | -1.46* | -4.76* | -0.87* | +2.97 | +4.04 | +3.98 | +6.04* | +3.68 | 574 |
| Female | +0.23 | -3.58 | -0.06* | -1.78* | -0.12* | +3.24 | +4.48 | +3.50 | +4.13* | +3.40 | 494 | |
| N | 707 | 62 | 163 | 136 | 1068 | 707 | 62 | 163 | 136 | 1068 | ||
| CV | Male | +0.62 | +2.14* | +0.62* | +0.81* | 0.97* | +2.84 | +3.80 | +4.00 | +4.89 | +3.53 | 1452 |
| Female | +0.65 | +0.41* | -0.54 * | -2.17* | -0.34* | +2.76 | +3.59 | +3.77 | +4.58 | +3.59 | 2058 | |
| N | 1323 | 670 | 897 | 620 | 3510 | 1323 | 670 | 897 | 620 | 3510 | ||
The significant differences are marked with “*” (p-value < 0.05 in the MW test). CV = Cross-validation; MW = Mann-Whitney U test; N = Number of samples in the age range or sex subsample.
Five diseases (including ovarian cancer and multiple sclerosis) were associated with significantly higher age predictions (p-value (MW) < 0.05).
| GEO ID | Mean error in control | Mean error in cases | N control | N case | N total | DeepMAge sample | Case cohort description | ||
|---|---|---|---|---|---|---|---|---|---|
| GSE53740* | -0.37 | +0.63 | 2.70E-2 | 1.50E-1 | 197 | 186 | 383 | Training | Neurodegenerative tauopathy |
| GSE19711* | -2.97 | -1.27 | 9.84E-6 | 4.39E-1 | 272 | 264 | 536 | Training | Ovarian cancer |
| GSE77696 | +4.43 | +3.96 | 1.31E-1 | 5.29E-2 | 117 | 261 | 378 | Training | HIV |
| GSE106648* | -1.84 | +0.26 | 2.17E-8 | 2.52E-1 | 139 | 140 | 279 | Training | Multiple sclerosis |
| GSE67530 | -2.66 | -1.63 | 1.12E-1 | 1.01E-1 | 105 | 39 | 144 | Training | Acute respiratory distress syndrome |
| GSE52588 | 0.67 | 1.19 | 1.71E-1 | 4.84E-1 | 58 | 29 | 87 | Training | Down syndrome |
| GSE97362* | 1.24 | -4.04 | 2.05E-3 | 9.30E-2 | 83 | 150 | 233 | Training | CHARGE / Kabuki syndrome |
| GSE84624 | 0.54 | 0.73 | 4.39E-1 | 9.87E-2 | 24 | 24 | 48 | Training | Kawasaki disease |
| GSE112696 | 4.24 | 4.56 | 3.44E-1 | 1.89E-1 | 6 | 6 | 12 | Verification | Myasthenia gravis |
| GSE102177 | 1.99 | 1.91 | 4.94E-1 | 2.38E-1 | 18 | 18 | 36 | Verification | Maternal gestational diabetes |
| GSE87582 | -9.59 | -3.79 | 1.08E-1 | 2.82E-1 | 1 | 20 | 21 | Verification | HIV |
| GSE107737* | -1.98 | 3.66 | 3.63E-3 | 1.56E-1 | 12 | 12 | 24 | Verification | Congenital |
| GSE87640* | -0.20 | 1.03 | 1.24E-3 | 3.57E-1 | 84 | 156 | 240 | Verification | Inflammatory bowel diseases |
| GSE99624 | -1.58 | -3.99 | 6.43E-2 | 3.76E-1 | 16 | 32 | 48 | Verification | Ostheoporosis |
pvalue (MW) is the significance of the MW test for equal mean prediction error between the case and control cohorts in each study; “*” marks the studies with a significant (p-value<0.05) MW test result; p-value(random MW) is the significance of the test for a permuted sample. For the control samples marked as “Training,” the predictions were obtained during CV; for the case samples marked as “Training,” the predictions were obtained with the final model, which had not been previously exposed to these samples. The studies in which the studied condition was significantly associated with higher DeepMAge predictions are marked in green. CV = Cross-validation; GEO ID = Gene Expression Omnibus accession number; HIV = Human Immunodeficiency Virus; MW = Mann-Whitney U test; N = Number of samples in the corresponding GEO project cohorts.
In seven out of 15 verification studies, DeepMAge performed better than the 353 CpG clock according to two quality metrics (MedAE and Pearson’s r).
| GEO ID | MedAE, years | Pearson’s r | N | Age range, years | Male ratio, % | ||
|---|---|---|---|---|---|---|---|
| DeepMAge | 353 CpG | DeepMAge | 353 CpG | ||||
| GSE107459 ** | 1.63 | 3.43 | 0.79 | 0.68 | 127 | 18-35 | 0 |
| GSE102177 * | 1.87 | 1.33 | 0.86 | 0.83 | 18 | 4-14 | 56 |
| GSE34639 * | 1.92 | 0.22 | 0.89 | 0.88 | 48 | 0-1 | 33 |
| GSE105123 ** | 2.06 | 2.87 | 0.47 | 0.38 | 107 | 19-23 | 58 |
| GSE61496 ** | 2.14 | 3.42 | 0.97 | 0.95 | 310 | 30-74 | 53 |
| GSE87640 * | 2.52 | 3.02 | 0.86 | 0.87 | 84 | 20-58 | 62 |
| GSE98876 ** | 2.54 | 4.77 | 0.89 | 0.81 | 71 | 26-69 | 100 |
| GSE79329 ** | 2.63 | 3.74 | 0.92 | 0.89 | 34 | 43-70 | 100 |
| GSE99624 ** | 2.72 | 3.73 | 0.93 | 0.81 | 16 | 49-82 | 38 |
| GSE107737 * | 3.03 | 3.62 | 0.34 | 0.46 | 12 | 18-29 | 100 |
| GSE37008 | 3.74 | 2.26 | 0.81 | 0.81 | 99 | 24-45 | 37 |
| GSE112696 * | 3.75 | 2.78 | 0.34 | 0.23 | 6 | 22-27 | 67 |
| GSE59065 ** | 4.35 | 5.01 | 0.95 | 0.94 | 295 | 22-84 | 48 |
| GSE103911 * | 6.96 | 6.14 | 0.85 | 0.76 | 65 | 27-77 | 71 |
| GSE87582 | 9.59 | 6.41 | - | - | 1 | 60 | 100 |
| Average | 2.77 | 3.51 | 0.97 | 0.93 | 1293 | 0-84 | 52 |
There were only two studies for which DeepMAge was not superior to the 353 CpG clock according to at least one metric. Considering the 15 studies in aggregate, DeepMAge provided superior prediction accuracy. “**” marks the studies in which DeepMAge shows superior performance based on both MedAE and Pearson’s r, “*” marks the studies in which DeepMAge shows superior performance based on only one metric. GEO ID = Gene Expression Omnibus accession number; MedAE = Median absolute error; N = Number of samples in the corresponding GEO project.
Figure 3.DeepMAge, but not the 353 CpG clock, predicted donors with IBD (GEO study accession GSE87640) to be on average 1.23 years older than the healthy donors from the same study (p-value = 1.24E3). Outliers outside the (-20; +20) prediction error window were removed from the image; The box is formed by the interquartile range with the median marked inside it. Whiskers protrude no farther than 1.5 times the interquartile range. GEO = Gene Expression Omnibus; IBD = Inflammatory bowel disease; N= Number of samples in a corresponding cohort.
Figure 4.The DeepMAge clock shares 122 CpGs with the 353 CpG clock and seven CpGs with the 71 CpG clock. The latter two were published in 2013.