| Literature DB >> 31443728 |
Qian Zhang1, Costanza L Vallerga1, Rosie M Walker2, Tian Lin1, Anjali K Henders1, Grant W Montgomery1, Ji He3, Dongsheng Fan3, Javed Fowdar4, Martin Kennedy5, Toni Pitcher6,7, John Pearson5, Glenda Halliday8, John B Kwok8, Ian Hickie8, Simon Lewis8, Tim Anderson6,7, Peter A Silburn9, George D Mellick4, Sarah E Harris2,10, Paul Redmond10, Alison D Murray11, David J Porteous2,10, Christopher S Haley12, Kathryn L Evans2, Andrew M McIntosh2,10,13, Jian Yang1, Jacob Gratten1,9, Riccardo E Marioni2,10, Naomi R Wray1,9, Ian J Deary10,14, Allan F McRae1, Peter M Visscher15.
Abstract
BACKGROUND: DNA methylation changes with age. Chronological age predictors built from DNA methylation are termed 'epigenetic clocks'. The deviation of predicted age from the actual age ('age acceleration residual', AAR) has been reported to be associated with death. However, it is currently unclear how a better prediction of chronological age affects such association.Entities:
Keywords: Age prediction; Ageing; DNA methylation; Epigenetic clock; Mortality
Mesh:
Year: 2019 PMID: 31443728 PMCID: PMC6708158 DOI: 10.1186/s13073-019-0667-1
Source DB: PubMed Journal: Genome Med ISSN: 1756-994X Impact factor: 11.117
Description of DNA methylation cohorts
| Cohort1 | Sample size2 | Number of samples with valid age | Mean age (SD) | Age range | Source | Disease |
|---|---|---|---|---|---|---|
| LBC1921 [ | 692 | 692 | 82.3 (4.3) | [77.8, 90.6] | Blood | Not available |
| LBC1936 [ | 2326 | 2326 | 72.4 (2.8) | [67.7, 77.7] | Blood | Not available |
| BSGS [ | 614 | 614 | 21.4 (14.1) | [9.9, 74.9] | Blood | Not available |
| SGPD | 1962 | 1556 | 67.2 (9.5) | [23.0, 104.0] | Blood | Parkinson’s disease 988, control 974 |
| MND [ | 695 | 600 | 45.2 (15.0) | [17.0, 76.0] | Blood | Motor neuron disease (MND) 497, control 198 |
| GS [ | 5101 | 5100 | 48.5(14.0) | [18.0, 94.5] | Blood | Not available |
| GSE72775 [ | 335 | 335 | 70.2 (10.3) | [36.5, 90.5] | Blood | Not available |
| GSE78874 [ | 259 | 259 | 68.8(9.7) | [36.0, 88.0] | Saliva | Not available |
| GSE72773 [ | 310 | 310 | 65.6 (13.9) | [35.1, 91.9] | Blood | Not available |
| GSE72777 [ | 46 | 46 | 14.7 (10.4) | [2.2, 35.0] | Blood | Not available |
| GSE41169 [ | 95 | 95 | 31.6 (10.3) | [18.0, 65.0] | Blood | Schizophrenia 62, control 33 |
| GSE40279 [ | 656 | 656 | 64.0 (14.7) | [19.0, 101.0] | Blood | Not available |
| GSE42861 [ | 689 | 689 | 51.9 (11.8) | [18.0, 70.0] | Blood | Rheumatoid arthritis 354, control 335 |
| GSE53740 [ | 384 | 383 | 67.8(9.6) | [34.0, 93.0] | Blood | Alzheimer’s disease 15, corticobasal degeneration 1, frontotemporal dementia (FTD) 121, FTD/MND 7, progressive supranuclear palsy 43, control 193, unknown 4 |
1LBC Lothian Birth Cohort, BSGS Brisbane Systems Genomics Study, SGPD Systems Genomic of Parkinson’s Disease consortium, MND Motor Neuron Disease cohort, GS Generation Scotland. Cohorts with prefix GSE are from the GEO database
2The number of samples in each cohort. Some samples in LBC were measured from the same individual but at different chronological age
Fig. 1The relationship between training sample size and predictor error measured at the square root of the mean squared error (RMSE) in test data sets. Each point represents the RMSE of the test result based on predictors with different sample size and methods. Points with RMSE larger than 15 are excluded. Prediction results from Horvath are marked as black dash line, and the black solid line represents prediction result from Hannum’s age predictor
Fig. 2Relationship between the training sample size and the test statistics (t test) from the association between age acceleration residual (AAR) and mortality. Each point represents the test statistic from the survival analysis based on the predicted ages from predictors with different training sample sizes
Summary details of two LBC cohorts and the relationship between all-cause mortality and predicted age from different methods (before and after cell count correction)
| LBC1921 wave one | LBC1936 wave one | |
|---|---|---|
|
| 436 | 906 |
|
| 386 | 214 |
| Chronological age, mean (SD)1 | 79.1 (0.6) | 69.5 (0.8) |
| Before cell count correction | ||
| Hannum, mean (SD) | 80.3 (6.2) | 71.3 (5.7) |
| Hannum, hazard ratio ( | 1.12 (0.016, 1.02–1.23) | 1.18 (0.020, 1.02–1.37) |
| Horvath, mean (SD) | 73.8 (6.9) | 66.1 (6.4) |
| Horvath, hazard ratio ( | 1.09 (0.038, 1.00–1.20) | 1.19 (0.0022, 1.06–1.32) |
| Elastic Net, mean (SD)3 | 77.4 (3.6) | 72.5 (3.2) |
| Elastic Net, hazard ratio ( | 1.08 (0.38, 0.91–1.27) | 1.00 (0.96, 0.79–1.28) |
| BLUP, mean (SD)3 | 77.3 (3.3) | 72.5 (2.8) |
| BLUP, hazard ratio ( | 1.20 (0.066, 0.99–1.46) | 1.25 (0.12, 0.95–1.64) |
| After cell count correction | ||
| Hannum, hazard ratio ( | 1.10 (0.057, 1.00–1.21) | 1.11 (0.15, 0.96–1.29) |
| Horvath, hazard ratio ( | 1.07 (0.13, 0.98–1.17) | 1.14 (0.032, 1.01–1.28) |
| Elastic Net, hazard ratio ( | 1.07 (0.39, 0.91–1.27) | 1.03 (0.79, 0.82–1.31) |
| BLUP, hazard ratio ( | 1.21 (0.05, 1.00–1.48) | 1.21 (0.17, 0.92–1.60) |
1Mean (predicted) age and its standard deviation
2Hazard ratio, P value, and 95% confidence interval from the survival analysis based on the predicted age. Hazard ratios were expressed per 5 years of methylation age acceleration
3Both results of Elastic Net and BLUP were based on the age predictor with the largest training sample size (sample size = 10,411 for LBC1936 and sample size = 12,710 for LBC1921)
Fig. 3The change of odds ratio from the enrichment test with the increase of training sample size (excluding LBC1936). The enrichment test examines whether AAR-associated CpG sites are enriched in probes with cellular heterogeneity
Enrichment test on the AAR-associated CpG sites from different methods based upon samples from LBC1936 wave one
| Number of significant associations ( |
| Number of CpG sites showing significant cellular heterogeneity | Odds ratio ( | |
|---|---|---|---|---|
| Hannum | 12,015 | 3.6 | 4958 | 3.85 ( |
| Horvath | 18,847 | 5.4 | 5955 | 2.53 ( |
| Elastic Net2 | 159 | 2.1 | 21 | 0.78 ( |
| BLUP2 | 793 | 2.6 | 130 | 1.00 ( |
1The odds ratio for the enrichment of EWAS significant CpG sites in the probe set showing significant cellular heterogeneity
2Both results of Elastic Net and BLUP were based on the age predictor with the largest training sample size (training set without LBC1936, sample size = 10,411)
Fig. 4Comparison of prediction performance (a correlation and b root mean square error) between the predictor from this study (based on Elastic Net) and Horvath’s age predictor in non-blood samples