| Literature DB >> 29754148 |
Chunxiao Li1, Wenjing Gao1, Ying Gao1, Canqing Yu1, Jun Lv1, Ruoran Lv2, Jiali Duan2, Ying Sun2, Xianghui Guo3, Weihua Cao1, Liming Li1.
Abstract
The DNA methylation age, a good reflection of human aging process, has been used to predict chronological age of adults and newborns. However, the prediction model for children and adolescents was absent. In this study, we aimed to generate a prediction model of chronological age for children and adolescents aged 6-17 years by using age-specific DNA methylation patterns from 180 Chinese twin individuals. We identified 6,350 age-related CpGs from the epigenome-wide association analysis (N=179). 116 known age-related sites in children were confirmed. 83 novel CpGs were selected as predictors from all age-related loci by elastic net regression and they could accurately predict the chronological age of the pediatric population, with a correlation of 0.99 and the error of 0.23 years in the training dataset (N=90). The predictive accuracy in the testing dataset (N=89) was high (correlation=0.93, error=0.62 years). Among the 83 predictors, 49 sites were novel probes not existing on the Illumina 450K BeadChip. The top two predictors of age were on the PRKCB and REG4 genes, which are associated with diabetes and cancer, respectively. Our results suggest that the chronological age can be accurately predicted among children and adolescents aged 6-17 years by 83 newly identified CpG sites.Entities:
Keywords: DNA methylation; Illumina Infinium MethylationEPIC Beadchip; adolescent; aging; children; prediction
Mesh:
Year: 2018 PMID: 29754148 PMCID: PMC5990383 DOI: 10.18632/aging.101445
Source DB: PubMed Journal: Aging (Albany NY) ISSN: 1945-4589 Impact factor: 5.682
The number, gender and zygosity distribution of subjects by age.
| Age, y | No. of all | Boys, No. (%) | MZ, No. (%) |
| 6 | 3 | 0 (0) | 3 (100) |
| 7 | 20 | 13 (65) | 8 (40) |
| 8 | 14 | 8 (57) | 8 (57) |
| 9 | 24 | 12 (50) | 16 (67) |
| 10 | 30 | 16 (53) | 14 (47) |
| 11 | 20 | 16 (80) | 10 (50) |
| 12 | 28 | 14 (50) | 12 (43) |
| 13 | 10 | 4 (40) | 6 (60) |
| 14 | 20 | 14 (70) | 8 (40) |
| 15 | 2 | 1 (50) | 0 (0) |
| 16 | 6 | 3 (50) | 4 (67) |
| 17 | 2 | 0 (0) | 2 (100) |
| Total | 179 | 101 (56) | 91 (51) |
a. Age and sex were self-reported by subjects and their parents;
b. The zygosity was determined from gene detection;
c. The mean ± SD of age:10.7±2.5 years; MZ: monozygotic twin.
Figure 1Manhattan plot of epigenome-wide DNA methylation analysis and chronological age. The red horizontal line indicating the P values reached the significant level of FDR < 0.05. The epigenome-wide analysis identified 6,350 CpG sites related with age.
Figure 2aCorrelation between Chronological age and DNAm age. In the training data, chronological age and DNAm age were highly correlated in the training dataset: r = 0.99, median error = 0.23 years.
The top 20 chronological age predictive CpGs in the model.
| Probename | CHR | Gene Name | Gene Group | Relation to CpG Island | Coefficient Values | Methylation β Values, means (SD) |
| cg00497086 | 16 | Body | Open Sea | 10.0 | 0.79 (0.02) | |
| cg01231611 | 1 | TSS200 | Open Sea | -9.7 | 0.86 (0.02) | |
| cg06072257 | 1 | Other | Open Sea | -8.6 | 0.68 (0.02) | |
| cg21242642 | 1 | Other | Open Sea | 6.2 | 0.16 (0.02) | |
| cg06711259* | 22 | 1stExon | N_Shore | -4.0 | 0.80 (0.02) | |
| cg00303541* | 3 | 5'UTR | Island | 3.9 | 0.26 (0.04) | |
| cg03579624* | 3 | Other | N_Shore | 3.6 | 0.28 (0.05) | |
| cg04955914* | 2 | Body | N_Shore | -3.5 | 0.58 (0.02) | |
| cg27406001 | 10 | Other | Open Sea | -3.5 | 0.57 (0.05) | |
| cg10816468 | 6 | Other | Open Sea | -3.1 | 0.64 (0.04) | |
| cg13993467 | 3 | Body | Open Sea | -2.9 | 0.64 (0.04) | |
| cg24388008 | 12 | Other | Open Sea | -2.7 | 0.10 (0.02) | |
| cg02772754 | 22 | Body | Open Sea | 2.6 | 0.51 (0.04) | |
| cg07219494* | 5 | Other | S_Shelf | -2.5 | 0.71 (0.06) | |
| cg13274149* | 9 | 3'UTR | Island | 2.4 | 0.35 (0.05) | |
| cg12642568 | 1 | 5'UTR | N_Shelf | -2.4 | 0.69 (0.02) | |
| cg13612317* | 10 | TSS1500 | S_Shore | -2.1 | 0.60 (0.04) | |
| cg07465899* | 4 | Other | N_Shore | -2.1 | 0.60 (0.02) | |
| cg02478540 | 4 | Other | Open Sea | -2.0 | 0.20 (0.02) | |
| cg16119613* | 12 | Other | N_Shelf | -2.0 | 0.40 (0.03) |
“-” means not on the known gene.
“*” means also on the Illumina 450K Beadchip.
Figure 2bCorrelation between Chronological age and DNAm age. DNAm age were also highly correlated with chronological age in the testing dataset: r = 0.93, median error = 0.64 years. Solid line = regression line.
Figure 3The genomic distribution of age-associated sites compared with all 850K probes passed QC. (a)The gene region distribution: frequency of age-related CpG sites according to the gene location; (b) The CpG islands distribution: frequency of age-related CpG sites according to the proximity to a CpG island. The ordinate represents the % CpG sites. The genomic distributions among the 83 age predictive sites, 6,350 chronological age-related CpG sites, and all the probes passed QC located on the 850 K BeadChip array were different. The annotation to be inside a CpG island was significantly over-represented on the 850k array (18.8%) compared to the 6,350 age-related CpGs and the 83 DNAm age predictors (9.2%, 8.4%), both with P <0.05. There was no differences in the distribution of the CpG sites with regard to other types of genomic distribution. The blue bar represents the all the probes passed QC located on the 850 K BeadChip array; the orange bar represents the 6,350 age-related CpGs; and the grey bar represents the 83 DNAm age predictors.