| Literature DB >> 34375949 |
Aleksandra Pisarek1, Ewelina Pośpiech1, Antonia Heidegger2, Catarina Xavier2, Anna Papież3, Danuta Piniewska-Róg4, Vivian Kalamara5, Ramya Potabattula6, Michał Bochenek1, Marta Sikora-Polaczek7, Aneta Macur8, Anna Woźniak9, Jarosław Janeczko8, Christopher Phillips10, Thomas Haaf6, Joanna Polańska3, Walther Parson2,11, Manfred Kayser5, Wojciech Branicki1,9.
Abstract
DNA methylation analysis is becoming increasingly useful in biomedical research and forensic practice. The discovery of differentially methylated sites (DMSs) that continuously change over an individual's lifetime has led to breakthroughs in molecular age estimation. Although semen samples are often used in forensic DNA analysis, previous epigenetic age prediction studies mainly focused on somatic cell types. Here, Infinium MethylationEPIC BeadChip arrays were applied to semen-derived DNA samples, which identified numerous novel DMSs moderately correlated with age. Validation of the ten most age-correlated novel DMSs and three previously known sites in an independent set of semen-derived DNA samples using targeted bisulfite massively parallel sequencing, confirmed age-correlation for nine new and three previously known markers. Prediction modelling revealed the best model for semen, based on 6 CpGs from newly identified genes SH2B2, EXOC3, IFITM2, and GALR2 as well as the previously known FOLH1B gene, which predict age with a mean absolute error of 5.1 years in an independent test set. Further increases in the accuracy of age prediction from semen DNA will require technological progress to allow sensitive, simultaneous analysis of a much larger number of age correlated DMSs from the compromised DNA typical of forensic semen stains.Entities:
Keywords: DNA methylation; amplicon bisulfite sequencing; epigenetic age; epigenetic age estimation; semen
Mesh:
Year: 2021 PMID: 34375949 PMCID: PMC8386575 DOI: 10.18632/aging.203399
Source DB: PubMed Journal: Aging (Albany NY) ISSN: 1945-4589 Impact factor: 5.682
Age correlation results of the univariable linear regression analysis for the ten best age correlated CpG markers selected from Infinium Methylation EPIC BeadChip array analysis of semen-derived bisulfite converted DNA samples from 38 men.
|
|
|
|
|
|
|
|
| cg02766173 | -4.00 | 0.72 | 3.63×10-7 | 0.03 |
|
| cg10528482 | -4.00 | 0.76 | 3.64×10-8 | 0.01 |
|
| cg00018181 | -1.17 | 0.71 | 7.44×10-7 | 0.04 |
|
| cg01886988 | -4.00 | 0.71 | 5.67×10-7 | 0.04 |
|
| cg17147820 | -1.49 | 0.73 | 1.60×10-7 | 0.02 |
|
| cg09855959 | -4.00 | 0.71 | 6.72×10-7 | 0.04 |
|
| cg18701351 | -4.00 | 0.77 | 1.12×10-8 | 0.01 |
|
| cg19862839 | 0.36 | -0.72 | 3.37×10-7 | 0.03 |
|
| cg07909178 | -0.20 | 0.71 | 7.46×10-7 | 0.04 |
|
| cg17704154 | -4.00 | 0.69 | 1.467×10-6 | 0.04 |
Figure 1Correlation between DNA methylation and chronological age in the model training dataset (N = 125) for six CpG sites included in the final age prediction model for semen.
Final set of age predictive CpGs in semen and characteristics of the multivariable linear regression model for semen.
|
|
|
|
|
|
|
|
|
|
| C2 | - | chr7:102288454 | -0.43 | -3.95 | 1.38×10-4 | 0.36 |
|
| C1 | cg06979108 | chr11:89589683 | 0.42 | 6.23 | 8.73×10-9 | 0.55 |
|
| C1 | - | chr5:525617 | 0.25 | 3.09 | 3.00×10-3 | 0.54 |
|
| C1 | cg05432003 | chr11:312518 | -0.30 | -2.66 | 9.00×10-3 | 0.55 |
|
| C1 | - | chr17:76077680 | 0.74 | 3.56 | 1.00×10-3 | 0.57 |
|
| C5 | - | ch17:76077748 | -0.61 | -2.85 | 5.00×10-3 | 0.60 |
Figure 2Epigenetically predicted vs. chronological age in semen samples based on the model training (N = 125) and model test (N = 54) datasets, respectively. The accuracy of prediction achieved with the developed epigenetic age prediction model for semen equals a MAE of 4.3 years (RMSE = 5.2) in the training set and a MAE of 5.1 years (RMSE = 6.3) in the test set. The six CpGs included in the model explained 60% of the age variation observed in the training set.
Age prediction accuracy in semen using different epigenetic prediction models.
|
|
|
| |||
|
|
|
|
| ||
| Current model (6 CpGs from 5 loci) | 4.3 | 5.1 | 5.2 | 6.3 | |
| Lee et al. 2015 (3 CpGs from the | 4.9 | 5.7 | 5.8 | 7.0 | |
| Current model with novel markers only (5 CpGs from 4 loci) | 4.3 | 5.2 | 5.5 | 6.3 | |
*Present in the current model (6 CpGs from 5 loci).