| Literature DB >> 32580727 |
Yang Han1,2, Julia Franzen1,2, Thomas Stiehl3, Michael Gobs1,2, Chao-Chung Kuo1,2, Miloš Nikolić1,2, Jan Hapala1,2, Barbara Elisabeth Koop4, Klaus Strathmann5, Stefanie Ritz-Timme4, Wolfgang Wagner6,7.
Abstract
BACKGROUND: Age-associated DNA methylation changes provide a promising biomarker for the aging process. While genome-wide DNA methylation profiles enable robust age-predictors by integration of many age-associated CG dinucleotides (CpGs), there are various alternative approaches for targeted measurements at specific CpGs that better support standardized and cost-effective high-throughput analysis.Entities:
Keywords: Aging; Amplicon sequencing; Blood; Buccal swabs; CTCF; DNA methylation; Droplet digital PCR; Epigenetic; Human
Mesh:
Substances:
Year: 2020 PMID: 32580727 PMCID: PMC7315536 DOI: 10.1186/s12915-020-00807-2
Source DB: PubMed Journal: BMC Biol ISSN: 1741-7007 Impact factor: 7.431
Fig. 1Selection of age-associated CpGs and targeted analysis with pyrosequencing. a Illumina BeadChip profiles of 973 blood samples of 7 studies (all 450 k) were used to select age-associated CpGs by Pearson’s correlation (blue), Spearman’s rank correlation (red), or Pearson’s correlation after logarithmic transformation of age (green). Sixty-five CpGs passed at least one of these thresholds (correlation coefficient: R > 0.5 or R < − 0.5) and the Venn diagram depicts a very high overlap. b, c A multivariable linear model for these 65 age-related CpGs revealed high correlation with chronological age in the training set (b; n = 973), and in an independent validation set (c; n = 3674; color code corresponds to different studies as indicated in Supplementary Fig. S3A). d, e Epigenetic age prediction based on pyrosequencing of 6 CpGs (d) and 9 CpGs (e) in blood samples of the training set (n = 40; blue) and an independent validation set (n = 40; red)
Fig. 2Age-associated DNA methylation measurements with droplet digital PCR. a Two-dimensional amplitude analysis of duplex ddPCR (blue: positive droplets for methylated CCDC102B; green: positive droplets for non-methylated CCDC102B; orange: double-positive droplets; black: negative droplets). b–h DNAm measurements by ddPCR in the training set (n = 40; blue) and independent validation set (n = 40; red). i Epigenetic age prediction based on ddPCR measurements of 7 CpGs in blood samples of the training and validation set
Correlation of DNA methylation and chronological age with different targeted approaches
| Pyrosequencing | ddPCR | BBA-seq | ||||
|---|---|---|---|---|---|---|
| Training ( | Validation ( | Training ( | Validation ( | Training ( | Validation ( | |
| 0.76 | 0.82 | 0.82 | ||||
| 0.53 | 0.62 | 0.58 | ||||
| 0.56 | 0.72 | 0.60 | ||||
| 0.56 | 0.67 | 0.68 | ||||
| 0.45 | – | 0.87 | ||||
| 0.32 | 0.5 | 0.42 | ||||
| 0.29 | 0.43 | 0.37 | ||||
| 0.03 | – | 0.14 | ||||
| 0.8 | 0.89 | 0.82 | ||||
| Mean | 0.48 | 0.66 | 0.59 | |||
1We have always depicted result for the CpG with the highest Pearson correlation within the amplicon. Thus, the CpGs are not always identical (Additional file 1: Table S5, S7 and S9)
Fig. 3Epigenetic age predictions using BBA-seq. a–c Nine amplicons with age-associated CpGs were analyzed by bisulfite barcoded amplicon sequencing (BBA-seq) in a training set of 38 blood samples (blue) and an independent validation set of 39 samples (red). Age predictions were based on a multivariable linear model of 9 CpGs within 9 amplicons (a), Lasso regression model of 17 CpGs within 8 amplicons (b), or elastic net regression model of 26 CpGs within 8 amplicons (c). d, e In analogy, genomic DNA was isolated from buccal swabs and analyzed by BBA-seq (training set: n = 46, blue; independent validation set: n = 49, red) with a multivariable model of 9 CpGs (d), by Lasso regression model of 27 CpGs within 7 amplicons (e), and by elastic net regression model of 26 CpGs within 7 amplicons (f)
Fig. 4Age-associated DNAm changes peak close to CTCF binding sites. a Pearson’s correlation of age with DNAm levels of CpGs within the amplicons of ELOVL2, PDE4C, and FHL2 are plotted for the blood samples of the training set (n = 38, blue) and validation set (n = 39, red). X-axis represents the position of CpGs within the amplicons. b Enrichment of CTCF binding at the position of these amplicons (gray shaded region) was then analyzed in chromatin immune precipitation (ChIP) sequencing data of hESC (GSM822297), K562 (GSM822311), and A549 cells (GSM822289). Peak heights were automatically trimmed by IGV tool (indicated in brackets). The positions of predicted CTCF binding motives are also presented. c Boxplot of normalized read counts of CTCF ChIP-seq data of A549, hESC, and K562 cell lines either at the 65 age-associated CpGs or at 1000 randomly chosen CpGs from 450 k BeadChip array. Read counts from ChIP-seq data were normalized by quantile normalization and analyzed within a window of 500 base pairs (p value was estimated by Mann-Whitney rank test)
Fig. 5Analysis of age-associated DNAm patterns within individual BBA-seq reads. a Heat map to exemplarily depicts frequencies of DNAm patterns within the 36 neighboring CpGs of the ELOVL2 amplicon in BBA-seq data of a young (21 years old) and old sample (72 years old). b For comparison, heatmaps are presented based on random simulation of DNAm patterns under the assumption that DNAm at neighboring CpGs occurs entirely independent (simulations correspond to 21 and 72 year old donors). c Pearson’s correlation of DNAm levels between neighboring CpG sites within ELOVL2 amplicon (BBA-seq data of training set). d For each BBA-seq read of ELOVL2 training set, we estimated the epigenetic age based on the binary sequel of methylated and non-methylated CpGs. The plot depicts relative read count of every donor in the training set that were classified for predicted ages between 0 and 200 years (relative read count normalized by read count per sample). e–g The mean age-predictions based on individual BBA-seq reads were determined for each sample and then plotted against the chronological age of the samples of the training (blue, n = 38) and validation set (red, n = 39). This analysis was performed independently for the amplicons of ELOVL2 (e), PDE4C (f), and FHL2 (g)