| Literature DB >> 31767039 |
Christopher G Bell1, Robert Lowe2, Peter D Adams3,4, Andrea A Baccarelli5, Stephan Beck6, Jordana T Bell7, Brock C Christensen8,9,10, Vadim N Gladyshev11, Bastiaan T Heijmans12, Steve Horvath13,14, Trey Ideker15, Jean-Pierre J Issa16, Karl T Kelsey17,18, Riccardo E Marioni19,20, Wolf Reik21,22, Caroline L Relton23, Leonard C Schalkwyk24, Andrew E Teschendorff25,26, Wolfgang Wagner27, Kang Zhang28, Vardhman K Rakyan29.
Abstract
Epigenetic clocks comprise a set of CpG sites whose DNA methylation levels measure subject age. These clocks are acknowledged as a highly accurate molecular correlate of chronological age in humans and other vertebrates. Also, extensive research is aimed at their potential to quantify biological aging rates and test longevity or rejuvenating interventions. Here, we discuss key challenges to understand clock mechanisms and biomarker utility. This requires dissecting the drivers and regulators of age-related changes in single-cell, tissue- and disease-specific models, as well as exploring other epigenomic marks, longitudinal and diverse population studies, and non-human models. We also highlight important ethical issues in forensic age determination and predicting the trajectory of biological aging in an individual.Entities:
Mesh:
Year: 2019 PMID: 31767039 PMCID: PMC6876109 DOI: 10.1186/s13059-019-1824-y
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Fig. 1a Chronological age estimation error. With increasing training sample size, improved measurement of chronological age is expected, even using current array data (adapted from Zhang et al. [46]). y-axis: root mean square error (RMSE) of the predicted age. b DNA methylation clocks contain both chronological and biological information. The relative proportions of each will depend on the CpG probes employed in the construction of the clock. Therefore, there are multiple clocks that can be deconvoluted from aging-related epigenetic changes. Moving forward, more precise chronological (forensic age clock) and biological clocks, specific for particular diseases, informative of health or disease state need to be defined and separated. c Epigenetic age trajectory. Epigenetic age is not linear over the life course. Chronological age in years (x-axis) and epigenetic age in years (y-axis)
Major biological and analytic issues with epigenetic DNA methylation clocks
| Significant issue | Current problem | Potential solutions/advances |
|---|---|---|
| Biological age measures. No single measure or “gold standard” of biological age is likely to be possible | Simultaneously measuring multiple contributing biological processes that are changing with age | Focused analysis on specific components of aging biology and/or specific age-related diseases |
| Prediction versus mechanistic insight | Predictors are by design not optimal for mechanistic insight but are nevertheless used | Separate prediction (using sparse CpG sets) from mechanistic studies (based on whole DNA methylome/integrated epigenome) |
| Age-associated changes in non-dividing cells | Uncertainty over mechanism and current ability to dissect apart intrinsic (intracellular) from extrinsic (whole tissue) changes | Analyze aging in single cells. Also, determine whether construction of single-cell clocks is possible |
| Confusion between epigenetic correlations of aging and the aging process itself | Aging is a complex multi-systemic process. Specific evidence is required that the epigenetic changes observed in DNA methylation clocks are driving the aging process itself | To reduce confusion for those outside the epigenomics field, publications need to be clear that epigenetic observations usually only represent a biomarker of aging |
| Bias of DNA methylation clock training sets | Uncertainty of the contribution of deviation between predicted and actual age to biological aging or prediction error. Clocks trained on small samples are prone to confounding by cell composition | Larger studies, as well as increasingly focused tissue/disease-relevant clocks and cell type-specific information |
| Contribution to DNA methylation clock signals of cell type proportions and rare cells and/or clonality | Uncertainty whether cell type deconvolution increases or decreases biomarker power for different diseases | Refined single-cell analysis to separate tissue cell proportional changes from intrinsic cell changes for specific diseases. Also, purified cell type analyses and further refinement of cell type deconvolution methodology |
| Pan-tissue aging changes | Separation of true pan-tissue changes from cell proportion changes | Single-cell analysis to identify cell proportion changes from individual cell changes. Also, purified cell type and statistical cell type deconvolution analyses |
| Aging-related increased variability in DNA methylation versus directional changes | Difficult to deconvolute intrinsic from extrinsic changes in heterogeneous cell type-derived DNA, as well as to delineate technical from biological variation | Single-cell analysis to differentiate cell proportion from individual cell changes. Use of multiple technical and statistical methodologies to dissect origin of variability, including deep-targeted BS-seq and third-generation sequencing |
| Construction of a clock at the single-cell level | Currently technically challenging, particularly due to missing data in each individual cell | Imputation may be helpful, but ultimately improvements in single-cell analysis will be required |
| Identification of disease-related changes | Uncertainty whether capturing the most disease relevant changes | Improved methodology: latest array increased enhancer CpGs focus—improved high-throughput power to identify tissue-specific and disease-specific loci. Also, increased deep BS sequencing |
| Disease mechanism is unknown | Role of aging-related epigenomic deterioration contributing to age-related disease pathology is not appreciated | Discovery of disease-related mechanisms through disease-relevant cell and tissue-type epigenomic analysis of aging-related changes |
| Regulatory role of DNA methylation is more complex than classical models | Complex interplay between transcription factors and epigenomic factors impacts on outcome within different functional loci (promoters, enhancers, insulators, transcribed regions, etc.) | Detailed experimental evidence within specific genetic loci and in disease-relevant cell types, including appropriate disease stressors, to infer potential repressive and/or activating roles |
| Differentiation between DNA methylation loss due to reduction in active processes required for maintenance, or active enzymatic-driven loss | Firm evidence required in appropriate cell types of decay without cell division. May be more prevalent at dynamic enhancer regions. Neuronal cells have high post-mitotic expression of DNMTs and TETs plus high 5-hydroxymethylcytosine (5hmC) | Detailed models studying DNMT and TET expression in disease-appropriate cell types. Assaying the specific products of TET activity, such as 5hmC |
| Functionality of DNA methylation changes is often assumed | Crossing statistical significance thresholds does not infer function. Statistical differences between quantitative and categorial measures | Acknowledged functional evidence deficiency in results and that further integrated disease-relevant tissue genomic/epigenomic/transcriptomic analyses in appropriate models are required |
| Low reproducibility of DNA methylation clocks in model organisms reduces the utility of published clocks | Technical issue because of the reliance on low-depth sequencing due to the lack of available commercial DNA methylation arrays in non-human species | Higher-depth base-resolution sequencing studies are required to improve portability of DNA methylation clocks between experiments. Also, new methodology robust to stochastic missing data |
| Aging DNA methylation sites are only partially conserved among different mammalian species | Reduced insights to be made from comparative studies | Integrative whole epigenome analyses to identify common mechanistic processes |
| Role of DNA methylation and rare modifications, such as 5-hydroxymethylation (5hmC) in specific functional loci, such as enhancers | Large-scale base-resolution analyses currently performed using bisulfite conversion. This does not differentiate between 5hmC and 5mC | Oxidative bisulfite sequencing and new methodologies, such as a non-destructive DNA deaminase, and third-generation direct modification analysis |
| Interconnected role of DNA modifications and chromatin modifications | Unknown directionality and causative effects of cross talk between these different epigenomic modifications | In vitro, organoid, and model organism evaluation of epigenetic machinery with age. Integration of DNA modifications, histone post-translational marks, and transcriptional data into a single integrated aging model |
| Population variation in DNA methylation clock measures | Genetic variation may be influencing clock measures directly, or impacting on relevant causative factors, such as inflammation and immunological aging | Integrating genetic effectors into clock and age-related measures, including haplotypic information. This will also lead to insights into causal or mechanistic pathways |
| Many different DNA methylation clock models | Many available clocks and ad hoc application and interpretation of results can result in suboptimal robustness of findings | Systematic evaluation of methods with a priori assumptions about the meaning of associations of various measures |
| Forensic use of DNA methylation clocks to determine legal age | Robustness of DNA methylation clocks across populations, tissues, and environments is unknown. Furthermore, the impact of acute and chronic inflammatory processes needs assessment | Assess variability by the analysis of large, diverse, and well-powered datasets in the range of tissues likely to be employed (whole blood, buccal cells, etc.) |
| DNA methylation clocks as a de facto measure of an individual’s “health” | Associations with biological aging are cross-sectional and epidemiological. Accuracy within an individual and in other populations the clock is not derived from is unknown | Longitudinal studies required to assess clock changes within an individual over time. Requires appropriately powered studies across diverse populations. Re-commercialization—public must be protected by provision of accurate data regarding estimate/error rates |
Fig. 2All clock probes are strongly biased to reside within active functional loci. This is due to their construction from promoter-focused arrays. Overlap of CpGs from four DNA methylation clocks with the six Core Encode Combined Chromatin Segmentation tracks [130] from ENCODE Analysis Data at UCSC. a Horvath clock [24]. b Hannum et al. clock [23]. c PhenoAge clock [43]. d epiTOC clock [31]. Location is assessed for overlap with the seven functional categories: PF (promoter flanking—light red), TSS (transcription start site and promoter region—red), CTCF (blue), WE (weak enhancer—yellow), E (enhancer—gold), T (transcribed region—green), and R (repressed—grey), from any of the six Core Encode cell types (Gm12878, H1hesc, Helas3, Hepg2, Huvec, K562). This percentage overlap is shown on the y-axis and is compared with the percentage overlap for all ~28 × 106 CpGs in the human genome on the x-axis. Calculated via bedtools [131]. The size of the circle is proportional to the entire genome space for each functional category (~10(genome size proportion)). e-h Direct overlap comparison for four DNA methylation clocks (Horvath clock, Hannum et al. clock, PhenoAge clock, epiTOC clock) as well as Illumina array CpGs (27k, 450k, EPIC) and all genomic CpGs (far right bar) with: e the Combined Segmentation track for blood-derived tissue (GM12878) [130]. Functional segments are delineated as PF (promoter flanking), TSS (transcription start site and promoter region), CTCF, WE (weak enhancer), E (enhancer), T (transcribed region), and R (repressed). NC, not covered CpGs in this Combined Segmentation overlap; f Gencode [132] Exon and Transcripts; g UCSC [133]-defined CpG islands and shore regions (+/−2 kb); h Major repeat classes (UCSC RepeatMasker [133]), including DNA repeat elements (DNA_repeats), long interspersed nuclear elements (LINE), low complexity repeats and other rare repeat classes, long terminal repeat elements (LTR), simple repeats (microsatellites aka short tandem repeats), and short interspersed nuclear elements (SINE), of which ~63% are Alu elements
Fig. 3Single-cell analysis. Distinct cell variation in aging epigenetic clock changes may exist that would not be apparent in bulk comparison. Black and white squares represent methylated and unmethylated loci, respectively. Each row represents a single cell’s epigenome (represented as haploid for simplicity) with increased variability present in individual 2
Summary of recommendations arising from the challenges of studying DNA methylation clocks in the context of aging
| Challenges and recommendations | |
| 1. Delineation of the chronological and biological components of DNA methylation clocks | |
| • Quantify the accuracy and robustness of “forensic” age estimates from different DNA sources | |
| • Isolate pan-tissue “biological” aging changes for novel insights into aging | |
| 2. Functional characterization of tissue-specific and disease-specific clocks | |
| • Refine tissue- and disease-specific clocks for disease-specific measures | |
| • Deeper understanding of the pathogenesis of specific age-related diseases | |
| • All published clock algorithms should be transparent and publicly available | |
| 3. Integration of epigenetics into large and diverse longitudinal population studies | |
| • For predictive biomarkers of clinical utility | |
| • Understand the cause and consequences of clock measures and any rate change on aging-related disease and longevity | |
| 4. Genome-wide analyses of aging and exploration of additional epigenomic marks | |
| • Identity novel and potentially more sensitive chronological or disease-specific clock-like mechanisms | |
| 5. Single-cell analysis of aging changes and disease | |
| • Explore functionality of clock-like and other aging-related epigenetic changes | |
| • Define the components of tissue-specific changes | |
| 6. Generation of robust non-human data of aging | |
| • Explore fundamental biology of aging using DNA methylation clocks in model organisms | |
| • Expand and standardize the application of DNA methylation clocks to test longevity interventions in mice | |
| 7. Inclusion of epigenetics within current genetic ethical and legal frameworks | |
| • To educate and protect the public from misinformation and misuse |