| Literature DB >> 30837884 |
Emma Cazaly1, Joseph Saad1, Wenyu Wang1, Caroline Heckman1, Miina Ollikainen1,2, Jing Tang1,3,4.
Abstract
Epigenetic research involves examining the mitotically heritable processes that regulate gene expression, independent of changes in the DNA sequence. Recent technical advances such as whole-genome bisulfite sequencing and affordable epigenomic array-based technologies, allow researchers to measure epigenetic profiles of large cohorts at a genome-wide level, generating comprehensive high-dimensional datasets that may contain important information for disease development and treatment opportunities. The epigenomic profile for a certain disease is often a result of the complex interplay between multiple genetic and environmental factors, which poses an enormous challenge to visualize and interpret these data. Furthermore, due to the dynamic nature of the epigenome, it is critical to determine causal relationships from the many correlated associations. In this review we provide an overview of recent data analysis approaches to integrate various omics layers to understand epigenetic mechanisms of complex diseases, such as obesity and cancer. We discuss the following topics: (i) advantages and limitations of major epigenetic profiling techniques, (ii) resources for standardization, annotation and harmonization of epigenetic data, and (iii) statistical methods and machine learning methods for establishing data-driven hypotheses of key regulatory mechanisms. Finally, we discuss the future directions for data integration that shall facilitate the discovery of epigenetic-based biomarkers and therapies.Entities:
Keywords: data integration; data resources; drug discovery; epigenetics; functional annotation; profiling techniques
Year: 2019 PMID: 30837884 PMCID: PMC6390500 DOI: 10.3389/fphar.2019.00126
Source DB: PubMed Journal: Front Pharmacol ISSN: 1663-9812 Impact factor: 5.810
FIGURE 1The pillars to understanding the functional impact of epigenetics data. The epigenetic links need to be made with sequence variants in genetics as well as changes in transcriptomics. Understanding the impact of epigenetics on intermediate phenotypes for example metabolomics and proteomics may ultimately help explain the disease etiology and help drug discovery. GWASs, genome-wide association studies; EWASs, epigenome wide association studies; meQTL, methylation quantitative trait loci; eQTL, expression quantitative trait loci; TWAS, transcriptome-wide association study.
Summary of major profiling techniques for DNA methylation.
| Technique | Method | Advantages | Limitations |
|---|---|---|---|
| Whole-Genome Bisulfite Sequencing (WGBS) | Bisulfite converted DNA is amplified and sequenced | Genome-wide, single nucleotide resolution | Costly and computationally intensive |
| Reduced-Representation Bisulfite Sequencing (RRBS) | Methylation-insensitive restriction enzymes digest DNA, enriching for CpG regions | Cheaper than WGBS with relatively high coverage | Enzymatic digestion covers most but not all CpG sites |
| Pyrosequencing | DNA is bisulfite converted, amplified, with the ratio of C/T nucleotides measured | Genome-wide or targeted, single nucleotide resolution. Allele-specific primers | Relatively expensive |
| Methylated DNA Immunoprecipitation (MeDip) | Methylated DNA is enriched by immunoprecipitation followed by sequencing or microarray analysis | Random fragmentation by sonication avoids restriction enzyme bias | Varying CpG density can confound methylation estimates |
| Methylation Sensitive Restriction Enzyme Sequencing (MSRE/MRE-Seq or Methyl-seq) | Unmethylated DNA is restriction enzyme digested while methylated DNA is amplified | No bisulfite conversion bias | DNA may be partially digested, limited coverage |
| Combined Bisulfite Restriction Analysis (COBRA) | Bisulfite converted DNA is amplified and restriction enzyme digested | Simple, fast, inexpensive, works on FFPE-treated DNA | DNA may be partially digested, limited coverage |
| Methylation Specific PCR | Bisulfite converted DNA is amplified with methylation specific primers | Simple and inexpensive | Purely qualitative |
| High Resolution Melt Analysis (HRM) | Bisulfite converted DNA is amplified by q-PCR | Most sensitive method for determining methylation at a specific region | Single base resolution not possible |
| Illumina MethylationEPIC BeadChip Microarray (previously 450k, 27k) | Bisulfite (or oxidized + bisulfite) converted DNA is interrogated on a microarray chip | Relatively simple and inexpensive. Extremely popular | Data has limited coverage and requires pre-processing |
| Global DNA Methylation | Methods include LINE1, Alu, LUMA, HPLC-UV | Relatively inexpensive | Does not identify differentially methylated regions |
| Tet-assisted Bisulfite Sequencing (TAB-seq) | 5hmC is protected then oxidized to 5caC then uracil by TET | Differentiation between 5mC and 5hmC at single base resolution | Sensitivity and specificity depends on sequencing depth |
| Oxidative Bisulfite Sequencing (OxBis) | DNA is oxidized then bisulfite converted to 5fC and subsequently uracil | Quantitative genome-wide coverage | Bias to regions of low 5mC. Must be performed in parallel with bisulfite techniques |
| APOBEC-coupled epigenetic sequencing (ACE-seq) | Non-destructive DNA deaminase enzymes discriminate between 5hmC and 5mC | Genome-wide, single nucleotide resolution. Very low DNA input required | Not yet extensively tested |
| Hydroxymethylated DNA Immunoprecipitation (hMeDIP) | Immunoprecipitation and sequencing of hydroxymethylated DNA | Simple and inexpensive | Only semi quantitative and bias to regions of low 5hmC |
Summary of major profiling techniques for Chromatin Accessibility and Histone modifications.
| Technique | Method | Advantages | Limitations |
|---|---|---|---|
| Chromatin Immunoprecipitation (ChIP) | Couples highly specific antibodies for DNA-binding proteins with sequencing, microarrays or PCR | Detect DNA associated proteins and histone modifications | Requires intact cells and chromatin |
| Digital DNase | Enzymes digest nuclease-accessible regions, indicating open chromatin | Maps both nucleosomes and non-histone proteins | High sequencing depth required. Potential actin contamination. |
| NOMe-seq | Single-molecule, high-resolution nucleosome positioning assay | Maps both DNA methylation and nucleosomes at high resolution | Relies on presence of CpG residues |
| Assay for Transposase-Accessible Chromatin using sequencing (ATAC-seq) | Measures chromatin accessibility based on Tn5 transposase activity. Maps nucleosomes and non-histone proteins | Simple, fast, low input of cells with single nucleotide resolution | Distance between binding sites may bias results |
| Chromosome Conformation Capture (3C, 4C, 5C, Hi-C) | Assess spatial organization of chromatin in a cell | Various modified versions | Often lack genome-wide, single nucleotide resolution |
Epigenetic data repositories and browsers.
| Consortia and resources | Data availability | URLs |
|---|---|---|
| The International Human Epigenome Consortium (IHEC) | Reference epigenomes generated by NIH Roadmap, ENCODE, CEEHRC, BLUEPRINT, DEEP, AMED/CREST, and KEP | IHEC Data Portal |
| NIH Roadmap Epigenomics | Maps of histone modifications, chromatin accessibility, DNA methylation and mRNA Expression in stem cells and primary | VizHub |
| Canadian Epigenetics, Environment and Health Research Consortium (CEEHRC) Network | Reference epigenomes including histone modifications, DNA methylation, mRNA and miRNA of human cancer and normal cells | CEEHRC Data |
| BLUEPRINT Epigenome | Reference epigenomes of human normal and malignant hematopoietic cells | BLUEPRINT Portal |
| The German epigenome programme (DEEP) | Reference epigenomes of human cells and tissues in normal and complex disease states | DEEP Data Portal |
| IHEC Team Japan (AMED-CREST) | Reference epigenomes of human gastrointestinal epithelial cells, vascular endothelial cells and cells of reproductive organs | IHEC Data Portal |
| Korea Epigenome Project (KEP) | Reference epigenome map for common complex diseases | IHEC Data Portal |
| DeepBlue | Epigenomic data server for storing and working with genomic and epigenomic data. Collection of over 30,000 experiment files from the main epigenome mapping projects available. Uploading own data allowed | DeepBlue server |
| Allelic Epigenome Project | Allelic DNA methylome, histone modifications, and transcriptome in human cells and tissues | Genboree |
| GTEx | Genotype and expression profiles in different tissues enabling eQTL studies | GTEx Portal |
| BRAINEAC | Brain eQTL Almanac provides genotype and expression profile across 10 brain regions | BRAINEAC |
| MQTLdb | Methylation and genotype data on mother-child pairs providing access to meQTL mapping across five different stages of life | mQTL Database |
| Fetal brain meQTLs | Epigenome-wide significant meQTLs observed in fetal brain | Fetal Brain meQTL |
| Pancan-meQTL | Database of | Pancan-meQTL |
| Epigenome Browser | UCSC genome browser with tracks from ENCODE project | UCSC Epigenome Browser |
| WashU Epigenome Browser | Web browser with tracks from ENCODE and Roadmap Epigenomics projects | WashU Epigenome Browser |
| Ensembl | ENCODE data used in the regulatory build | Ensembl ENCODE |
| RMBase | Database listing over 100 RNA modifications |