| Literature DB >> 23804400 |
Bo Zhang1, Yan Zhou, Nan Lin, Rebecca F Lowdon, Chibo Hong, Raman P Nagarajan, Jeffrey B Cheng, Daofeng Li, Michael Stevens, Hyung Joo Lee, Xiaoyun Xing, Jia Zhou, Vasavi Sundaram, Ginell Elliott, Junchen Gu, Taoping Shi, Philippe Gascard, Mahvash Sigaroudinia, Thea D Tlsty, Theresa Kadlecek, Arthur Weiss, Henriette O'Geen, Peggy J Farnham, Cécile L Maire, Keith L Ligon, Pamela A F Madden, Angela Tam, Richard Moore, Martin Hirst, Marco A Marra, Baoxue Zhang, Joseph F Costello, Ting Wang.
Abstract
DNA methylation plays key roles in diverse biological processes such as X chromosome inactivation, transposable element repression, genomic imprinting, and tissue-specific gene expression. Sequencing-based DNA methylation profiling provides an unprecedented opportunity to map and compare complete DNA methylomes. This includes one of the most widely applied technologies for measuring DNA methylation: methylated DNA immunoprecipitation followed by sequencing (MeDIP-seq), coupled with a complementary method, methylation-sensitive restriction enzyme sequencing (MRE-seq). A computational approach that integrates data from these two different but complementary assays and predicts methylation differences between samples has been unavailable. Here, we present a novel integrative statistical framework M&M (for integration of MeDIP-seq and MRE-seq) that dynamically scales, normalizes, and combines MeDIP-seq and MRE-seq data to detect differentially methylated regions. Using sample-matched whole-genome bisulfite sequencing (WGBS) as a gold standard, we demonstrate superior accuracy and reproducibility of M&M compared to existing analytical methods for MeDIP-seq data alone. M&M leverages the complementary nature of MeDIP-seq and MRE-seq data to allow rapid comparative analysis between whole methylomes at a fraction of the cost of WGBS. Comprehensive analysis of nineteen human DNA methylomes with M&M reveals distinct DNA methylation patterns among different tissue types, cell types, and individuals, potentially underscoring divergent epigenetic regulation at different scales of phenotypic diversity. We find that differential DNA methylation at enhancer elements, with concurrent changes in histone modifications and transcription factor binding, is common at the cell, tissue, and individual levels, whereas promoter methylation is more prominent in reinforcing fundamental tissue identities.Entities:
Mesh:
Year: 2013 PMID: 23804400 PMCID: PMC3759728 DOI: 10.1101/gr.156539.113
Source DB: PubMed Journal: Genome Res ISSN: 1088-9051 Impact factor: 9.043
Figure 1.Benchmarking the performance of M&M. (A) The distribution of P-values generated by M&M when comparing two H1 ESC biological replicates (blue area) and when comparing H1 ESC and fetal NSC (red area). At a P-value cutoff of less than 1 × 10−10 (green line), M&M predicted 70 DMRs between the two H1 samples, and 16,398 DMRs between H1 ESC and fetal NSC. (B) The distribution of P-values generated by MEDIPS for the same comparisons as in A. At a P-value cutoff of less than 1 × 10−10 (green line), MEDIPS predicted 2066 DMRs between the two H1 ESC replicates, and 11,162 DMRs between H1 ESC and fetal NSC. (C) Whole-genome bisulfite sequencing (WGBS) data were used to validate DMRs predicted by M&M between H1 ESC and fetal NSC. DMRs predicted by M&M were ranked according to their P-values, then average DNA methylation levels for each of the top 1000 significantly hypermethylated DMRs (red) and the top 1000 significantly hypomethylated DMRs (blue) in fetal NSC were computed using WGBS data from the same two samples (H1 ESC and fetal NSC). Distribution of the DNA methylation level differences was plotted for hypermethylated DMRs and hypomethylated DMRs separately. The gray area represents the distribution of DNA methylation differences in the whole-genome background, calculated at 500-bp-window resolution using the same WGBS data sets. (D) Same as C, except that DMRs were predicted by MEDIPS. (E) DNA methylation differences between H1 ESC and fetal NSC were calculated using WGBS data for individual CpGs within the top 500, 1000, 2000, 5000, and 10,000 hypermethylated and hypomethylated DMRs (predicted by M&M, at varying cutoffs). These values were plotted as a boxplot. (F) Same as E, except that DMRs were predicted by MEDIPS. (G) Concordance between M&M (red) or MEDIPS (blue) predicted DMRs and differential methylation for these regions calculated from WGBS data. DMRs predicted by M&M and MEDIPS were ranked based on their P-values. At different cutoffs, DMRs were determined to be concordant with WGBS data (if differences in WGBS data were greater than 0.1 and were in the correct direction). (H) Reproducibility of DMR predictions in M&M (red) and MEDIPS (blue). DMR discovery was performed between two cell types from the same individual and repeated in a second individual. DMRs identified in each individual were ranked according to their P-values and intersected between the two individuals. The percentages of overlapping DMRs at different cutoffs were plotted.
Figure 2.M&M analyses of DNA methylation differences across multiple tissue types, cell types, and individuals. (A) P-value distributions of M&M predictions between tissue types (green lines), cell types (blue lines), and individuals (red lines). (B) Biclustering analysis of tissue-specific DMRs. (Left panel) Based on RPKM values of MeDIP-seq; (right panel) based on RPKM values of MRE-seq.
Tissue-specific DMRs
Figure 3.Genomic distribution and functional enrichment of tissue-specific DMR. (A) Genomic distribution of tissue-specific DMRs. (B) Functional enrichment of H1 ESC-specific hypermethylated DMRs by GREAT analysis. (C) Functional enrichment of tissue-specific hypomethylated DMRs by GREAT analysis.
Figure 4.Tissue-specific DMRs are enriched for regulatory histone modifications. (A) H3K4me1 and H3K4me3 profiles at tissue-specific DMRs in H1 ESCs, CD4 memory T cells, breast myoepithelial cells, and fetal brain tissue. (B) ChromHMM regulatory function annotation of tissue-specific DMRs. (C) Expression of genes near tissue-specific DMRs in samples representing different tissues.
Figure 5.Identification of tissue-specific DMRs spanning large chromosomal domains. (A) A breast-specific hypomethylated region containing multiple noncoding RNA genes. (Green box) ∼75-kb region hypomethylated in all breast cell types (luminal [Lum], myoepithelial [Myo], and stem cell-enriched [BSC]). (Red box) Hypermethylation events within the same region in the HCC1954 breast tumor cell line. (B) A large H1 ESC-specific hypermethylated chromosomal domain spanning the PCDHG gene cluster. (Orange box) H1 ESC-specific hypermethylated DMRs in the vicinity of the promoters of several PCDHG gene family members.
Figure 6.Cell type-specific DMRs between CD4 naive cells and CD4 memory cells. (A) Genomic distribution of CD4 memory cell hypomethylated DMRs (green) and CD4 naive cell hypomethylated DMRs (red). (B) Histone modification profiles (H3K4me1 and H3K4me3) of DMRs between CD4 memory cells and CD4 naive cells. (C) Functional enrichment in CD4 memory cell (green) and CD4 naive cell hypomethylated DMRs (red). (D) ChromHMM regulatory function annotation of CD4 memory cell DMRs and CD4 naive cell DMRs. (E) TFBS enrichment of CD4 memory cell DMRs (green) and CD4 naive cell DMRs (red).
Cell type-specific DMRs
Individual-specific DMRs
Figure 7.Individual-specific DMRs. (A) Genomic distribution of individual-specific DMRs identified in blood (blue), breast (red), and fetal brain (green). (B) ChromHMM regulatory function annotation of individual-specific DMRs. (C) Histone modification profiles (H3K4me1 and H3K4me3) of individual-specific DMRs identified in fetal brain. (D) Human Epigenome Browser (Zhou et al. 2011) view of 30 juxtaposed individual DMRs identified in fetal brain with DNA methylation, H3K4me3, and H3K4me1 profiles. (E) Functional enrichment of individual-specific DMRs identified in fetal brain. (F) TFBS enrichment of individual-specific DMRs in fetal brain.
Figure 8.Sequence conservation of DMRs. Vertebrate phastCon scores were obtained at 100-bp resolution for each DMR and their respective upstream/downstream 5-kb regions. Averaged scores in each 100-bp window were plotted. (A) Conservation of tissue-specific DMRs. (B) Conservation of cell type-specific DMRs. (C) Conservation of individual-specific DMRs.