| Literature DB >> 26290333 |
Naoki Kubo1,2, Hidehiro Toh1, Kenjiro Shirane1, Takayuki Shirakawa3, Hisato Kobayashi4, Tetsuya Sato5, Hidetoshi Sone3, Yasuyuki Sato3, Shin-ichi Tomizawa3, Yoshinori Tsurusaki6, Hiroki Shibata7, Hirotomo Saitsu6, Yutaka Suzuki8, Naomichi Matsumoto6, Mikita Suyama5, Tomohiro Kono4,9, Kazuyuki Ohbo3, Hiroyuki Sasaki10.
Abstract
BACKGROUND: In the male germline, neonatal prospermatogonia give rise to spermatogonia, which include stem cell population (undifferentiated spermatogonia) that supports continuous spermatogenesis in adults. Although the levels of DNA methyltransferases change dynamically in the neonatal and early postnatal male germ cells, detailed genome-wide DNA methylation profiles of these cells during the stem cell formation and differentiation have not been reported.Entities:
Mesh:
Year: 2015 PMID: 26290333 PMCID: PMC4546090 DOI: 10.1186/s12864-015-1833-5
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Overall CG methylation levels and distribution. a Schematic representation of male germ cell development and the cell types analyzed in this study. The interrupted arrow shows that the first round of spermatogenesis skips the stem cell stage. PGC, primordial germ cell. b Developmental changes in the level of CG methylation in the entire genome (line graph) and in 10 kb windows (box plots). Central bar, median; lower and upper box limits, 25th and 75th percentiles, respectively; whiskers, 1.5 times the interquartile range from the 25th to the 75th percentile. The E16.5 PSG data are from Kobayashi et al. [18]. c Distribution of CG methylation across mouse chromosome 1. The methylation levels determined in 500 kb windows are shown for the respective cell types (colored lines). Large PMDs (≥500 kb) identified in P0.5 PSGs by MethylSeekR are highlighted in yellow. The GC contents in 500 kb windows are also shown. cLADs are lamina-associated domains common to embryonic stem cells, neural precursor cells, astrocytes, and embryonic fibroblasts [39]. The CG methylation valleys marked with asterisks are not PMDs but composites of unmethylated CGIs and hypermethylated segments. d Developmental changes in the CG methylation level in 10 kb windows in PMDs and non-PMDs. Those windows overlapping with any PMDs in P0.5 PSGs were considered as PMDs and the rest were considered as non-PMDs. e Transcript levels of genes located in PMDs and non-PMDs. The transcript levels are shown as log2 (FPKM + 1). The PMD genes are RefSeq coding genes whose promoter regions (from 2000 bp upstream to 500 bp downstream of the transcription start site) overlap with any PMDs. The remaining genes are non-PMD
Fig. 2High levels of CH methylation in neonatal PSGs. a Proportions of methylated cytosines at CG and CH sites, and unmethylated cytosines at all sites. The nonconversion rate was subtracted from the level of methylated cytosine at CG and CH sites, assuming that it occurred randomly at any unmethylated cytosine. b Sequence context of methylated CH in P0.5 PSGs. c A scatter plot showing correlations between the levels of CH and CG methylation in P0.5 PSGs. Each dot represents both levels in a 50 kb window. The 50 kb regions were classified into those containing any CGI (almost all of which are non-PMD regions; gray) and those without CGI, and the latter were further classified into PMD (red) or non-PMD regions (blue). d Discrepancy between the CG and CH methylation levels observed at some retrotransposons. The CH (top) and CG methylation (middle) levels in P0.5 PSGs are compared with the CG methylation levels in E13.5 PGCs (bottom) across two genomic regions. An LTR (IAP) (left panel) and LINE copies (right panel) have high levels of CG methylation but very low levels of CH methylation in PSGs. These retrotransposon copies are more CG methylated than the adjacent regions in PGCs. The methylation levels were determined in 500 bp windows. Regions marked with arrowheads are CGI or unmethylated region (UMR). e The CH methylation levels in P0.5 PSGs (blue bar) and the CG methylation levels in E13.5 PGCs (gray bar) in the whole genome and repeat sequences. The levels of the major and minor satellite repeats were calculated from reads aligned to the consensus sequences, while those of the other sequences were calculated from reads uniquely aligned to the reference mouse genome (mm10). f Negative correlation between the CH methylation levels in P0.5 PSGs and the CG methylation levels in E13.5 PGCs. The methylation levels were calculated in 5 kb windows
Fig. 3Distribution of 5hmC in P0.5 PSGs. a 5hmC levels in the whole genome and satellite repeats. The levels in the major and minor satellite repeats were calculated from sequence reads aligned to the consensus sequences. The levels in the whole genome were calculated from reads uniquely aligned to the reference mouse genome (mm10). b Densities of 5hmC and 5mC (per 100 bp) in the whole genome and major and minor satellite repeats. The proportions of 5hmCs/5mCs at CG and CH sites are also shown. c The 5hmC levels of the whole genome and repeat sequence in P0.5 PSGs (blue bar) and the 5mC levels of those in E13.5 PGCs (gray bar). The levels of the satellite repeats were calculated from reads aligned to the consensus sequences, while those of the other sequences were calculated from reads uniquely aligned to the reference mouse genome (mm10). d Positive correlation between the 5hmC levels in P0.5 PSGs and the 5mC levels in E13.5 PGCs. The levels were calculated in 5 kb windows
Fig. 4Identification and characterization of stage-specific DMRs. a A heat-map representation of the changes in CG methylation at the DMRs identified in the transitions from P0.5 PSGs to P7.5 Kit− SGs and from P7.5 Kit− SGs to Kit+ SGs. The color gradation (from green to red, with an intermediate in yellow) shows the methylation levels (from low to high). The six DMR groups identified by cluster analysis using the changing methylation patterns are shown. The methylation levels in E16.5 PSGs and mature spermatozoa are also shown for comparison. b Genomic locations of the DMRs of each cluster relative to the gene structure (top) or PMD/non-PMD regions (bottom). The DMRs in non-PMD regions are further subdivided based on whether they overlap with CGI shores (2 kb from the edge of a CGI) or exons. The circle size represents the number of DMRs. c Levels of H3K4me1, H3K4me3, H3K27ac, and RNA polymerase II (Pol2) relative to the DMRs in the adult mouse testis [50]. The data are shown for combined cluster-1 and −2 DMRs and combined cluster-5 and −6 DMRs. d GO biological process and mouse phenotype terms enriched for the DMRs of clusters 1 and 2 by GREAT analysis
Fig. 5Examples of the DMRs linked with gene expression. Representative CG methylation profiles are shown for the DMRs located near or within genes essential for spermatogenesis (Stra8 and Plzf) or those showing high expression in P7.5 Kit− SGs (Smoc2 and Ski). Histone modification data (H3K27ac and H3K4me1) from the adult mouse testis [50] are also shown. Gene expression levels are shown on the right of each methylation profile. The DMRs are marked with open squares
Fig. 6Motifs enriched in the DMRs. a Enriched sequence motifs identified by the HOMER de novo motif analysis of the cluster-1 and −2 DMRs. The top seven de novo motifs are shown. The fraction containing at least one instance of each motif is given under the P value, with the expected frequency of the motif in random background regions given in parentheses. The closely matched known motifs are shown on the right (top three, similarity score > 0.6). The motif symbols marked with asterisks are sourced from Homo sapiens. b Expression dynamics of messenger RNAs encoding the transcription factors bound to the known motifs in (a). The expression dynamics of the top-ranked transcription factors are shown