| Literature DB >> 31307500 |
Zijuan Li1,2, Meiyue Wang1,2, Kande Lin3, Yilin Xie1,2, Jingyu Guo1,4, Luhuan Ye1,2, Yili Zhuang1,2, Wan Teng2,5, Xiaojuan Ran1,2, Yiping Tong2,5, Yongbiao Xue2,5, Wenli Zhang6, Yijing Zhang7,8.
Abstract
BACKGROUND: Bread wheat is an allohexaploid species with a 16-Gb genome that has large intergenic regions, which presents a big challenge for pinpointing regulatory elements and further revealing the transcriptional regulatory mechanisms. Chromatin profiling to characterize the combinatorial patterns of chromatin signatures is a powerful means to detect functional elements and clarify regulatory activities in human studies.Entities:
Keywords: Allohexaploid; Bread wheat; Chromatin signature; Enhancer; Epigenomic map; Promoter; Regulatory element
Mesh:
Year: 2019 PMID: 31307500 PMCID: PMC6628505 DOI: 10.1186/s13059-019-1746-8
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Fig. 1Chromatin profiling revealed the epigenetic regulation of genes. a Circos plot summarizing the chromosomal distribution of epigenetic marks. The outermost circle depicts the ideograms of each chromosome. The second outermost circle represents gene density, with red and white indicating high and low density, respectively. Bar plots in the middle circles present the density of epigenetic marks, including seven histone modifications, DNase I-hypersensitive site (DHS), and DNA methylation levels. The three innermost circles represent the densities of three major transposable elements (TEs) in wheat (CACTA, Gypsy, and Copia). b Peak distribution of each mark surrounding various genomic features. TSS, transcription start site; TES, transcription end site. c Five groups of genes marked by different combinations of epigenetic modifications. The normalized intensity of each mark surrounding genes was recorded for k-means clustering. d Violin plot presenting the distribution of gene expression densities for various groups. e Boxplot presenting the tissue specificity, which is represented by the coefficient of expression variance (CV) across various tissues. The transcriptomic data for seven tissues were published previously [13]. f Top enriched protein families for each gene group ranked based on the enrichment P value. Each circle represents one enriched term. The color intensity of the circle represents the fold enrichment. The size of the circle represents the number of genes in each group with the given term. g Enrichment of each gene group for conserved genes (old) or non-conserved genes (young)
Fig. 2Sub-genome-biased promoter binding and regulation of homolog triads by various markers. a Ternary plot presenting the relative binding densities of seven epigenetic marks in the promoters of triad genes. Each circle represents a gene triad. The distance for each triad was determined based on the ratio of the normalized read density for one sub-genome to the read density for all sub-genomes. b Fraction of triads with significantly unbalanced binding across sub-genomes. c Enrichment of the overlap between the biased binding of epigenetic marks and the biased expression of target genes. Dark blue represents a significant overlap. A total of 12,669 expressed triad genes were used
Fig. 3Distinctive and predictive chromatin signatures of functional elements. a Chromatin states determined with a multivariate hidden Markov model. The heatmap presents the emission parameters based on genome-wide combinations of epigenetic marks. Dark blue represents a high frequency of a given mark at regions corresponding to the chromatin state. Each row represents one state, and each column represents one chromatin mark, except the last column, which represents the genomic coverage of the given state. Replicates displayed good consistency. b Bar plot indicating the distribution of genomic positions in each state for various genomic features. c Pie plot presenting the fraction of regions in each chromatin state covering mRNA or lncRNA sequencing reads. d Genomic tracks illustrating three predicted genes based on the signatures of chromatin states 1–4. The first gene was annotated according to the IWGSC RefSeq genome assembly (version 1.0), whereas the other two genes were not annotated, but had a high RNA-seq read density. The data in square brackets represent the range of normalized read densities. The number of reads in each position was normalized against the total number of reads (reads per million mapped reads). e, f For each chromatin state, the fractions of open chromatin regions characterized by DHS read density (e) and conservation score across wheat species (f) were calculated. g Boxplot presenting the distribution of DNA methylation ratios of each chromatin state in three sequence contexts. h Fraction of each chromatin state overlapping with a CpG island. i Number of regions in each chromatin state associated with various types of TEs. j Distribution of TEs in various genomic segments associated with distal (R1 and R3) as well as interstitial and proximal (R2 and C) regions in chromatin states 12 and 13. k Cumulative distances of TEs to the nearest genes in chromatin states 12 and 13
Fig. 4Conservation of the epigenetic architecture of regulatory elements across sub-genomes. a Pair-wise comparison (Jaccard similarity) of shared chromatin states between sub-genomes (i.e., the fraction of sub-genome collinear regions from each chromatin state sharing the same state between sub-genomes). b. Genomic tracks illustrating the conservation of epigenetic features in the predicted enhancer regions exhibiting sequence collinearity across three sub-genomes. The number of reads in each position was normalized against the total number of reads (reads per million mapped reads)
Fig. 5Epigenetic and sequence features of predicted promoters and enhancers. a–d Average profile of H3K4me1 (a), H3K4me3 (b), DHS (c), and H3K9ac (d) read densities surrounding the center of gene-proximal and gene-distal regions in chromatin state 5 (S5) and regions in chromatin states 1 (S1) and 2 (S2) corresponding to gene bodies highly conserved across three sub-genomes. e Cis-elements differentially enriched in the promoter and enhancer-like regions in state 5. The bar plot represents the relative log2 fold-change of the occurrence of corresponding motifs in promoters versus enhancer-like regions
Fig. 6Experimental validation of predicted enhancers. a, b The predicted enhancer loci were cloned into pMY155 reporter constructs (a) to examine their regulatory potential based on a luciferase reporter assay (b). Results for the first six predicted regions are presented, and results for the other 26 predicted regions are provided in Additional file 1: Figure S4. c Relative intensity of 26 predicted enhancer-like elements in states 5–7 and six active genic regions in states 1 and 2 in the reporter assay. Two control regions (ctrl) were randomly chosen from genomic regions without any histone modifications characterized in the present study. The order of the region tested is the same with the experimental results shown in Additional file 1: Figure S4 and the quantitative results listed in Additional file 2: Table S5. d Correlation between the relative intensity determined by the reporter assay and the ChIP-sequencing or DNase-sequencing read density in predicted enhancer-like regions of each epigenetic mark. Detailed data are listed in Additional file 2: Table S5. e Scatter plot presenting the correlation between the DHS read density and relative intensity in the reporter assay for the 32 predicted regions. Detailed data are listed in Additional file 2: Table S5