| Literature DB >> 34823554 |
Weipeng Mo1,2,3, Bo Liu1,2,3, Hong Zhang1,2,3, Xianhao Jin1,2,3, Dongdong Lu1,2,3, Yiming Yu1,2,3, Yuelin Liu1,2,3, Jinbu Jia1,2,3, Yanping Long1,2,3, Xian Deng4, Xiaofeng Cao4, Hongwei Guo1,2,3, Jixian Zhai5,6,7.
Abstract
BACKGROUND: The dynamic process of transcription termination produces transient RNA intermediates that are difficult to distinguish from each other via short-read sequencing methods.Entities:
Keywords: Arabidopsis; Nanopore sequencing; Nascent RNA; Transcription termination
Mesh:
Substances:
Year: 2021 PMID: 34823554 PMCID: PMC8613925 DOI: 10.1186/s13059-021-02543-4
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Fig. 1Single-molecule Nascent RNA Sequencing (FLEP-seq) captures various transient RNA intermediates produced during transcription termination. a Schematic of the FLEP-seq library preparation and sequencing. A 3′ adapter is ligated to nascent RNA for reverse transcription and full-length cDNA cloning. b Example of the RNA intermediates from gene DRM2. Upper panel: Nanopore reads are aligned to gene ordered by 3′ end position. The number of individual long reads (n) is indicated. Lower panel: coverage of Illumina short-read nascent RNA data from previous studies [39–42]. The gray dash line indicates the poly(A) site (pA). c Proportion of various RNA intermediates during termination. Readthrough transcripts (blue), 5′ cleavage products (red), 3′ cleavage products (yellow) and poly(A) transcripts (purple). d Sequencing summary for two biological replicates of FLEP-seq libraries in wildtype Col-0. e Advantage of Nanopore (upper) over Illumina [40] (lower) in separating reads from closely adjacent genes. The red dashed box highlights the intergenic region. f Meta-profile showing the 3′ end distribution of non-poly(A) + nascent RNA detected by the FLEP-seq, plaNET-seq [42] reads, and GRO-seq [40] near poly(A) site
Fig. 2The termination landscape in Arabidopsis. a Schematic diagram shows the definition of termination window presented in this study. Reads with 3′ end located more than 50 nt downstream of poly(A) site are used to define termination window. The readthrough distance is calculated as the distance between the poly(A) site and the 3′ end of reads. The median readthrough distance at the corresponding gene is used to represent its termination window size. b Comparison of termination window size quantified by either the median or longest readthrough distance. The gene model is shown at the top and is zoom-in to highlight the region from 200 nt upstream of the poly(A) site to the longest readthrough distance. The boxplots show the readthrough distance distribution for all reads in the region. pA, poly(A) site (also in Fig. 2f, g, and Fig. 4a, d). c, d Scatter plots for the longest (c) and median (d) readthrough distance per gene between replicates (a minimum of 15 readthrough reads for calculating the termination window size). The Pearson’s correlation coefficient and the number of genes (n) are shown. e The distribution of termination window size among genes. f, g Examples of genes with short and long termination windows, respectively
Fig. 4Downstream tRNA promotes efficient termination of upstream protein-coding genes. a, d Examples of Pol II genes that have tRNA gene immediately downstream, arranged in tandem (a) or convergent (d). The number of reads (n) is shown. The poly(A) transcript reads are shown in purple, and their poly(A) tail is shown in red. The readthrough transcript reads are shown in blue. The distance between the termination window end and tRNA is marked by bracket. b, e The distribution for the distance between each pair of the termination window end (TWE) and its downstream tRNA gene. Tandem orientation (b). Convergent orientation (e). c, f Read coverage of genes that have tRNA or non-tRNA genes immediately downstream. In the case of tandem orientation, the start of the downstream gene is used as a reference (c). In the case of convergent orientation, the end of the downstream gene is used as a reference (f). g Model for tRNA-dependent termination in Arabidopsis
Fig. 3The end of termination window precisely matches Pol II occupancy and co-localizes with termination factors. a, c Meta-profile of published ChIP-seq signal for Pol II, Ser2P, Ser5P [53] (a) and BDR1, FPA [54] (b) centered around either the poly(A) site (pA) or the termination window end (TWE). b Meta-profile of read coverage for FLEP-seq and pNET-seq [41] centered around the poly(A) site or the termination window end. d Meta-profiles of Pol II, Pol II Ser2P, BDR1, and FPA ChIP-seq signal in four groups of genes ordered by different termination window sizes. e Venn diagram shows the overlap between BDR1, FPA peaks, and termination windows
Fig. 53′ cleavage products accumulate in the atxrn3 mutant. a 5′ end distribution for the non-poly(A) reads in wildtype atxrn3, fpa, and met1 mutant near poly(A) site. b Examples of genes with accumulated 3′ cleavage products in atxrn3 mutant. Left panel, wildtype; right panel, atxrn3 mutant. c Comparison of 3′ end distribution for the non-poly(A) reads in wildtype and atxrn3, fpa, met1 mutant near poly(A) site (the maximum limit of the y-axes is set at 0.003 to highlight differences at low-signal region). The zoomed-in view is shown in red dashed box. d Comparison of termination window size per gene between wildtype and mutants. The p value was calculated using a Mann–Whitney U test. NS, not significant (p > 0.001)
Fig. 6Cleaved readthrough transcripts can transcribe into downstream genes and produce spliced and polyadenylated mRNAs in atxrn3. a Examples of genes with cleaved readthrough reads extended to downstream genes. Upper panel, wildtype; lower panel, atxrn3 mutant. The black dashed line indicates the poly(A) site. The red dashed box showed the magnified view of cleaved and polyadenylated reads at the subsequent poly(A) site. b Zoomed-in view of poly(A) reads aligned to gene AT1G73530 in two biological replicates of the wildtype and atxrn3 FLEP-seq libraries. c Read coverage of RNA-seq data [14] around the AT1G73540-AT1G73510 region in wildtype and atxrn3 mutant (log2 scale). d Model of AtXRN3-mediated transcription termination