Literature DB >> 26001965

Global analysis of RNA cleavage by 5'-hydroxyl RNA sequencing.

Sally E Peach1, Kerri York2, Jay R Hesselberth3.   

Abstract

RNA cleavage by some endoribonucleases and self-cleaving ribozymes produces RNA fragments with 5'-hydroxyl (5'-OH) and 2',3'-cyclic phosphate termini. To identify 5'-OH RNA fragments produced by these cleavage events, we exploited the unique ligation mechanism of Escherichia coli RtcB RNA ligase to attach an oligonucleotide linker to RNAs with 5'-OH termini, followed by steps for library construction and analysis by massively parallel DNA sequencing. We applied the method to RNA from budding yeast and captured known 5'-OH fragments produced by tRNA Splicing Endonuclease (SEN) during processing of intron-containing pre-tRNAs and by Ire1 cleavage of HAC1 mRNA following induction of the unfolded protein response (UPR). We identified numerous novel 5'-OH fragments derived from mRNAs: some 5'-OH mRNA fragments were derived from single, localized cleavages, while others were likely produced by multiple, distributed cleavages. Many 5'-OH fragments derived from mRNAs were produced upstream of codons for highly electrostatic peptides, suggesting that the fragments may be generated by co-translational mRNA decay. Several 5'-OH RNA fragments accumulated during the induction of the UPR, some of which share a common sequence motif that may direct cleavage of these mRNAs. This method enables specific capture of 5'-OH termini and complements existing methods for identifying RNAs with 2',3'-cyclic phosphate termini.
© The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 26001965      PMCID: PMC4787814          DOI: 10.1093/nar/gkv536

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Many endoribonucleases including the tRNA splicing endonuclease (1), Ire1 (2) and the ribonucleases T2 (3) and L (4) create cleavage products with 5′-hydroxyl (5′-OH) and 2′,3′-cyclic phosphate termini, as do self-cleaving ribozymes like the hammerhead, HDV, VS, hairpin and twister ribozymes (5,6). In addition, accelerated RNA cleavage by intramolecular phosphoester transfer at structurally distorted phosphodiester bonds generates 5′-OH and 2′,3′-cyclic phosphate products (7). However, products of RNA cleavage with 5′-OH and 2′,3′-cyclic phosphate termini are not typically identified in global surveys of RNA populations, and thus their frequency and role in RNA metabolism remain unknown. Methods for studying RNAs with specific termini can shed light on RNA transcription initiation, processing, and decay pathways. Several methods have been developed for the global study of mRNA 5′-termini, but most of these do not capture 5′-OH termini, or do so indirectly via a 5′-PO4 intermediate. In a recent example, 5′-OH termini were indirectly inferred from cloning of 5′-PO4 termini created upon phosphorylation of 5′-OH termini (8), but this method requires several enzymatic manipulations prior to the ligation step. Similarly, the Parallel Analysis of RNA Ends (PARE) method (9) begins with ligation of an RNA adapter to the 5′-PO4 terminus of an RNA substrate with T4 RNA ligase, and thus this method does not capture 5′-OH termini. Other methods compare signals among different enzymatic manipulations of 5′ mRNA cap structures to enable specific detection of mRNA 5′-termini (10), but these methods are unable to capture 5′-OH termini. We previously developed a method to identify RNA fragments with 2′,3′-cyclic phosphate and 2′-PO4/3′-OH termini using the Arabidopsis thaliana tRNA ligase (11). Recent studies used this method to discover virus and host targets of ribonuclease L cleavage (4,12), and to identify the Gas2 mRNA as a novel target of the Ire1 endonuclease in the Schizosaccharomyces pombe unfolded protein response (UPR) (13). Fungal and plant tRNA ligases join RNAs with 5′-OH and 2′,3′-cyclic phosphate termini via 5′-PO4 and 5′-adenylate intermediates (14,15), and thus the method employing A. thaliana tRNA ligase does not distinguish between 5′-OH and 5′-PO4 RNA termini. In addition, libraries produced by this method typically contain high levels of ribosomal RNA fragments with cyclic-phosphate termini, which may mask signals from lower abundance fragments from mRNA. Therefore, we were motivated to develop a new method to capture the other products of RNA cleavage—5′-OH RNA fragments—that also allows mRNAs to be enriched via oligo-dT selection of their polyadenylate tails. Escherichia coli RtcB RNA ligase catalyzes a unique chemistry involving the attack of a 3′-guanylylate intermediate by a 5′-OH nucleophile, yielding a 3′-5′ phosphodiester bond and GMP (16). As noted previously (17,18), RtcB RNA ligase therefore represents a unique reagent for direct capture of 5′-OH RNAs. Here, we developed a method using the E. coli RtcB RNA ligase to globally identify 5′-OH RNA fragments in complex RNA populations and applied the method to study 5′-OH RNA fragments in budding yeast.

MATERIALS AND METHODS

Expression and purification of E. coli RtcB RNA ligase

The E. coli RtcB gene was PCR amplified, recombined into pDONR221 (Gateway BP, Invitrogen) and Sanger sequenced to confirm lack of mutations in the open reading frame. The RtcB ORF was recombined into pET-53-DEST (creating an N-terminal 6xHis tag; Gateway LR, Invitrogen) and transformed into BL21 (DE3) RIPL cells (Stratagene). The pET-53-RtcB expression construct is available from Addgene (Plasmid 51282). For RtcB protein expression, cells were grown from a single colony in 1-l terrific broth with selective antibiotics (50 μg/ml ampicillin and 34 μg/ml chloramphenicol) at 37°C to an OD600 of 0.6 and then chilled on ice for 30 min. Ethanol was added to 2%, expression was induced with 0.1 mM isopropylthiogalactoside (IPTG). and cultures were grown 16 h at 17°C. Cells were harvested by centrifugation and lysed for 1 h at 4°C in Buffer A (50 mM Tris pH 7.4, 250 mM NaCl, 10% sucrose) with 0.2 mg/ml lysozyme. Triton X-100 was added to a final concentration of 0.1%, and the cells were sonicated briefly to reduce viscosity. Following clarification by centrifugation at 10 000 × g, the supernatant was incubated for 1 h at 4°C with 3 ml of pre-washed Nickel-NTA beads (Qiagen). The beads were washed with 25 ml of Buffer A, then loaded onto a column and washed twice with 25 ml Buffer E (50 mM Tris–HCl (pH 7.4), 10% glycerol, 250 mM NaCl) containing 15 mM imidazole. The column was eluted with 3 ml each of Buffer E with 50, 150 and 300 mM imidazole. Peak fractions were identified by SDS-PAGE and were pooled and diluted to 50 mM NaCl with Buffer D (50 mM Tris–HCl (pH 7.4), 10% glycerol, 5 mM dithiothreitol (DTT), 1 mM ethylenediaminetetraacetic acid (EDTA)). RtcB protein was further purified on a 1 ml HiTrap Heparin HP column (GE Healthcare Life Sciences), washed with Buffer D containing 50 mM NaCl, and eluted with 3 ml each of Buffer D containing 150, 300 and 500 mM NaCl. RtcB protein eluted predominantly in the 150–300 mM NaCl fractions, with a typical yield of 2 mg/l culture. Peak fractions were pooled, quantitated by Bradford assay, snap frozen in liquid nitrogen and stored at −80°C.

RtcB ligation assay

Oligonucleotides used in this study are listed in Table 1. Ligation assays were used to confirm recombinant RtcB function. A linker oligonucleotide with a 3′ phosphate was incubated at equimolar concentration (200 pM) with another oligonucleotide with a 5′-OH terminus in the presence of 1.25 μM RtcB in buffer (1 mM GTP, 50 mM MOPS pH 7.4, 2 mM MnCl2) for 1 h at 37°C. The reaction was quenched with stop dye (95% formamide, 5 mM EDTA pH 8.0, 0.01% xylene cyanol and bromophenol blue), denatured at 70°C for 3 min, and electrophoresed on 10% polyacrylamide gels containing 7 M urea and 1X 89 mM Tris, 89 mM boric acid and 2 mM EDTA (TBE) and stained with SYBR Gold (Invitrogen) prior to visualization.
Table 1.

Oligonucleotides

NameSequence
RNA ligation assay
M13-linker5′ Amino – rGrUrUrUrUrCrCrCrArGrUrCrArCrGrArC – 3′ phosphate
RPS31-5OH5′ OH – rArArGrArArGrArGrArArArGrArArGrArArGrGrUrCrUrArCrArCrCrArCrC 3′ Amino
5OH library
5OH-Linker5′ desthiobiotin – ACACTCTTTCCCTACACGACGCTCTTCCGATCTrNrNrNrNrNrNrNrN 3′ phosphate
5OH-RT-amino5′ Amino – GTGACTGGAGTTCAGACGTGTGCTCTTCCGATCTNNNNNNNNN – 3′
Validation primers
M13-F5′ GTTTTCCCAGTCACGAC
RPS31-F5′ CCCTGGAAGTTGAATCTTCTG
RPS31-R5′ GCGTTAACCTTGTAGACGGAAT

Yeast strains and culturing

Yeast strains used in this study are listed in Table 2. Single colonies were inoculated in YEPD and incubated at 30°C overnight with rotation. Cultures were diluted to an OD600 of 0.1 in YEPD. In untreated conditions, yeast cells were harvested in mid-log phase. To induce the UPR, yeast grown to mid-log phase were treated for 2 h with tunicamycin (final concentration of 2.5 μg/ml, Sigma-Aldrich). Cells were harvested by centrifugation and total RNA was isolated by hot acid phenol extraction and treated with TURBO DNase (Ambion) to remove genomic DNA.
Table 2.

Yeast strains

StrainGenotype
XRN1 (BY4742)MATα his3Δ1leu2Δ0lys2Δ0ura3Δ0
xrn1ΔMATα his3Δ1leu2Δ0lys2Δ0 ura3Δ0 xrn1Δ::hygMX

5′-Hydroxyl RNA library construction and sequencing

Total RNA from budding yeast (15 μg) was enriched for a polyadenylated mRNA fraction using Oligo-dT magnetic beads (Ambion). PolyA-enriched RNA was incubated with 1.1 μM RtcB in buffer (1 mM GTP, 50 mM MOPS pH 7.4, 2 mM MnCl2), 20 units RNase Inhibitor (Enzymatics) and 10 μM linker (‘5OH-Linker’ in Table 1) at 37°C for 1 h. Following phenol/chloroform extraction and ethanol precipitation, RNA was fragmented with Ambion Fragmentation Reagent for 15 min at 70°C, and the reaction was quenched with the addition of EDTA. Samples were denatured in stop dye (95% formamide, 0.01% xylene cyanol / bromophenol blue), heated to 65°C for 5 min and separated on a 10% acrylamide TBEurea gel. Following staining with SYBR Gold (Invitrogen), the gel was visualized by blue light transillumination and a gel slice containing RNA above 80 nt was excised and pulverized. RNA was eluted from the gel by 2 h incubation at 40°C in 0.3 M sodium acetate, pH 5.2, 1 mM EDTA, pH 8.0 and precipitated with 2.5 volumes of ethanol. cDNA was prepared using an Illumina-compatible primer with a 6-basepair degenerate region (oligonucleotide 5OH-RT-amino) and Protoscript II RT (NEB), and was purified using Agencourt RNAClean XP Beads (Beckman Coulter). Second strand synthesis was performed with RNase H, E. coli DNA Polymerase, and E. coli ligase (Enzymatics). Double-stranded DNA products were purified using magnetic Streptavidin beads (Invitrogen), eluted with 25 mM biotin in elution buffer (Omega Bio-Tek) and PCR amplified with Illumina TruSeq primers and Phusion DNA polymerase. PCR reactions were purified with SPRI Size Select Beads (Beckman Coulter) to yield an average product size of 350 bp. Indexed libraries were quantified by Qubit (Invitrogen), mixed to a final concentration of 1–10 nM and sequenced on an Illumina MiSeq in a 50 cycle run. Raw and processed sequencing data are available at NCBI GEO under accession number GSE61527.

Analysis of DNA sequencing data

Sequences in FASTQ format were preprocessed to remove molecular indexes using umitools (https://github.com/brwnj/umitools). Processed reads were aligned to the S. cerevisiae genome (sacCer1) with Bowtie (19) and visualized in the UCSC Genome Browser (20). Coverage signals in bedGraph format were generated, normalized to UMI-corrected Counts Per Million reads mapped (CPM), and analyzed with BEDTools (21) and custom scripts. Studies of mRNA cleavage focused on the collection of 5′-OH fragments in the upper decile of the CPM distribution (i.e. the most abundant 10%) that were reproducible in replicate samples prepared from two independent cultures. Significant discrete 5′-OH signals within mRNAs were identified as previously described (22). Briefly, CPM values for each mRNA were fit to a negative binomial distribution, and P values were computed for each site. P values were adjusted using the Benjamini-Hochberg false discovery rate (FDR) (<0.05). Clustered hits were determined by identifying regions of high 5′-OH signal density using MACS (23) then selecting regions longer than 100 bp and discarding regions included in the list of mRNAs with single fragments. Additional workflow and analysis scripts are available at https://github.com/hesselberthlab/5OH. Processed data can be visualized on the UCSC Genome Browser in the track hub at https://github.com/hesselberthlab/trackhub.

Validation of 5′-hydroxyl RNA fragments by RtcB ligation and RT-PCR

5′-OH RNA fragments were verified by RtcB-mediated ligation of an oligonucleotide linker, followed by reverse transcription and gene-specific PCR. Total RNA (5 μg) was ligated to an RNA oligonucleotide with a 5′-amino group and a 3′-phosphate (M13-Linker, Table1 ) with RtcB. Reactions were phenol-chloroform extracted, ethanol precipitated, and reverse transcribed with Superscript III RT (Invitrogen) and gene-specific primers. cDNA was PCR amplified using primer pairs that hybridize to full length mRNA or the linker-ligated 5′-OH RNA fragment. PCR products were resolved on 10% TBE non-denaturing 29:1 acrylamide gels and stained with SYBR Gold.

RESULTS

Development of a method for capture of 5′-hydroxyl RNAs

To demonstrate the utility of RtcB RNA ligase as a reagent for global capture of 5′-OH RNA, we first asked whether RtcB could ligate unrelated oligonucleotide substrates—a prerequisite for global capture of 5′-OH molecules. We overexpressed and purified E. coli RtcB ligase (Figure 1A) and used the enzyme to perform ligations with oligoribonucleotide substrates with 2′-OH/3′-PO4 and 5′-OH termini with minimal predicted base pairing (Figure 1B). We found that RtcB could ligate these substrates in a bimolecular ligation with ∼25% efficiency, confirming previous studies of RtcB intermolecular ligation (24) and suggesting that RtcB could be used in global profiling of cellular RNAs with 5′-OH termini.
Figure 1.

(A) SDS-PAGE analysis of E. coli RtcB protein. Recombinant RtcB protein (10 μg) was electrophoresed through a 10% SDS-PAGE gel and stained with Coomassie blue. (B) RtcB-mediated bimolecular ligation of RNA substrates with 3′-PO4 and 5′-OH termini was analyzed by polyacrylamide gel electrophoresis. The ligation product at 50 nt was dependent on RtcB, with an efficiency of ∼25%. (C) Schematic of 5′-hydroxyl (5′-OH) cloning. RNA cleavage generates two fragments with 2′,3′-cyclic phosphate and 5′-OH termini. An adaptor with a 5′-desthiobiotin (B) and 3′-phosphate (denoted by P) is ligated to the 5′-OH RNA fragment with E. coli RtcB ligase. This adaptor has eight randomized positions at its 3′ end which serve as a molecular index to uniquely identify individual ligation events upon sequencing (25). Following ligation, excess adaptor is removed by gel purification. A primer with a 3′ random hexamer region is used for reverse transcription to create cDNA, followed by second-strand synthesis to generated double-stranded DNA products. After streptavidin (SAv) isolation, DNA fragments are PCR amplified and analyzed by Illumina sequencing.

(A) SDS-PAGE analysis of E. coli RtcB protein. Recombinant RtcB protein (10 μg) was electrophoresed through a 10% SDS-PAGE gel and stained with Coomassie blue. (B) RtcB-mediated bimolecular ligation of RNA substrates with 3′-PO4 and 5′-OH termini was analyzed by polyacrylamide gel electrophoresis. The ligation product at 50 nt was dependent on RtcB, with an efficiency of ∼25%. (C) Schematic of 5′-hydroxyl (5′-OH) cloning. RNA cleavage generates two fragments with 2′,3′-cyclic phosphate and 5′-OH termini. An adaptor with a 5′-desthiobiotin (B) and 3′-phosphate (denoted by P) is ligated to the 5′-OH RNA fragment with E. coli RtcB ligase. This adaptor has eight randomized positions at its 3′ end which serve as a molecular index to uniquely identify individual ligation events upon sequencing (25). Following ligation, excess adaptor is removed by gel purification. A primer with a 3′ random hexamer region is used for reverse transcription to create cDNA, followed by second-strand synthesis to generated double-stranded DNA products. After streptavidin (SAv) isolation, DNA fragments are PCR amplified and analyzed by Illumina sequencing. We next established a general protocol to capture 5′-OH RNA from complex RNA populations using the RtcB ligase (Figure 1C). We began by chemically fragmenting total RNA from budding yeast, producing many RNA fragments with 5′-OH and 2′,3′-cyclic phosphate termini. RNA fragments with 5′-OH termini were ligated to an oligonucleotide with 5′-desthiobiotin and 3′-PO4 groups using E. coli RtcB ligase. This oligonucleotide linker contains an 8-base random sequence at its 3′ end that serves as a molecular index during sequence analysis, enabling quantitation of unique ligation events and estimation of relative amounts of 5′-OH fragments in the initial RNA population (25). After gel purification to remove excess unligated linker, RNA was reverse transcribed with a second oligonucleotide primer containing an Illumina primer binding site and six randomized bases at the 3′ end to facilitate random priming to RNA templates. Double-stranded RNA:DNA hybrids were converted to double-stranded DNA by second-strand synthesis, and double-stranded DNA fragments containing the desthiobiotin linker were purified on immobilized streptavidin and eluted with free biotin. Purified double-stranded DNA fragments were PCR amplified with primers that incorporate library-specific indexes and flow cell sequences compatible with Illumina sequencing. Analysis of these libraries by gel electrophoresis yielded a broad distribution of double stranded DNA products from 120 bp to 1500 bp with a median size of 350 bp. These products were dependent on RtcB ligation and reverse transcription (data not shown).

Global analysis of 5′-hydroxyl RNAs in budding yeast

We used the cloning protocol (as described in ‘Materials and Methods’ section) to identify 5′-OH RNA fragments in the budding yeast S. cerevisiae. We prepared libraries from oligo-dT enriched yeast total RNA, in order to increase signals that might be present on polyadenylated mRNA fragments. These libraries were sequenced on the Illumina platform to identify 5′-OH RNA fragments. The libraries had an average of 2.7 million reads, with an average of 23% of the reads aligning uniquely to the yeast genome and an additional 30% of reads aligning multiple times. We examined the distribution of uniquely aligning reads within existing RNA annotations, and found that in polyA-enriched samples, 21% mapped to mRNA regions, 0.4% mapped to abundant non-coding RNA species (tRNA, snoRNA, and snRNA), and 65% of the reads were derived from ribosomal RNA. The remaining 13.6% of reads mapped to other regions including retrotransposon-derived and intergenic regions. Using these sequences, we also assessed whether RtcB exhibited a bias for specific sequences at the ligation junction. We found that the base composition of each position downstream of the ligation site (i.e. the 5′-OH terminus) mirrored budding yeast A/T content (62%), suggesting that RtcB does not have strong preference for 5′-OH RNAs with specific sequences. The 8 nt molecular index in ligated adaptors (Figure 1C and 5OH-Linker, Table 1) allowed us to examine base composition upstream of the ligation site, and we found that RtcB preferred adaptors with an adenosine at their 3′ terminus, which was present in 42% of sequences, while the remaining 7 positions had more random composition (data not shown). Because the molecular index is trimmed from reads prior to alignment, this ligation bias has no effect on sequence alignment, but does limit the diversity of possible molecular indexes. To validate the protocol, we analyzed the capture of 5′-OH termini generated in vitro and in vivo. Ribosomal RNAs have 5′-PO4 termini produced by endonucleolytic (26) and exonucleolytic processing (27). These ends should be underrepresented in 5′-OH libraries, but we would expect to recover these termini in 5′-OH libraries prepared from RNA samples that had been dephosphorylated in vitro (Figure 2A). Therefore, we compared libraries prepared from total RNA samples that were treated or not with recombinant shrimp alkaline phosphatase (rSAP) to convert 5′-PO4 RNAs in the samples to 5′-OH groups. An average of 2574 reads aligned to the 5′ end of 5.8S rRNA in the dephosphorylated sample compared to an average of 358 5′-OH fragments aligned to the same position in the untreated samples, (Figure 2B), representing a 7-fold enrichment of reads at the 5′ end of this RNA upon phosphatase treatment and confirming that the method captures 5′-OH RNAs and discriminates against 5′-PO4 RNAs.
Figure 2.

(A) Schematic of RtcB-mediated ligation to a 5′-OH RNA substrate produced by phosphatase-mediated removal of a 5′-PO4 group. (B) Abundance of 5′-OH fragments (per-site, UMI-corrected Counts Per Million reads mapped, CPM) mapped to 5.8S rRNA (157 nt). In an untreated sample (top), 5′-OH signal was uniformly low across the body of the 5.8S rRNA. 5′-OH RNA fragments cloned from total RNA treated with shrimp alkaline phosphatase (bottom) mapped to the 5′-end of 5.8S rRNA (gray arrows), indicating conversion of the 5′-PO4 to a 5′-OH group. (C) Abundance of 5′-OH fragments (CPM) in the snR18 locus show that the 5′-PO4 of snR18, produced by Rnt1-mediated processing of the EFB1 pre-mRNA (29), is specifically captured upon phosphatase treatment and 5′-OH cloning (gray arrows). (D) Schematics of tRNAs. On the left, a mature tRNA with the CCA terminator and the D, anticodon, variable, and TΨC loops. On the right, a portion of an intron-containing pre-tRNA with cleavage sites recognized by tRNA Splicing Endonuclease (SEN) represented by black lines. Sites within tRNAs that were captured abundantly by 5′-OH cloning are shaded. Arrows indicate the most abundant fragment in each tRNA shown in (E). (E) Examples of tRNA fragments captured by 5′-OH cloning in rRNA-depleted total RNA. Fragments from tRNALeuCAA mapped to the 5′-terminus of the 3′ exon (top panel). Anti-codon loop cleavages were found in tRNASerAGA (middle panel). tRNAAspGUC was the most abundant species captured by 5′-OH cloning and primarily showed D-arm and loop cleavage (bottom panel, Supplementary Table S1). Arrows indicate the most abundant fragment in each tRNA and correspond to the arrows in (D). Key regions of the tRNA are indicated by solid bars labeled as Intron, Anticodon loop, and D arm and loop.

(A) Schematic of RtcB-mediated ligation to a 5′-OH RNA substrate produced by phosphatase-mediated removal of a 5′-PO4 group. (B) Abundance of 5′-OH fragments (per-site, UMI-corrected Counts Per Million reads mapped, CPM) mapped to 5.8S rRNA (157 nt). In an untreated sample (top), 5′-OH signal was uniformly low across the body of the 5.8S rRNA. 5′-OH RNA fragments cloned from total RNA treated with shrimp alkaline phosphatase (bottom) mapped to the 5′-end of 5.8S rRNA (gray arrows), indicating conversion of the 5′-PO4 to a 5′-OH group. (C) Abundance of 5′-OH fragments (CPM) in the snR18 locus show that the 5′-PO4 of snR18, produced by Rnt1-mediated processing of the EFB1 pre-mRNA (29), is specifically captured upon phosphatase treatment and 5′-OH cloning (gray arrows). (D) Schematics of tRNAs. On the left, a mature tRNA with the CCA terminator and the D, anticodon, variable, and TΨC loops. On the right, a portion of an intron-containing pre-tRNA with cleavage sites recognized by tRNA Splicing Endonuclease (SEN) represented by black lines. Sites within tRNAs that were captured abundantly by 5′-OH cloning are shaded. Arrows indicate the most abundant fragment in each tRNA shown in (E). (E) Examples of tRNA fragments captured by 5′-OH cloning in rRNA-depleted total RNA. Fragments from tRNALeuCAA mapped to the 5′-terminus of the 3′ exon (top panel). Anti-codon loop cleavages were found in tRNASerAGA (middle panel). tRNAAspGUC was the most abundant species captured by 5′-OH cloning and primarily showed D-arm and loop cleavage (bottom panel, Supplementary Table S1). Arrows indicate the most abundant fragment in each tRNA and correspond to the arrows in (D). Key regions of the tRNA are indicated by solid bars labeled as Intron, Anticodon loop, and D arm and loop. The levels of ribosomal RNA fragments found in the poly(A)-enriched library suggested that oligo-dT capture of polyadenylated mRNA is not sufficient to deplete ribosomal RNA signals from libraries, and that other strategies for ribosomal RNA depletion might be used to further reduce signals from ribosomal RNA. We prepared libraries from an RNA sample following treatment with a commercial strategy to remove ribosomal RNA by hybridization to antisense probes (28). We found that signals from ribosomal RNAs were ∼10-fold less abundant in these libraries compared to libraries prepared from oligo-dT selected RNA, but also unexpectedly found that the commercial kit contained a phosphatase activity that converted the 5′-PO4 termini of ribosomal RNAs to 5′-OH groups, leading to their capture by 5′-OH cloning independent of additional phosphatase treatment (data not shown). Similar to rRNA, some small nucleolar RNAs (snoRNAs) are produced from endonucleolytic and exonucleolytic processing of precursor RNA molecules and have 5′-monophosphate termini (29). Dephosphorylation of the 5′-PO4 termini of these snoRNAs would produce 5′-OH termini, and we found abundant signals for these in 5′-OH libraries following rSAP treatment. For example, the 5′-end of the box C/D snoRNA U18, which is produced by Rnt1 and Rat1 processing of the EFB1 pre-mRNA (30), was 6-fold more abundant in the dephosphorylated sample compared to the untreated sample (Figure 2C). Overall we identified six snoRNAs (snR18, 24, 55, 57, 61, 64) with 5′-PO4 termini that were captured by 5′-OH cloning after in vitro phosphatase treatment. Cleavage of pre-tRNA molecules by the tRNA splicing endonuclease (SEN) creates intron and 3′-exon products with 5′-OH groups (1) (Figure 2D). Yeast pre-tRNA molecules range from 71 to 133 nt and processed tRNAs from 71 to 87 nt. Sequences in our libraries have an effective read length of 42 nt, and thus our libraries can be used to quantify cleavages within ∼30–40 nt of the 5′-end of processed tRNA molecules, depending on variable loop length. We analyzed signals from tRNAs in the 5′-OH library prepared from rRNA-depleted total RNA. The presence of 5′-OH fragments corresponding to the 5′-terminus of the 3′-exon of the intron-containing tRNALeuCAA (Figure 2E, top panel) confirms that this method can capture known 5′-OH termini generated in vivo. We also identified products of cleavage in the anticodon loop of tRNASerAGA (Figure 2E, middle panel), consistent with 2′,3′-cyclic phosphate capture of anticodon loop cleavage (11). Finally, we identified high levels of 5′-OH fragments that mapped to the D arm and loop of 28 tRNA species (Figure 2E, bottom panel; Supplementary Table S1). We compared the 5′-OH fragments in tRNAs to a previous study of 5′-PO4 RNA fragments (22), and found that 5′-PO4 signals were localized mainly to the 5′-ends of tRNA, indicating that the 5′-OH fragments represent a distinct pattern of tRNA cleavages (data not shown). We did not identify the 5′-OH intron products of SEN cleavage, presumably because they are phosphorylated by Trl1 and degraded by Xrn1 (31).

mRNA decay events identified with 5′-hydroxyl RNA capture

In order to focus on 5′-OH fragments present in mRNA, we collected sequence reads from libraries prepared from poly(A)-enriched RNA that mapped uniquely within annotated mRNA transcripts (32), and found 710 mRNAs that had significant signals in two replicate samples (35% overlap among mRNAs). We examined the distribution of 5′-OH fragments within these mRNAs (Figure 3A), and found that 76.7% of 5′-OH fragments mapped within open reading frames, 9.9% were in 5′ untranslated regions (UTRs) and 13.4% were in 3′ UTRs. We found 81 mRNAs with 5′-OH fragments that mapped exclusively to the 5′ and 3′ UTRs, and 551 mRNAs with 5′-OH fragments exclusively in their open reading frames. The remaining 78 mRNAs contained fragments distributed throughout the mRNA. Many 5′-OH fragments were biased toward the 3′-end of mRNAs, possibly reflecting a bias imposed by poly(A) purification prior to 5′-OH cloning (Figure 3A).
Figure 3.

(A) 5′-OH fragments are found in numerous mRNAs. Each of the 710 mRNAs with a 5′-OH signal in the upper decile of replicates from a wild-type strain was divided into proportionally sized bins for the 5′ and 3′ UTR regions (two bins each, left and right of the solid vertical lines), and the open reading frame (ORF; 20 bins). 5′-OH fragment abundance (CPM) in each bin was summed, and mRNAs were sorted by their maximum bin value from 5′ to 3′. Each mRNA is an entry on the y-axis. A total of 11 mRNAs have signal exclusively within their 5′ UTRs (top) and 69 mRNAs have signal exclusively in their 3′ UTRs (bottom). The gray centerline marks the middle of the ORF region; 5′-OH signal was predominantly in the 5′ portion of 143 mRNAs (above horizontal dashed line), while remaining mRNAs had signals mainly localized toward their 3′ ends. (B) Comparison of signals in mRNAs in 5′-OH and RNA-seq libraries. There was a significant correlation (Pearson R = 0.31; P < 10−16) between 5′-OH signals (counts per million reads per kilobase; CPKM) and RNA-seq (FPKM) (33) for 629 mRNAs (a subset of the mRNAs in A). Dotted lines indicate median RNA-seq FPKM and 5′-OH CPKM values, and the linear regression is plotted as a gray line. (C) Plots of 5′-OH abundance (CPM) versus position (bp) for four representative mRNAs (of 187 total, Supplementary Table S2) with multiple, distributed 5′-OH fragments. All mRNAs are plotted from 5′ to 3′, with UTR regions in gray, and ORF regions in black. 5′-OH fragments in the IMH1, RPN2, and YEF3 mRNAs were predominantly localized to their coding regions, and 5′-OH fragments in the SAC1 mRNA were restricted to its 3′ UTR. (D) Three representative mRNAs (of 17 total, Supplementary Table S3) with single, predominant 5′-OH fragments. The ADH1 and YNK1 mRNAs each contained a single predominant 5′-OH fragment in their ORF regions, and the CYS4 mRNA contained a single 5′-OH fragment in its 3′ UTR. The MDH1 mRNA had a predominant 5′-OH fragment 8 nt downstream of its stop codon.

(A) 5′-OH fragments are found in numerous mRNAs. Each of the 710 mRNAs with a 5′-OH signal in the upper decile of replicates from a wild-type strain was divided into proportionally sized bins for the 5′ and 3′ UTR regions (two bins each, left and right of the solid vertical lines), and the open reading frame (ORF; 20 bins). 5′-OH fragment abundance (CPM) in each bin was summed, and mRNAs were sorted by their maximum bin value from 5′ to 3′. Each mRNA is an entry on the y-axis. A total of 11 mRNAs have signal exclusively within their 5′ UTRs (top) and 69 mRNAs have signal exclusively in their 3′ UTRs (bottom). The gray centerline marks the middle of the ORF region; 5′-OH signal was predominantly in the 5′ portion of 143 mRNAs (above horizontal dashed line), while remaining mRNAs had signals mainly localized toward their 3′ ends. (B) Comparison of signals in mRNAs in 5′-OH and RNA-seq libraries. There was a significant correlation (Pearson R = 0.31; P < 10−16) between 5′-OH signals (counts per million reads per kilobase; CPKM) and RNA-seq (FPKM) (33) for 629 mRNAs (a subset of the mRNAs in A). Dotted lines indicate median RNA-seq FPKM and 5′-OH CPKM values, and the linear regression is plotted as a gray line. (C) Plots of 5′-OH abundance (CPM) versus position (bp) for four representative mRNAs (of 187 total, Supplementary Table S2) with multiple, distributed 5′-OH fragments. All mRNAs are plotted from 5′ to 3′, with UTR regions in gray, and ORF regions in black. 5′-OH fragments in the IMH1, RPN2, and YEF3 mRNAs were predominantly localized to their coding regions, and 5′-OH fragments in the SAC1 mRNA were restricted to its 3′ UTR. (D) Three representative mRNAs (of 17 total, Supplementary Table S3) with single, predominant 5′-OH fragments. The ADH1 and YNK1 mRNAs each contained a single predominant 5′-OH fragment in their ORF regions, and the CYS4 mRNA contained a single 5′-OH fragment in its 3′ UTR. The MDH1 mRNA had a predominant 5′-OH fragment 8 nt downstream of its stop codon. We identified a subset of mRNAs with 5′-OH signals that also were expressed under similar conditions as measured by a previous RNA-seq experiment (33). For each of these 629 mRNAs, we calculated a summary statistic by dividing the sum of its CPM values by the length of the mRNA in kilobases (Counts Per Kilobase per Million reads mapped; CPKM), and compared mRNA CPKM values to their expression levels in Reads Per Kilobase per Million mapped (RPKM) (33). There was a modest but significant correlation between levels of 5′-OH fragments and mRNA abundance (Pearson R = 0.31; P < 10−16), indicating that 5′-OH fragment abundance is partly a reflection of mRNA abundance (Figure 3B). We also evaluated whether these fragments were associated with intrinsic structural properties of RNAs. We compared 5′-OH fragment levels to previous measures of global RNA structures using Ribonuclease S1 and V1 mapping (34), and found minimal correlation between these data sets (Pearson R = −0.04; P < 0.048). This suggests that 5′-OH sites are uniformly distributed among RNA secondary structure elements (i.e., single and double-strand regions). There were unique patterns of 5′-OH fragment distribution within mRNAs. We identified 187 mRNAs (Supplementary Table S2) with multiple fragments mapping to localized regions within the gene body, and these 5′-OH fragments clustered within a mean distance of 191 nt. For example, the IMH1, RPN2 and YEF3 mRNAs had multiple 5′-OH fragments within their coding regions, and the SAC1 mRNA had multiple fragments within its 3′ UTR (Figure 3C). The pattern of 5′-OH fragments in these mRNAs suggests that a localized process cleaves these specific mRNAs within a defined region, but cleavage occurs in different locations among several mRNA molecules. We also found 17 mRNAs (Supplementary Table S3) that each had a single predominant 5′-OH fragment (FDR < 0.05). For example, the ADH1 and YNK1 mRNAs had single sites of cleavage in their open reading frames, CYS4 had a predominant cleavage in its 3′ UTR, and a predominant cleavage in the MDH1 mRNA is 8 nt downstream of its stop codon (Figure 3D). We also identified a prominent 5′-OH fragment in the RPS31 mRNA, which encodes an in-frame fusion of ubiquitin to the S31 protein of the ribosomal small subunit (35). The 5′-OH fragment mapped 3 nt downstream of the codons for the Arg-Gly-Gly (RGG) terminus of ubiquitin, suggesting that this fragment might be generated during translation and ubiquitin processing (36) (Figure 4A). However, another mRNA encoding a ubiquitin-ribosomal protein fusion, RPL40A, had significant levels of 5′-OH fragments located farther downstream of the codons encoding the ubiquitin C-terminus, suggesting that ubiquitin processing is not responsible for RPS31 mRNA cleavage (Figure 4B).
Figure 4.

(A) A single predominant 5′-OH RNA fragment (gray arrow) maps to the RPS31 mRNA, encoding the ubiquitin-Rps31 fusion protein. The fragment maps to a position 3 nt downstream of the codons for Arg-Gly-Gly (RGG) residues (vertical black arrow), which signal ubiquitin removal by deubiquitylating enzymes (36). (B) Multiple, distributed 5′-OH signals are further downstream of RGG codons (vertical black arrow) in the RPL40A mRNA, which encodes the ubiquitin-Rpl40a fusion protein. (C) An abundant RNA fragment recovered in ribosome profiling experiments (38) maps to the same position in the RPS31 mRNA as the 5′-OH fragment in (A). This fragment was present in all samples, but was most abundant in ribosome profiling done in cells treated with the non-hydrolysable GTP analog GMP-PNP (shown). (D) The 5′-OH fragment in RPS31 was validated by ligation-mediated RT-PCR. 5′-OH fragments in total RNA were incubated with an RtcB linker (a synthetic oligonucleotide with a 3′-phosphate, M13-linker, Table) in the presence or absence of RtcB ligase. These reactions were divided and reverse transcribed with a primer specific for RPS31 that hybridizes downstream of the 5′-OH terminus. cDNA products of reverse transcription were PCR amplified with primers that hybridize to RPS31 up- and downstream of the 5′-OH terminus and analyzed by gel electrophoresis, yielding abundant fragments at the appropriate size (415 bp), independent of RtcB ligation (left panel). A second PCR was performed with a 5′ primer that hybridizes to the ligated linker, and analysis of these products showed ligation-dependent PCR products at the expected sizes (major 5′-OH fragment at 215 bp; minor products at ∼400 bp). A portion of (A) is displayed on the right for reference. (E) Schematic of 5′-OH signals with respect to regions encoding specific peptide regions in (F). (F) The mean 5′-OH fragment coverage in rRNA-depleted total RNA was calculated 75 nt upstream (negative) and downstream (positive) of the start position of all mRNA sequences encoding at least five consecutive basic or acidic residues, including Arg (n = 12), Lys (n = 59), a mixture of Lys and Arg (n = 280), Glu (n = 105), Asp (n = 102), and a mixture of Glu and Asp (n = 642). A peak of 5′-OH fragments is present ∼30 nt upstream of regions composed of Lys and Arg, but this signal is lower in regions encoding Lys or Arg alone (top panels). Coverage of 5′-OH fragments is elevated higher in regions encoding Glu than regions encoding Asp, or Glu and Asp (bottom panels). (G) The mean 5′-OH fragment coverage was calculated 75 nt upstream (negative) and downstream (positive) of the start position of all mRNA sequences encoding varying lengths of consecutive basic and acidic residues. 5′-OH fragments were enriched upstream of regions encoding polyelectrostatic (both basic and acidic) resides (Arg, Lys, Glu, Asp; left panel) and were highest upstream of regions encoding only Lys and Glu (right panel). In both panels, regions encoding longer consecutive stretches (≥20) of polyelectrostatic amino acids had higher 5′-OH signals than regions encoding shorter stretches (≥5). (H) Representative examples of mRNAs (CBF5 and HSC82) with 5′-OH fragments that map to positions upstream of sequences encoding polyelectrostatic amino acid motifs. Regions encoding polyelectrostatic amino acid stretches are indicated by gray boxes above the gene models.

(A) A single predominant 5′-OH RNA fragment (gray arrow) maps to the RPS31 mRNA, encoding the ubiquitin-Rps31 fusion protein. The fragment maps to a position 3 nt downstream of the codons for Arg-Gly-Gly (RGG) residues (vertical black arrow), which signal ubiquitin removal by deubiquitylating enzymes (36). (B) Multiple, distributed 5′-OH signals are further downstream of RGG codons (vertical black arrow) in the RPL40A mRNA, which encodes the ubiquitin-Rpl40a fusion protein. (C) An abundant RNA fragment recovered in ribosome profiling experiments (38) maps to the same position in the RPS31 mRNA as the 5′-OH fragment in (A). This fragment was present in all samples, but was most abundant in ribosome profiling done in cells treated with the non-hydrolysable GTP analog GMP-PNP (shown). (D) The 5′-OH fragment in RPS31 was validated by ligation-mediated RT-PCR. 5′-OH fragments in total RNA were incubated with an RtcB linker (a synthetic oligonucleotide with a 3′-phosphate, M13-linker, Table) in the presence or absence of RtcB ligase. These reactions were divided and reverse transcribed with a primer specific for RPS31 that hybridizes downstream of the 5′-OH terminus. cDNA products of reverse transcription were PCR amplified with primers that hybridize to RPS31 up- and downstream of the 5′-OH terminus and analyzed by gel electrophoresis, yielding abundant fragments at the appropriate size (415 bp), independent of RtcB ligation (left panel). A second PCR was performed with a 5′ primer that hybridizes to the ligated linker, and analysis of these products showed ligation-dependent PCR products at the expected sizes (major 5′-OH fragment at 215 bp; minor products at ∼400 bp). A portion of (A) is displayed on the right for reference. (E) Schematic of 5′-OH signals with respect to regions encoding specific peptide regions in (F). (F) The mean 5′-OH fragment coverage in rRNA-depleted total RNA was calculated 75 nt upstream (negative) and downstream (positive) of the start position of all mRNA sequences encoding at least five consecutive basic or acidic residues, including Arg (n = 12), Lys (n = 59), a mixture of Lys and Arg (n = 280), Glu (n = 105), Asp (n = 102), and a mixture of Glu and Asp (n = 642). A peak of 5′-OH fragments is present ∼30 nt upstream of regions composed of Lys and Arg, but this signal is lower in regions encoding Lys or Arg alone (top panels). Coverage of 5′-OH fragments is elevated higher in regions encoding Glu than regions encoding Asp, or Glu and Asp (bottom panels). (G) The mean 5′-OH fragment coverage was calculated 75 nt upstream (negative) and downstream (positive) of the start position of all mRNA sequences encoding varying lengths of consecutive basic and acidic residues. 5′-OH fragments were enriched upstream of regions encoding polyelectrostatic (both basic and acidic) resides (Arg, Lys, Glu, Asp; left panel) and were highest upstream of regions encoding only Lys and Glu (right panel). In both panels, regions encoding longer consecutive stretches (≥20) of polyelectrostatic amino acids had higher 5′-OH signals than regions encoding shorter stretches (≥5). (H) Representative examples of mRNAs (CBF5 and HSC82) with 5′-OH fragments that map to positions upstream of sequences encoding polyelectrostatic amino acid motifs. Regions encoding polyelectrostatic amino acid stretches are indicated by gray boxes above the gene models. In the RPS31 mRNA, the 7 codons following the Arg-Gly-Gly residues—and at the site of the 5′-OH fragment—are AAA and AAG codons that encode Lys and Arg residues, which are known to trigger co-translational mRNA cleavage (37). In addition, a similar fragment from the RPS31 mRNA was previously identified in ribosomal profiling experiments (38) (Figure 4C), suggesting that this fragment is associated with ribosomes. We validated the RPS31 mRNA fragment by RtcB-mediated linker ligation followed by reverse transcription and PCR, yielding product sizes consistent with 5′-OH cloning (Figure 4C). Based on this amplification, we estimate that these fragments are <1% of the abundance of full length RPS31 mRNA. Given that the RPS31 5′-OH fragment mapped upstream of a region of six consecutive basic residues, we analyzed the global relationship between 5′-OH fragments and nearby codon content. We first examined annotated open reading frames to identify sequences encoding amino acid sequences of at least five consecutive identical or related residues (e.g. five or more consecutive lysine residues, or a mix of five consecutive lysine and arginine residues), and determined the average coverage of 5′-OH fragments 75 nt upstream and 75 nt downstream from the start of these regions (Figure 4E). We analyzed 5′-OH fragments cloned from rRNA-depleted RNA, and found that 5′-OH fragments accumulated upstream of the 240 instances of consecutive basic residues composed of both lysine and arginine codons (Figure 4F). This relationship was more pronounced than the accumulation of 5′-OH fragments upstream of regions with consecutive lysine or arginine residues alone (Figure 4F). We also found high levels of 5′-OH fragments upstream of the 105 instances of consecutive glutamate residues, but this signal was diminished when we considered consecutive acidic residues composed of aspartate alone (n = 102) or regions composed of both glutamate and aspartate (n = 642) (Figure 4F). The highest levels of 5′-OH fragments were upstream of regions composed of a mixture of acidic and basic residues, particularly the combination of Lys and Glu (Figure 4G). The levels of 5′-OH fragments were higher when longer lengths of consecutive amino acids were considered (Figure 4G). We identified specific examples of mRNAs with 5′-OH fragments upstream of regions encoding consecutive acidic residues including CBF5 and HSC82 (Figure 4H). Analysis of other consecutive codons showed that they are not associated with 5′-OH fragment accumulation (data not shown). These findings suggest a relationship between the electrostatic properties of a nascent peptide and mRNA cleavage events that yield 5′-OH fragments, possibly via a co-translational process such as no-go mRNA decay (39). The eukaryotic 5′→3′ exonuclease Xrn1 specifically recognizes 5′-PO4 RNA substrates, but has greatly reduced activity on 5′-OH RNA in vitro (40). Consistent with this activity, xrn1Δ budding yeast accumulate several types of 5-PO4 RNAs (22), and phosphorylation of the 5′-OH RNA products of tRNA splicing is required for their turnover by Xrn1 (31). Given the high specificity of Xrn1 for 5′-PO4 RNA substrates in vitro, it is possible that other 5′-OH RNA fragments could be subject to a similar turnover pathway with a requisite phosphorylation event prior to Xrn1-mediated turnover. We cloned 5′-OH RNA fragments from xrn1Δ cells and compared the levels of these RNA fragments to those from XRN1 cells. Fifty percent of the mRNAs identified in XRN1 cells were also identified in xrn1Δ cells (n = 507). The Pearson correlation between the strains was 0.54 compared to a correlation of 0.68 between biological replicates of the same strain, indicating that some 5′-OH RNAs may enter an Xrn1 degradation pathway.

Identification of endonucleolytic cleavages produced during the unfolded protein response

During the unfolded protein response (UPR), misfolded or aggregated proteins accumulate in the lumen of the endoplasmic reticulum and activate the kinase-endoribonuclease Ire1 (41). Activated Ire1 endonuclease cleaves an intron from the HAC1 pre-mRNA (42), and the cleaved exons are subsequently ligated by yeast Trl1 tRNA ligase (43). After ligation, the mature HAC1 mRNA is translated, and Hac1 protein traffics to the nucleus where it drives expression of dozens of stress response genes (43). We mapped 5′-OH RNA fragments in cells treated with the N-linked glycosylation inhibitor tunicamycin, which causes protein misfolding and UPR activation. The primary 5′-OH RNA fragments captured in the HAC1 mRNA mapped upstream of consecutive electrostatic residues (Figure 5A), consistent with the overall enrichment of 5′-OH signal upstream of these residues in mRNAs (Figure 4F). The canonical sites of Ire1 cleavage in the HAC1 pre-mRNA were also identified, and 5′-OH signals increased upon in vitro phosphatase treatment, consistent with their phosphorylation in vivo, presumably by Trl1 5′-kinase (43) (Figure 5B).
Figure 5.

(A) The majority of 5′-OH fragments (CPM) in the HAC1 mRNA map upstream of codons for consecutive electrostatic amino acid residues. Their abundance was similar in vehicle-treated (DMSO, top) and tunicamycin-treated (bottom) cells. (B) Ire1 cleavage fragments in HAC1 are captured in 5′-OH libraries. Low levels of fragments resulting from intron cleavage were found in all conditions. The fragments were most abundant in the xrn1Δ strain treated with tunicamycin and subjected to in vitro phosphorylation (top panel). (C) 5′-OH fragment abundance (CPM) in KAR2 mRNA was examined in cells treated with vehicle (DMSO, top) and tunicamycin (middle). A 5′-OH fragment increased 17-fold upon tunicamycin treatment (middle) and mapped to the sequence 5′-CCAUU|A-3′ (bottom). (D) Consensus sequence motif in logo format derived from sequences flanking 5′-OH fragments (10 nt up- and downstream) in mRNAs that increased in abundance 5-fold or more upon tunicamycin treatment. A total of 10 instances of the motif were found among 23 mRNAs (Supplementary Table S4). (E) Two 5′-OH fragments in the TSA1 mRNA increased 11.4-fold and 8.7-fold upon tunicamycin treatment (top and middle panel) and mapped to the consensus motif in (D) (bottom panel, arrows).

(A) The majority of 5′-OH fragments (CPM) in the HAC1 mRNA map upstream of codons for consecutive electrostatic amino acid residues. Their abundance was similar in vehicle-treated (DMSO, top) and tunicamycin-treated (bottom) cells. (B) Ire1 cleavage fragments in HAC1 are captured in 5′-OH libraries. Low levels of fragments resulting from intron cleavage were found in all conditions. The fragments were most abundant in the xrn1Δ strain treated with tunicamycin and subjected to in vitro phosphorylation (top panel). (C) 5′-OH fragment abundance (CPM) in KAR2 mRNA was examined in cells treated with vehicle (DMSO, top) and tunicamycin (middle). A 5′-OH fragment increased 17-fold upon tunicamycin treatment (middle) and mapped to the sequence 5′-CCAUU|A-3′ (bottom). (D) Consensus sequence motif in logo format derived from sequences flanking 5′-OH fragments (10 nt up- and downstream) in mRNAs that increased in abundance 5-fold or more upon tunicamycin treatment. A total of 10 instances of the motif were found among 23 mRNAs (Supplementary Table S4). (E) Two 5′-OH fragments in the TSA1 mRNA increased 11.4-fold and 8.7-fold upon tunicamycin treatment (top and middle panel) and mapped to the consensus motif in (D) (bottom panel, arrows). We compared 5′-OH mRNA signals in untreated cells and cells treated with the tunicamycin, and found a 5′-OH fragment in the KAR2 mRNA (the yeast ortholog of BiP/HSP70) that was 17-fold more abundant in the presence of tunicamycin and occurred in the sequence 5′-CCAUU|A-3′ (Figure 5C). Initially, we attributed this signal to upregulation of KAR2 mRNA expression by the UPR transcriptional response (44), but we also identified six other ER-related mRNAs with 5′-OH fragments present in a similar sequence motif that increased 5-fold or more upon tunicamycin treatment (Figure 5D, Supplementary Table S4), including the UPR-induced thioredoxin peroxidase TSA1 mRNA (45), where 5′-OH fragments accumulated at two such motifs upon tunicamycin treatment (Figure 5E).

DISCUSSION

We developed a method to directly capture RNAs with 5′-OH termini using the unique substrate specificity of E. coli RtcB RNA ligase (16) (Figure 1). This method is highly selective for the capture of 5′-OH products (Figure 2B and C), and yields reproducible signals among several classes of cellular RNA. RNA fragments with 5′-OH termini have not been directly detected in other global studies, and the collection of 5′-OH RNA fragments reported here collectively comprise a previously uninterrogated RNA class. Future applications of this method could include synthetic 5′-OH RNAs spiked in at known concentrations to determine the lower limit of detection and provide a reference for absolute quantitation (46). We identified three categories of tRNA fragments with 5′-OH termini: those derived from cleavage at the boundary of the intron and 3′ exon, cleavage in the anticodon loop, and cleavage in the D-arm/loop (Figure 2D and E). The first of these validates the method by demonstrating capture of a known 5′-OH terminus generated in vitro during the well-characterized processing of an intron-containing pre-tRNA by the tRNA Splicing Endonuclease (SEN) (1). A mechanism of anticodon loop cleavage is also known: Rny1 can cleave the anticodon loop during stress conditions (47), and related fragments have been previously captured by 2′,3′-cyclic phosphate cloning (11). However, the most abundant cleavages observed in 5′-OH cloning were in the D-arm and loop, which has not been previously described. This cleavage may represent a novel tRNA stress fragment or may be a consequence of mapping bias toward the 5′-end of a highly abundant species. We identified a relationship between 5′-OH mRNA fragments and mRNA sequences that encoded consecutive basic peptide sequences (Figure 4A–F). In the RPS31 mRNA, a 5′-OH fragment accumulates upstream of codons encoding a polybasic peptide (Figure 4A). Production of the fragment does not appear to be related to ubiquitin processing, as another ubiquitin fusion RPL40A does not have 5′-OH fragments near the site of ubiquitin cleavage (Figure 4B). A similar fragment in the RPS31 mRNA was also identified in ribosomal profiling experiments done in budding yeast (38) (Figure 4C). These experiments predominantly recover RNA fragments that are protected from ribonuclease digestion by ribosomes, but do so without relying on a specific 5′ end chemistry (48). Additionally, several previous studies showed that nascent peptides with specific electrostatic properties cause ribosome pausing and trigger co-translational mRNA decay. In non-stop decay (NSD), translation of the polyadenylate tails of mRNAs that lack stop codons leads to a nascent polylysine peptide (49), which is proposed to interact with the negatively charged exit tunnel of the ribosome (50) to trigger ribosome stalling and mRNA decay. Similarly, internal polybasic peptides trigger co-translational mRNA decay, and this process is dependent on the non-canonical release factors Dom34 and Hbs1 (37). In addition, global measures of translation indicated that polybasic sequences cause ribosomal stalling, suggesting they interfere with ribosome elongation (51). The presence of this RPS31 5′-OH fragment in ribosome profiling data suggests that it may be associated with ribosomes and may be a product of a ribosome-directed decay process driven by the polybasic coding region. Many studies have used synthetic reporter mRNAs to study the role of nascent peptides in determining decay of a translating mRNA (37,52–53). However, there are no known physiological mRNA substrates of no-go decay (NGD), and the endonuclease activity responsible for mRNA cleavage during NGD is not known (39). Detailed mapping of mRNA cleavage events during NGD showed that mRNA cleavage occurs around 20 nt upstream of a polybasic sequence (37), which is consistent with the distance between 5′-OH fragments and other polyelectrostatic sequences (Figure 4E–G). NGD decay intermediates are degraded by Xrn1 (52). However, insofar as Xrn1 requires 5′-PO4 substrates (40), the products of NGD that are degraded by Xrn1 could be generated by an endonuclease that creates 5′-OH RNA products that are phosphorylated by a cellular 5′-kinase activity (e.g., the tRNA ligase Trl1), promoting their decay by Xrn1— a scenario similar to 5′-kinase-mediated turnover of spliced tRNA introns by Xrn1 (31) and RegB mRNA cleavage products in bacteria (54). Thus, it is possible that many of the 5′-OH fragments we identified are products of NGD. Future studies will determine whether the production of 5′-OH fragments in mRNA sequences encoding highly electrostatic peptides (Figure 4E–H) is dependent on NGD factors such as Dom34 and Hbs1 (37,52). We also identified 5′-OH fragments upstream of polyacidic and other polyelectrostatic sequences composed of a mixture of basic and acidic residues (Figure 4E–H). These signals were more prominent than any regions composed of any other single amino acid. Polyacidic stretches in the C-terminus of several human proteins are associated with ribosome frameshifting in these regions (55). However, it is not clear whether frameshifting is a consequence of the polyacidic nascent peptide, or whether it is a consequence of the propensity for out-of-frame polyglutamate codons to encode new Met initiation codons. Notably, the 5′-OH fragment we identified 8 nt downstream of the MDH1 stop codon (Figure 3D) is near the site of ribosomal stop codon read through in other fungi, which appends a peroxisomal targeting sequence (PTS) to the Mdh1 protein for its localization (56). If a similar phenomenon accounts for the MDH1 mRNA 5′-OH fragment in S. cerevisiae, it suggests that production of 5′-OH fragments is a general feature of mRNA decay at sites of non-canonical ribosome elongation (i.e. frameshifting or read through). A potential artifact of mRNA purification by oligo-dT enrichment is the isolation of A-rich mRNA sequences independent of their non-templated 3′-polyadenylate tails. In particular, some codons for electrostatic residues (Arg, Lys, Glu) are A-rich, and might be preferentially isolated by oligo-dT enrichment. We found that 5′-OH fragments upstream of polylysine stretches (encoded by AAA and AAG codons) were indeed increased in polyA libraries relative to rRNA-depleted libraries, but 5′-OH fragment levels upstream of other mixed polyelectrostatic sequences were present at similar levels in polyA selected and rRNA-depleted libraries. Furthermore, in oligo-dT selected samples and rRNA-depleted samples, we detected no enrichment of 5′-OH signal upstream of polyasparagine residues, which are encoded by AAU and AAC codons and are abundant in yeast mRNAs (n = 138). During activation of the unfolded protein response, the HAC1 pre-mRNA is cleaved by the kinase-endoribonuclease Ire1, producing 2′,3′-cyclic phosphate and 5′-OH RNA fragments (42) that are substrates for ligation by tRNA ligase (43). However, previous studies that identified 5′-PO4 mRNA fragments from dcp2Δ xrn1Δ yeast recovered HAC1 cleavage products at the sites of Ire1 cleavage (22). Using 5′-OH cloning, we recovered HAC1 mRNA cleavage products at their highest levels in xrn1Δ cells after their dephosphorylation in vitro (Figure 5B, top panel). The presence of these 5′-PO4 RNA fragments possibly indicates that the Trl1 5′-kinase step is rapid, but further HAC1 mRNA ligation by Trl1 is slow or often not completed. Corroborating this idea, fungal ligases have rapid 5′-kinase kinetics, but a slower ligation step (57). Similarly, the products of metazoan Regulated IRE1-Dependent Decay (RIDD) are degraded by Xrn1 (58), suggesting that a 5′-kinase activity phosphorylates 5′-OH products of Ire1 cleavage prior to their degradation by Xrn1. Previous studies suggested that Hac1 is the sole target of Ire1 cleavage in budding yeast (59), but a more recent study suggested that Ire1 cleaves different substrates depending on its oligomeric state, and provided evidence that monomeric yeast Ire1 can cleave mRNAs other than HAC1 in vitro (60). We identified 23 mRNAs with 5′-OH fragments that increased more than 5-fold upon tunicamycin treatment, and found that seven of these fragments were present in the consensus motif 5′-CAUU|A-3′, suggesting that they are targeted by a site-specific endonuclease (Figure 5C–E, Supplementary Table S1). An obvious candidate for this endonuclease is Ire1, but this motif differs from the canonical Ire1 recognition motif for HAC1 mRNA in budding yeast (2), as well as motifs in other RIDD targets in fission yeast and metazoans (61). Further studies will determine whether these mRNAs are directly cleaved by Ire1, or are an indirect consequence of UPR activation. The ability to globally map 5′-OH RNA fragments will enable new studies of 5′-kinase enzymes that phosphorylate 5′-OH substrates. For example, the human RNA 5′-kinase hClp1 was first identified as a component of the CF IIm sub-complex of cleavage and polyadenylation specificity factor (62), and has recently been implicated in quality control of RNA 5′-termini during transfer RNA processing (63). Mutations in hClp1 cause severe neurodevelopmental phenotypes, and reduce its 5′-kinase activity in vitro (64,65). Previous studies proposed that reduction in 5′-kinase activity causes accumulation of aberrant—and presumably 5′-OH—RNA fragments (66), but it is currently not known how accumulation of these molecules might lead to human neurodevelopmental defects or the spinal motor neuron deficits that develop in mice expressing kinase-dead Clp1 (66). Turnover of these molecules may be analogous to the turnover of tRNA introns in yeast, where phosphorylation of 5′-OH tRNA introns leads to their rapid 5′→3′ decay by Xrn1 (31). Application of the 5′-OH cloning method in cells harboring mutant hClp1 with reduced 5′-kinase activity might identify new 5′-OH RNA substrates of hClp1, providing insight into this class of neurodevelopmental disorders.

ACCESSION NUMBER

Data deposited in NCBI GEO under accession number GSE61527.

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online.
  66 in total

1.  Relationship between internucleotide linkage geometry and the stability of RNA.

Authors:  G A Soukup; R R Breaker
Journal:  RNA       Date:  1999-10       Impact factor: 4.942

2.  5'-exoribonuclease 1: Xrn1.

Authors:  A Stevens
Journal:  Methods Enzymol       Date:  2001       Impact factor: 1.600

3.  The final step in the formation of 25S rRNA in Saccharomyces cerevisiae is performed by 5'-->3' exonucleases.

Authors:  T H Geerlings; J C Vos; H A Raué
Journal:  RNA       Date:  2000-12       Impact factor: 4.942

4.  Human pre-mRNA cleavage factor II(m) contains homologs of yeast proteins and bridges two other cleavage factors.

Authors:  H de Vries; U Rüegsegger; W Hübner; A Friedlein; H Langen; W Keller
Journal:  EMBO J       Date:  2000-11-01       Impact factor: 11.598

5.  Rewriting the rules for end joining via enzymatic splicing of DNA 3'-PO4 and 5'-OH ends.

Authors:  Ushati Das; Anupam K Chakravarty; Barbara S Remus; Stewart Shuman
Journal:  Proc Natl Acad Sci U S A       Date:  2013-11-11       Impact factor: 11.205

6.  Cellular mRNAs access second ORFs using a novel amino acid sequence-dependent coupled translation termination-reinitiation mechanism.

Authors:  Phillip S Gould; Nigel P Dyer; Wayne Croft; Sascha Ott; Andrew J Easton
Journal:  RNA       Date:  2014-01-10       Impact factor: 4.942

7.  CapSeq and CIP-TAP identify Pol II start sites and reveal capped small RNAs as C. elegans piRNA precursors.

Authors:  Weifeng Gu; Heng-Chi Lee; Daniel Chaves; Elaine M Youngman; Gregory J Pazour; Darryl Conte; Craig C Mello
Journal:  Cell       Date:  2012-12-21       Impact factor: 41.582

8.  CLP1 links tRNA metabolism to progressive motor-neuron loss.

Authors:  Toshikatsu Hanada; Stefan Weitzer; Barbara Mair; Christian Bernreuther; Brian J Wainger; Justin Ichida; Reiko Hanada; Michael Orthofer; Shane J Cronin; Vukoslav Komnenovic; Adi Minis; Fuminori Sato; Hiromitsu Mimata; Akihiko Yoshimura; Ido Tamir; Johannes Rainer; Reinhard Kofler; Avraham Yaron; Kevin C Eggan; Clifford J Woolf; Markus Glatzel; Ruth Herbst; Javier Martinez; Josef M Penninger
Journal:  Nature       Date:  2013-03-10       Impact factor: 49.962

9.  The UCSC Genome Browser database: 2014 update.

Authors:  Donna Karolchik; Galt P Barber; Jonathan Casper; Hiram Clawson; Melissa S Cline; Mark Diekhans; Timothy R Dreszer; Pauline A Fujita; Luvina Guruvadoo; Maximilian Haeussler; Rachel A Harte; Steve Heitner; Angie S Hinrichs; Katrina Learned; Brian T Lee; Chin H Li; Brian J Raney; Brooke Rhead; Kate R Rosenbloom; Cricket A Sloan; Matthew L Speir; Ann S Zweig; David Haussler; Robert M Kuhn; W James Kent
Journal:  Nucleic Acids Res       Date:  2013-11-21       Impact factor: 16.971

10.  A widespread self-cleaving ribozyme class is revealed by bioinformatics.

Authors:  Adam Roth; Zasha Weinberg; Andy G Y Chen; Peter B Kim; Tyler D Ames; Ronald R Breaker
Journal:  Nat Chem Biol       Date:  2013-11-17       Impact factor: 15.040

View more
  20 in total

Review 1.  IT'S 2 for the price of 1: Multifaceted ITS2 processing machines in RNA and DNA maintenance.

Authors:  Monica C Pillon; Yu-Hua Lo; Robin E Stanley
Journal:  DNA Repair (Amst)       Date:  2019-07-08

2.  Comparative parallel analysis of RNA ends identifies mRNA substrates of a tRNA splicing endonuclease-initiated mRNA decay pathway.

Authors:  Jennifer E Hurtig; Michelle A Steiger; Vinay K Nagarajan; Tao Li; Ti-Chun Chao; Kuang-Lei Tsai; Ambro van Hoof
Journal:  Proc Natl Acad Sci U S A       Date:  2021-03-09       Impact factor: 11.205

Review 3.  Analysis of post-transcriptional RNA metabolism in prokaryotes.

Authors:  Bijoy K Mohanty; Sidney R Kushner
Journal:  Methods       Date:  2018-11-15       Impact factor: 3.608

4.  Yeast Dxo1 is required for 25S rRNA maturation and acts as a transcriptome-wide distributive exonuclease.

Authors:  Jennifer E Hurtig; Ambro van Hoof
Journal:  RNA       Date:  2022-02-09       Impact factor: 5.636

Review 5.  Recent insights into the structure, function, and regulation of the eukaryotic transfer RNA splicing endonuclease complex.

Authors:  Cassandra K Hayne; Tanae A Lewis; Robin E Stanley
Journal:  Wiley Interdiscip Rev RNA       Date:  2022-02-14       Impact factor: 9.349

Review 6.  RNA damage in biological conflicts and the diversity of responding RNA repair systems.

Authors:  A Maxwell Burroughs; L Aravind
Journal:  Nucleic Acids Res       Date:  2016-08-17       Impact factor: 16.971

7.  Translation of poly(A) tails leads to precise mRNA cleavage.

Authors:  Nicholas R Guydosh; Rachel Green
Journal:  RNA       Date:  2017-02-13       Impact factor: 4.942

8.  Ribothrypsis, a novel process of canonical mRNA decay, mediates ribosome-phased mRNA endonucleolysis.

Authors:  Fadia Ibrahim; Manolis Maragkakis; Panagiotis Alexiou; Zissimos Mourelatos
Journal:  Nat Struct Mol Biol       Date:  2018-03-05       Impact factor: 15.369

9.  Characterization of the molecular crosstalk within the essential Grc3/Las1 pre-rRNA processing complex.

Authors:  Monica C Pillon; Mack Sobhany; Robin E Stanley
Journal:  RNA       Date:  2018-02-09       Impact factor: 4.942

Review 10.  Methodologies for Transcript Profiling Using Long-Read Technologies.

Authors:  Spyros Oikonomopoulos; Anthony Bayega; Somayyeh Fahiminiya; Haig Djambazian; Pierre Berube; Jiannis Ragoussis
Journal:  Front Genet       Date:  2020-07-07       Impact factor: 4.599

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.