| Literature DB >> 24664469 |
Jason Talkish, Gemma May, Yizhu Lin, John L Woolford, C Joel McManus.
Abstract
The functions of RNA molecules are intimately linked to their ability to fold into complex secondary and tertiary structures. Thus, understanding how these molecules fold is essential to determining how they function. Current methods for investigating RNA structure often use small molecules, enzymes, or ions that cleave or modify the RNA in a solvent-accessible manner. While these methods have been invaluable to understanding RNA structure, they can be fairly labor intensive and often focus on short regions of single RNAs. Here we present a new method (Mod-seq) and data analysis pipeline (Mod-seeker) for assaying the structure of RNAs by high-throughput sequencing. This technique can be utilized both in vivo and in vitro, with any small molecule that modifies RNA and consequently impedes reverse transcriptase. As proof-of-principle, we used dimethyl sulfate (DMS) to probe the in vivo structure of total cellular RNAs in Saccharomyces cerevisiae. Mod-seq analysis simultaneously revealed secondary structural information for all four ribosomal RNAs and 32 additional noncoding RNAs. We further show that Mod-seq can be used to detect structural changes in 5.8S and 25S rRNAs in the absence of ribosomal protein L26, correctly identifying its binding site on the ribosome. While this method is applicable to RNAs of any length, its high-throughput nature makes Mod-seq ideal for studying long RNAs and complex RNA mixtures.Entities:
Keywords: RNA structure; RNA-binding protein; high-throughput sequencing
Mesh:
Substances:
Year: 2014 PMID: 24664469 PMCID: PMC3988572 DOI: 10.1261/rna.042218.113
Source DB: PubMed Journal: RNA ISSN: 1355-8382 Impact factor: 4.942
FIGURE 1.Mapping DMS modification sites with high-throughput sequencing. (A) RNA from DMS-treated cells is randomly fragmented, ligated to specific 5′ (blue) and 3′ (green) adapter oligonucleotides, and reverse transcribed. cDNA is circularized and products containing 5′ adapter sequences are reduced via subtractive hybridization. Remaining cDNA products are PCR amplified to introduce Illumina library sequences. Each library is given a unique 6-nt barcode and pooled for high-throughput sequencing. Reads containing the 5′ adapter sequence are removed from further analysis. (B) Genome browser view (IGV) showing the number of 5′ read ends from DMS treated (+, purple) and untreated (−, blue) samples. The peak in the “no DMS” sample corresponds to a known site of endogenous rRNA methylation. The difference between these plots is shown in red, after shifting peaks 1 nt 5′ to represent actual sites of chemical modification. (C) Enlarged genome browser view of structure probing on the 5.8S rRNA. Reverse-transcriptase blocks due to DMS modification are mostly A and C nucleotides, and map to known single-stranded regions in 5.8S rRNA.
FIGURE 2.Sites of yeast in vivo DMS modification identified by Mod-seq. (A) Example of Mod-seq sites on the 522-nt-long SRP RNA (SCR1). Nucleotides with significantly enriched sites of DMS modification are shown with colored bars depicting relative modification strength (fold enrichment). (B) Nucleotide composition of all significant modification sites identified. Bar graph shows the number of significant modifications at each of the four RNA nucleotide bases. Bars are colored to represent the distribution of fold enrichment for each nucleotide. Sites with high fold enrichment are almost exclusively As and Cs, while significant modification of Gs have smaller enrichment values.
FIGURE 3.Footprinting the binding site of an RNA binding protein. (A) Mod-seq probing results in the L26 deletion mutant and wild-type cells. (B) Views of the X-ray-crystal structure model of the yeast 80S ribosome generated with Pymol (PDB: 3U5F, 3U5G, 3U5H, 3U5I). The large (left) and small (right) subunits are shown in different shades of gray (upper). Ribosomal protein L26 is shown in a transparent surface rendering. Nucleotides with increased modification by DMS in the absence of L26 protein in vivo are shown in colors representing their fold-enrichment values. Magnified views showing residues with the largest differences in DMS sensitivity (lower; red and orange nucleotides). These nucleotides cluster around the site of L26 binding.