| Literature DB >> 29358234 |
Ester Saus1,2, Jesse R Willis1,2, Leszek P Pryszcz1,2,3, Ahmed Hafez4, Carlos Llorens4, Heinz Himmelbauer1,2,5, Toni Gabaldón1,2,6.
Abstract
RNA molecules play important roles in virtually every cellular process. These functions are often mediated through the adoption of specific structures that enable RNAs to interact with other molecules. Thus, determining the secondary structures of RNAs is central to understanding their function and evolution. In recent years several sequencing-based approaches have been developed that allow probing structural features of thousands of RNA molecules present in a sample. Here, we describe nextPARS, a novel Illumina-based implementation of in vitro parallel probing of RNA structures. Our approach achieves comparable accuracy to previous implementations, while enabling higher throughput and sample multiplexing.Entities:
Keywords: RNA secondary structure; genome-wide enzymatic probing; multiplex sequencing
Mesh:
Substances:
Year: 2018 PMID: 29358234 PMCID: PMC5855959 DOI: 10.1261/rna.063073.117
Source DB: PubMed Journal: RNA ISSN: 1355-8382 Impact factor: 4.942
Summary and main characteristics of methods to probe RNA secondary structure
FIGURE 1.Summary of the different steps performed in the nextPARS protocol. From the cells or tissue of interest (A), total RNA is extracted (B) and then poly(A)+ RNA is selected (C) to initially prepare the samples for nextPARS analyses. Once the quality and quantity of poly(A)+ RNA samples is confirmed, RNA samples are denatured and in vitro folded to perform the enzymatic probing of the molecules with the corresponding concentrations of RNase V1 and S1 nuclease (D). For the library preparation using the Illumina TruSeq Small RNA Sample Preparation Kit, an initial phosphatase treatment of the 3′ends and a kinase treatment of the 5′ ends are required (E) to then ligate the corresponding 5′ and 3′ adapters at the ends of the RNA fragments (F). Then a reverse transcription of the RNA fragments and a PCR amplification are performed to obtain the library (G). The library is size-selected to get rid of primers and adapters dimers using an acrylamide gel and a final quality control is performed (H). Libraries are sequenced in single-reads with read lengths of 50 nucleotides (nt) using Illumina sequencing platforms (I) and computational analyses are done as described in the Materials and Methods section in order to map Illumina reads and determine the enzymatic cleavage points, using the first nucleotide in the 5′ end of the reads (which correspond to the 5′end of original RNA fragments) (J).
Correlations within nextPARS replicates, within PARS replicates, and between nextPARS and PARS
FIGURE 2.nextPARS results for TETp4p6 fragment. (A) Sites having a nextPars score higher than 0.5 (predicted paired site) or lower than −0.5 (predicted unpaired site) are indicated as green (+1, double-stranded) and pink (−1, single-stranded), respectively, on the reference secondary structure of TETp4p6 RNA according to the PDB database and visualized using the VARNA program (Visualization Applet for RNA, http://varna.lri.fr/ [Darty et al. 2009]). Nucleotides that do not pass the threshold are assigned as 0. Green crosses (+) show V1 cuts (paired sites) which target double-stranded nucleotides in the reference structure, and pink asterisks (*) show S1 cuts (unpaired sites) which target single-stranded nucleotides in the reference structure. Numbers and percentages of bases detected for each conformation are shown below the RNA molecule. (B) Plot comparing both PARS and nextPARS techniques: normalized number of reads for V1 enzyme are plotted for each technique. (C) Plot comparing the results obtained with nextPARS with those of previously published results obtained by traditional footprinting experiments (Kertesz et al. 2010).
FIGURE 3.Comparison of scoring methods from nextPARS and PARS. Receiver operator characteristic (ROC) curves to assess the ability to determine whether a nucleotide is paired or unpaired. They were generated by varying a score threshold at 101 evenly spaced values over the full range of scores for the five benchmark molecules with known structures. True positives here are considered to be those sites that are paired in the reference structure with a score greater than the given threshold, while true negatives are those sites that are unpaired in the reference structure with a score lower than the given threshold. The blue curves are the nextPARS scores for the benchmark molecules in the two different data sets and the red curves are for the PARS scores in the two data sets. Solid curves are for a data set with its own scoring method, dashed curves are for a data set with the opposite scoring method. Also included in the legend is the area under the curve (AUC).
FIGURE 4.Probing of RNA molecules with RNase A enzyme. Examples of the signals obtained in some RNA molecules when performing nextPARS using RNase A, an enzyme that cuts specifically in single-stranded cytosines (C) and uracils (U). Scores were calculated for each site by first capping all read counts for a given transcript at the 95th percentile and then normalizing to have a maximum of 1 (as done in the “Computation of nextPARS scores” of the Materials and Methods, but since Rnase A is the only enzyme in this case, there will be no subtraction performed, so all values will then fall in the range of 0 to 1). Cuts are considered for signals above a threshold of 0.8. (A) nextPARS signals above the threshold of 0.8 are depicted for TETp4p6 and TETp9-9.1 RNA fragments after probing them by nextPARS using RNase A. Secondary structures of the RNA fragments according to PDB are displayed using VARNA program (Visualization Applet for RNA, http://varna.lri.fr/ [Darty et al. 2009]). In green, nucleotides with a cut signal above 0.8; green crosses (+) show cuts obtained in a C or U; pink asterisks (*) show cuts obtained in a G or A; and blue arrows (→) show cuts obtained in double-stranded positions. (B) Table summarizing the total number (N) and percentages (%) of cuts with a signal above 0.8 threshold obtained in five different RNA fragments with known secondary structure (TETp4p6, TETp9-9.1, SRA, B2, U1): first column, N and % of cuts with a signal above 0.8 in the molecules; second column, N and % of these cuts in C or U nucleotides; and third column, N and % of cuts in G or A nucleotides.