| Literature DB >> 28213527 |
Andy Phan1, Katherine Mailey1, Jessica Saeki1, Xiaobo Gu1,2, Susan J Schroeder3,2.
Abstract
Accurate thermodynamic parameters improve RNA structure predictions and thus accelerate understanding of RNA function and the identification of RNA drug binding sites. Many viral RNA structures, such as internal ribosome entry sites, have internal loops and bulges that are potential drug target sites. Current models used to predict internal loops are biased toward small, symmetric purine loops, and thus poorly predict asymmetric, pyrimidine-rich loops with >6 nucleotides (nt) that occur frequently in viral RNA. This article presents new thermodynamic data for 40 pyrimidine loops, many of which can form UU or protonated CC base pairs. Uracil and protonated cytosine base pairs stabilize asymmetric internal loops. Accurate prediction rules are presented that account for all thermodynamic measurements of RNA asymmetric internal loops. New loop initiation terms for loops with >6 nt are presented that do not follow previous assumptions that increasing asymmetry destabilizes loops. Since the last 2004 update, 126 new loops with asymmetry or sizes greater than 2 × 2 have been measured. These new measurements significantly deepen and diversify the thermodynamic database for RNA. These results will help better predict internal loops that are larger, pyrimidine-rich, and occur within viral structures such as internal ribosome entry sites.Entities:
Keywords: RNA secondary structure prediction; internal ribosome entry site; pyrimidine-rich internal loops; thermodynamics; viral RNA
Mesh:
Substances:
Year: 2017 PMID: 28213527 PMCID: PMC5393185 DOI: 10.1261/rna.059865.116
Source DB: PubMed Journal: RNA ISSN: 1355-8382 Impact factor: 4.942
FIGURE 1.A comparison of the internal loop characteristics in viral IRES and the thermodynamic database for RNA structure prediction. Data for RNA internal loops are from an IRES database (Mokrejs et al. 2010) (top row). The RNA secondary structures in the IRES database were experimentally determined from phylogeny and chemical or enzymatic probing. Analysis for RNA internal loops in the thermodynamic database (Santa Lucia et al. 1990, 1991; Peritz et al. 1991; Walter et al. 1994; Wu et al. 1995; Schroeder et al. 1996; Xia et al. 1997; Schroeder and Turner 2000, 2001; Burkard et al. 2001; Chen et al. 2004, 2005, 2006, 2009; Bourdelat-Parks and Wartell 2005; Chen and Turner 2006; Badhwar et al. 2007; Christiansen and Znosko 2008, 2009; Hausmann and Znosko 2012) (bottom row). Loop sizes are described by the number of nucleotides on each side of a loop. For example, a 1 × 3 loop has 1 nt opposite 3 nt. N is any number >3 nt. When the number of nucleotides on each side of the loop is different, then the loop is described as asymmetric. A loop with both pyrimidine and purine nucleotides is described as a mix. The IRES database and thermodynamic database contain 107 and 469 total loops, respectively. Note that the thermodynamic database does not include the new measurements presented in this work. For comparisons of loop size, 1 × 2 loops are shown in blue; 1 × 3 loops in peach; 1 × n loops in green; 2 × 2 loops in black; 2 × 3 loops in gray; 2 × n loops in bright pink; 3 × 3 loops in yellow; 3 × n loops in purple; and n × n loops in pastel pink. For comparisons of loop symmetry, symmetric loops are shown in blue, and asymmetric loops are shown in pink. For comparisons of nucleotide content, purine-only loops are shown in black, pyrimidine-only loops in gray; and loops with a mix of purine and pyrimidines in green.
RNA thermodynamic data
FIGURE 2.Loops with all cytidine nucleotides are more stable at pH 5. Comparison of cytidine loop free energies at pH 7 (red bars) and pH 5 (blue bars). Standard error was determined by comparing the free energy errors from the linear and melt curve fits. (*) Data obtained from Santa Lucia et al. (1991). The duplex sequence is 5′CGC CC GCG/3′GCG CC CGC and has only 3 bp in the stems. The difference in the length of the surrounding stems may explain the difference in stabilities for this loop (Schroeder and Turner 2000). (**) Data obtained from Schroeder and Turner (2000).
RNA loop prediction rules
FIGURE 3.Loop initiation terms in the 2016 and 2004 models. Loop initiation parameters include the terms for the free energy penalties for loop asymmetry. The parameters are the constant value in the linear regression analysis of each type of loop.