Literature DB >> 20495679

RNA pseudoknots: folding and finding.

Biao Liu, David H Mathews, Douglas H Turner.   

Abstract

RNA pseudoknots are important for function. Three-dimensional structural information is available, insights into factors affecting pseudoknot stability are being reported, and computer programs are available for predicting pseudoknots.

Entities:  

Year:  2010        PMID: 20495679      PMCID: PMC2873773          DOI: 10.3410/B2-8

Source DB:  PubMed          Journal:  F1000 Biol Rep        ISSN: 1757-594X


Introduction and context

RNA pseudoknots are important for many functions [1-3]. Pseudoknots are formed by pairing between bases in a loop region and complementary bases outside the loop (Figure 1). That is, for any base pairs i-j and k-l, i4] have an additional loop, so the three-loop nomenclature from Brierley et al. [1] is more generally applicable (Figure 1). Loop 2 typically has zero or one nucleotide, whereas loops 1 and 3 and the stems are more variable (Table 1).
Figure 1.

Simian retrovirus-1 (SRV-1) mutants’ frameshift efficiency [49-51] and their predicted free energy changes for the mutations

A three-loop nomenclature is used for the pseudoknot. The free energy changes are predicted by coupling the individual nearest neighbor model [37] with a statistical polymer model [21,22], neglecting tertiary interactions.

Table 1.

Three-dimensional structures of short pseudoknots

Type of Organism or RNARNAFunctionMethodPDB #Ref.Size, ntsStem 1, bpsLoop 1, ntsStem 2, bpsLoop 2, ntsLoop 3, ntsCoaxial stack or bend
Mammalian retrovirusRibosomal frameshiftingCoaxial stack
Simian retrovirus-1 (SRV-1)NMR1E95[51]36616012
Mouse mammary tumor virus (MMTV)NMR1KPD[56]3252618
1KAJ[57]
1RNK[58]
Plant Luteoviridae virusesBeet western yellow virus (BWYV)X-ray437D[5]2652317Bend
1L2X[9]
Sugarcane yellow leaf virus (ScYLV)NMR2AP0[8]2852319
1YG4[2]
Potato leaf roll virus (PLRV)X-ray2A43[7]2642319
Pea enation mosaic virus (PEMV)NMR1KPY[6]3352317
HumanTelomerase RNATelomere maintenanceNMR1YMO[3]4668908Slight bend
2K96[43]
Plant virus (tymovirus) 3′ NCRTurnip yellow mosaic virus (TYMV)t-RNA likeNMR3PHP[59]2334503Coaxial stack
1A60[60]
BacteriaAquifex aeolicus tmRNATrans-translationNMR2G1W[61]2141315Bend

bps, base pairs; NCR, non-coding region; NMR, nuclear magnetic resonance; nts, nucleotides; PDB, Protein Data Bank; Ref., reference.

Simian retrovirus-1 (SRV-1) mutants’ frameshift efficiency [49-51] and their predicted free energy changes for the mutations

A three-loop nomenclature is used for the pseudoknot. The free energy changes are predicted by coupling the individual nearest neighbor model [37] with a statistical polymer model [21,22], neglecting tertiary interactions. bps, base pairs; NCR, non-coding region; NMR, nuclear magnetic resonance; nts, nucleotides; PDB, Protein Data Bank; Ref., reference.

Major recent advances

Folding of pseudoknots

Quite a few three-dimensional (3D) structures have been determined for isolated pseudoknots of fewer than 50 nucleotides (Table 1). Some tertiary interactions are conserved in particular classes and are essential for biological activity [2,3]. Examples include a quadruple-base interaction in pseudoknots from Luteoviridae viruses [2,5-9] and triplexes in telomerase RNA [3] and in viral mRNA that undergo -1 frameshifting [10]. Structures of natural pseudoknots bound to small molecules are being reported [11-13], and structures of larger RNAs are revealing long-range pseudoknots [14-20]. The stability of RNA pseudoknots is a key factor determining structure-function relationships [10] and is important for predicting RNA structure [21-32]. Thermodynamic measurements have started appearing [3,33-36]. A statistical polymer model for loops [21,22] coupled with the INN-HB (Individual Nearest Neighbor-Hydrogen Bonding) model for stems [37] allows estimates of the stability of small pseudoknots. Coaxial stacking of the two stems can be included, although this is not always observed in 3D structures [5]. Contributions from tertiary interactions between the loops and stems are neglected because little is known about their thermodynamics. Mechanical unfolding of single molecules by optical tweezers [10,38,39] reveals that frameshifting efficiency is highly related to the mechanical stability of pseudoknots, as suggested from cryoelectron microscopy [40] and prediction by a statistical polymer model [41]. Single-molecule experiments also indicate that pseudoknot folding and unfolding at low forces are stepwise [42] and that the presence of Mg2+ stabilizes the pseudoknot more than hairpins [39]. A better understanding of pseudoknot thermodynamic and mechanical stability and of folding dynamics will help reveal structure-function relationships [43].

Finding pseudoknots

Finding pseudoknots by computationally folding RNA sequences is a difficult problem. Because of computational cost, most of the popular dynamic programming algorithms for predicting the lowest free energy structure do not allow pseudoknots. With state-of-the-art knowledge, finding the lowest free energy structure with pseudoknots takes an exponentially increasing amount of time as the sequence gets longer; that is, the problem is NP-complete [23,24]. To predict low free energy structures with pseudoknots, roughly four different practical approaches are used by available programs. The first approach is to apply stochastic methods either to simulate folding pathways or to sample structures [25,26,44]. With these algorithms, structures are revised according to an element of chance and new pairs that are pseudoknotted with existing pairs can be added. A variation on this theme follows a folding pathway to find low free energy structures but is deterministic in its choices of stems [27]. The second approach is to use a dynamic programming algorithm in which the possible topologies of the predicted structures are limited [28-30]. The possible topologies predicted by a number of different programs have been examined [31]. A third approach is to assemble structures from component base pairs using a graph-theoretic approach [45,46]. A fourth approach is to iteratively build structures using algorithms that cannot predict pseudoknots with a single iteration [47,48]. The Nuclear magnetic resonance (NMR)-Assisted Prediction of RNA Secondary Structure (NAPSS) algorithm is an iterative approach that includes constraints from simple NMR experiments to improve predictions [32]. A number of the programs cited above either require multiple homologous sequences or are capable of using them to find a conserved structure [44-46,48]. These programs should be more accurate at structure prediction than those that use a single sequence because of the additional information available in the multiple sequences. For example, pairs that are not conserved in the set of sequences can be excluded from the final structure. A recent benchmark of the accuracy of structure prediction using single sequences provides guidance for choosing a program [47]; Table 2 lists programs available for free. For 12 RNA sequences of 210 nucleotides or longer, a pathway folding algorithm [25] had the highest accuracy. The iterative approaches also performed similarly and were time-efficient [47,48].
Table 2.

Available programs for secondary structure prediction including pseudoknots

Program nameWebsite
HotKnots [47]http://www.cs.ubc.ca/labs/beta/Software/HotKnots/
hxmatch [46]http://www.tbi.univie.ac.at/papers/SUPPLEMENTS/HXMATCH/
ILM [48]http://cs.utsa.edu/~jruan/Software.html
KineFold [26]http://kinefold.curie.fr/
pknotsRG [29]http://bibiserv.techfak.uni-bielefeld.de/pknotsrg/
SimulFold [44]http://people.cs.ubc.ca/~irmtraud/simulfold/
vsfold5 [27]http://www.rna.it-chiba.ac.jp/~vsfold/vsfold5/

This table provides a list of programs that are available for free and can make predictions on a desktop computer for sequences that are up to at least 200 nucleotides long. Note that hxmatch is designed to find conserved structures for sequences in a multiple alignment and that ILM can be run with one sequence or with an alignment of multiple sequences. SimulFold works on a set of homologous sequences and infers the sequence alignment. KineFold and vsfold5 are available as web servers. pknotsRG is available for use on a web server or for download.

This table provides a list of programs that are available for free and can make predictions on a desktop computer for sequences that are up to at least 200 nucleotides long. Note that hxmatch is designed to find conserved structures for sequences in a multiple alignment and that ILM can be run with one sequence or with an alignment of multiple sequences. SimulFold works on a set of homologous sequences and infers the sequence alignment. KineFold and vsfold5 are available as web servers. pknotsRG is available for use on a web server or for download.

Future directions

Predictions of the occurrence and stabilities of pseudoknots can be improved. The best-performing program tested by Ren et al. [47] predicted only 57% of known canonical base pairs, and only 39% of the predicted pairs were in the known structures. New insights into the structures and stabilities will suggest computational simplifications for existing strategies and improve approximations of stabilities. New methods to find pseudoknots conserved in multiple sequences and to incorporate more experimental data will restrain folding space. Finding more pseudoknots will likely expand the types of functions known for pseudoknots. Increased understanding of the sequence dependence of pseudoknot stabilities can improve comparisons with functional studies. For example, the relative frameshifting efficiencies of simian retrovirus-1 (SRV-1) pseudoknot and its mutants [49,50] qualitatively correlate with their predicted relative stabilities (Figure 1, revised from [51]). The exception is mutant U11A34, which is predicted to be less stable than the wild-type pseudoknot but shows higher efficiency of frameshifting. This could result from an additional tertiary interaction if a U11A34/A7 base triple is formed. Such comparisons can suggest new experiments. Future work can also resolve current ambiguities. For example, there is disagreement about the role of pseudoknot stability in human telomerase RNA. The pseudoknot domain may act as a molecular switch [52], in which the pseudoknot and the stem 1 hairpin have nearly equal stability [34], or only the pseudoknot conformation may be important for function [53]. A two-base mutation destabilizing the human telomerase pseudoknot is found in some patients with the inherited disease dyskeratosis congenita [54]. A theoretical calculation suggests that folding kinetics of the pseudoknot may determine activity [55]. There is much more to be revealed about the roles of pseudoknots and their modus operandi.
  59 in total

1.  Crystal structure of a luteoviral RNA pseudoknot and model for a minimal ribosomal frameshifting motif.

Authors:  Pradeep S Pallan; William S Marshall; Joel Harp; Frederic C Jewett; Zdzislaw Wawrzak; Bernard A Brown; Alexander Rich; Martin Egli
Journal:  Biochemistry       Date:  2005-08-30       Impact factor: 3.162

2.  Structure of the Tetrahymena ribozyme: base triple sandwich and metal ion at the active site.

Authors:  Feng Guo; Anne R Gooding; Thomas R Cech
Journal:  Mol Cell       Date:  2004-11-05       Impact factor: 17.970

3.  Functional analysis of the pseudoknot structure in human telomerase RNA.

Authors:  Jiunn-Liang Chen; Carol W Greider
Journal:  Proc Natl Acad Sci U S A       Date:  2005-04-22       Impact factor: 11.205

4.  An APL-programmed genetic algorithm for the prediction of RNA secondary structure.

Authors:  F H van Batenburg; A P Gultyaev; C W Pleij
Journal:  J Theor Biol       Date:  1995-06-07       Impact factor: 2.691

5.  Metal ions and flexibility in a viral RNA pseudoknot at atomic resolution.

Authors:  Martin Egli; George Minasov; Li Su; Alexander Rich
Journal:  Proc Natl Acad Sci U S A       Date:  2002-03-19       Impact factor: 11.205

6.  Mutations linked to dyskeratosis congenita cause changes in the structural equilibrium in telomerase RNA.

Authors:  Carla A Theimer; L David Finger; Lukas Trantirek; Juli Feigon
Journal:  Proc Natl Acad Sci U S A       Date:  2003-01-13       Impact factor: 11.205

7.  Predicting RNA pseudoknot folding thermodynamics.

Authors:  Song Cao; Shi-Jie Chen
Journal:  Nucleic Acids Res       Date:  2006-05-18       Impact factor: 16.971

8.  NMR structure of the Aquifex aeolicus tmRNA pseudoknot PK1: new insights into the recoding event of the ribosomal trans-translation.

Authors:  Sylvie Nonin-Lecomte; Brice Felden; Frédéric Dardel
Journal:  Nucleic Acids Res       Date:  2006-04-04       Impact factor: 16.971

9.  Minor groove RNA triplex in the crystal structure of a ribosomal frameshifting viral pseudoknot.

Authors:  L Su; L Chen; M Egli; J M Berger; A Rich
Journal:  Nat Struct Biol       Date:  1999-03

10.  Design, implementation and evaluation of a practical pseudoknot folding algorithm based on thermodynamics.

Authors:  Jens Reeder; Robert Giegerich
Journal:  BMC Bioinformatics       Date:  2004-08-04       Impact factor: 3.169

View more
  20 in total

1.  TurboKnot: rapid prediction of conserved RNA secondary structures including pseudoknots.

Authors:  Matthew G Seetin; David H Mathews
Journal:  Bioinformatics       Date:  2012-01-27       Impact factor: 6.937

2.  RNA secondary structure analysis using RNAstructure.

Authors:  David H Mathews
Journal:  Curr Protoc Bioinformatics       Date:  2006-03

3.  Accurate SHAPE-directed RNA secondary structure modeling, including pseudoknots.

Authors:  Christine E Hajdin; Stanislav Bellaousov; Wayne Huggins; Christopher W Leonard; David H Mathews; Kevin M Weeks
Journal:  Proc Natl Acad Sci U S A       Date:  2013-03-15       Impact factor: 11.205

Review 4.  RNA families in Epstein-Barr virus.

Authors:  Walter N Moss; Nara Lee; Genaro Pimienta; Joan A Steitz
Journal:  RNA Biol       Date:  2013-12-20       Impact factor: 4.652

5.  Modeling RNA secondary structure folding ensembles using SHAPE mapping data.

Authors:  Aleksandar Spasic; Sarah M Assmann; Philip C Bevilacqua; David H Mathews
Journal:  Nucleic Acids Res       Date:  2018-01-09       Impact factor: 16.971

Review 6.  Computational approaches for the discovery of splicing regulatory RNA structures.

Authors:  Ryan J Andrews; Walter N Moss
Journal:  Biochim Biophys Acta Gene Regul Mech       Date:  2019-04-29       Impact factor: 4.490

7.  A Polymer Physics Framework for the Entropy of Arbitrary Pseudoknots.

Authors:  Ofer Kimchi; Tristan Cragnolini; Michael P Brenner; Lucy J Colwell
Journal:  Biophys J       Date:  2019-07-10       Impact factor: 4.033

8.  Using the RNAstructure Software Package to Predict Conserved RNA Structures.

Authors:  David H Mathews
Journal:  Curr Protoc Bioinformatics       Date:  2014-06-17

9.  Bridging the gap between in vitro and in vivo RNA folding.

Authors:  Kathleen A Leamy; Sarah M Assmann; David H Mathews; Philip C Bevilacqua
Journal:  Q Rev Biophys       Date:  2016-06-24       Impact factor: 5.318

10.  Longitudinal analysis of intra-host simian immunodeficiency virus recombination in varied tissues of the rhesus macaque model for neuroAIDS.

Authors:  Susanna L Lamers; David J Nolan; Samantha L Strickland; Mattia Prosperi; Gary B Fogel; Maureen M Goodenow; Marco Salemi
Journal:  J Gen Virol       Date:  2013-08-20       Impact factor: 3.891

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.