| Literature DB >> 27242032 |
Eugene Baulin1, Victor Yacovlev2, Denis Khachko3, Sergei Spirin4, Mikhail Roytberg5.
Abstract
The Universe of RNA Structures DataBase (URSDB) stores information obtained from all RNA-containing PDB entries (2935 entries in October 2015). The content of the database is updated regularly. The database consists of 51 tables containing indexed data on various elements of the RNA structures. The database provides a web interface allowing user to select a subset of structures with desired features and to obtain various statistical data for a selected subset of structures or for all structures. In particular, one can easily obtain statistics on geometric parameters of base pairs, on structural motifs (stems, loops, etc.) or on different types of pseudoknots. The user can also view and get information on an individual structure or its selected parts, e.g. RNA-protein hydrogen bonds. URSDB employs a new original definition of loops in RNA structures. That definition fits both pseudoknot-free and pseudoknotted secondary structures and coincides with the classical definition in case of pseudoknot-free structures. To our knowledge, URSDB is the first database supporting searches based on topological classification of pseudoknots and on extended loop classification.Database URL: http://server3.lpm.org.ru/urs/.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27242032 PMCID: PMC4885603 DOI: 10.1093/database/baw085
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Figure 1.The stem H and its loop. Each base is represented with a dot; bases are enumerated from 1 to 57. Stems are represented with arcs. The stem H (in green) has the left wing at positions 9–10 and the right wing at positions 47–48. There are two H-ECR’s (see ‘Loops’ subsection below) inside H, the pseudoknotted nested ECR at positions 18–31, see blue arcs, and the ECR at positions 34–43, see the orange arc. The loop related to the stem H comprises positions 11–46 except at position lying inside H-related ECRs. Thus the loop of the stem H has the following structure: a side 11–17; a face of the pseudoknotted ECR {18, 31}, a side 32–33; a face of the stem {34; 43}, a side 44–46. Note that the side 11–17 contains a wing 13–15. Therefore the loop is a pseudoknotted multiple junction. The figure is prepared based on one of the figures from the website www.e-rna.org.
Figure 2.Loops of different stems can be of different types. Stem 1 – pseudoknotted multiple junction; Stem 2 – classical hairpin; Stems 3, 4, 7, 8 – pseudoknotted hairpins; Stem 5 – pseudoknotted internal loop; Stem 6 – isolated internal loop. Boxes and the structures outlined in purple highlight pseudoknots. The loop of stem 5 is shown in green.
Figure 3.Signature of the pseudoknotted ECR. (a) The ECR contains six stems; each stem is labeled with a letter (see the text). The word abcdDCefBAFE composed of such letters is a full signature of the pseudoknot. (b) The nested stems named cC and dD at (a) are removed. The letters for the remaining stems are reassigned. The word abcdBADC is an upper signature of the pseudoknot. (c) We combine each family of parallel stems into one arc. The letters are reassigned. The word abAB is a signature of the pseudoknot. The figure is prepared after the site www.e-rna.org.
Tertiary interactions in classical RNA structures and in pseudoknots
HL stands for Helix-Loop, HH stands for Helix-Helix, LL stands for Loop-Loop. Local interactions are the interactions inside one structural element or between two adjacent elements (e.g. between hairpin loop and its stem); other interactions are considered as long-range interactions. Each line corresponds to an interaction type. The cells contain corresponding (i) number of pairs of the type (‘Number of pairs’); (ii) fraction of LL and non-LL pairs among all pairs of the type (‘Fractions of structure types’); (iii) fraction of LL or non-LL pairs of the type among all LL or non-LL pairs (‘Fractions of pair types’). In the fields «Fractions of structure types» and «Fractions of pair types» the numbers >70% are colored in orange, the numbers from 40 to 70% are colored in yellow, the numbers from 20% to 40% are colored in green, and the numbers from 10 to 20% are colored in blue.