| Literature DB >> 22529851 |
Shubhra Sankar Ray1, Sukanya Halder, Stephanie Kaypee, Dhananjay Bhattacharyya.
Abstract
One of the important goals of most biological investigations is to classify and organize the experimental findings so that they are readily useful for deriving generalized rules. Although there is a huge amount of information on RNA structures in PDB, there are redundant files, ambiguous synthetic sequences etc. Moreover, a systematic hierarchical organization, reflecting RNA classification, is missing in PDB. In this investigation, we have classified all the available RNA structures from PDB through a programmatic approach. Hence, it would be now a simple assignment to regularly update the classification as and when new structures are released. The classification can further determine (i) a non-redundant set of RNA structures and (ii) if available, a set of structures of identical sequence and function, which can highlight structural polymorphism, ligand-induced conformational alterations etc. Presently, we have classified the available structures (2095 PDB entries having RNA chain longer than nine nucleotides solved by X-ray crystallography or NMR spectroscopy) into nine functional classes. The structures of same function and same source are mostly seen to be similar with subtle differences depending on their functional complexation. The web-server is available online at http://www.saha.ac.in/biop/www/HD-RNAS.html and is updated regularly.Entities:
Keywords: RNA classification; RNA crystal structures; RNA database; functional RNA; functional annotation; structure prediction
Year: 2012 PMID: 22529851 PMCID: PMC3329738 DOI: 10.3389/fgene.2012.00059
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Figure 1HD-RNAS homepage.
Special keywords and their corresponding classification for a RNA chain.
| Keywords | Significance |
|---|---|
| UTR, EXON, INTRON | mRNA |
| CODON, ANTIOCODON, ACCEPTOR tRNA-fragment, A-site, P-site, E-site, tRNA X-MER (e.g., tRNA 30-MER) | tRNA |
| OPAL or AMBER or OCHRE | tRNA – OPAL or AMBER or OCHRE |
| RIBONUCLEASE P, RNASE P | Ribonuclease |
| FMET, FME, INITIATOR, INI, PRIMER | tRNA |
| S-TURN, CATALYTIC RNA, HAMMERHEAD | Ribozyme |
| APTAMER | Riboswitch |
| 4.8S, 5S, 5.8S, 16S, 18S, 23S, 28S, 30S, 50S, 70S, 80S | rRNA |
| 4.5S, 7S, 7SL | SRP-RNA |
Figure 2Proper classification of individual RNA chains present in 2J00 in HD-RNAS.
Figure 3BLAST search results for E. coli tRNA.
Validation of BLAST search results.
| PDB | Chain | BLAST result | PDB description |
|---|---|---|---|
| 1ASY | R, S | BLAST gb M25168.1 YSCTRDCER S.cerevisiae Asp-tRNA 149 2e-34 | Yeast aspartyl-tRNA synthetase complexed with tRNA Asp |
| 1ASZ | R, S | BLAST gb M25168.1 YSCTRDCER S.cerevisiae Asp-tRNA 149 2e-34 | Yeast aspartyl-tRNA synthetase complexed with tRNA Asp |
| 1GID | A, B | BLAST emb V01416.1 Fragments of a Tetrahymena gene for 26s rRNA with… 311 8e-83 | Tetrahymena group I INTRON |
| 1HR2 | A, B | BLAST emb V01416.1 Fragments of a Tetrahymena gene for 26s rRNA with… 295 4e-78 | Tetrahymena group I INTRON |
| 1IVS | C, D | BLAST dbj AB080139.1 | Thermus thermophilus valyl-tRNA synthetase complexed with tRNA(VAL) |
| 1U9S | A | BLAST gb AE017221.1 | Thermus thermophilus |
| 1VTQ | A | BLAST gb M25168.1 YSCTRDCER S.cerevisiae Asp-tRNA 149 2e-34 | Yeast t-RNA-Asp |
| 2CSX | C, D | BLAST gb AE000657.1 Aquifex aeolicus VF5, complete genome 149 2e-34 | Aquifex aeolicus methionyl-tRNA synthetase complexed with tRNA(MET) |
| 2CT8 | C, D | BLAST gb AE000657.1 Aquifex aeolicus VF5, complete genome 147 1e-33 | Aquifex aeolicus methionyl-tRNA synthetase complexed with tRNA(MET) |
| 2CV0 | C, D | BLAST gb AE017221.1 | Glutamyl-tRNA synthetase from thermus thermophilus in complex with tRNA(GLU) |
| 2CV1 | C, D | BLAST gb AE017221.1 | Glutamyl-tRNA synthetase from thermus thermophilus in complex with tRNA(GLU) |
| 2CV2 | C, D | BLAST gb AE017221.1 | Glutamyl-tRNA synthetase from thermus thermophilus in complex with tRNA(GLU) |
| 2DXI | C, D | BLAST gb AE017221.1 | Glutamyl-tRNA synthetase from thermus thermophilus complexed with tRNA(GLU) |
| 2IHX | B | BLAST gb M21526.1 ALRSRCAC Rous sarcoma virus defective mutant PR2257… 149 2e-34 | Sequence occurs naturally in rous sarcoma virus (RSV) |
| 2NOQ | A | BLAST gb AF218039.1 AF218039 Cricket paralysis virus non-structural pol… 377 e-102 | Ribosome-bound cricket paralysis virus IRES RNA |
| 2ZNI | C, D | BLAST dbj AP008230.1 | Pyrrolysyl-tRNA synthetase-tRNA(PYL) complex from desulfitobacterium hafniense |
| 3AKZ | F, H, G, E | BLAST gb AE000512.1 | Thermotoga maritima non-discriminating glutamyl-tRNA synthetase in complex with tRNAGLN |
| 3AL0 | E | BLAST gb AE000512.1 | Glutamine transamidosome from thermotoga maritima in the glutamylation state |
| 3FOZ | D | BLAST gb CP002185.1 Escherichia coli W, complete genome 137 3e-3137 8e-31 | |
| 3Q50 | A | BLAST gb AE008691.1 | Based on the sequence of class I, type 1 PREQ1 riboswitch aptamer from T. tengcongensis |
| 3Q51 | A | BLAST gb AE008691.1 | Based on the sequence of class I, type 1 PREQ1 riboswitch aptamer from T. tengcongensis |
Subclasses where the number of PDB files are 10 or greater and their structural variation.
| Functional type | Organism | No. of PDB files | Representative structure | Average similarity | SD |
|---|---|---|---|---|---|
| 16S rRNA | 26 (25) | 3OFP_A | 0.9053 | 0.0403 | |
| 16S rRNA | 73 (69) | 2VQE_A | 0.8556 | 0.0579 | |
| 5S rRNA | 12 (12) | 2ZJR_Y | 0.6792 | 0.1558 | |
| 5S rRNA | 27 (24) | 3OFQ_B | 0.8896 | 0.0454 | |
| 5S rRNA | 61 (61) | 1VQO_9 | 0.9077 | 0.0451 | |
| 5S rRNA | 45 (44) | 2J01_B | 0.8349 | 0.0952 | |
| 23S rRNA | 23 (23) | 2ZJR_X | 0.6582 | 0.1306 | |
| 23S rRNA | 30 (30) | 3OFR_A | 0.742 | 0.2972 | |
| 23S rRNA | 65 (58) | 1VQO_0 | 0.9714 | 0.0128 | |
| 23S rRNA | 44 (41) | 2J01_A | 0.9516 | 0.03 | |
| tRNAfMet | 20 (20) | 2FMT_C/D | 0.8499 | 0.0943 | |
| tRNAGln | 14 (14) | 1ZJW | 0.7755 | 0.2094 | |
| tRNAPhe | 55 (55) | 3FOZ_D | 0.5269 | 0.2425 |
*Numbers in parentheses give the total no. of structures after removing outliers.
Function assignment to RNAs of the other-RNA class.
| Query | Length | Original function | Best match | Predicted function | Best score | Average | SD | Prediction score |
|---|---|---|---|---|---|---|---|---|
| 1FIR-A | 76 | tRNALys | 1EVV-A | tRNAPhe | 362.00 | 131.60 | 13.62 | 16.92 |
| 2B57-A | 65 | Riboswitch | 2XNZ-A | Riboswitch | 307.00 | 99.34 | 11.35 | 18.29 |
| 2ZZM-B | 88 | tRNALeu | 1WZ2-D | tRNALeu | 326.50 | 118.16 | 14.40 | 14.46 |
| 2ZZN-C | 75 | tRNACys | 3KFU-L | tRNAAsn | 304.00 | 128.54 | 12.98 | 13.52 |
| 2ZZN-D | 75 | tRNACys | 1F7U-B | tRNAArg | 309.00 | 135.00 | 13.32 | 13.06 |
| 3KIQ-a | 1504 | rRNA | 2VQE-A | 16S rRNA | 5904.50 | 2062.09 | 46.95 | 81.85 |
| 3KIQ-v | 77 | tRNA | 2FMT-D | tRNAFmet | 349.00 | 150.39 | 15.36 | 12.93 |
| 3KIR-A | 2848 | rRNA | 2XG0-A | rRNA fragment | 12839.00 | 3764.09 | 55.97 | 162.13 |
| 3KIR-B | 119 | rRNA | 2J01-B | 5S rRNA | 505.00 | 156.37 | 13.11 | 26.60 |
| 3KIS-a | 1504 | rRNA | 2VQE-A | 16S rRNA | 5904.50 | 2068.17 | 46.28 | 82.89 |
| 3KIS-v | 77 | tRNA | 2ZUE-B | tRNAArg | 342.00 | 146.26 | 11.99 | 16.32 |
| 3KIT-A | 2848 | rRNA | 2XG0-A | rRNA fragment | 12821.00 | 3779.68 | 61.28 | 147.53 |
| 3KIT-B | 119 | rRNA | 2J01-B | 5S rRNA | 523.00 | 148.25 | 13.61 | 27.54 |
| 3KNN-W | 75 | tRNA | 1F7U-B | tRNAArg | 279.00 | 129.06 | 12.13 | 12.36 |
| 3KNN-X | 77 | tRNAFmet | 2FMT-D | tRNAFmet | 331.00 | 147.96 | 14.51 | 12.62 |
| 1VFG-D | 75 | Primer tRNA | 2V0G-F | tRNALeu | 138.00 | 93.55 | 8.51 | 5.22 |
| 3KNN-Y | 75 | tRNA | 2XQD-W | tRNA | 208.00 | 146.06 | 11.57 | 5.35 |
| 3MOJ-A | 74 | 23S rRNA | 2XG0-A | rRNA fragment | 196.50 | 158.28 | 8.48 | 4.51 |
| 3NPB-A | 119 | Riboswitch | 2A64-A | Ribonuclease | 318.00 | 255.50 | 15.32 | 4.08 |
| 3R9X-C | 35 | 16S rRNA | 1M5O-E | Ribozyme | 142.00 | 83.44 | 13.25 | 4.42 |
| 3KIS-w | 77 | tRNA | 2XG0-A | rRNA fragment | 164.50 | 160.35 | 12.56 | 0.33 |
| 3KIQ-w | 77 | tRNA | 2XG0-A | rRNA fragment | 211.00 | 203.98 | 10.29 | 0.68 |
| 1P6V-B | 68 | tmRNA | 2XG0-A | rRNA fragment | 146.00 | 107.33 | 8.65 | 4.47 |
| 1P6V-D | 68 | tmRNA | 3IAB-R | Ribonuclease | 102.00 | 65.03 | 10.56 | 3.50 |
| 1S03-A | 47 | mRNA | 2ZNI-D | tRNALys | 178.50 | 127.49 | 9.00 | 5.67 |
| 1S03-B | 47 | mRNA | 2ZNI-D | tRNALys | 178.50 | 127.83 | 9.54 | 5.31 |
| 1VFG-C | 75 | Primer tRNA | 2XG0-A | rRNA fragment | 123.50 | 105.85 | 6.09 | 2.90 |
Function annotation to unannotated RNAs.
| Unannotated RNA | ||||||||
|---|---|---|---|---|---|---|---|---|
| Query | Length | Original function | Best match | Predicted function | Best score | Average | SD | Prediction score |
| 1JBR-D | 31 | 28S rRNA | 1Q96-B | 28S rRNA | 117.00 | 39.56 | 7.76 | 9.98 |
| 2HW8-B | 36 | mRNA | 1ZHO-H | mRNA | 180.00 | 66.01 | 8.86 | 12.86 |
| 2ZH1-B | 33 | tRNAPhe | 3AKZ-G | tRNAGln | 154.50 | 87.89 | 9.13 | 7.30 |
| 3DS7-A | 67 | Guanine riboswitch | 1U8D-A | mRNA (Riboswitch) | 317.00 | 109.10 | 10.63 | 19.56 |
| 3DS7-B | 67 | Guanine riboswitch | 1U8D-A | mRNA (Riboswitch) | 299.00 | 107.86 | 10.67 | 17.92 |
| 3HJW-D | 58 | H/ACA snoRNA | 1WZ2-D | tRNALeu | 197.00 | 105.06 | 11.43 | 8.04 |
| 3ICQ-D | 67 | tRNA | 1EVV-A | tRNAPhe | 212.00 | 102.10 | 10.20 | 10.78 |
| 3ICQ-E | 67 | tRNA | 2Y10-V | tRNATrp | 230.00 | 116.67 | 10.18 | 11.13 |
| 3LWO-D | 58 | H/ACA RNA | 1WZ2-D | tRNALeu | 197.00 | 105.99 | 13.15 | 6.92 |
| 3LWP-D | 58 | H/ACA snoRNA | 1WZ2-D | tRNALeu | 197.00 | 107.14 | 12.43 | 7.23 |
| 3LWQ-D | 58 | H/ACA snoRNA | 1WZ2-D | tRNALeu | 197.00 | 105.22 | 11.26 | 8.15 |
| 3LWR-D | 58 | H/ACA snoRNA | 1WZ2-D | tRNALeu | 197.00 | 107.21 | 11.17 | 8.04 |
| 3LWV-D | 58 | H/ACA RNA | 1WZ2-D | tRNALeu | 197.00 | 107.64 | 13.11 | 6.82 |
| 3OUY-C | 35 | tRNAIle | 3A3A-A | tRNASel | 157.00 | 95.52 | 8.96 | 6.86 |
| 3OV7-C | 34 | tRNAIle | 3AKZ-G | tRNAGln | 134.00 | 78.04 | 9.20 | 6.08 |
| 3OVA-C | 34 | tRNAIle | 3AKZ-G | tRNAGln | 159.50 | 88.13 | 9.76 | 7.31 |
| 3OVB-C | 35 | tRNAIle | 3AKZ-G | tRNAGln | 164.50 | 86.64 | 9.72 | 8.01 |
| 3OVB-D | 35 | tRNAIle | 3A3A-A | tRNASel | 175.00 | 104.03 | 10.02 | 7.09 |
| 3OVS-C | 34 | tRNAIle | 3A3A-A | tRNASel | 170.00 | 103.36 | 9.39 | 7.10 |
| 3OVS-D | 34 | tRNAIle | 3AKZ-G | tRNAGln | 142.00 | 75.11 | 9.84 | 6.80 |
| 3OV7-D | 34 | tRNAIle | 3A3A-A | tRNASel | 152.00 | 95.84 | 10.76 | 5.22 |
| 3OUY-D | 35 | tRNAIle | 3A3A-A | tRNASel | 139.00 | 88.53 | 9.62 | 5.25 |
| 2HVY-E | 65 | H/ACA | 3R8S-B | 5S rRNA | 196.00 | 109.93 | 12.66 | 6.80 |
| 3HAX-E | 63 | H/ACA | 1M5O-E | Ribozyme | 201.50 | 104.17 | 14.31 | 6.80 |
| 3P22-C | 40 | Guide RNA | 3IAB-R | Ribonuclease | 167.00 | 91.79 | 11.09 | 6.78 |
| 3P22-E | 40 | Guide RNA | 3IAB-R | Ribonuclease | 167.00 | 89.59 | 11.18 | 6.92 |
| 3P22-G | 40 | Guide RNA | 3IAB-R | Ribonuclease | 167.00 | 92.37 | 12.71 | 5.87 |
| 1DDY-A | 35 | Aptamer | 2A64-A | Ribonuclease | 94.00 | 68.71 | 9.19 | 2.75 |
| 1DDY-C | 35 | Aptamer | 2A64-A | Ribonuclease | 94.00 | 67.48 | 8.89 | 2.99 |
| 1DDY-E | 35 | Aptamer | 2A64-A | Ribonuclease | 94.00 | 67.40 | 8.49 | 3.14 |
| 1DDY-G | 35 | Aptamer | 2A64-A | Ribonuclease | 94.00 | 68.01 | 9.96 | 2.61 |
| 1KH6-A | 48 | IRES RNA (viral) | 3AM1-B | tRNA | 164.00 | 120.80 | 8.39 | 5.15 |
| 1XJR-A | 47 | s2m RNA (viral) | 3ADD-D | tRNASel | 125.50 | 98.22 | 9.22 | 2.96 |
| 3NVK-K | 34 | Box C/D snoRNA | 2XG0-A | rRNA Fragment | 73.50 | 71.68 | 5.87 | 0.31 |
| 3NVK-L | 34 | Box C/D snoRNA | 2XQD-W | tRNA | 68.00 | 59.05 | 6.88 | 1.30 |
| 3P22-A | 40 | Guide RNA | 1EUY-B | tRNAGln | 155.00 | 86.87 | 9.14 | 7.45 |
Figure 4Bootstrapping method for function prediction. Blue-random scores, red-best match score. (A) Chain v of 3KIS, (B) chain w of 3KIS. For chain v, the best score is significantly larger than random average score giving correct prediction, whereas for chain w, best match score lies within the range of random scores, leading to false prediction (Table 4).