| Literature DB >> 19483098 |
Emidio Capriotti1, Marc A Marti-Renom.
Abstract
Recent interest in non-coding RNA transcripts has resulted in a rapid increase of deposited RNA structures in the Protein Data Bank. However, a characterization and functional classification of the RNA structure and function space have only been partially addressed. Here, we introduce the SARA program for pair-wise alignment of RNA structures as a web server for structure-based RNA function assignment. The SARA server relies on the SARA program, which aligns two RNA structures based on a unit-vector root-mean-square approach. The likely accuracy of the SARA alignments is assessed by three different P-values estimating the statistical significance of the sequence, secondary structure and tertiary structure identity scores, respectively. Our benchmarks, which relied on a set of 419 RNA structures with known SCOR structural class, indicate that at a negative logarithm of mean P-value higher or equal than 2.5, SARA can assign the correct or a similar SCOR class to 81.4% and 95.3% of the benchmark set, respectively. The SARA server is freely accessible via the World Wide Web at http://sgu.bioinfo.cipf.es/services/SARA/.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19483098 PMCID: PMC2703911 DOI: 10.1093/nar/gkp433
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Composition of the different RNA datasets used in this work
| Datasets | Number of chains | Number of alignments | Number of different SCOR functions |
|---|---|---|---|
| RNA09 | 451 | 101 475 | |
| BgALI | 451 | 50 995 | |
| FSCOR | 419 | 168 | |
| R-FSCOR | 192 | 168 | |
| T-FSCOR | 227 | 88 |
Parameters for the extreme value distribution fitting
| PID | PSS | PSI | ||||
|---|---|---|---|---|---|---|
| μ | σ | μ | σ | μ | σ | |
| A | 75.4 | 630.4 | 444.2 | 519.7 | 644.3 | 779.4 |
| B | −0.569 | −1.132 | −0.869 | −1.148 | −0.727 | −1.059 |
| r | −0.915 | −0.947 | −0.985 | −0.946 | −0.986 | −0.934 |
A and B are the parameters that describe the power law function (Y = A × XB) and r is the associated correlation coefficient of the fitted data.
Figure 1.Accuracy of structure-based function assignment by the SARA program. (A) QCF, QSF and dataset coverage as a function of the mean logarithm of the P-values for PSI, PSS and PID scores for the leave-one-out benchmark using the FSCOR dataset. (B) Same representation as in panel A for the T-FSCOR benchmark dataset using the R-FSCOR dataset for searching.
Figure 2.User interface for the SARA server. (A) Pair-wise structure alignment. (B) Structure-based function assignment. Both panels include snapshots of the actual user interface as well as a flowchart of the actions taken by the back-end SARA program. User input and output are enclosed within the orange and green dashed areas, respectively.