| Literature DB >> 21799937 |
Xiaomin Ying1, Yuan Cao, Jiayao Wu, Qian Liu, Lei Cha, Wuju Li.
Abstract
BACKGROUND: Bacterial sRNAs are a class of small regulatory RNAs involved in regulation of expression of a variety of genes. Most sRNAs act in trans via base-pairing with target mRNAs, leading to repression or activation of translation or mRNA degradation. To date, more than 1,000 sRNAs have been identified. However, direct targets have been identified for only approximately 50 of these sRNAs. Computational predictions can provide candidates for target validation, thereby increasing the speed of sRNA target identification. Although several methods have been developed, target prediction for bacterial sRNAs remains challenging.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21799937 PMCID: PMC3142192 DOI: 10.1371/journal.pone.0022705
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Two-step model for hybridization of sRNAs and targets.
Figure 2ΔΔG distributions for true and pseudo seed regions in the training dataset.
ΔΔG includes contributions from both the hybridization energy and the energy required to make the sRNA and target binding sites accessible.
Figure 3Variation of the stability index based on the number of features analyzed.
The stability index represented the average of prediction accuracies over 1000 simulations with a training:test data partition ratio of 75%:25%.
Figure 4ROC curves of sTarPicker and three state-of-the-art prediction methods on the test dataset.
Results from sTarPicker were sorted based on sTarPicker probability (descending), ΔΔG of binding regions (ascending), and ΔΔG of seed regions (ascending). If an sRNA-target pair returned several binding sites, only the site with the smallest rank was selected. Results from IntaRNA were sorted in ascending order of the energies. Results from sRNATarget and sRNATarget2 were sorted in descending order of the prediction scores. Results from TargetRNA and TargetRNA2 were sorted in ascending order of the P-values. The ROC curve was generated from the rates of true and false predictions while varying the number of considered interactions in all of the test data.
Prediction accuracy of binding sites on seventeen validated sRNA-target pairs.
| sRNA-target | Sensitivity | PPV | ||||||
| sTarPicker | IntaRNA | TargetRNA | TargetRNA2 | sTarPicker | IntaRNA | TargetRNA | TargetRNA2 | |
| GcvB-sstT | 0.417 | 0.000 | - | - | 0.172 | 0.000 | - | - |
| MicF-ompF | 0.840 | 1.000 | 0.560 | 0.560 | 1.000 | 1.000 | 0.636 | 0.636 |
| MicA-phoP | 0.455 | 0.455 | 1.000 | - | 1.000 | 1.000 | 1.000 | - |
| OmrA-csgD | 0.737 | 0.684 | - | - | 1.000 | 1.000 | - | - |
| GcvB-livK | 0.625 | 0.625 | - | - | 0.652 | 0.652 | - | - |
| GcvB-oppA | 0.913 | 1.000 | 1.000 | - | 0.955 | 1.000 | 1.000 | - |
| InvR-nmpC | 0.927 | 0.000 | - | - | 0.704 | 0.000 | - | - |
| MicA-lamB | 1.000 | 1.000 | - | - | 0.885 | 0.821 | - | - |
| MicA-ompX | 1.000 | 0.900 | - | 1.000 | 0.714 | 1.000 | - | 0.769 |
| RybB-ompN | 1.000 | 1.000 | 1.000 | - | 1.000 | 1.000 | 1.000 | - |
| MicC-nmpC | 1.000 | 1.000 | - | - | 1.000 | 1.000 | - | - |
| PrrF1-sodB | 0.840 | 0.600 | - | - | 1.000 | 0.789 | - | - |
| Qrr1-hapR | 0.708 | 0.708 | - | - | 1.000 | 1.000 | - | - |
| Qrr1-luxO | 0.444 | 0.444 | - | - | 1.000 | 1.000 | - | - |
| MicX-VC0620 | 0.406 | 0.938 | 0.969 | - | 0.765 | 1.000 | 0.969 | - |
| Qrr1-luxR | - | 0.000 | - | - | - | 0.000 | - | - |
| LhrA-lmo0850 | 1.000 | 0.786 | - | - | 0.824 | 1.000 | - | - |
| Average on 17 pairs |
| 0.655 | 0.266 | 0.092 |
| 0.780 | 0.271 | 0.083 |
*Indicates that the interaction was in the training dataset. -Indicates that no interaction was predicted. The highest average sensitivity and PPV are shown in bold.
Figure 5ROC curves of sTarPicker and three state-of-the-art methods in genome-wide prediction of sRNA targets.
For sTarPicker, the results for each sRNA were sorted based on sTarPicker probability (descending), ΔΔG of binding regions (ascending), and ΔΔG of seed regions (ascending). If an sRNA-target pair returned several binding sites, only the site with the smallest rank was selected. For IntaRNA, results for each sRNA were sorted in ascending order of the energies. For sRNATarget and sRNATarget2, results for each sRNA were sorted in descending order of the prediction scores. For TargetRNA and TargetRNA2, results for each sRNA were sorted in ascending order of the P-values. Each ROC curve was generated by calculating sensitivity and specificity while varying the number of predicted interactions taken into consideration for each sRNA.
Predicted results for interaction of Yfr1 and six validated targets.
| Target | sTarPicker | IntaRNA | sRNATarget | ||||
| Prob | Rank | Rank | Energy | Rank | Prob | Rank | |
| PMM1119 (som) | 1 | 2 | 1 | −13.42 | 1 | 1 | 1 |
| PMM1121 (som) | 0.837 | 3 | 2 | −10.54 | 3 | 0 | - |
| PMM0494 (ppa) | 0.001 | 32 | 7 | −12.58 | 2 | 1 | 1 |
| PMM1697 (σ factor) | 0 | - | - | −9.03 | 4 | 0 | - |
| PMM0538 | 0 | - | - | −8.15 | 6 | 0 | - |
| PMM0050 (argJ) | 0 | - | - | −7.51 | 10 | 0 | - |
The Rank column shows the rank of the target among all predicted targets from the genome-wide prediction. For sTarPicker, results for each sRNA were sorted based on sTarPicker probability (descending), ΔΔG of binding regions (ascending), and ΔΔG of seed regions (ascending). If an sRNA-target pair returned several binding sites, only the site with the smallest rank was selected. For IntaRNA, results were sorted in ascending order of the energies. Only interactions at the interval of [−39, +19] were considered according to their paper. For sRNATarget, results were sorted in descending order of the sRNATarget scores. The first two som genes were experimentally validated and shown to be bona fide interaction. The remaining interactions were validated to be non-interaction. ‘-’indicates that no interaction was predicted.
indicates that the rank was obtained when target binding sites were confined at the interval of [−150, +100].
indicates that the rank was obtained when target binding sites were confined at the interval of [−39, +19].
indicates that the rank was obtained with default parameter settings.