| Literature DB >> 23200137 |
Wuju Li1, Xiaomin Ying, Qixuan Lu, Linxi Chen.
Abstract
Bacterial small RNAs (sRNAs) are an emerging class of regulatory RNAs of about 40-500 nucleotides in length and, by binding to their target mRNAs or proteins, get involved in many biological processes such as sensing environmental changes and regulating gene expression. Thus, identification of bacterial sRNAs and their targets has become an important part of sRNA biology. Current strategies for discovery of sRNAs and their targets usually involve bioinformatics prediction followed by experimental validation, emphasizing a key role for bioinformatics prediction. Here, therefore, we provided an overview on prediction methods, focusing on the merits and limitations of each class of models. Finally, we will present our thinking on developing related bioinformatics models in future.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23200137 PMCID: PMC5054197 DOI: 10.1016/j.gpb.2012.09.004
Source DB: PubMed Journal: Genomics Proteomics Bioinformatics ISSN: 1672-0229 Impact factor: 7.691
Figure 1The action mechanisms of For cis-encoded sRNA-target mRNA interactions, there exists a perfect base pairing region and these genes overlap but are localized on different strands. Here the interaction GadY:gadX was provided to demonstrate such interaction, in which the blue color represents sRNA and the red color stands for the target mRNA. However, for trans-encoded sRNA-target mRNA interactions, there exists an imperfect base pairing region. These genes are separate from each other and therefore there is no overlap between them. The interaction MicC:ompC was shown as an example. Please see sRNATarBase for detailed information (http://ccb.bmi.ac.cn/srnatarbase/). The entry names for GadY:gadX and MicC:ompC are SRNAT00067 and SRNAT00015, respectively.
Main computational tool for prediction of bacterial sRNAs and their target mRNAs
| Comparative genomics- based models for sRNA prediction | QRNA | Sequence and secondary structure; suitable for two sequence alignment | ||
| RNAz | Sequence and secondary structure; suitable for multiple sequence alignment | |||
| EvoFold | Sequence, structure and evolution; suitable for multiple sequence alignment | |||
| SIPHT | Sequence and Rho-independent terminators | |||
| sRNAPredict | Sequence and Rho-independent terminators | |||
| NAPP | – | Phylogenetic profiling of nucleic acid fragments; cluster analysis | ||
| Machine learning-based models for sRNA prediction | Carter et al. | Nucleotide compositions and secondary structure; neural networks and support vector machines | ||
| Sætrom et al. | – | Sequence; genetic algorithm and boosting algorithm | ||
| PSoL | – | Sequence and secondary structure; support vector machine | ||
| Tran et al. | Sequence and secondary structure; neural network | |||
| Prediction models for general RNA–RNA interactions | RNAcofold | Extension of minimum energy folding algorithm to two sequences | ||
| RNAhybrid | Extension of minimum energy folding algorithm to two sequences; neglecting intra-molecular base-pairings and multi-loops | |||
| RNAplex | Extension of minimum energy folding algorithm to two sequences; running faster | |||
| RNAup | Consideration of accessibility of binding sites | |||
| inteRNA | Searching the joint structure of interacting RNAs with the minimum total free energy | |||
| piRNA | Computing the partition function over joint structures formed by two interacting nucleic acids | |||
| Rip | Computing the full partition function over joint structures formed by two interacting RNAs based on the combinatorial notion of ‘tight structures’ | |||
| RactIP | Prediction based on joint structures using integer programming | |||
| Ripalign | Prediction based on joint structures with consideration of both thermodynamic stability and sequence/structure covariation | |||
| PETcofold | Predicting interactions and secondary structures of two multiple alignments of RNA sequences | |||
| Prediction models for sRNA-target mRNA interactions | TargetRNA | Hybridization; not consider structures from sRNA or mRNA | ||
| sRNATarget | Sequence and RNA secondary structure profile; naïve Bayes method | |||
| IntaRNA | Accessibility of binding sites; user-specified seed | |||
| RNApredator | Target site accessibility; RNAup | |||
| sTarpicker | Thermodynamic stability; site accessibility of sRNA and targets; naïve Bayes method | |||
Note: The main features and properties of the related models were provided in column “Main features”. For example, for QRNA, both sequence and secondary structure information were applied, and the model was suitable for two sequence alignment.