| Literature DB >> 29099311 |
Amita Barik1, Santasabuj Das1,2.
Abstract
Small RNAs (sRNAs) in bacteria have emerged as key players in transcriptional and post-transcriptional regulation of gene expression. Here, we present a statistical analysis of different sequence- and structure-related features of bacterial sRNAs to identify the descriptors that could discriminate sRNAs from other bacterial RNAs. We investigated a comprehensive and heterogeneous collection of 816 sRNAs, identified by northern blotting across 33 bacterial species and compared their various features with other classes of bacterial RNAs, such as tRNAs, rRNAs and mRNAs. We observed that sRNAs differed significantly from the rest with respect to G+C composition, normalized minimum free energy of folding, motif frequency and several RNA-folding parameters like base-pairing propensity, Shannon entropy and base-pair distance. Based on the selected features, we developed a predictive model using Random Forests (RF) method to classify the above four classes of RNAs. Our model displayed an overall predictive accuracy of 89.5%. These findings would help to differentiate bacterial sRNAs from other RNAs and further promote prediction of novel sRNAs in different bacterial species.Keywords: Base-pairing propensity; base-pair distance; minimum free energy of folding; motifs; random forests; shannon entropy; small RNAs
Mesh:
Substances:
Year: 2017 PMID: 29099311 PMCID: PMC5785981 DOI: 10.1080/15476286.2017.1387709
Source DB: PubMed Journal: RNA Biol ISSN: 1547-6286 Impact factor: 4.652