| Literature DB >> 19691855 |
Shiva Kumar1, Faraz A Ansari, Vinod Scaria.
Abstract
MicroRNAs (small approximately 22 nucleotide long non-coding endogenous RNAs) have recently attracted immense attention as critical regulators of gene expression in multi-cellular eukaryotes, especially in humans. Recent studies have proved that viruses also express microRNAs, which are thought to contribute to the intricate mechanisms of host-pathogen interactions. Computational predictions have greatly accelerated the discovery of microRNAs. However, most of these widely used tools are dependent on structural features and sequence conservation which limits their use in discovering novel virus expressed microRNAs and non-conserved eukaryotic microRNAs. In this work an efficient prediction method is developed based on the hypothesis that sequence and structure features which discriminate between host microRNA precursor hairpins and pseudo microRNAs are shared by viral microRNA as they depend on host machinery for the processing of microRNA precursors. The proposed method has been found to be more efficient than recently reported ab-initio methods for predicting viral microRNAs and microRNAs expressed by mammals.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19691855 PMCID: PMC2743665 DOI: 10.1186/1743-422X-6-129
Source DB: PubMed Journal: Virol J ISSN: 1743-422X Impact factor: 4.099
Figure 1Illustrative summary of the process flow in the presented method for microRNA precursor prediction. The lines in red denote the process flow in prediction while the lines in dark blue denote the process flow during training.
Sensitivity and specificity measures of top 5 models with maximum specificity
| 1 | MODEL 031 | 68% | 87% |
| 2 | MODEL 034 | 69.7 | 85.32 |
| 3 | MODEL 076 | 69.3 | 86 |
| 4 | MODEL 001 | 77% | 78% |
| 5 | MODEL 100 | 67% | 78% |
Comparison of the number of viral microRNAs predicted by the 3 ab-initio prediction algorithms-BayesmiRNAfind, mir-abela, mirSVM and our method on a dataset of viral microRNA precursors derived from mirBase
| Epstein-Barr Virus | 21 | 18 | 21 | 22 |
| Cytomegalovirus | 10 | 5 | 7 | 10 |
| Kaposi Sarcoma Herpesvirus | 9 | 5 | 8 | 11 |
| Herpes Simplex Virus | 2 | 0 | 0 | 2 |
The numbers denote the total number of positive predictions and numbers in brackets following the organism name denotes the total number of microRNA hairpins in mirRbase and that following the positive predictions denote the percentage of total predicted. (*)KSHV expresses only 12 microRNA hairpins. The 13th sequence is one with a possible single nucleotide editing that was cloned. (**) For HSV, the sequences were derived from respective literature as the version of miRbase did not yet include the sequences.
Figure 2Average frequencies of the top 20 differentiating feature elements in experimentally validated microRNA precursor hairpins in comparison to pseudo microRNA hairpins.
Figure 3Average frequencies of the top 20 differentiating feature elements in pseudo microRNA precursor hairpins in comparison to experimentally validated microRNA hairpins.
Comparison of the prediction efficiency of TripletSVM and our method on eukaryotic microRNA hairpins derived from mirBase
| 72 | 68 | 71 | 73 | |
| 272 | 227 | 276 | 285 | |
| 76 | 78 | 86 | 74 | |
The numbers denote the total number of positive predictions. The number in brackets following the organism name denotes the total number of entries in miRbase and that following the number of positive predictions is the percentage positive predictions.