| Literature DB >> 27264539 |
Pandurang Kolekar1, Abhijeet Pataskar1, Urmila Kulkarni-Kale1, Jayanta Pal2, Abhijeet Kulkarni1.
Abstract
Cellular mRNAs are predominantly translated in a cap-dependent manner. However, some viral and a subset of cellular mRNAs initiate their translation in a cap-independent manner. This requires presence of a structured RNA element, known as, Internal Ribosome Entry Site (IRES) in their 5' untranslated regions (UTRs). Experimental demonstration of IRES in UTR remains a challenging task. Computational prediction of IRES merely based on sequence and structure conservation is also difficult, particularly for cellular IRES. A web server, IRESPred is developed for prediction of both viral and cellular IRES using Support Vector Machine (SVM). The predictive model was built using 35 features that are based on sequence and structural properties of UTRs and the probabilities of interactions between UTR and small subunit ribosomal proteins (SSRPs). The model was found to have 75.51% accuracy, 75.75% sensitivity, 75.25% specificity, 75.75% precision and Matthews Correlation Coefficient (MCC) of 0.51 in blind testing. IRESPred was found to perform better than the only available viral IRES prediction server, VIPS. The IRESPred server is freely available at http://bioinfo.net.in/IRESPred/.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27264539 PMCID: PMC4893748 DOI: 10.1038/srep27436
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1The strategy used for training and test data set generation, model building and evaluation.
The optimum parameters employed in model building using training sets and performance evaluation using testing sets.
| Model | Optimum parameters used in model building* | Performance measures used in model evaluation§ | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| s | t | d | g | c | CV (%) | Acc (%) | Sn (%) | Sp (%) | Pr (%) | MCC | |
| 1 | 2 | 0 | 1 | 3.1192 | 0.0347 | 63.54 | 75.51 | 75.75 | 75.25 | 75.75 | 0.51 |
| 2 | 1 | 1 | 1 | 1.8022 | 0.0347 | 68.75 | 63.44 | 63.44 | 63.44 | 63.44 | 0.26 |
| 3 | 0 | 1 | 2 | 1.0050 | 1.0397 | 67.19 | 62.36 | 62.36 | 62.36 | 62.36 | 0.24 |
| 4 | 2 | 0 | 1 | 3.1192 | 0.0347 | 69.79 | 65.05 | 61.30 | 68.82 | 66.28 | 0.30 |
| 5 | 1 | 1 | 1 | 2.2181 | 0.0347 | 67.71 | 61.83 | 60.22 | 63.44 | 62.22 | 0.23 |
*s: SVM type, t: kernel type, d: degree, g: gamma, c: cost and CV: 10-fold cross validation accuracy. Parameters as specified by svm-train program in LibSVM3.12 package.
§Acc: accuracy, Sn: sensitivity, Sp: specificity, Pr: precision and MCC: Matthews correlation coefficient.
Figure 2The process flow of IRESPred web server.
The performance comparison of IRESPred and VIPS servers using positive and negative data sets compiled in the present study.
| Server | Accuracy (%) | Sensitivity (%) | Specificity (%) | Precision (%) | MCC |
|---|---|---|---|---|---|
| IRESPred | 70.89 | 69.84 | 71.95 | 71.35 | 0.41 |
| VIPS | 51.87 | 23.28 | 81.08 | 55.69 | 0.053 |