| Literature DB >> 27597968 |
Huan Yang1, Hua Tang2, Xin-Xin Chen1, Chang-Jian Zhang1, Pan-Pan Zhu3, Hui Ding1, Wei Chen4, Hao Lin1.
Abstract
Tuberculosis is killing millions of lives every year and on the blacklist of the most appalling public health problems. Recent findings suggest that secretory protein of Mycobacterium tuberculosis may serve the purpose of developing specific vaccines and drugs due to their antigenicity. Responding to global infectious disease, we focused on the identification of secretory proteins in Mycobacterium tuberculosis. A novel method called MycoSec was designed by incorporating g-gap dipeptide compositions into pseudo amino acid composition. Analysis of variance-based technique was applied in the process of feature selection and a total of 374 optimal features were obtained and used for constructing the final predicting model. In the jackknife test, MycoSec yielded a good performance with the area under the receiver operating characteristic curve of 0.93, demonstrating that the proposed system is powerful and robust. For user's convenience, the web server MycoSec was established and an obliging manual on how to use it was provided for getting around any trouble unnecessary.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27597968 PMCID: PMC4997101 DOI: 10.1155/2016/5413903
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Comparing the performance between different classifiers.
| Algorithm | Sn (%) | Sp (%) | AA (%) | AUC |
|---|---|---|---|---|
| SVM | 94.29 | 80.08 | 87.18 | 0.93 |
| Random Forest | 45.71 | 84.59 | 65.15 | 0.69 |
| Bayes Net | 82.86 | 55.64 | 69.25 | 0.66 |
| RBF Network | 85.71 | 36.09 | 60.90 | 0.59 |
Figure 1ROC curves achieved by SVM, Bayes Net, RBF Network, and Random Forest in discriminating secretory proteins from nonsecretory proteins of M. tuberculosis.
Figure 2Home page of MycoSec web server at http://lin.uestc.edu.cn/server/MycoSec/.