| Literature DB >> 27437396 |
Xin-Xin Chen1, Hua Tang2, Wen-Chao Li1, Hao Wu3, Wei Chen4, Hui Ding1, Hao Lin1.
Abstract
Owing to the abuse of antibiotics, drug resistance of pathogenic bacteria becomes more and more serious. Therefore, it is interesting to develop a more reasonable way to solve this issue. Because they can destroy the bacterial cell structure and then kill the infectious bacterium, the bacterial cell wall lyases are suitable candidates of antibacteria sources. Thus, it is urgent to develop an accurate and efficient computational method to predict the lyases. Based on the consideration, in this paper, a set of objective and rigorous data was collected by searching through the Universal Protein Resource (the UniProt database), whereafter a feature selection technique based on the analysis of variance (ANOVA) was used to acquire optimal feature subset. Finally, the support vector machine (SVM) was used to perform prediction. The jackknife cross-validated results showed that the optimal average accuracy of 84.82% was achieved with the sensitivity of 76.47% and the specificity of 93.16%. For the convenience of other scholars, we built a free online server called Lypred. We believe that Lypred will become a practical tool for the research of cell wall lyases and development of antimicrobial agents.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27437396 PMCID: PMC4942628 DOI: 10.1155/2016/1654623
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
The original values of nine physicochemical properties used in this study.
| Amino acids | Hydrophobicity | Hydrophilicity | Rigidity | Flexibility | Irreplaceability | Mass | pI | pK( | pK( |
|---|---|---|---|---|---|---|---|---|---|
| A | 0.62 | −0.5 | −1.338 | −3.102 | 0.52 | 15 | 6.11 | 2.35 | 9.87 |
| C | 0.29 | −1 | −1.511 | 0.957 | 1.12 | 47 | 5.02 | 1.71 | 10.78 |
| D | −0.9 | 3 | −0.204 | 0.424 | 0.77 | 59 | 2.98 | 1.88 | 9.6 |
| E | −0.74 | 3 | −0.365 | 2.009 | 0.76 | 73 | 3.08 | 2.19 | 9.67 |
| F | 1.19 | −2.5 | 2.877 | −0.466 | 0.86 | 91 | 5.91 | 2.58 | 9.24 |
| G | 0.48 | 0 | −1.097 | −2.746 | 0.56 | 1 | 6.06 | 2.34 | 9.6 |
| H | −0.4 | −0.5 | 2.269 | −0.223 | 0.94 | 82 | 7.64 | 1.78 | 8.97 |
| I | 1.38 | −1.8 | −1.741 | 0.424 | 0.65 | 57 | 6.04 | 2.32 | 9.76 |
| K | −1.5 | 3 | −1.822 | 3.950 | 0.81 | 73 | 9.47 | 2.2 | 8.9 |
| L | 1.06 | −1.8 | −1.741 | 0.424 | 0.58 | 57 | 6.04 | 2.36 | 9.6 |
| M | 0.64 | −1.3 | −1.741 | 2.484 | 1.25 | 75 | 5.74 | 2.28 | 9.21 |
| N | −0.78 | 0.2 | −0.204 | 0.424 | 0.79 | 58 | 10.76 | 2.18 | 9.09 |
| P | 0.12 | 0 | 1.979 | −2.404 | 0.61 | 42 | 6.3 | 1.99 | 10.6 |
| Q | −0.85 | 0.2 | −0.365 | 2.009 | 0.86 | 72 | 5.65 | 2.17 | 9.13 |
| R | −2.53 | 3 | 1.169 | 3.060 | 0.60 | 101 | 10.76 | 2.18 | 9.09 |
| S | −0.18 | 0.3 | −1.511 | 0.957 | 0.64 | 31 | 5.68 | 2.21 | 9.15 |
| T | −0.05 | −0.4 | −1.641 | −1.339 | 0.56 | 45 | 5.6 | 2.15 | 9.12 |
| V | 1.08 | −1.5 | −1.641 | −1.339 | 0.54 | 43 | 6.02 | 2.29 | 9.74 |
| W | 0.81 | −3.4 | 5.913 | −1.000 | 1.82 | 130 | 5.88 | 2.38 | 9.39 |
| Y | 0.26 | −2.3 | 2.714 | −0.672 | 0.98 | 107 | 5.63 | 2.2 | 9.11 |
Figure 1A heat map to show the overall accuracy in 5-fold cross-validation with different parameter groups (g, δ).
Comparison among the performances of different algorithms.
| Algorithm | Sn (%) | Sp (%) | MCC | OA (%) | AA (%) | auROC |
|---|---|---|---|---|---|---|
| SVM | 76.47 | 93.16 | 0.678 | 90.13 | 84.82 | 0.926 |
| Random Forest | 80.88 | 85.02 | 0.572 | 84.27 | 82.95 | 0.905 |
| Naïve Bayes | 76.47 | 83.06 | 0.512 | 81.87 | 79.77 | 0.881 |
| LibD3C | 66.18 | 88.60 | 0.515 | 84.53 | 77.39 | 0.859 |
Figure 2The ROC curve for the proposed model with the 63 optimal features in jackknife cross-validation using SVM.
Figure 3A semiscreenshot to show the home page of Lypred. Its website address is http://lin.uestc.edu.cn/server/Lypred/.