| Literature DB >> 23509793 |
Murtada Khalafallah Elbashir1, Yu Sheng, Jianxin Wang, Fangxiang Wu, Min Li.
Abstract
A β-turn is a secondary protein structure type that plays a significant role in protein configuration and function. On average 25% of amino acids in protein structures are located in β-turns. It is very important to develope an accurate and efficient method for β-turns prediction. Most of the current successful β-turns prediction methods use support vector machines (SVMs) or neural networks (NNs). The kernel logistic regression (KLR) is a powerful classification technique that has been applied successfully in many classification problems. However, it is often not found in β-turns classification, mainly because it is computationally expensive. In this paper, we used KLR to obtain sparse β-turns prediction in short evolution time. Secondary structure information and position-specific scoring matrices (PSSMs) are utilized as input features. We achieved Q total of 80.7% and MCC of 50% on BT426 dataset. These results show that KLR method with the right algorithm can yield performance equivalent to or even better than NNs and SVMs in β-turns prediction. In addition, KLR yields probabilistic outcome and has a well-defined extension to multiclass case.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23509793 PMCID: PMC3590576 DOI: 10.1155/2013/870372
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1The architecture of the KLR method.
Q total and MCC for different values of selected vectors m.
| Number of selected vectors |
| MCC |
|---|---|---|
| 70 | 79.96 | 0.46 |
| 80 | 80.32 | 0.47 |
| 90 | 80.25 | 0.47 |
| 100 | 80.38 | 0.47 |
| 110 | 80.41 | 0.47 |
| 120 | 80.54 | 0.48 |
| 130 | 80.51 | 0.48 |
Figure 2The MCC in function of the number of the selected PVs.
Figure 3The Q total in function of the number of the selected PVs.
Comparison of KLR with other recent β-turns prediction methods on BT426 dataset.
| Method |
|
|
| Specificity | MCC |
|---|---|---|---|---|---|
| KLR | 80.7 | 58.98 | 65.25 | 85.34 | 0.50 |
| BTNpred [ | 80.9 | 62.7 | 55.6 | N/A | 0.47 |
| NetTurnP [ | 78.2 | 54.4 | 75.6 | 79.1 | 0.50 |
| BetaTPred2 [ | 75.5 | 49.8 | 72.3 | N/A | 0.43 |
| BTPRED [ | 74.9 | 55.3 | 48.0 | N/A | 0.35 |
| DEBT [ | 79.2 | 54.8 | 70.1 | N/A | 0.48 |
| SVM [ | 79.8 | 55.6 | 68.9 | N/A | 0.47 |
| BTSVM [ | 78.7 | 56.0 | 62.0 | N/A | 0.45 |
| E-SSpred [ | 80.9 | 63.6 | 49.2 | N/A | 0.44 |
| 1–4 & 2-3 correlation model [ | 59.1 | 32.4 | 61.9 | N/A | 0.17 |
Comparison of KLR with other recent β-turns prediction methods on BT547 and BT823 datasets.
| Method | Dataset |
|
|
| MCC |
|---|---|---|---|---|---|
| KLR | BT547 | 80.46 | 59.04 | 65.36 | 0.50 |
| BTNpred | 80.5 | 61.6 | 54.2 | 0.45 | |
| COUDES [ | 74.6 | 48.7 | 70.4 | 0.42 | |
| SVM [ | 76.6 | 47.6 | 70.2 | 0.43 | |
|
| |||||
| KLR | BT823 | 80.66 | 58.42 | 64.64 | 0.49 |
| BTNpred | 80.6 | 60.8 | 54.6 | 0.45 | |
| COUDES | 74.2 | 47.5 | 69.6 | 0.41 | |
| SVM [ | 76.8 | 53.0 | 72.3 | 0.45 | |
Comparison of the elapsed time in seconds between KLR, BTNpred, and E-SSpred.
| Dataset | KLR | BTNpred | E-SSpred |
|---|---|---|---|
| BT426 | 753.55 | 11077.185 | 13036.415 |
| BT547 | 940.55 | 13261.755 | 15726.2 |
| BT823 | 683.44 | 18183.256 | 24140.072 |
Figure 4Average execution time of the KLR model in function of the number of the training instances.
Figure 5ROC curve for the evaluation of the KLR model on the BT426 dataset.