| Literature DB >> 19958486 |
Yu-Cheng Liu1, Meng-Han Yang, Win-Li Lin, Chien-Kang Huang, Yen-Jen Oyang.
Abstract
BACKGROUND: Proteins are dynamic macromolecules which may undergo conformational transitions upon changes in environment. As it has been observed in laboratories that protein flexibility is correlated to essential biological functions, scientists have been designing various types of predictors for identifying structurally flexible regions in proteins. In this respect, there are two major categories of predictors. One category of predictors attempts to identify conformationally flexible regions through analysis of protein tertiary structures. Another category of predictors works completely based on analysis of the polypeptide sequences. As the availability of protein tertiary structures is generally limited, the design of predictors that work completely based on sequence information is crucial for advances of molecular biology research.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19958486 PMCID: PMC2788375 DOI: 10.1186/1471-2164-10-S3-S22
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1The overall structure of the proposed hybrid predictor.
Figure 2An illustration of the process employed to generate the feature vector of the residue of concern. (a) The feature vectors are derived from PSSM with window size set to 7. Rows corresponding to residue types that are neither charged nor polar are deleted. (b) Rows corresponding to residue types with charge are duplicated and one additional row is included to indicate whether the residue is at one end of the protein chain or not.
Performance of Boden' s predictor with different cut-off values of entropy
| Entropy Cuf-off | TP | FP | TN | FN | accuracy | sensitivity | specificity | precision | F-score | MCC |
|---|---|---|---|---|---|---|---|---|---|---|
| 0.05 | 5803 | 54661 | 113 | 0 | 0.098 | 1.000 | 0.002 | 0.096 | 0.175 | 0.014 |
| 0.10 | 5792 | 54364 | 410 | 11 | 0.102 | 0.998 | 0.007 | 0.096 | 0.176 | 0.020 |
| 0.15 | 5703 | 49720 | 5054 | 100 | 0.178 | 0.983 | 0.092 | 0.103 | 0.186 | 0.079 |
| 0.20 | 5572 | 46635 | 8139 | 231 | 0.226 | 0.960 | 0.149 | 0.107 | 0.192 | 0.093 |
| 0.25 | 5328 | 43024 | 11750 | 475 | 0.282 | 0.918 | 0.215 | 0.110 | 0.197 | 0.097 |
| 0.30 | 4903 | 38576 | 16198 | 900 | 0.348 | 0.845 | 0.296 | 0.113 | 0.199 | 0.092 |
| 0.35 | 4468 | 34265 | 20509 | 1335 | 0.412 | 0.770 | 0.374 | 0.115 | 0.201 | 0.088 |
| 0.40 | 4009 | 30101 | 24673 | 1794 | 0.473 | 0.691 | 0.450 | 0.118 | 0.201 | 0.084 |
| 0.45 | 3584 | 26400 | 28374 | 2219 | 0.528 | 0.618 | 0.518 | 0.120 | 0.200 | 0.080 |
| 0.50 | 3142 | 22971 | 31803 | 2661 | 0.577 | 0.541 | 0.581 | 0.120 | 0.197 | 0.073 |
| 0.55 | 2702 | 19722 | 35052 | 3101 | 0.623 | 0.466 | 0.640 | 0.120 | 0.191 | 0.064 |
| 0.60 | 2254 | 16500 | 38274 | 3549 | 0.669 | 0.388 | 0.699 | 0.120 | 0.184 | 0.055 |
| 0.65 | 1730 | 12617 | 42157 | 4073 | 0.724 | 0.298 | 0.770 | 0.121 | 0.172 | 0.047 |
| 0.70 | 1100 | 8011 | 46763 | 4703 | 0.790 | 0.190 | 0.854 | 0.121 | 0.148 | 0.036 |
| 0.75 | 684 | 5183 | 49591 | 5119 | 0.830 | 0.118 | 0.905 | 0.117 | 0.117 | 0.023 |
| 0.80 | 463 | 3443 | 51331 | 5340 | 0.855 | 0.080 | 0.937 | 0.119 | 0.095 | 0.020 |
| 0.85 | 272 | 2238 | 52536 | 5531 | 0.872 | 0.047 | 0.959 | 0.108 | 0.065 | 0.009 |
| 0.90 | 142 | 1339 | 53435 | 5661 | 0.884 | 0.024 | 0.976 | 0.096 | 0.039 | 0.000 |
| 0.95 | 71 | 664 | 54110 | 5732 | 0.894 | 0.012 | 0.988 | 0.097 | 0.022 | 0.000 |
Performance comparison between the hybrid predictor and Boden' s predictor
| Predictor | TP | FP | TN | FN | accuracy | sensitivity | specificity | precision | F-score | MCC |
|---|---|---|---|---|---|---|---|---|---|---|
| The hybrid predictor (high-sensitivity mode) | 4123 | 21492 | 33331 | 1684 | 0.618 | 0.710 | 0.608 | 0.161 | 0.262 | 0.189 |
| The hybrid predictor (high-specificity mode) | 2617 | 11682 | 43141 | 3190 | 0.755 | 0.451 | 0.787 | 0.183 | 0.260 | 0.165 |
| Boden' s predictor (3-class mode)with entropy threshold = 0.4 | 4009 | 30101 | 24673 | 1794 | 0.473 | 0.691 | 0.450 | 0.118 | 0.201 | 0.084 |
| Boden' s predictor (3-class mode)with entropy threshold = 0.65 | 1730 | 12617 | 42157 | 4073 | 0.724 | 0.298 | 0.770 | 0.121 | 0.172 | 0.047 |
| Boden' s predictor (8-class mode)with entropy threshold = 0.52 | 2388 | 27895 | 26879 | 976 | 0.503 | 0.710 | 0.491 | 0.079 | 0.142 | 0.094 |
| Boden' s predictor (8-class mode)with entropy threshold = 0.69 | 1198 | 11563 | 43211 | 2166 | 0.764 | 0.356 | 0.789 | 0.094 | 0.149 | 0.082 |
Performance comparison between the hybrid predictor and Kuznetsov' s predictor
| Predictor | TP | FP | TN | FN | accuracy | sensitivity | specificity | precision | F-score | MCC |
|---|---|---|---|---|---|---|---|---|---|---|
| The hybrid predictor with only the RVKDE based classifier enabled | 1236 | 9697 | 34877 | 1033 | 0.771 | 0.545 | 0.782 | 0.113 | 0.187 | 0.166 |
| The hybrid predictor with both the QUICKRBF and RVKDE based classifiers enabled | 813 | 4792 | 39782 | 1456 | 0.867 | 0.358 | 0.892 | 0.145 | 0.207 | 0.166 |
| Kuznetsov' s predictor With false positive rate = 20 | 1020 | 9205 | 35369 | 1249 | 0.777 | 0.450 | 0.793 | 0.100 | 0.163 | 0.126 |
| Kuznetsov' s predictor With false positive rate = 10 | 633 | 4676 | 39898 | 1636 | 0.865 | 0.279 | 0.895 | 0.119 | 0.167 | 0.118 |
Figure 3A case study. (a) Two conformations of protein Ap4A hydrolase are plotted by Chimera [36] with different colors. The one colored by yellow is with red-colored ligand ATP·MgFx, and the one colored by blue is without the ligand. (b) The conformationally ambivalent regions reported in [27,28] are plotted by Jmol [37] with colors yellow, blueviolet, darkblue, and greenyellow. (c) The conformationally ambivalent regions predicted by the proposed hybrid predictor are plotted by Jmol [37] with colors lawngreen, royalblue, and lightgreen.
Figure 4The schematic diagram of a RBF network.
Parameter settings of the proposed hybrid predictor for the experiment reported in Table 2
| QuickRBF | RVKDE | |||
|---|---|---|---|---|
| Number of hidden nodes | ||||
| 1400 | 1 | 4 | 30 | 200 |
Parameter settings of the proposed hybrid predictor for the experiment reported in Table 3
| QuickRBF | RVKDE | |||
|---|---|---|---|---|
| Number of hidden nodes | ||||
| 1400 | 1 | 5 | 25 | 190 |