| Literature DB >> 28155651 |
Nguyen-Quoc-Khanh Le1, Yu-Yen Ou2.
Abstract
BACKGROUND: Guanonine-protein (G-protein) is known as molecular switches inside cells, and is very important in signals transmission from outside to inside cell. Especially in transport protein, most of G-proteins play an important role in membrane trafficking; necessary for transferring proteins and other molecules to a variety of destinations outside and inside of the cell. The function of membrane trafficking is controlled by G-proteins via Guanosine triphosphate (GTP) binding sites. The GTP binding sites active G-proteins initiated to membrane vesicles by interacting with specific effector proteins. Without the interaction from GTP binding sites, G-proteins could not be active in membrane trafficking and consequently cause many diseases, i.e., cancer, Parkinson… Thus it is very important to identify GTP binding sites in membrane trafficking, in particular, and in transport protein, in general.Entities:
Keywords: GTP binding site; Position specific scoring matrix; Radial basis function network; Significant amino acid pairs; Transport protein
Mesh:
Substances:
Year: 2016 PMID: 28155651 PMCID: PMC5259906 DOI: 10.1186/s12859-016-1369-y
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Process of GTP binding sites in transport proteins
Fig. 2Whole architecture for predicting GTP binding sites in transport proteins
All 22 GTP binding proteins using in the proposed study
| Number of proteins | GTP binding sites | Non-GTP binding sites |
|---|---|---|
| 22 | 364 | 10434 |
The details of all 22 GTP binding proteins separated into independent dataset and cross-validation dataset
| Independent dataset | Cross-validation dataset | ||||
|---|---|---|---|---|---|
| Q9UTE0 | Q9ERI2 | Q5S006 | Q9H0F7 | Q57986 | P09527 |
| Q8IXI2 | Q9ULW5 | Q41009 | P33650 | O75695 | Q6IQ22 |
| P60953 | O35963 | P93042 | P42208 | P51157 | Q9UL25 |
| P20606 | Q9C0L9 | A8INQ0 | P62834 | ||
Fig. 3Methodology for identifying SAAP values in data set
Fig. 4Architecture of the RBF network
Fig. 5Composition of amino acid between GTP binding sites and non-GTP binding sites in data set
Predicting GTP binding sites in the transport proteins with different window sizes
| Window Size | True Positive | False Positive | True Negative | False Negative | Sens | Spec | Acc | MCC |
|---|---|---|---|---|---|---|---|---|
| WS13 | 259 | 334 | 8440 | 53 | 83 | 96.2 | 95.7 | 0.58 |
| WS15 | 260 | 348 | 8426 | 52 | 83.3 | 96 | 95.6 | 0.58 |
| WS17 | 249 | 409 | 8365 | 63 | 79.8 | 95.3 | 94.8 | 0.53 |
| WS19 | 261 | 348 | 8426 | 51 | 83.7 | 96 | 95.6 | 0.58 |
Fig. 6Sequence logo for 22 GTP binding proteins in transport proteins (generated from WebLogo)
Predicting GTP binding sites in the transport proteins with different feature sets
| Feature set | True Positive | False Positive | True Negative | False Negative | Sens | Spec | Acc | MCC | |
|---|---|---|---|---|---|---|---|---|---|
| 5-fold | BINARY | 261 | 1951 | 6823 | 51 | 83.7 | 77.8 | 78 | 0.26 |
| BLOSUM62 | 232 | 412 | 8362 | 80 | 74.4 | 95.3 | 94.6 | 0.49 | |
| PAM250 | 246 | 341 | 8433 | 66 | 78.8 | 96.1 | 95.5 | 0.56 | |
| PSSM | 260 | 351 | 8423 | 52 | 83.3 | 96 | 95.6 | 0.58 | |
| PSSM + SAAPs | 261 | 348 | 8426 | 51 | 83.7 | 96 | 95.6 | 0.58 | |
| Indept | BINARY | 49 | 100 | 1610 | 3 | 94.2 | 94.2 | 94.2 | 0.54 |
| BLOSUM62 | 49 | 98 | 1612 | 3 | 94.2 | 94.3 | 94.3 | 0.54 | |
| PAM250 | 49 | 71 | 1639 | 3 | 94.2 | 95.8 | 95.9 | 0.62 | |
| PSSM | 48 | 23 | 1687 | 4 | 92.3 | 98.7 | 98.5 | 0.78 | |
| PSSM + SAAPs | 49 | 20 | 1690 | 3 | 94.2 | 98.8 | 98.7 | 0.81 |
Fig. 7Comparison predictive performance between different classifiers with ROC Curve and AUC
The comparison of predicting GTP binding sites in the transport proteins between different classifiers
| Feature set | True Positive | False Positive | True Negative | False Negative | Sens | Spec | Acc | MCC | |
|---|---|---|---|---|---|---|---|---|---|
| 5-fold | kNN | 258 | 482 | 8287 | 54 | 82.7 | 94.5 | 94.1 | 0.51 |
| RandomForest | 225 | 420 | 8349 | 87 | 72.1 | 95.2 | 94.4 | 0.48 | |
| LibSVM | 251 | 505 | 8264 | 61 | 80.4 | 94.2 | 93.8 | 0.49 | |
| QuickRBF | 261 | 348 | 8426 | 51 | 83.7 | 96 | 95.6 | 0.58 | |
| Indept | kNN | 49 | 68 | 1641 | 3 | 94.2 | 96 | 96 | 0.61 |
| RandomForest | 40 | 41 | 1668 | 12 | 76.9 | 97.6 | 97 | 0.6 | |
| LibSVM | 43 | 112 | 1597 | 9 | 82.7 | 93.4 | 93.1 | 0.45 | |
| QuickRBF | 49 | 20 | 1690 | 3 | 94.2 | 98.8 | 98.7 | 0.81 |
Predicting GTP binding sites in the transport proteins with other studies
| Cross-validation | Independent | |||||||
|---|---|---|---|---|---|---|---|---|
| Feature set | Sens | Spec | Acc | MCC | Sens | Spec | Acc | MCC |
| Proposed method | 83.7 | 96 | 95.6 | 0.58 | 94.2 | 98.8 | 98.7 | 0.81 |
| GTPBinder | 66.8 | 99.1 | 96.3 | 0.75 | 82.7 | 79.9 | 80 | 0.26 |
| NsitePred | 47.3 | 99.1 | 96.8 | 0.56 | 60.4 | 98.8 | 96.9 | 0.64 |
| TargetSOS | 47.3 | 99.5 | 97.4 | 0.6 | 61.9 | 98.8 | 97.1 | 0.66 |
Predicting GTP binding sites in two newly discovered proteins
| Classifier | True Positive | False Positive | True Negative | False Negative | Sens | Spec | Acc | MCC |
|---|---|---|---|---|---|---|---|---|
| Proposed Method | ||||||||
| Q9H0F7 | 15 | 9 | 162 | 0 | 100 | 99 | 99 | 0.84 |
| A8INQ0 | 12 | 5 | 510 | 0 | 100 | 99 | 99 | 0.84 |
| TargetSOS | ||||||||
| Q9H0F7 | 13 | 7 | 164 | 2 | 86.7 | 95.9 | 95.2 | 0.73 |
| A8INQ0 | 10 | 14 | 501 | 2 | 83.3 | 97.3 | 97 | 0.58 |
| GTPBinder | ||||||||
| Q9H0F7 | 11 | 33 | 138 | 4 | 73.3 | 80.7 | 80.1 | 0.35 |
| A8INQ0 | 8 | 130 | 385 | 4 | 66.7 | 74.8 | 74.6 | 0.14 |