| Literature DB >> 28155663 |
Tamara Vasylenko1, Yi-Fan Liou1, Po-Chin Chiou1, Hsiao-Wei Chu1, Yung-Sung Lai1, Yu-Ling Chou1, Hui-Ling Huang2,3,4, Shinn-Ying Ho5,6,7.
Abstract
BACKGROUND: Bacterial tyrosine-kinases (BY-kinases), which play an important role in numerous cellular processes, are characterized as a separate class of enzymes and share no structural similarity with their eukaryotic counterparts. However, in silico methods for predicting BY-kinases have not been developed yet. Since these enzymes are involved in key regulatory processes, and are promising targets for anti-bacterial drug design, it is desirable to develop a simple and easily interpretable predictor to gain new insights into bacterial tyrosine phosphorylation. This study proposes a novel SCMBYK method for predicting and characterizing BY-kinases.Entities:
Keywords: BY-kinase; Dipeptide; Drug repurposing; Propensity scores; Scoring card method
Mesh:
Substances:
Year: 2016 PMID: 28155663 PMCID: PMC5260027 DOI: 10.1186/s12859-016-1371-4
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Flowchart of the system design for the prediction and analysis of BY-kinases. BYKs denote BY-kinases, non-BYKs stand for non-BY-kinases
Summary of the training and test datasets
| Dataset | BYKP | Non-BYKP | Total |
|---|---|---|---|
| BYK-TRN1102 | 558 | 544 | 1102 |
| BYK-TST472 | 239 | 239 | 478 |
Performance of established datasets as compared for various E-value cut-offs by BLASTp
|
| Hit rate | ACC |
|---|---|---|
| 0.1 | 74% | 73% |
| 0.01 | 72% | 71% |
| 0.001 | 71% | 71% |
| 0.0001 | 70% | 69% |
| 0.00001 | 69% | 68% |
Comparison of the prediction accuracies (%) of BY-kinase predictors
| Classifier | Training accuracy | Test accuracy | Specificity | Sensitivity |
|---|---|---|---|---|
| SVM/DPC | 97.27% | 95.76% | 95.28% | 96.23% |
| SVM/AAC | 96.07% | 95.13% | 96.57% | 93.72% |
| SVM/AA-index | 94.56% | 94.07% | 94.85% | 93.31% |
| J48/DPC | 80.94% | 82.63% | 83.70% | 81.50% |
| J48/AAC | 86.48% | 89.62% | 87.00% | 92.30% |
| J48/AA-index | 88.75% | 88.35% | 90.40% | 86.30% |
| NB/DPC | 84.85% | 86.23% | 86.20% | 86.30% |
| NB/AAC | 77.22% | 78.18% | 67.80% | 88.80% |
| NB/AA-index | 76.50% | 71.19% | 90.00% | 51.90% |
| SCMBYK | 97.55% | 96.73% | 98.00% | 96.00% |
The performance of 10 independent runs using BYK-TRN1102
| Fitness | Training ACC (%) | Test ACC (%) | MCC | Sen. | Spe. | Threshold | |
|---|---|---|---|---|---|---|---|
| #1 | 99.21 | 97.36 | 96.27 | 0.93 | 0.97 | 0.95 | 474 |
|
|
|
|
|
|
|
|
|
| #3 | 99.21 | 97.82 | 96.36 | 0.93 | 0.97 | 0.95 | 486 |
| #4 | 99.08 | 97.73 | 96.82 | 0.94 | 0.98 | 0.95 | 485 |
| #5 | 99.32 | 97.64 | 96.82 | 0.94 | 0.99 | 0.95 | 484 |
| #6 | 99.16 | 97.36 | 96.00 | 0.92 | 0.99 | 0.93 | 460 |
| #7 | 99.02 | 97.00 | 96.27 | 0.93 | 0.95 | 0.98 | 496 |
| #8 | 98.94 | 97.18 | 96.18 | 0.92 | 0.98 | 0.95 | 470 |
| #9 | 99.08 | 97.82 | 96.91 | 0.94 | 0.97 | 0.97 | 464 |
|
|
|
|
|
|
|
|
|
| AVEG | 99.19 | 97.50 | 96.49 | 0.93 | 0.97 | 0.96 | 476.20 |
The bold indicate the performances of SCMBYK
Fig. 2Heat map of the SCMBYK propensity scores of dipeptides
Fig. 3The DP visualization of BY-kinase structures. a Visualization of the overall structure of the Etk kinase domain (PDB code 3CIO), and a close view of the high-score Walker B motif. b CapB2 DP visualization (PDB code 3BFV), and a close view of the highly scored stretch between the Walker A’ and Walker B motifs. The red color is used to mark the positions of high-score dipeptides, in contrast to the low-score dipeptides, which are colored blue
Fig. 4The AA visualization of BY-kinase structures. a AA visualization of the overall structure of the Etk kinase domain (PDB code 3CIO). b CapB2 AA visualization (PDB code 3BFV). Red color represents the positions of highly-scored amino acids, in contrast to the low-scored AA, which are colored in blue
Fig. 5Schematic organization of BY-kinases and their active sites. a Organization of BY-kinases in Proteobacteria and Firmicutes. Walker motifs (A, A’ and B), extracellular hairpin domains, and transmembrane spans are colored yellow, purple, and blue, respectively. In Proteobacteria, the extracellular loop and the intracellular domain are parts of the same protein, whereas in Firmicutes they are linked via specific protein-protein interactions. b Sequence logos of signature amino-acid sites of top-30-scored BY-kinases. “GK” in Walker A, “DXDXR” in Walker A’, and “DXPPX” in Walker B are indicated by larger letters
The putative BY-kinases and the potential drugs
| Drug ID | Drug name | Target protein | Organism | Score |
|---|---|---|---|---|
| DB00724 | Imiquimod | P76123 |
| 478.39 |
| DB09242 | Moxonidine | P76123 |
| 478.39 |
| DB00697 | Tizanidine | P76123 |
| 478.39 |
| DB00336 | Nitrofural | O05031 |
| 474.70 |
| DB03147 | Flavin adenine dinucleotide | O05031 |
| 474.70 |
| DB01091 | Butenafine | Q92HC9 |
| 472.28 |
| DB00857 | Terbinafine | Q92HC9 |
| 472.28 |
| DB00735 | Naftifine | Q92HC9 |
| 472.28 |