| Literature DB >> 24564381 |
Kai-Yao Huang, Cheng-Tsung Lu, Neil Bretaña, Tzong-Yi Lee, Tzu-Hao Chang.
Abstract
BACKGROUND: The phosphorylation of virus proteins by host kinases is linked to viral replication. This leads to an inhibition of normal host-cell functions. Further elucidation of phosphorylation in virus proteins is required in order to aid in drug design and treatment. However, only a few studies have investigated substrate motifs in identifying virus phosphorylation sites. Additionally, existing bioinformatics tool do not consider potential host kinases that may initiate the phosphorylation of a virus protein.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24564381 PMCID: PMC3853219 DOI: 10.1186/1471-2105-14-S16-S10
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Conceptual diagram of virus progression.
Data statistics of training set and independent testing set.
| Data set | pSer | pThr | pTyr | ||
|---|---|---|---|---|---|
| virPTM | Positive data | 233 | 54 | 14 | |
| Negative data | 2588 | 1170 | 65 | ||
| Balanced negative data | 233 | 54 | 14 | ||
| dbPTM | Positive data | 42 | 12 | 1 | |
| Negative data | 679 | 186 | 11 | ||
| Balanced negative data | 42 | 12 | 1 | ||
| UniProtKB | Positive data | 24 | 10 | - | |
| Negative data | 217 | 159 | - | ||
| Balanced negative data | 24 | 10 | - | ||
| Phospho.ELM | Positive data | 2 | - | 2 | |
| Negative data | 67 | - | 16 | ||
| Balanced negative data | 2 | - | 2 | ||
| Positive data | 42 | 12 | 2 | ||
| Negative data | 352 | 106 | 16 | ||
| Balanced negative data | 42 | 12 | 2 | ||
Figure 2The analytical flowchart of applying MDDLogo.
Figure 3The conceptual diagram of two-layeredSVMs trained with MDDLogo-identified motifs.
Five-fold cross validation results on pSer MDDLogo-clustered SVM models.
| SVM model | Number of positive data | Number of negative data | Cost value | Gamma value | Sn | Sp | Acc | MCC |
|---|---|---|---|---|---|---|---|---|
| All data | 233 | 233 | 0.5 | 0.125 | 0.76 | 0.72 | 0.74 | 0.48 |
| Subgroup S1 | 66 | 66 | 2 | 0.125 | 0.98 | 0.87 | 0.93 | 0.86 |
| Subgroup S2 | 54 | 54 | 8 | 0.03125 | 0.94 | 0.92 | 0.93 | 0.87 |
| Subgroup S3 | 34 | 34 | 0.5 | 0.03125 | 0.91 | 0.79 | 0.85 | 0.71 |
| Subgroup S4 | 20 | 20 | 2 | 0.125 | 0.90 | 0.80 | 0.85 | 0.70 |
| Subgroup S5 | 15 | 15 | 2 | 0.125 | 0.87 | 0.80 | 0.83 | 0.66 |
| Subgroup S6 | 44 | 44 | 0.5 | 0.03125 | 0.75 | 0.61 | 0.68 | 0.37 |
Five-fold cross validation results on pThr MDDLogo-clustered SVM models.
| SVM model | Number of positive data | Number of negative data | Cost value | Gamma value | Sn | Sp | Acc | MCC |
|---|---|---|---|---|---|---|---|---|
| All data | 54 | 54 | 2 | 0.125 | 0.70 | 0.70 | 0.70 | 0.40 |
| Subgroup T1 | 19 | 19 | 2 | 0.125 | 0.95 | 0.90 | 0.92 | 0.84 |
| Subgroup T2 | 19 | 19 | 2 | 0.03125 | 0.95 | 0.95 | 0.95 | 0.89 |
| Subgroup T3 | 16 | 16 | 0.5 | 0.125 | 0.68 | 0.75 | 0.72 | 0.44 |
Figure 4Comparison of independent testing performance. (A) Comparison of independent testing results between Single pSer SVM model and MDDLogo-clustered pSer SVM models. (B) Comparison of independent testing results between Single pThr SVM model and MDDLogo-clustered pThr SVM models.
Figure 5Comparison of independent testing performance between ViralPhos and other kinase-specific phosphorylation site prediction tools.
Figure 6User interface of ViralPhos.