| Literature DB >> 22759651 |
Jose M G Izarzugaza1, Angela del Pozo, Miguel Vazquez, Alfonso Valencia.
Abstract
BACKGROUND: Most of the many mutations described in human protein kinases are tolerated without significant disruption of the corresponding structures or molecular functions, while some of them have been associated to a variety of human diseases, including cancer. In the last decade, a plethora of computational methods to predict the effect of missense single-nucleotide variants (SNVs) have been developed. Still, current high-throughput sequencing efforts and the concomitant need for massive interpretation of protein sequence variants will demand for more efficient and/or accurate computational methods in the forthcoming years.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22759651 PMCID: PMC3303724 DOI: 10.1186/1471-2164-13-S4-S3
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Performance of the classifier depending on the SVM classification thresholds applied using all kinase groups
| SVM threshold | Accuracy (%) | Precision (%) | F-score | MCC | |
|---|---|---|---|---|---|
| -1.00 | 74.3 | 46.5 | 89.1 | 61.1 | 0.6 |
| -0.75 | 80.9 | 56.1 | 79.8 | 65.9 | 0.6 |
| -0.25 | 80.6 | 60.0 | 66.1 | 62.9 | 0.5 |
| 0.00 | 82.3 | 58.6 | 47.2 | 52.3 | 0.4 |
Performance of the classifier depending on the SVM classification thresholds applied using all kinase groups.
Number of mutations in each of the groups in which UniProt divides the protein kinase superfamily
| Group | Disease | Neutral | Total |
|---|---|---|---|
| TK † | 496 | 565 | 1061 |
| TKL † | 172 | 151 | 323 |
| Atypical_PI3-PI4 † | 49 | 138 | 187 |
| CAMK † | 40 | 518 | 558 |
| Other | 36 | 411 | 447 |
| RGC † | 23 | 35 | 58 |
| CMGC † | 18 | 178 | 196 |
| AGC † | 16 | 190 | 206 |
| STE | 7 | 222 | 229 |
| Atypical_ADCK | 6 | 14 | 20 |
| Atypical_Alpha-type | 1 | 88 | 89 |
| CK1 | 1 | 52 | 53 |
| NEK | 0 | 45 | 45 |
| Atypical_RIO | 0 | 14 | 14 |
| Atypical_PDK-BCKDK | 0 | 5 | 5 |
| Atypical_FAST | 0 | 1 | 1 |
Number of mutations in each of the groups in which UniProt divides the protein kinase superfamily. The groups enriched in disease-associated mutations are highlighted by †.
Performance of the classifier depending on the SVM classification thresholds applied when using groups highly populated in disease mutations only
| SVM threshold | Accuracy (%) | Precision (%) | Recall (%) | MCC |
|---|---|---|---|---|
| -1.000 | 71.5 | 51.4 | 88.9 | 0.6 |
| -0.750 | 77.0 | 61.7 | 81.5 | 0.6 |
| -0.250 | 79.4 | 68.1 | 69.9 | 0.5 |
| 0.000 | 71.6 | 60.7 | 56.3 | 0.4 |
Performance of the classifier depending on the SVM classification thresholds applied when using groups highly populated in disease mutations only.
Performance of the classifier when the groups in which UniProt divides the protein kinase superfamily are considered individually
| Group | Accuracy (%) | Precision (%) | Recall (%) | MCC |
|---|---|---|---|---|
| CMGC † | 91.5 | 87.5 | 8.6 | 0.1 |
| TKL † | 68.7 | 70.5 | 70.9 | 0.4 |
| TK † | 71.3 | 69.7 | 68.3 | 0.4 |
| RGC † | 58.2 | 47.9 | 61.3 | 0.2 |
| Atypical_PI3-PI4 † | 70.6 | 47.1 | 100 | 0.8 |
| STE | 96.8 | 43.7 | 11.1 | 0.1 |
| AGC † | 90.8 | 43.3 | 61.1 | 0.5 |
| Other | 88.9 | 41.6 | 95.4 | 0.9 |
| CK1 | 97.7 | 33.3 | 22.2 | 0.2 |
| Atypical_Alpha-type | 89.9 | 9.1 | 88.9 | 0.8 |
| CAMK † | 55.5 | 8.3 | 51.9 | 0.1 |
| Atypical_ADCK | 70.0 | 0 | 0 | 0 |
| NEK | 100 | 0 | 0 | 0 |
| Atypical_RIO | 100 | 0 | 0 | 0 |
| Atypical_PDK-BCKDK | 100 | 0 | 0 | 0 |
| Atypical_FAST | 100 | 0 | 0 | 0 |
Performance of the classifier when the groups in which UniProt divides the protein kinase superfamily are considered individually. Groups enriched in disease-associated mutations are indicated by †.
Summary of the performance of other state-of-the-art classifiers of mutations, either general or kinase-specific
| Method | Scope | Accuracy (%) | Precision (%) | Recall (%) | MCC |
|---|---|---|---|---|---|
| Kinase† | 83.3 | 60.0 | 75.2 | 0.6 | |
| SNPs&GO [ | Kinase† | 82.3 | 62.8 | 77.5 | 0.6 |
| Torkamani [ | Kinase | 77.0 | - | - | 0.5 |
| MutationAssessor [ | Kinase† | 53.8 | 41.6 | 95.6 | 0.5 |
| SNAP [ | Kinase† | 49.4 | 34.0 | 93.1 | 0.4 |
| SIFT [ | Kinase† | 77.6 | 37.8 | 27.9 | 0.2 |
| SNPs&GO [ | Genome-wide | 82.0 | 83.0 | 78.0 | 0.6 |
| MutationAssessor [ | Genome-wide | 79.0 | - | - | - |
| SNAP [ | Genome-wide | 78.2 | 76.7 | 80.2 | |
| SIFT [ | Genome-wide | 68.3 | 66.1 | 56.5 | 0.3 |
Summary of the performance of other state-of-the-art classifiers of mutations, either general or kinase-specific. Performance was measured in terms of overall accuracy recall and the Matthews correlation coefficient. General methods with which the prediction corresponds to our dataset are marked with †. The remaining results for the classifiers displayed here were taken directly from their original publications