| Literature DB >> 20537135 |
Boris Sobolev1, Dmitry Filimonov, Alexey Lagunin, Alexey Zakharov, Olga Koborova, Alexander Kel, Vladimir Poroikov.
Abstract
BACKGROUND: The knowledge about proteins with specific interaction capacity to the protein partners is very important for the modeling of cell signaling networks. However, the experimentally-derived data are sufficiently not complete for the reconstruction of signaling pathways. This problem can be solved by the network enrichment with predicted protein interactions. The previously published in silico method PAAS was applied for prediction of interactions between protein kinases and their substrates.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20537135 PMCID: PMC3098073 DOI: 10.1186/1471-2105-11-313
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Local similarity estimation. The diagonal corresponds to the shift value h providing the best match between the region of sequence Q and sequence D. Ais the summarized similarity of superposed areas of sequences Q and D terminated at qand d, respectively. Thus, the score R= A- A, presents the highest similarity score being found for the selected region of sequence Q. Finally, the similarity score Stakes the maximal values from Rscores.
Designations and descriptions of kinases whose substrates were included into the training set
| Kinase type | Description | ||
|---|---|---|---|
| ABL1 | Proto-oncogene tyrosine-protein kinase | 178 | 1271 |
| ATM | Ataxia telangiectasia mutated | 118 | 3056 |
| AURORA_A | Serine/threonine-protein kinase 6 (STK6) | 136 | 1863 |
| AURORA_B | Serine/threonine-protein kinase 12 (AURKB) | 136 | 923 |
| CAM_KII_group | Calcium/calmodulin-dependent protein kinase II | 52 | 5037 |
| CAM_KII_alpha | Calcium/calmodulin-dependent protein kinase II alpha | 52 | 4967 |
| CDK1 | Cell division control protein 2 homolog (Cyclin-dependent kinase 1) | 107 | 4684 |
| CDK2 | Cell division protein kinase 2 | 119 | 1971 |
| CDKgroup | Cyclin-dependent kinases | 149 | 1863 |
| CK1alpha | Casein kinase 1, alpha | 140 | 911 |
| CK1group | Casein kinases 1 | 195 | 2843 |
| CK2group | Casein kinase 2 | 98 | 2346 |
| DNA_PK | DNA-dependent protein kinase catalytic subunit | 270 | 4128 |
| EGFR | Epidermal growth factor receptor (Receptor tyrosine-protein kinase ErbB-1) | 76 | 1291 |
| ERK2 | Mitogen-activated protein kinase 1 | 196 | 2225 |
| ERK1 | Mitogen-activated protein kinase 3 | 168 | 2749 |
| FYN | Proto-oncogene tyrosine-protein kinase Fyn | 164 | 2758 |
| GSK3beta | Glycogen synthase kinase 3 beta | 164 | 2470 |
| GSKgroup | Glycogen synthase kinases 3 | 157 | 1914 |
| INS_R | Insulin receptor | 132 | 1382 |
| JNK1 | c-Jun N-terminal kinase 1 | 196 | 1242 |
| JNK2 | c-Jun N-terminal kinase 2 | 196 | 1075 |
| LCK | Lymphocyte-specific protein tyrosine kinase | 220 | 2472 |
| LKB1 | Serine/threonine kinase 11 (LKB1) | 433 | 1263 |
| LYN | Tyrosine-protein kinase Lyn | 202 | 1827 |
| MAPKAPK2 | mitogen-activated protein kinase-activated protein kinase 2 | 168 | 1807 |
| MAPKgroup | P38, JNK and ERK | 136 | 1914 |
| PAK1 | Serine/threonine-protein kinase PAK1 | 89 | 2647 |
| PDK-1 | 3-phosphoinositide dependent protein kinase 1 | 268 | 1374 |
| PKAalpha | Protein kinase, cAMP-dependent, catalytic, alpha | 52 | 2749 |
| PKAgroup | cAMP-dependent protein kinase | 30 | 5037 |
| PKBgroup | Protein kinases B | 130 | 5890 |
| PKCalpha | Protein kinase C, alpha type | 72 | 2441 |
| PKCbeta | Protein kinase C, beta 1 | 149 | 1531 |
| PKCdelta | Protein kinase C, delta type | 187 | 2414 |
| PKCgroup | Protein kinase | 30 | 2442 |
| PKCzeta | Protein kinase C, zeta type | 147 | 1242 |
| PKGgroup | cGMP-dependent protein kinases | 90 | 5037 |
| PLK1 | Polo like kinase 1 | 163 | 3418 |
| ROCKgroup | Rho-associated, coiled-coil containing protein kinases | 309 | 737 |
| RSKgroup | Ribosomal protein S6 kinases | 198 | 2647 |
| SGKgroup | Serum/glucocorticoid regulated kinase | 341 | 3144 |
| SRC | Proto-oncogene tyrosine-protein kinase Src | 101 | 4544 |
| SYK | Tyrosine-protein kinase SYK (Spleen tyrosine kinase). | 113 | 1290 |
| P38alpha | mitogen-activated protein kinase 14 | 168 | 902 |
Lmin and Lmax are the minimal and maximal values of the sequence length of proteins referred to the given class.
Figure 2Intersection of the kinase substrate classes.
IAP values obtained by LOO CV for the training set
| Kinase type | Number of substrates | Frame values | |||||||
|---|---|---|---|---|---|---|---|---|---|
| 10 | 15 | 20 | 25 | 30 | 35 | 40 | 45 | ||
| ABL1 | 32 | 0.652 | 0.665 | 0.672 | 0.661 | 0.675 | 0.685 | 0.683 | 0.664 |
| ATM | 30 | 0.787 | 0.780 | 0.786 | 0.779 | 0.785 | 0.781 | 0.782 | 0.786 |
| AURORA_A | 12 | 0.747 | 0.732 | 0.743 | 0.792 | 0.784 | 0.773 | 0.769 | 0.744 |
| AURORA_B | 14 | 0.857 | 0.840 | 0.819 | 0.858 | 0.871 | 0.879 | 0.871 | 0.876 |
| CAM_II_group | 40 | 0.689 | 0.708 | 0.699 | 0.707 | 0.680 | 0.692 | 0.693 | 0.703 |
| 21 | 0.616 | 0.592 | 0.591 | 0.528 | 0.519 | 0.529 | |||
| CDK1 | 69 | 0.640 | 0.645 | 0.641 | 0.648 | 0.656 | 0.657 | 0.662 | 0.658 |
| CDK2 | 28 | 0.767 | 0.747 | 0.754 | 0.761 | 0.753 | 0.748 | 0.730 | 0.725 |
| CDKgroup | 47 | 0.693 | 0.715 | 0.702 | 0.682 | 0.664 | 0.663 | 0.667 | 0.670 |
| CK1alpha | 11 | 0.578 | 0.553 | 0.575 | 0.609 | 0.625 | 0.642 | 0.594 | 0.560 |
| CK1group | 18 | 0.642 | 0.644 | 0.639 | 0.637 | 0.627 | 0.630 | 0.660 | 0.662 |
| CK2group | 122 | 0.745 | 0.740 | 0.735 | 0.746 | 0.742 | 0.737 | 0.742 | 0.748 |
| 11 | 0.492 | 0.506 | 0.458 | 0.563 | 0.537 | 0.545 | |||
| EGFR | 27 | 0.840 | 0.843 | 0.883 | 0.861 | 0.893 | 0.887 | 0.891 | 0.888 |
| ERK2 | 71 | 0.714 | 0.700 | 0.695 | 0.697 | 0.698 | 0.700 | 0.696 | 0.702 |
| ERK1 | 61 | 0.655 | 0.639 | 0.634 | 0.632 | 0.632 | 0.631 | 0.622 | 0.634 |
| FYN | 25 | 0.687 | 0.697 | 0.695 | 0.696 | 0.651 | 0.627 | 0.619 | 0.632 |
| GSK3beta | 26 | 0.654 | 0.661 | 0.690 | 0.688 | 0.706 | 0.718 | 0.716 | 0.725 |
| GSKgroup | 20 | 0.650 | 0.640 | 0.664 | 0.616 | 0.589 | 0.592 | 0.557 | 0.566 |
| INS_R | 13 | 0.709 | 0.641 | 0.643 | 0.685 | 0.668 | 0.632 | 0.623 | 0.569 |
| JNK1 | 20 | 0.762 | 0.773 | 0.754 | 0.766 | 0.783 | 0.770 | 0.777 | 0.779 |
| JNK2 | 10 | 0.672 | 0.631 | 0.605 | 0.653 | 0.632 | 0.599 | 0.638 | 0.655 |
| LCK | 29 | 0.813 | 0.820 | 0.831 | 0.824 | 0.826 | 0.834 | 0.838 | 0.835 |
| LKB1 | 16 | 0.996 | 0.995 | 0.994 | 0.994 | 0.993 | 0.989 | 0.990 | 0.991 |
| LYN | 26 | 0.751 | 0.771 | 0.743 | 0.722 | 0.705 | 0.706 | 0.711 | 0.712 |
| MAPKAPK2 | 17 | 0.618 | 0.637 | 0.629 | 0.641 | 0.571 | 0.542 | 0.511 | 0.522 |
| MAPKgroup | 36 | 0.677 | 0.664 | 0.676 | 0.676 | 0.666 | 0.668 | 0.665 | 0.669 |
| 21 | 0.500 | 0.517 | 0.570 | 0.575 | 0.576 | 0.541 | |||
| PDK-1 | 24 | 0.957 | 0.958 | 0.956 | 0.957 | 0.955 | 0.948 | 0.949 | 0.950 |
| 22 | 0.367 | 0.356 | 0.356 | 0.424 | 0.420 | 0.398 | |||
| PKAgroup | 206 | 0.658 | 0.660 | 0.669 | 0.668 | 0.672 | 0.648 | 0.646 | 0.648 |
| PKBgroup | 63 | 0.663 | 0.676 | 0.661 | 0.655 | 0.640 | 0.630 | 0.637 | 0.637 |
| PKCalpha | 81 | 0.663 | 0.653 | 0.656 | 0.643 | 0.646 | 0.649 | 0.656 | 0.653 |
| 10 | 0.294 | 0.376 | 0.350 | 0.475 | 0.481 | 0.427 | |||
| 17 | 0.418 | 0.472 | 0.449 | 0.463 | 0.479 | 0.493 | |||
| PKCgroup | 145 | 0.733 | 0.744 | 0.754 | 0.757 | 0.756 | 0.724 | 0.724 | 0.725 |
| PKCzeta | 11 | 0.643 | 0.626 | 0.668 | 0.701 | 0.736 | 0.746 | 0.743 | 0.733 |
| 10 | 0.492 | 0.505 | 0.553 | 0.606 | 0.587 | 0.548 | |||
| PLK1 | 18 | 0.678 | 0.628 | 0.670 | 0.718 | 0.731 | 0.721 | 0.688 | 0.704 |
| ROCKgroup | 12 | 0.828 | 0.852 | 0.862 | 0.856 | 0.866 | 0.872 | 0.880 | 0.889 |
| RSKgroup | 18 | 0.592 | 0.592 | 0.618 | 0.658 | 0.645 | 0.626 | 0.620 | 0.640 |
| SGKgroup | 11 | 0.738 | 0.749 | 0.699 | 0.695 | 0.699 | 0.699 | 0.699 | 0.683 |
| SRC | 92 | 0.731 | 0.732 | 0.740 | 0.742 | 0.742 | 0.738 | 0.746 | 0.745 |
| SYK | 21 | 0.741 | 0.729 | 0.752 | 0.766 | 0.775 | 0.764 | 0.744 | 0.750 |
| P38alpha | 24 | 0.726 | 0.720 | 0.723 | 0.737 | 0.741 | 0.728 | 0.705 | 0.717 |
| Average | 0.678 | 0.678 | 0.681 | 0.689 | 0.689 | 0.687 | 0.683 | 0.681 | |
IAP values obtained by 20-fold multiple splitting
| Kinase type | All species | Human | ||||||
|---|---|---|---|---|---|---|---|---|
| No | LOO CV | M | SD | No | LOO CV | M | SD | |
| ABL1 | 32 | 0.661 | 0.685 | 0.072 | 24 | 0.600 | 0.600 | 0.106 |
| ATM | 30 | 0.779 | 0.751 | 0.086 | 29 | 0.771 | 0.762 | 0.075 |
| AURORA_A | 12 | 0.792 | 0.777 | 0.130 | - | |||
| AURORA_B | 14 | 0.858 | 0.857 | 0.114 | 11 | 0.808 | 0.815 | 0.115 |
| CAM_II_group | 40 | 0.707 | 0.657 | 0.084 | 19 | 0.651 | 0.681 | 0.090 |
| CAM_KII_alpha | 21 | 0.531 | 0.515 | 0.120 | 16 | 0.475 | 0.494 | 0.111 |
| CDK1 | 69 | 0.648 | 0.641 | 0.049 | 62 | 0.679 | 0.653 | 0.064 |
| CDK2 | 28 | 0.761 | 0.740 | 0.066 | 21 | 0.659 | 0.666 | 0.092 |
| CDKgroup | 47 | 0.682 | 0.653 | 0.065 | 30 | 0.546 | 0.534 | 0.080 |
| CK1alpha | 11 | 0.609 | 0.595 | 0.134 | 10 | 0.548 | 0.535 | 0.097 |
| CK1group | 18 | 0.637 | 0.622 | 0.158 | 10 | 0.487 | 0.508 | 0.182 |
| CK2group | 122 | 0.746 | 0.734 | 0.041 | 87 | 0.680 | 0.670 | 0.046 |
| DNA_PK | 11 | 0.508 | 0.466 | 0.121 | - | |||
| EGFR | 27 | 0.861 | 0.808 | 0.090 | 21 | 0.723 | 0.728 | 0.092 |
| ERK1 | 71 | 0.697 | 0.624 | 0.048 | 54 | 0.621 | 0.637 | 0.054 |
| ERK2 | 61 | 0.632 | 0.673 | 0.038 | 52 | 0.662 | 0.656 | 0.048 |
| FYN | 25 | 0.696 | 0.726 | 0.083 | 19 | 0.625 | 0.664 | 0.096 |
| GSK3beta | 26 | 0.688 | 0.671 | 0.087 | 20 | 0.606 | 0.588 | 0.125 |
| GSKgroup | 20 | 0.616 | 0.599 | 0.113 | 13 | 0.648 | 0.620 | 0.183 |
| INS_R | 13 | 0.685 | 0.583 | 0.133 | - | |||
| JNK1 | 20 | 0.766 | 0.763 | 0.093 | 15 | 0.786 | 0.759 | 0.121 |
| JNK2 | 10 | 0.653 | 0.634 | 0.125 | - | |||
| LCK | 29 | 0.824 | 0.795 | 0.058 | 24 | 0.787 | 0.792 | 0.072 |
| LKB1 | 16 | 0.994 | 0.993 | 0.003 | 15 | 0.995 | 0.995 | 0.005 |
| LYN | 26 | 0.722 | 0.745 | 0.090 | 20 | 0.699 | 0.691 | 0.118 |
| MAPKAPK2 | 17 | 0.641 | 0.652 | 0.082 | 15 | 0.590 | 0.596 | 0.081 |
| MAPKgroup | 36 | 0.676 | 0.661 | 0.063 | 31 | 0.625 | 0.614 | 0.089 |
| PAK1 | 21 | 0.573 | 0.574 | 0.098 | 16 | 0.527 | 0.514 | 0.092 |
| PDK-1 | 24 | 0.957 | 0.956 | 0.038 | 19 | 0.942 | 0.933 | 0.079 |
| PKAalpha | 22 | 0.409 | 0.466 | 0.108 | - | |||
| PKAgroup | 206 | 0.668 | 0.655 | 0.028 | 138 | 0.595 | 0.593 | 0.037 |
| PKBgroup | 63 | 0.655 | 0.650 | 0.051 | 55 | 0.625 | 0.624 | 0.038 |
| PKCalpha | 81 | 0.643 | 0.627 | 0.054 | 68 | 0.648 | 0.630 | 0.053 |
| PKCbeta | 10 | 0.364 | 0.383 | 0.155 | - | |||
| PKCdelta | 17 | 0.49 | 0.507 | 0.111 | 16 | 0.382 | 0.439 | 0.115 |
| PKCgroup | 145 | 0.757 | 0.755 | 0.036 | 84 | 0.691 | 0.656 | 0.046 |
| PKCzeta | 11 | 0.701 | 0.670 | 0.207 | 10 | 0.646 | 0.584 | 0.170 |
| PKGgroup | 10 | 0.551 | 0.642 | 0.187 | - | |||
| PLK1 | 18 | 0.718 | 0.661 | 0.163 | 17 | 0.699 | 0.669 | 0.119 |
| ROCKgroup | 12 | 0.856 | 0.839 | 0.155 | - | |||
| RSKgroup | 18 | 0.658 | 0.644 | 0.095 | 14 | 0.471 | 0.511 | 0.146 |
| SGKgroup | 11 | 0.695 | 0.685 | 0.182 | 10 | 0.600 | 0.543 | 0.159 |
| SRC | 92 | 0.742 | 0.717 | 0.043 | 65 | 0.656 | 0.627 | 0.052 |
| SYK | 21 | 0.766 | 0.709 | 0.158 | 17 | 0.689 | 0.615 | 0.179 |
| p38alpha | 24 | 0.737 | 0.700 | 0.083 | 23 | 0.767 | 0.786 | 0.065 |
| Average | 0.689 | 0.677 | 0.096 | 0.654 | 0.648 | 0.094 | ||
The calculation was performed for the whole training set (1021 substrate proteins) and subset, including only human proteins (709). The numbers of the substrates in each class (No) are presented. IAP values were calculated by LOO CV procedure. The IAP values, averaged on 20 rounds of multiple splitting (M) and their standard deviations (SD), are presented. The kinase substrate classes containing less than 10 proteins (for human) are excluded.
The confirmation of TRANSPATH® interaction data with the PAAS prediction
| Substrate Accession No in UniProt | Substrate Name in TRANSPATH® database | Kinase type | IAP* | |
|---|---|---|---|---|
| O15169 | Axin | CK1group | - | 0.637 |
| O15169 | Axin | GSK3beta | 0.844 | 0.688 |
| O15169 | Axin | Cdk* | 0.031 | 0.682 |
| P24941 | Cdk2 | Lyn | 0.525 | 0.722 |
| P17302 | Connexin-43 | Src | 0.754 | 0.742 |
| P17302 | Connexin-43 | PKCgroup* | 0.863 | 0.757 |
| P17302 | Connexin-43 | PKCalpha | 0.342 | 0.643 |
| Q13158 | FADD | PKCgroup* | 0.446 | 0.757 |
| Q13158 | FADD | CK1alpha | - | 0.609 |
| P05230 | FGF-1 | CK2group | - | 0.746 |
| P43694 | GATA-4 | ERK2 | 0.915 | 0.697 |
| P43694 | GATA-4 | GSK3beta | 0.688 | 0.688 |
| Q16665 | HIF-1alpha | ERK1 | 0.038 | 0.632 |
| Q16665 | HIF-1alpha | ERK2 | - | 0.697 |
| Q01344 | IL-5Ralpha | Lyn | 0.153 | 0.722 |
| P17535 | JunD | ERK2 | 0.749 | 0.697 |
| P17535 | JunD | JNK2 | 0.686 | 0.653 |
| P17535 | JunD | JNK1 | 0.642 | 0.766 |
| Q13233 | MEKK1 | ABL1l | 0.044 | 0.661 |
| Q13233 | MEKK1 | PKCgroup | - | 0.757 |
| Q13233 | MEKK1 | GSKgroup* | 0.371 | 0.616 |
| O15151 | Mdm4 | CK1alpha | 0.183 | 0.609 |
| O15151 | Mdm4 | ATM | 0.818 | 0.779 |
| P27361 | ERK1 | Lck | - | 0.824 |
| P27361 | ERK1 | MAPKgroup | 0.627 | 0.824 |
| Q16539 | p38alpha | p38aplha | 0.482 | 0.737 |
| Q13469 | NF-AT1 | JNK1 | 0.301 | 0.766 |
| Q13469 | NF-AT1 | CK1group | 0.664 | 0.637 |
| Q13469 | NF-AT1 | PKCzeta | 0.193 | 0.701 |
| P16234 | PDGFRalpha | ABL1 | 0.305 | 0.661 |
| P09619 | PDGFRbeta | ABL1 | 0.468 | 0.661 |
| P53350 | Plk1 | Cdk1 | - | 0.648 |
| P53350 | Plk1 | PKAgroup | - | 0.668 |
| P28749 | p107 | CDKgroup* | 0.636 | 0.682 |
| Q13309 | Skp2 | Cdk2 | 0.249 | 0.761 |
| Q9Y6H5 | Synphilin-1 | GSK3Beta | 0.709 | 0.688 |
| Q9Y6H5 | Synphilin-1 | CK2group | 0.429 | 0.746 |
| Q93038 | DR3 | ERK2 | 0.617 | 0.697 |
| P10276 | RAR-alpha | MAPKgroup* | 0.845 | 0.676 |
| P10276 | RAR-alpha | PKCgroup | - | 0.757 |
| P23771 | GATA-3 | MAPKgroup* | 0.536 | 0.676 |
| P29353 | Shc-1 | Src | 0.881 | 0.742 |
| P29353 | Shc-1 | ABL1 | 0.456 | 0.661 |
| P29353 | Shc-1 | JNK1 | 0.126 | 0.766 |
| P29353 | Shc-1 | MAPKgroup | 0.028 | 0.676 |
| P29353 | Shc-1 | Lyn | 0.013 | 0.722 |
| P29353 | Shc-1 | RSKgroup | - | 0.658 |
| P35228 | NOS2 | ERK1 | - | 0.632 |
| Q07812 | Bax | PKBgroup* | 0.850 | 0.655 |
| Q07812 | Bax | JNK1 | 0.619 | 0.766 |
| Q07812 | Bax | MAPKgroup | 0.124 | 0.676 |
| Q13009 | Tiam-1 | PKCgroup | 0.315 | 0.757 |
| P05771 | PKCgroup | PDK-1 | 0.521 | 0.957 |
| P05129 | PKCgamma | PDK-1 | 0.456 | 0.957 |
| P28482 | ERK2 | PDK-1 | 0.690 | 0.957 |
*IAP values were calculated for the kinase type (specificity class) at training procedure.
Figure 3Signal transduction cascade from TNF ligands to transcription factors reconstructed by ExPlain™ system. TNF ligand is depicted as orange triangle. Transcription factors (TFs, diamonds) are identified by promoter analysis of up-regulated genes upon TNF-alpha stimulation of HUVEC cell line. Dashed arrows represent the novel predicted kinase-substrate interactions helping to connect TNF ligands with TFs through cascades of phosphorylation events. All other arrows represent signal transduction interactions known in TRANSPATH®. The up-regulated molecules are red. The down-regulated molecules are green. Two underlined TFs can be reached from TNF ligands in less than 6 steps with the help of the novel kinase-substrate interactions only.
Figure 4Binding sites for MEF-2A and STAT6 transcription factors. These binding sites are closely situated in promoters of three highly up-regulated genes upon TNF-alpha treatment. TF sites are found with ExPlain™ and position weight matrices (PWMs) from TRANSFAC® database. Sites are shown as arrows above the sequences of promoters. The names of PWMs are shown together with the obtained site score (shown in the brackets).