| Literature DB >> 25949811 |
Hisayoshi Yoshizaki1, Shujiro Okuda2.
Abstract
BACKGROUND: Protein phosphorylation is a post-translational modification that is essential for a wide range of eukaryotic physiological processes, such as transcription, cytoskeletal regulation, cell metabolism, and signal transduction. Although more than 200,000 phosphorylation sites have been reported in the human genome, the physiological roles of most remain unknown. In this study, we provide some useful datasets for the assessment of functional phosphorylation signaling using a comparative genome analysis of phosphorylation motifs.Entities:
Keywords: Comparative evolutionary analysis; Kinase; Phosphorylation motif
Mesh:
Substances:
Year: 2015 PMID: 25949811 PMCID: PMC4422407 DOI: 10.1186/s13742-015-0057-6
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Figure 1Scatterplot of the CIs of known phosphosites against those of all phosphorylation residues in the human genome. CIs of motifs with known phosphosites and all serine/threonine/tyrosine residues were plotted: the correlation coefficient is 0.865. The solid line indicates y = x. See also Data2-1, Data2-2, and Data2-3 in Additional file 2.
Figure 2Histogram of phosphomotifs, including R-x-x-S/T and S/T-P patterns based on CIs. Frequency distribution of CIs for the phosphomotifs including S/T-P and basic amino acid residues at the N-terminals was calculated (see Additional file 3). Each class of the frequency distribution includes a successive group of ten phosphomotifs ordered by CI (rank of CI). Average CIs in a class are presented as plots in the upper panel. The lower panel shows the number of phosphomotifs including R/K-x-x-S/T, R/K-x-x-x-S/T, or R/K-x-S/T (black) and S/T-P (white) patterns for ranked CI. See also Additional file 3.
Relationships between phosphorylation motifs and protein kinases
| Motif id | Motif sequence | Number of substrates | Kinase classification | 1st | 2nd | 3rd | |||
|---|---|---|---|---|---|---|---|---|---|
| 12 | S/T-P | 3671 | Group | CMGC | 84% | AGC | 6% | ATYPICAL | 4% |
| Family | MAPK | 41% | CDK | 35% | GSK | 6% | |||
| 166 | R-X-S/T | 1873 | Group | AGC | 65% | CAMK | 10% | Other | 10% |
| Family | PKA | 33% | PKC | 19% | AUR | 7% | |||
| 173 | R-X-X-S/T | 2908 | Group | AGC | 61% | CAMK | 21% | CMGC | 8% |
| Family | PKA | 24% | PKC | 13% | AKT | 10% | |||
The top three fractions of kinase groups and families are shown.
Figure 3Evolutionary expansion and conservation of kinases and respective substrates. (A) Evolutionary conservation of substrates of kinase groups: CIs for each kinase group defined in PhosphoSitePlus [13] were calculated using the same method as the CIs of phosphomotifs (see the section of the definition of phosphomotif CIs). See also Data5-1 in Additional file 5. (B) Evolutionary expansions of kinase groups: 371 kinases were classified into nine kinase groups and were extracted from PhosphoSitePlus and KEGG BRITE databases. Orthologs of the kinases in the genomes hsa, mmu, dre, dme, and cel were assigned by the KEGG SSDB homology database. See also Additional file 6. Fractions of kinase groups for each genome obtained from KEGG SSDB were calculated. In addition, the fraction that was calculated for the kinases classified in WormBase used in the study by Lehmann et al. is indicated by an asterisk [28,29]. The blue line indicates the number of kinase genes. The assignment of kinase groups was performed using PhosphoSitePlus [13,14]. hsa, Homo sapiens; mmu, Mus musculus; dre, Danio rerio; dme, Drosophila melanogaster; cel, Caenorhabditis elegans.