| Literature DB >> 12225582 |
Mitch Kostich1, Jessie English, Vincent Madison, Ferdous Gheyas, Luquan Wang, Ping Qiu, Jonathan Greene, Thomas M Laz.
Abstract
BACKGROUND: Eukaryotic protein kinases (EPKs) constitute one of the largest recognized protein families represented in the human genome. EPKs, which are similar to each other in sequence, structure and biochemical properties, are important players in virtually every signaling pathway involved in normal development and disease. Near completion of projects to sequence the human genome and transcriptome provide an opportunity to identify and perform sequence analysis on a nearly complete set of human EPKs.Entities:
Mesh:
Substances:
Year: 2002 PMID: 12225582 PMCID: PMC126868 DOI: 10.1186/gb-2002-3-9-research0043
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Variability tolerated at key residue positions within the EPK catalytic domain
| Residue | G1 | G2 | G3 | K | E | D | N | D | E | D |
| Subdomain | I | I | I | II | III | V1B | V1B | VII | VIII | IX |
| Number not conserved* | 52/473 | 29/473 | 135/473 | 16/482 | 19/483 | 29/493 | 23/493 | 23/496 | 34/493 | 22/493 |
| % conserved | 89% | 94% | 71% | 97% | 96% | 94% | 95% | 95% | 93% | 95% |
| Amino-acid substitution† | A/14; S/10 | A/5; D/4 | A/55; S/53 | C/4; R/3, N/3 | A/7; D/3, R/3 | N/17; S/5 | (S/T)/8; (K/R)/8 | E/6; G/5 | D/14; N/10 | N/7; A, G, T/3 each |
| SCOP (structure)‡ | ||||||||||
| Number not conserved | 0/27 | 0/27 | 4/27 | 0/27 | 0/27 | 0/27 | 0/27 | 1/27 | 1/27 | 0/27 |
| Amino-acid substitution | S/3, A/1 | E/1 | N/1 |
The partial alignment (see Additional data files) was used to examine conservation among previously identified [52] highly conserved residues in EPK subdomains I-IX. The amino acids listed along the top of the table correspond to residues of PKA-Calpha (NP_002721): G1 = G51, G2 = G53, G3 = G56, K73, E92, D167, N172, D185, E209, D221. *The number of residues in Table 1 that differed from the consensus residue are listed first, along with the number of sequences informative at that residue. †The top two most commonly substituted amino acids at the given position and the number of occurrences of the substituted amino acid at that position. ‡The 27 EPK structural families defined in SCOP were searched for examples of EPKs with non-canonical residues at any of these ten positions. G53 is substitued three times with S (CK2, Cal, Phk) and once with A (Dap). Residue D in subdomain VII is substituted with E in titin. E in subdomain VIII is substituted with N in CK1.
Figure 1Dendrogram summarizing sequence groupings and branching patterns of EPKs derived from the phenogram. The program TreeExplorer was used to edit the main phenogram (see Additional data files), collapsing branches composed entirely of sequences that cluster with one another. Branch nomenclature was guided by previously accepted subfamily nomenclature, but differs from the classical subfamily nomenclature where justified by the structure of the phenogram. Sequences that do not clearly belong to a particular cluster were left in the figure as singletons. Because no branch lengths were specified for partial sequences present in the phenogram, and because only a portion of the catalytic domain of these sequences is available for comparison, assigning partial sequences to clusters would have been subject to significant error, and was not attempted. Partial sequences are therefore not included in this figure. Subfamilies are labeled in red, while singletons are labeled in blue. The relationship between each EPK, the assigned branch names, and the classical subfamily nomenclature is presented in the EPK data table (see Additional data files).