| Literature DB >> 17668044 |
Emmanuel J Chang1, Rashida Begum, Brian T Chait, Terry Gaasterland.
Abstract
Protein phosphorylation, mediated by a family of enzymes called cyclin-dependent kinases (Cdks), plays a central role in the cell-division cycle of eukaryotes. Phosphorylation by Cdks directs the cell cycle by modifying the function of regulators of key processes such as DNA replication and mitotic progression. Here, we present a novel computational procedure to predict substrates of the cyclin-dependent kinase Cdc28 (Cdk1) in the Saccharomyces cerevisiae. Currently, most computational phosphorylation site prediction procedures focus solely on local sequence characteristics. In the present procedure, we model Cdk substrates based on both local and global characteristics of the substrates. Thus, we define the local sequence motifs that represent the Cdc28 phosphorylation sites and subsequently model clustering of these motifs within the protein sequences. This restraint reflects the observation that many known Cdk substrates contain multiple clustered phosphorylation sites. The present strategy defines a subset of the proteome that is highly enriched for Cdk substrates, as validated by comparing it to a set of bona fide, published, experimentally characterized Cdk substrates which was to our knowledge, comprehensive at the time of writing. To corroborate our model, we compared its predictions with three experimentally independent Cdk proteomic datasets and found significant overlap. Finally, we directly detected in vivo phosphorylation at Cdk motifs for selected putative substrates using mass spectrometry.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17668044 PMCID: PMC1924601 DOI: 10.1371/journal.pone.0000656
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Position-specific scoring matrix representing the Cdk phosphorylation motif
| −1 | 0 | 1 | 2 | 3 | |
|
| 0.052 | 0 | 0 | 0.049 | 0.015 |
|
| 0.046 | 0 | 0 | 0.056 | 0.015 |
|
| 0.032 | 0 | 0 | 0.007 | 0 |
|
| 0.04 | 0 | 0 | 0.021 | 0 |
|
| 0.055 | 0 | 0 | 0.035 | 0.015 |
|
| 0.066 | 0 | 0 | 0.021 | 0.015 |
|
| 0.052 | 0 | 0 | 0.056 | 0.029 |
|
| 0.029 | 0 | 0 | 0.07 | 0.015 |
|
| 0.057 | 0 | 0 | 0.15 | 0.59 |
|
| 0.052 | 0 | 0 | 0.049 | 0.015 |
|
| 0.06 | 0 | 0 | 0.091 | 0.015 |
|
| 0.049 | 0 | 0 | 0.021 | 0.015 |
|
| 0.02 | 0 | 1 | 0.007 | 0.029 |
|
| 0.075 | 0 | 0 | 0.007 | 0.015 |
|
| 0.08 | 0 | 0 | 0.14 | 0.15 |
|
| 0.04 | 0.5 | 0 | 0.028 | 0.015 |
|
| 0.046 | 0.5 | 0 | 0.063 | 0.015 |
|
| 0.04 | 0 | 0 | 0.056 | 0.015 |
|
| 0.057 | 0 | 0 | 0.028 | 0.015 |
|
| 0.052 | 0 | 0 | 0.035 | 0.015 |
Figure 1Analysis of Canonical Cdk Motif Clustering in Yeast and Mock Proteomes.
(A) The number of proteins having a given score decreases as that score increases. Yeast proteins are represented in navy, and mock proteins are represented in magenta. At low score, (i.e. less than ∼4), yeast and mock are similar– the ratio of mock to yeast, shown by black squares, approximates unity (B). However at higher scores (i.e. 5 and above), yeast proteome contains substantially more proteins than mock (A), and the ratio of mock/yeast approaches zero (B). All proteins from the yeast proteome scoring 5 or higher are considered candidate substrates.
Bioinformatic screen for candidates Cdc28 substrates
| Protein Name | Candidate (Reg Expr) | Candidate (PSSM) | Borderline (PSSM) | In Vivo | In Vitro | Cyclin |
| Rad9 | 9 | 8.14 | x | |||
| Lte1 | 8 | 7.48 | x | Clb2 | ||
| Swi5 | 8 | 7.97 | Yes | x | ||
| Yer041w | 8 | 5.61 | ||||
| Ace2 | 7 | 7.14 | Yes | x | Clb3 | |
| Ase1 | 7 | 5.53 | x | |||
| Ash1 | 7 | 6.41 | x | |||
| Sli15 | 7 | 6.86 | ||||
| Bud4 | 6 | 5.22 | x | |||
| Cdh1 | 6 | 4.42 | Yes | x | Cln2/Clb3 | |
| Fir1 | 6 | 5.91 | x | |||
| Orc2 | 6 | 6.08 | Yes | x | Clb5** | |
| Zrg8 | 6 | 5.47 | ||||
| Bck1 | 5 | 4.85 | x | |||
| Bem3 | 5 | 6.52 | x | Cln2 | ||
| Boi1 | 5 | 4.30 | ||||
| Bud3 | 5 | 4.03 | x | Clb2 | ||
| Caf120 | 5 | 5.86 | x | |||
| Cdc15 | 5 | 3.43 | ||||
| Cdc6 | 5 | 3.96 | Yes | Clb2 | ||
| Exo84 | 5 | 3.86 | x | |||
| Fin1 | 5 | x | ||||
| Hcm1 | 5 | 3.80 | x | |||
| Hpr5 | 5 | 4.78 | x | |||
| Lre1 | 5 | 4.62 | x | |||
| Mcm3 | 5 | 4.46 | x | |||
| Mse1 | 5 | 3.93 | ||||
| Pak1 | 5 | 4.89 | x | |||
| Pkc1 | 5 | 5.70 | x | |||
| Pms1 | 5 | 5.31 | ||||
| Rga2 | 5 | 4.86 | x | |||
| Sfi1 | 5 | x | ||||
| Sir4 | 5 | 5.24 | x | |||
| Sld2 | 5 | 5.69 | Yes | |||
| Smc4 | 5 | 3.32 | x | |||
| Stb1 | 5 | 4.75 | Yes | x | ||
| Ste20 | 5 | 4.33 | Yes | Cln2 | ||
| Ypl267w | 5 | 4.47 | x | Cln2 | ||
| Bni4 | 5.86 | |||||
| Iqg1 | 4.62 | |||||
| Orc6 | 4.44 | Yes | x | Clb5** | ||
| Plm2 | 4.96 | |||||
| Rpo21 | 4.45 | |||||
| Ssn2 | 5.12 | |||||
| Yjl051w | 4.88 | |||||
| Ymr124w | 4.67 | |||||
| Acc1 | 3.30 | |||||
| Bni1 | 4.36 | x | ||||
| Chd1 | 3.86 | |||||
| Dal81 | 3.27 | x | ||||
| Dna2 | 3.45 | x | ||||
| Far1 | 3.46 | Yes | x | Cln2/Clb5 | ||
| Fun30 | 3.67 | x | ||||
| Fun31 | 3.32 | |||||
| Gac1 | 3.58 | |||||
| Hpc2 | 3.50 | |||||
| Inp52 | 3.75 | |||||
| Kel1 | 3.36 | x | Clb2 | |||
| Leu1 | 3.45 | |||||
| Mds3 | 4.06 | Clb3 | ||||
| Mlp1 | 3.39 | x | ||||
| Mps2 | 3.32 | |||||
| Mpt1 | 3.52 | |||||
| Msb1 | 3.34 | x | ||||
| Myo3 | 3.28 | x | ||||
| Ndd1 | 3.46 | Yes | x | |||
| Net1 | 3.38 | Yes | x | |||
| Nup60 | 3.50 | x | ||||
| Pds1 | 3.28 | Yes | x | |||
| Pkh2 | 3.59 | |||||
| Rim15 | 3.38 | |||||
| Sac3 | 3.36 | |||||
| Spa2 | 3.61 | x | ||||
| Swi4 | 3.29 | |||||
| Tfg1 | 3.63 | |||||
| Tra1 | 4.29 | |||||
| Tus1 | 3.39 | x | ||||
| Ubp2 | 3.70 | |||||
| Ulp2 | 3.25 | x | ||||
| Ycr033w | 4.19 | |||||
| Ydl239c | 3.38 | |||||
| Ygr271w | 3.50 | |||||
| Yhr080c | 3.24 | |||||
| Yil112w | 3.81 | |||||
| Yjl084w | 3.43 | x | ||||
| Ynr047w | 3.45 | |||||
| Yor066w | 4.22 | x | ||||
| Yor129c | 3.32 | |||||
| Yor177c | 3.34 | |||||
| Yox1 | 3.29 | |||||
| Zip1 | 3.37 | x |
see Supplementary Table S2.
Reference [47]
Reference[48]
Phosphorylation confirmed via mass spectrometry
Figure 2Analysis of kinetic-derived PSSM motif clustering.
The number of proteins having a given score decreases as that score increases. Yeast proteins are represented in navy, and mock proteins are represented in magenta. At low score, (I.e. less than ∼3.2), yeast and mock are similar– the ratio of mock to yeast, shown by black squares, approximates unity. (Proteins are grouped in bins 0.4 units wide) However at higher scores (I.e. 4.4 and above), yeast proteome contains substantially more proteins than mock, and the ratio of mock/yeast approaches zero. All proteins from the yeast proteome scoring 4.4or higher are considered candidate substrates. The region between 3.2 and 4.4 is considered a transition region and yeast proteins with these scores are considered borderline candidate substrates