| Literature DB >> 21985771 |
Amit Shrestha1, Garth Hamilton2, Eric O'Neill2, Stefan Knapp1, Jonathan M Elkins3.
Abstract
Bacterial over-expression of kinases is often associated with high levels of auto-phosphorylation resulting in heterogeneous recombinant protein preparations or sometimes in insoluble protein. Here we present expression systems for nine kinases in Escherichia coli and, for the most heavily phosphorylated, the characterisation of factors affecting auto-phosphorylation. Experiments showed that the level of auto-phosphorylation was proportional to the rate of expression. Comparison of phosphorylation states following in vitro phosphorylation with phosphorylation states following expression in E. coli showed that the non-physiological 'hyper-phosphorylation' was occurring at sites that would require local unfolding to be accessible to a kinase active site. In contrast, auto-phosphorylation on unphosphorylated kinases that had been expressed in bacteria overexpressing λ-phosphatase was only observed on distinct exposed sites. Remarkably, the Ser/Thr kinase PLK4 auto-phosphorylated on a tyrosine residue (Tyr177) located in the activation segment. The results give support to a mechanism in which auto-phosphorylation occurs before or during protein folding. In addition, the expression systems and protocols presented will be a valuable resource to the research community. Copyright ÂEntities:
Mesh:
Substances:
Year: 2011 PMID: 21985771 PMCID: PMC3445812 DOI: 10.1016/j.pep.2011.09.012
Source DB: PubMed Journal: Protein Expr Purif ISSN: 1046-5928 Impact factor: 1.650
Expression constructs. Vector information has been recently published [17].
| Target | GenBank ID | Residue Range | Vector | Purification Tag | Protease for tag cleavage | Antibiotic resistance |
|---|---|---|---|---|---|---|
| MAPK13 | 4506085 | 15–359 | pLIC-SGC1 | N-terminal His6 | TEV | Ampicillin |
| MAPK3 | 38257141 | 1–379 | pLIC-SGC1 | N-terminal His6 | TEV | Ampicillin |
| MAPK8 | 4506095 | 1–363 | pLIC-SGC1 | N-terminal His6 | TEV | Ampicillin |
| PLK4 | 21361433 | 1–341 | pNIC-CH | C-terminal His6 | – | Kanamycin |
| MAP2K2 | 13489054 | 46–400 | pNIC-CTHF | C-terminal His6 | TEV | Kanamycin |
| OSR1 | 4826878 | 1–309 | pNIC28-Bsa4 | N-terminal His6 | TEV | Kanamycin |
| STK24 | 20070158 | 31–301 | pNIC28-Bsa4 | N-terminal His6 | TEV | Kanamycin |
| MAPK9 | 21237736 | 1–380 | pNIC-CH | C-terminal His6 | – | Kanamycin |
| CHEK2 | 6005850 | 185–543 | pNIC28-Bsa4 | N-terminal His6 | TEV | Kanamycin |
Protein purification and mass spectrometry analysis. Gel filtration experiments were performed on four separate S200 16/60 columns and the observed retention volumes correspond to the molecular weights of the monomers. Deviations in retention volume are within the range expected for monomeric proteins of the molecular weights indicated. By comparison with molecular weight standards, a dimeric protein of 40 kDa MW would be expected to give a retention volume <75 ml under equivalent conditions. For the mass spectrometry analysis, the range is the minimum and maximum number of phosphorylations observed by mass spectrometry, and the modal value is the most highly populated phosphorylation state.
| Target | Expected molecular mass (Da) | Gel filtration retention volume (ml) | Protein yield (mg/l culture) | Observed mass (Da) | Additional peaks | Interpretation/range of phosphorylations |
|---|---|---|---|---|---|---|
| MAPK13 | 42405.7 | 87 | 9.0 | 42408.0 | None | Correct |
| MAPK3 | 45688.4 | 85 | 5.0 | 45690.9 | 1× Phosphorylation | Correct/range: 0–1 |
| MAPK8 | 44464.5 | 85 | 14.7 | 44472.4 | 1× Phosphorylation | Correct/range: 0–1 |
| PLK4 | 39130 | 95 | 27.0 | 39639.6 | 9–16× Phosphorylation | Loss of N-terminal Met/range: 8–16 |
| MAP2K2 | 42515.7 | 80 | 2.4 | 42518.8 | None | Correct |
| OSR1 | 37168.9 | 83 | 20.3 | 37171.1 | None | Correct |
| STK24 | 33257.1 | 84 | 4.0 | 33339.4 | 2–6× Phosphorylation | Correct/range: 1–6 |
| MAPK9 | 44615.5 | 80 | 36.7 | 44487.5 | None | Loss of N-terminal Met |
| CHEK2 | 40835.1 | 85 | 25.8 | 40917.2 | 2–4× Phosphorylation | Correct/range: 1–4 |
Fig. 1Domains present in each of the nine proteins in this study. For each protein the residue range covered by the expression construct in this study is illustrated above the domain diagram. Domain ranges were taken from analysis against the Pfam database [23].
Fig. 2SDS–PAGE gel of protein samples following gel filtration chromatography. Standard molecular weight markers were Precision Plus from BioRad (M); the numbers in the vertical scale on the left show the mass in kDa. Each sample lane was loaded with 2 μg of protein from the pooled gel filtration chromatography fractions that had been boiled in the presence of sodium dodecyl sulphate. The gel was visualised by Coomassie blue staining.
Fig. 3Example mass spectra of PLK4 following expression in E. coli. (A) Following expression without co-expressed λ-phosphatase. The peaks correspond to a range of phosphorylations from 8 to 16. (B) Following co-expression with λ-phosphatase. The peak corresponds to the molecular weight of the intact protein, minus an N-terminal methionine, with no phosphorylations.
Fig. 4Phosphorylation sites of PLK4 and CHEK2 mapped onto the available crystal structures of these proteins (PDB IDs 3COK and 3I6 W). (A) Phosphorylation sites of PLK4 following expression in E. coli. (B) Phosphorylation sites of PLK4 following in vitro auto-phosphorylation. (C) Phosphorylation sites of PLK4 following in vitro de-phosphorylation of hyper-phosphorylated protein from E. coli expression. (D) Phosphorylation sites of CHEK2 following expression in E. coli. (E) Phosphorylation sites of CHEK2 following in vitro de-phosphorylation of hyper-phosphorylated protein from E. coli expression.
Comparison of phosphorylation status under different expression conditions. The range is the minimum and maximum number of phosphorylations observed by mass spectrometry, and the modal value is the most highly populated phosphorylation state.
| Temp. at induction [IPTG] | 37 °C | 20 °C | ||
|---|---|---|---|---|
| 0.05 mM | 0.5 mM | 0.05 mM | 0.5 mM | |
| PLK4 | Range: 5–10 | Range: 8–16 | Range: 3–9 | Range: 4–9 |
| Modal value: 8 | Modal value: 13 | Modal value: 6 | Modal value: 7 | |
| CHEK2 | Range: 2–5 | Range: 2–5 | Range: 0–4 | Range: 0–5 |
| Modal value: 4 | Modal value: 4 | Modal value: 1 | Modal value: 3 | |
| STK24 | Range: 0–3 | Range: 1–5 | Range: 0–2 | Range: 1–5 |
| Modal value:1 | Modal value: 2 | Modal value: 0 | Modal value: 2 | |
Result from Table 2, the protein from the 50 ml scale expression under these conditions did not give a high quality mass spectrum.
Phosphorylation state following in vitro auto-phosphorylation of protein co-expressed with λ-phosphatase. The range is the minimum and maximum number of phosphorylations observed by mass spectrometry, and the modal value is the most highly populated phosphorylation state.
| Auto-phosphorylation with ATP and Mg2+ | Auto-phosphorylation with ATP, Mg2+ and Mn2+ | |
|---|---|---|
| PLK4 | Range: 0–5 | Range: 1–4 |
| Modal value: 1 | Modal value: 3 | |
| CHEK2 | Range: 0 | Range: 0 |
| Modal value: 0 | Modal value: 0 | |
| STK24 | Range: 0 | Range: 0 |
| Modal value: 0 | Modal value: 0 | |
Locations of phosphorylation sites in PLK4 as determined by proteolytic digestion followed by mass spectrometry. Underlined sites are in parts of the protein visible in the available structures.
| PLK4 | Auto-phosphorylation in | Auto-phosphorylation | Remaining sites after dephosphorylation |
|---|---|---|---|
| Trypsin digest | |||
| Tyr169 | |||
| Thr170 | Thr170 | ||
| Thr174 | Thr174 | ||
| Tyr177 | Tyr177 | Tyr177 | |
| Ser179 | Ser179 | ||
| Thr184 | |||
| Ser284 | |||
| Thr323 | |||
| Chymotrypsin digest | |||
| Thr184 | |||
| Ser186 | |||
Phosphorylation state following in vitro dephosphorylation with λ-phosphatase of protein expressed in E. coli without phosphatase co-expression. The range is the minimum and maximum number of phosphorylations observed by mass spectrometry, and the modal value is the most highly populated phosphorylation state.
| Dephosphorylation with λ-phosphatase | |
|---|---|
| PLK4 | Range: 0–6 |
| Modal value: 1 | |
| CHEK2 | Range: 0–2 |
| Modal value: 0 | |
| STK24 | Range: 1–4 |
| Modal value: 3 | |
Mass spectrometry analysis of proteins after removal of the hexahistidine tag. The range is the minimum and maximum number of phosphorylations observed by mass spectrometry, and the modal value is the most highly populated phosphorylation state. The hexahistidine tag for PLK4 and MAPK9 was non-removable so these proteins do not feature in this table.
| Target | Expected molecular mass (Da) | Observed mass (Da) | Additional peaks | Interpretation/range of phosphorylations |
|---|---|---|---|---|
| MAPK13 | 39940.1 | 39942.3 | None | Correct |
| MAPK3 | 43222.8 | 43225.1 | 1× Phosphorylation | Correct/range: 0–1 |
| MAPK8 | 41998.9 | 42001.6 | 1× Phosphorylation | Correct/range: 0–1 |
| MAP2K2 | 40610.8 | 40613.6 | None | Correct |
| OSR1 | 34703.3 | 34705.2 | None | Correct |
| STK24 | 30791.5 | 30793.2 | 1–3× Phosphorylation | Correct/range: 0–3 |
| CHEK2 | 38369.5 | 38451.2 | 2–4× Phosphorylation | Correct/range: 1–4 |
Locations of phosphorylation sites in CHEK2 as determined by proteolytic digestion followed by mass spectrometry. Underlined sites are in parts of the protein visible in the available structures.
| CHEK2 | Auto-phosphorylation in | Auto-phosphorylation | Remaining sites after dephosphorylation |
|---|---|---|---|
| Trypsin digest | |||
| Ser228 | Ser228 | ||
| Ser260 | Ser260 | ||
| Ser506 | |||
| Thr507 | |||
| Ser516 | |||
| Thr517 | |||
| Ser518 | |||
| Thr532 | |||
| Thr533 | |||
| Chymotrypsin digest | Ser260 | ||
| Thr272 | |||
| Thr383 | |||