| Literature DB >> 22393408 |
Monica Totir1, Nathaniel Echols, Max Nanao, Christine L Gee, Alisa Moskaleva, Scott Gradia, Anthony T Iavarone, James M Berger, Andrew P May, Chloe Zubieta, Tom Alber.
Abstract
Structural biology and structural genomics projects routinely rely on recombinantly expressed proteins, but many proteins and complexes are difficult to obtain by this approach. We investigated native source proteins for high-throughput protein crystallography applications. The Escherichia coli proteome was fractionated, purified, crystallized, and structurally characterized. Macro-scale fermentation and fractionation were used to subdivide the soluble proteome into 408 unique fractions of which 295 fractions yielded crystals in microfluidic crystallization chips. Of the 295 crystals, 152 were selected for optimization, diffraction screening, and data collection. Twenty-three structures were determined, four of which were novel. This study demonstrates the utility of native source proteins for high-throughput crystallography.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22393408 PMCID: PMC3290569 DOI: 10.1371/journal.pone.0032498
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Proteome fractionation and purification flow chart.
Approximately 500 g of E. coli cells were lysed at pH 7 using a microfluidizer and the cell debris pelleted. The supernatant was applied to a tangential flow column with a nominal molecular weight cut off of 500 kDa, generating 2 fractions (retentate and flow through). The fraction above 500 kDa (retentate) was further purified via sucrose gradients, size exclusion, and ion exchange chromatography prior to crystallization trials. The fraction less than 500 kDa was applied to multiple affinity and ion exchange columns followed by phenyl sepharose, ion exchange, and size exclusion prior to crystallization trials in microfluidic chips.
Figure 2E. coli proteome predicted and experimental characterization.
(A) Predicted size distribution of all ORFs in the E. coli proteome. (B) Size exclusion chromatograph of crude E. coli lysate with the largest peak at approximately 100 kDa. (C) Final step ion exchange (MonoQ) purification in a typical fractionation experiment. Peaks marked with a star were sent for downstream crystallization trials.
Figure 3Crystallization of native source E. coli proteins.
(A) Capillary electrophoresis of purified protein fractions. White stars indicate samples successfully crystallized and black stars represent solved structures. (B) Crystals of 5-keto-4-deoxyuronate isomerase crystallized from fractions of varying purity. Crystal quality was not always correlated with sample purity. (C) Resolution of the data collected versus percent purity of the starting sample based on quantification of protein concentrations by capillary gel electrophoresis with the Caliper system. Sample purity did not correlate with higher resolution data.
Crystallization conditions and data collection statistics for previously deposited structures.
| PDB ID | Resolution this study | Resolution PDB | Rsym | I/sigI | R-free | Crystallization conditions | Protein description |
| 1IPW | 3.2 | 2.3 | 0.179 | 3.6 | 0.41 | 15–20% PEG 1000, 100 mM MES pH 6.0, 300–400 mM KOAc | Inorganic pyrophosphatase |
| 1XRU | 3.0 | 1.94 | 0.065 | 10.1 | 0.31 | 1.5 M Ammonium sulfate, 0.1 M MES, pH 6.5 | 5-keto-4-deoxyuronate isomerase |
| 1N57 | 2.8 | 1.6 | 0.122 | 15.5 | 0.27 | 0.1 M MgCl2, 0.1 M Tris pH 8.5, 20% PEG 10,000, 0.5% ethyl acetate | Hsp31 |
| 1R2K | 2.0 | 2.1 | 0.054 | 17.4 | 0.29 | 1.4 M sodium citrate tribasic dihydrate, 0.1 M Hepes, pH 7.5 | MoaB (molybdopterin biosynthesis protein B) |
| 1BJN | 2.2 | 2.3 | 0.052 | 20.0 | 0.29 | 0.2 M MgCl2, 0.1 M HEPES pH 7.5, 15% PEG 3350 | phosphoserine aminotransferase |
| 1GG9 | 2.5 | 1.89 | 0.126 | 7.2 | 0.28 | 200 mM NaCl, 100 mM Tris pH 8.5, 20% PEG 3350 | Catalase HPII |
| 1PKY | 3.2 | 2.5 | 0.08 | 20.5 | 0.34 | 0.05 M Ammonium sulfate, 0.05 M Bis-Tris, pH 6.5, 30% pentaerythritol ethoxylate | Pyruvate kinase I |
| 1CS1 | 3.25 | 1.5 | 0.109 | 5.7 | 0.36 | 45% polypropylene glycol 400, 0.1 M Bis-Tris, pH 6.5 | Cystathione gamma-synthase |
| 1TJ7 | 3.0 | 2.44 | 0.166 | 21.6 | 0.29 | 0.1 M Bis-Tris pH 6.5, 1.6 to 0.8 M Ammonium sulfate. | Ariginosuccinate lyase |
| 1BBW | 3.4 | 2.7 | 0.142 | 9.4 | 0.35 | 0.1 M Tris pH 8.5, 25% PEG 3350, 3% isopropanol | Lysyl-tRNA synthetase (constitutive) |
| 1NXG | 3.3 | 2.5 | 0.077 | 8.4 | 0.32 | 0.2 M Lithium sulfate monohydrate, 25% PEG 3350, 0.1 M Tris, pH 8.5 | Citrate synthase |
| 1DHP | 3.0 | 2.3 | 0.182 | 5.6 | 0.29 | 0.1 M HEPES pH 7.5, 10–25% PEG 3350 | Dihydrodipicolinate synthase |
| 1YAC | 2.3 | 1.8 | 0.106 | 6.8 | 0.25 | 0.1 M Bis-Tris pH 6.5, 45% Polypropylene glycol P 400 | ycaC gene product |
| 1×12 | 4.0 | 2.0 | 0.129 | 5.3 | 0.42 | 0.2 M NaFormate, 20% PEG 3350, 3% MeOH | Nicotinamide nucleotide transhydrogenase domain I |
| 1YE9 | 2.5 | 2.8 | 0.174 | 11.4 | 0.33 | 0.1 M MgFormate, 18% PEG 3350 | Catalase HPII (truncated) |
| 1HOT | 2.2 | 2.4 | 0.134 | 16 | 0.34 | 0.056 M sodium phosphate monobasic monohydrate, 1.344 M potassium phosphate dibasic, pH 8.2 | Glucosamine 6-phosphate deaminase |
| 1CG1 | 2.7 | 2.5 | 0.051 | 26.6 | 0.35 | 0.1 M TRIS pH 8.5, 25% PEG 3350 | Adenylosuccinate synthetase |
| 2PWZ | 1.6 | 2.2 | 0.07 | 27 | 0.42 | 1.1 M Sodium malonate in 0.1 M HEPES pH 7, 0.5% Jeffamine ED-2001 | Malate dehydrogenase |
| 1PMO | 2.0 | 2.3 | 0.087 | 11.9 | 0.32 | 0.2 M Ammonium citrate tribasic pH 7, 20% PEG3350 | Glutamate decarboxylase |
Figure 4Structures of previously deposited proteins solved during the pipeline.
All proteins were oligomers as shown above. Proteins are from top left - inorganic pyrophosphatase (1IPW), 5-keto-4-deoxyuronate isomerase (1XRU), Hsp31 (1N57), pyruvate kinase (1PKY), phosphoserine aminotransferase (1BJN), Citrate synthase (1NXG), ycaC gene product (1YAC), Cystathione gamma-synthase (1CS1), Dihydrodipicolinate synthase (1DHP), Arginosuccinate lyase (1TJ7), Nicotinamide nucleotide transhydrogenase domain I (1×12), MoaB (molybdopterin biosynthesis protein B) (1R2K), Catalase HPII (1GG9), Lysyl-tRNA synthetase (constitutive) (1BBW), Glutamate decarboxylase (1PMO), Glucosamine 6-phosphate deaminase (IHOT), Malate dehydrogenase (2PWZ), Adenylosuccinate synthetase (1CG1), catalase HPII truncated (1YE9).
Data collection and refinement statistics for methylglyoxal reductase (YghZ).
| YghZ | |
|
| |
| Space group | P1 |
| Cell dimensions | |
|
| 91.70, 98.06, 98.26 |
| α, β, ã (°) | 90.3, 93.0, 106.1 |
| Resolution (Å) | 98–1.8 (1.87–1.8) |
|
| 0.06(0.56) |
|
| 9.4(1.6) |
| Completeness (%) | 96.2 (93.3) |
| Redundancy | 3.9(3.9) |
|
| |
| Resolution (Å) | 94–1.8 |
| No. reflections | 277925 |
|
| 17.1/20.8 |
| No. atoms | |
| Protein | 39762 |
| Ligand/ion | 9 |
| Water | 2644 |
|
| |
| Protein | 28.0 |
| Ligand/ion | 20.0 |
| Water | 40.2 |
| R.m.s. deviations | |
| Bond lengths (Å) | 0.019 |
| Bond angles (°) | 1.534 |
*Values in parentheses are for highest-resolution shell.
Figure 5YghZ tetramer and active site.
Left, the YghZ tetramer viewed along the four-fold axis. Putative active site residues are depicted as ball-and-stick and colored by atom with the active site of one monomer outlined by a gray box. Right, close -up view of the active site with putative active site residues colored by atom and labeled.
Data collection and refinement statistics for Glucose-6-phosphate isomerase (pGI).
| pGI | |
|
| |
| Space group | P1 |
| Cell dimensions | |
|
| 69.8, 72.9, 181.9 |
| α, β, ã (°) | 92.5, 97.8, 114.6 |
| Resolution (Å) | 179.–2.05 (2.1–2.05) |
|
| 0.097(0.34) |
|
| 7.3 (1.2) |
| Completeness (%) | 95.0 (95.0) |
| Redundancy | 2.0(2.1) |
|
| |
| Resolution (Å) | 89–2.05 |
| No. reflections | 186581 |
|
| 17.2/23.0 |
| No. atoms | |
| Protein | 25984 |
| Ligand/ion | 1 |
| Water | 3262 |
|
| |
| Protein | 23.1 |
| Ligand/ion | 27.5 |
| Water | 30.3 |
| R.m.s. deviations | |
| Bond lengths (Å) | 0.011 |
| Bond angles (°) | 1.300 |
*Values in parentheses are for highest-resolution shell.
Figure 6PGI dimer and putative active site.
Left, the pGI dimer. Right, close -up view of the active site with putative active site residues colored by atom and labeled. The active site is formed at the dimer interface and has contributions from both monomers.
Data collection and refinement statistics for 6-phospho-beta-glucosidase (BglA).
| BglA | |
|
| |
| Space group | P1 |
| Cell dimensions | |
|
| 73.7, 79.4, 98.6 |
| α, β, ã (°) | 100.0, 107.2, 102.8 |
| Resolution (Å) | 22.9–2.3 (2.42–2.3) |
|
| 0.143(0.53) |
|
| 7.9 (2.2) |
| Completeness (%) | 92.7 (66.3) |
| Redundancy | 4.0(3.9) |
|
| |
| Resolution (Å) | 22.9–2.3 |
| No. reflections | 82815 |
|
| 23.7/17.1 |
| No. atoms | |
| Protein | 16976 |
| Ligand/ion | 19 |
| Water | 1651 |
|
| |
| Protein | 20.0 |
| Ligand/ion | 72.5 |
| Water | 23.78 |
| R.m.s. deviations | |
| Bond lengths (Å) | 0.003 |
| Bond angles (°) | 0.704 |
*Values in parentheses are for highest-resolution shell.
Figure 7BglA dimer and putative active site.
Left, BglA dimer with the putative active site outlined in a gray box. Right, close up of the active site with glucose-6-phosphate modeled based of the position of the sulfate ion from crystallization. Active site residues are depicted as ball-and-stick. Putative hydrogen bonds to the substrate are drawn as dashed lines.
Data collection and refinement statistics for glutamate dehydrogenase (GDH).
| GDH | |
|
| |
| Space group | P212121 |
| Cell dimensions | |
|
| 101.9 151.6 170.0 |
| α, β, ã (°) | 90.0 90.0 90.0 |
| Resolution (Å) | 40.0–3.2 (3.32–3.20) |
|
| 0.164 (0.515) |
|
| 7.9 (2.3) |
| Completeness (%) | 99.8 (99.7) |
| Redundancy | 3.7 (3.6) |
|
| |
| Resolution (Å) | 38–3.2 |
| No. reflections | 43835 |
|
| 0.2286/0.2653 |
| No. atoms | |
| Protein | 19913 |
| Ligand/ion | 8 |
| Water | 44 |
|
| |
| Protein | 64.6 |
| Ligand/ion | 48.3 |
| Water | 29.5 |
| R.m.s. deviations | |
| Bond lengths (Å) | 0.001 |
| Bond angles (°) | 0.424 |
Figure 8GDH hexamer from E. coli.
The protein forms a hexamer (dimer of trimers). Left, view of the GDH hexamer along the two-fold axis. Right, view of the GDH hexamer along the three-fold axis.