| Literature DB >> 24312350 |
Mark A Arbing1, Sum Chan, Liam Harris, Emmeline Kuo, Tina T Zhou, Christine J Ahn, Lin Nguyen, Qixin He, Jamie Lu, Phuong T Menchavez, Annie Shin, Thomas Holton, Michael R Sawaya, Duilio Cascio, David Eisenberg.
Abstract
The expression of heteroligomeric protein complexes for structural studies often requires a special coexpression strategy. The reason is that the solubility and proper folding of each subunit of the complex requires physical association with other subunits of the complex. The genomes of pathogenic mycobacteria encode many small protein complexes, implicated in bacterial fitness and pathogenicity, whose characterization may be further complicated by insolubility upon expression in Escherichia coli, the most common heterologous protein expression host. As protein fusions have been shown to dramatically affect the solubility of the proteins to which they are fused, we evaluated the ability of maltose binding protein fusions to produce mycobacterial Esx protein complexes. A single plasmid expression strategy using an N-terminal maltose binding protein fusion to the CFP-10 homolog proved effective in producing soluble Esx protein complexes, as determined by a small-scale expression and affinity purification screen, and coupled with intracellular proteolytic cleavage of the maltose binding protein moiety produced protein complexes of sufficient purity for structural studies. In comparison, the expression of complexes with hexahistidine affinity tags alone on the CFP-10 subunits failed to express in amounts sufficient for biochemical characterization. Using this strategy, six mycobacterial Esx complexes were expressed, purified to homogeneity, and subjected to crystallization screening and the crystal structures of the Mycobacterium abscessus EsxEF, M. smegmatis EsxGH, and M. tuberculosis EsxOP complexes were determined. Maltose binding protein fusions are thus an effective method for production of Esx complexes and this strategy may be applicable for production of other protein complexes.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24312350 PMCID: PMC3843698 DOI: 10.1371/journal.pone.0081753
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Schematic of Esx complex coexpression strategies.
Bicistronic operons encoding the Esx complex genes, separated by a naturally occurring intergenic region of variable length, were cloned into three different expression vectors. (A) The two subunits are coexpressed from a single bicistronic transcript with an N-terminal His6 tag and TEV protease site on the CFP-10 homolog. (B) The two subunits are coexpressed from a single bicistronic transcipt with an N-terminal MBP fusion with His6 tag and TEV protease site on the CFP-10 homolog. (C) The two subunits are coexpressed from a single bicistronic transcript with an N-terminal MBP fusion with TEV protease site on the CFP-10 homolog and a C-terminal His6 tag on the ESAT-6 homolog. Concurrent expression of TEV protease (TEVp) cleaves the MBP moiety from the CFP-10 homolog intracellularly at the TEV protease site positioned between the MBP C-terminus and CFP-10 N-terminus.
Figure 2Expression and affinity purification of mycobacterial Esx complexes.
(A) SDS-PAGE analysis (Any-kD TGX gel, Bio-Rad) of small-scale expression and affinity purification of His6–tagged proteins: lanes 1–3, EsxGHms complex; lanes 4–6, EsxOPmt complex; lanes 7–9, EsxTUmt complex; and lanes 10–12, EsxEFmt complex. The first lane for each complex is expression of the complex from the pMA507 vector with the His6 tag alone on the N-terminus of the CFP-10 homolog, the second lane is the complex expressed from pMA510 which expresses the CFP-10 homolog with an N-terminal MBP-His6 fusion (indicated by an asterisk), and the third lane is the complex expressed from the pMAPLe3 vector which allows proteolytic cleavage of the MBP moiety in vivo and purification of the complex via a C-terminal His6 tag on the ESAT-6 homolog. Arrows indicate the presence of Esx complex subunits. (B) SDS-PAGE analysis (Any-kD TGX gel, Bio-Rad) of concentrated Esx complex samples from large-scale purification (∼3 µg per lane): 1, EsxOPmt; 2, EsxEFmt; 3, EsxGHms; 4, EsxGHma; 5, EsxEFma; and 6, EsxTUma.
Data collection statistics for M. abscessus EsxEF, M. smegmatis EsxGH, and M. tuberculosis EsxOP complexes.
| EsxEFma (native) | EsxEFma (SeMet) | EsxGHms | EsxOPmt (form I) | EsxOPmt (form II) | ||
| Beamline | 24-ID-C | 24-ID-C | 24-ID-E | 24-ID-C | 24-ID-C | |
|
| Peak | Inflection | ||||
| Space group | P1 | P31 | P63 | P3221 | P3221 | C2221 |
| Wavelength (Å) | 0.9791 | 0.9793 | 0.9792 | 0.9792 | 0.9794 | 1.282 |
| Unit cell constants (Å) at -173°C | 47.6 74.1 84.4 (114.6° 103.3° 95.5°) | 46.4 46.4 71.4 (90° 90° 120°) | 105.6 105.6 71.3 (90° 90° 120°) | 66.4 66.4 162.6 (90° 90° 120°) | 66.4 66.4 162.5 (90° 90° 120°) | 76.1 112.1 95.3 (90 90 90) |
|
| ||||||
| Resolution range [Å] | 20.00–1.96 (2.01–1.96) | 30.00–3.00 (3.08–3.00) | 80.00–2.70 (2.77–2.70) | 28.31 – 2.55 (2.75–2.55) | 100.00–2.55 (2.64–2.55) | 52.53–2.55 (2.62–2.55) |
| Mean redundancy | 1.96 | 2.98 | 3.70 | 5.50 | 5.55 | 5.56 |
| Number of unique reflections | 67097 | 6796 | 46063 | 25291 | 25637 | 13340 |
| Completeness of data [%] | 93.0 (90.0) | 98.5 (99.5) | 99.0 (99.0) | 96.5 (100) | 98.3 (100) | 98.1 (95.3) |
| Mean I/ | 13.7 | 7.5 | 8.0 | 19.9 | 15.4 | 14.9 |
Numbers in parentheses refer to the high resolution data shell.
Refinement statistics for M. abscessus EsxEF, M. smegmatis EsxGH, and M. tuberculosis EsxOP complexes.
| EsxEFma (native) | EsxGHms | EsxOPmt (form I) | EsxOPmt (form II) | |
| PDB Accession | 4I0X | 3Q4H | 3OGI | 4GZR |
|
| ||||
| Space group | P1 | P63 | P3221 | C2221 |
| Resolution range [Å] | 20.00–1.96 (1.98–1.96) | 80.00–2.70 (2.77–2.70) | 28.31 – 2.55 (2.75–2.55) | 38.05–2.55 (2.64–2.55) |
| R | 15.9 (26.6) | 21.0 (24.1) | 21.6 (27.7) | 19.9 (25.2) |
| R | 21.3 (33.3) | 26.8 (27.7) | 26.0 (31.5) | 23.3 (28.6) |
| Test set size [%], selection | 10, random | 5, random | 5, random | 10, random |
|
| ||||
| RMSD Bond lengths [Å] | 0.012 | 0.008 | 0.008 | 0.014 |
| RMSD Bond angles [°] | 1.16 | 0.87 | 1.07 | 1.23 |
| Ramachandran plot (%) | ||||
| Most favored | 98.4 | 97.1 | 94.9 | 96.7 |
| Additionally allowed | 1.6 | 2.5 | 4.3 | 3.3 |
| Generously allowed | 0 | 0 | 0.9 | 0 |
| Disallowed | 0 | 0.3 | 0 | 0 |
|
| ||||
| Protein residues | EsxEma: A2–A76, C2–C76, E3–E69, G2–G69, I1–I76, K2–78; EsxFma: B8–B90, D9–D91, F7–F45/F51–F96, H12–H87, J10–J46/J50–J88, L9–L90 | EsxGms: A5–A94, C5–C91; EsxHms: B7–B86, D1–D83. | EsxOmt: A13–A39/A49–A69, C13–C68; EsxPmt: B5–B41/B46–B97, D7–D38/D56–D97 | EsxOmt: A12–A69; C9–C69; EsxPmt: B4–B38/B54–B95, D3–D38/D51–D94 |
| Waters | 337 | 33 | 35 | 31 |
| Ligands | SO4 2− (1), glycerol (1), β-mercaptoethanol (1) | – | – | SO4 2− (1) |
Numbers in parentheses refer to the high resolution data shell.
Figure 3Ribbon representations of the structures of the Esx complexes determined in this study.
The CFP-10 homologs are colored red and the ESAT-6 homologs are colored blue. (A) the EsxEFma complex; (B), the EsxGHms complex; and (C), the EsxOPmt complex. The N- and C-termini of individual chains are labeled and the disordered loop region of EsxOmt that connects its two α-helices is indicated by a dashed line. The tyrosine and acidic residues of the secretion signals of the EsxGHms and EsxOPmt complexes are shown in stick representation.
Figure 4Surface characteristics of Esx complexes (I).
The complexes are shown in the same orientation as in Figure 3 with the ESAT-6 homolog subunit facing the viewer. (A) the EsxEFma with the surface colored by electrostatic potential (first column), hydrophobicity (second column), and sequence identity (third column). (B) the EsxGHms complex colored as in (A). (C) the EsxOPmt complex colored as in (A). Colored bars under each column indicate: column 1, electrostatic surface potentials of +/− 5 kT calculated at an ionic strength of 150 mM; column 2, hydrophobicity with a gradient of red (most hydrophobic) to blue (least hydrophobic); and column 3, the degree of sequence conservation with variable regions in teal, highly conserved regions in burgundy, and the regions where the degree of conservation could not be assigned with confidence in yellow. Sequence conservation was calculated using alignments of 15 (EsxEFma), 21 (EsxGHms), and 27 (EsxOPmt) pairs of homologous sequences (listed in Table S2).
Figure 5Surface characteristics of Esx complexes (II).
View of the Esx complex surfaces with the complexes rotated 180° versus the orientations in Figure 3 to show the CFP-10 homolog side of the complex. With the exception of the rotation the parameters are the same as in Figure 4 with EsxEFma in (A), EsxGHms complex in (B), and the EsxOPmt complex in (C).
Structural homologs of the mycobacterial Esx complexes described in this study. Non-redundant targets in the first 200 results of the DALI search results are listed.
| Organism (Protein) | PDBid-chain | Z score | RMSD | Number aligned | Sequence identity (%) |
|
| |||||
|
| 4IOE-B | 9.8 | 2.2 | 75 (82) | 21 |
|
| 2G38-D | 8.1 | 1.9 | 75 (173) | 20 |
|
| 2VS0-B | 8.0 | 2.4 | 74 (84) | 7 |
|
| 4I0X-B | 7.7 | 2.4 | 74 (83) | 22 |
|
| 3ZBH-D | 7.5 | 2.4 | 75 (93) | 5 |
|
| 3GWK-C | 7.4 | 2.8 | 73 (98) | 14 |
|
| 3Q4H-A | 7.4 | 2.8 | 73 (90) | 14 |
|
| |||||
|
| 3FAV-D | 9.0 | 2.7 | 74 (78) | 19 |
|
| 3FX7-B | 8.8 | 2.1 | 77 (87) | 16 |
|
| 3FAV-A | 8.7 | 1.9 | 72 (74) | 19 |
|
| 3Q4H-C | 8.3 | 2.2 | 75 (87) | 13 |
|
| 4IOG-C | 7.4 | 3.6 | 79 (91) | 10 |
|
| 2G38-A | 7.3 | 2.4 | 72 (77) | 7 |
|
| |||||
|
| 4IOE-B | 8.9 | 2.2 | 81(82) | 16 |
|
| 3FAV-D | 8.3 | 3.1 | 76(78) | 14 |
|
| 4I0X-B | 8.2 | 2.3 | 78(83) | 12 |
|
| 4I0X-I | 8.1 | 2.2 | 75(76) | 12 |
|
| 2GTS-A | 8.1 | 2.6 | 76(77) | 11 |
|
| 2VS0-A | 8.0 | 2.9 | 81(83) | 16 |
|
| 3ZBH-B | 7.7 | 3.4 | 87(92) | 9 |
|
| 2G38-B | 6.2 | 3.6 | 85 (173) | 11 |
|
| 3O9O-A | 6.2 | 4.1 | 86 (92) | 10 |
|
| |||||
|
| 3FAV-D | 10.0 | 1.5 | 74 (78) | 24 |
|
| 4I0X-I | 8.8 | 2.8 | 76 (76) | 14 |
|
| 2VS0-A | 8.3 | 2.9 | 76 (83) | 11 |
|
| 3FX7-B | 8.2 | 2.6 | 78 (87) | 8 |
|
| 4I0X-L | 7.9 | 2.6 | 77 (82) | 8 |
|
| 4IOG-D | 7.8 | 2.7 | 78 (86) | 12 |
|
| 3Q4H-C | 7.6 | 3.5 | 79 (87) | 10 |
|
| 3OGI-D | 7.5 | 2.9 | 70 (74) | 11 |
|
| 2G38-B | 7.4 | 2.8 | 78 (173) | 15 |
|
| |||||
|
| 4I0X-L | 5.9 | 3.3 | 55 (82) | 7 |
|
| |||||
|
| 3O9O-B | 9.3 | 2.7 | 78 (96) | 13 |
|
| 3ZBH-F | 9.1 | 2.4 | 78 (93) | 17 |
|
| 2VRZ-A | 9.0 | 2.9 | 78 (98) | 10 |
|
| 4I0X-I | 8.9 | 1.9 | 70 (76) | 7 |
|
| 4IOG-B | 8.8 | 3.0 | 77 (93) | 19 |
|
| 3Q4H-B | 7.5 | 2.9 | 74 (80) | 11 |
|
| 2G38-A | 7.4 | 2.0 | 69 (77) | 12 |
In addition to protein name whether the structurally similar chain is of a CFP-10 or ESAT-6 homolog is noted. Alternatively, if the similar structure is of a homodimeric Esx complex or of a PE protein this information is provided.
The number in parentheses is the total length of the similar structure.
Figure 6Structural similarity of mycobacterial Esx complexes to ferritin-like proteins.
The Esx complexes are oriented and colored as in Figure 3. For clarity only the N-termini of the CFP-10 homologs and C-termini of the ESAT-6 homologs are labeled. Ferritin-like proteins are colored orange and N- and C-termini are labeled. (A) Stereo view of the superposition of EsxEFma with a single subunit of the Sulfolobus solfataricus DPS-like dodecamer assembly (PDBid 2CLB, chain A; Z-score of 10.8 with an RMSD of 2.7 Å for the superposition of 126 amino acids with a sequence identity of 5%). (B) Stereo view of the superposition of EsxGHms complex with a single subunit of the E. coli YciE ferritin-like dimer (PDBid 3OGH, chain A; Z-score of 10.5 with an RMSD of 3.0 Å for the superposition of 134 amino acids with a sequence identity of 6%). (C) Stereo view of the superposition of EsxOPmt complex with a single subunit of the Bacillus anthracis BA_0993 hypothetical ferritin-like protein dodecamer (PDBid 2QQY, chain A; Z-score of 9.6 with an RMSD of 3.0 Å for the superposition of 114 amino acids with a sequence identity of 4%).
Top five highest non-redundant results for the DALI search using the two chains of the Esx complexes as a single chain search model.
| Rank | Organism; Protein name; PFAM family/superfamily; oligomeric state | PDBid-chain | Z score | RMSD | Number aligned | Sequence identity (%) |
|
| ||||||
| 1 |
| 2CLB-A | 10.8 | 2.7 | 126 (169) | 5 |
| 2 |
| 1TK6-C | 10.6 | 2.8 | 128 (175) | 12 |
| 3 |
| 4ERU-B | 10.6 | 2.7 | 129 (158) | 9 |
| 4 |
| 2FJC-N | 10.6 | 2.6 | 121 (150) | 11 |
| 5 |
| 3HIU-A | 10.6 | 3.0 | 131 (142) | 7 |
|
| ||||||
| 1 |
| 3OGH-A | 10.5 | 3.0 | 134 (145) | 6 |
| 2 |
| 3HIU-B | 10.2 | 3.0 | 135 (146) | 7 |
| 3 |
| 1TK6-B | 9.7 | 3.4 | 132 (175) | 8 |
| 4 |
| 2GS4-A | 9.2 | 3.1 | 128 (159) | 16 |
| 5 |
| 2VXX-B | 8.9 | 3.5 | 131 (173) | 8 |
|
| ||||||
| 1 |
| 2QQY-A | 9.6 | 3.0 | 114 (139) | 4 |
| 2 |
| 3M6J-B | 9.1 | 2.6 | 96 (126) | 3 |
| 3 |
| 3BT5-A | 9.0 | 3.3 | 115 (151) | 10 |
| 4 |
| 3HIU-A | 9.0 | 3.1 | 112 (142) | 10 |
| 5 | Thermosynechococcus elongatus; Dps family DNA-binding stress response protein; ferritin/ferritin-like superfamily; dodecamer | 2C41-L | 8.9 | 2.9 | 112 (155) | 9 |
| 7 |
| 2WTL-F | 8.9 | 3.0 | 111 (161) | 6 |
If known.
The number in parentheses is the total length of the similar structure.