Literature DB >> 25203216

A distinct MaoC-like enoyl-CoA hydratase architecture mediates cholesterol catabolism in Mycobacterium tuberculosis.

Meng Yang1, Kip E Guja, Suzanne T Thomas, Miguel Garcia-Diaz, Nicole S Sampson.   

Abstract

The Mycobacterium tuberculosis (Mtb) igr operon plays an essential role in Mtb cholesterol metabolism, which is critical for pathogenesis during the latent stage of Mtb infection. Here we report the first structure of a heterotetrameric MaoC-like enoyl-CoA hydratase, ChsH1-ChsH2, which is encoded by two adjacent genes from the igr operon. We demonstrate that ChsH1-ChsH2 catalyzes the hydration of a steroid enoyl-CoA, 3-oxo-4,17-pregnadiene-20-carboxyl-CoA, in the modified β-oxidation pathway for cholesterol side chain degradation. The ligand-bound and apoenzyme structures of ChsH1-ChsH2(N) reveal an unusual, modified hot-dog fold with a severely truncated central α-helix that creates an expanded binding site to accommodate the bulkier steroid ring system. The structures show quaternary structure shifts that accommodate the four rings of the steroid substrate and offer an explanation for why the unusual heterotetrameric assembly is utilized for hydration of this steroid. The unique αβ heterodimer architecture utilized by ChsH1-ChsH2 to bind its distinctive substrate highlights an opportunity for the development of new antimycobacterial drugs that target a pathway specific to Mtb.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 25203216      PMCID: PMC4245171          DOI: 10.1021/cb500232h

Source DB:  PubMed          Journal:  ACS Chem Biol        ISSN: 1554-8929            Impact factor:   5.100


The resurgence of tuberculosis disease (TB) in the 1990s highlighted that tuberculosis is no longer a disease of the past. HIV/AIDS infection, multidrug resistant TB (MDR-TB), and extensively drug resistant TB (XDR-TB) have made the problem more complex.[1] A quarter of HIV infected deaths are caused by TB, and MDR-TB is present all over the world.[2] The drugs currently used to treat tuberculosis require long treatment times and have limited efficacy and significant toxicities. The first line drugs, including isoniazid, rifampicin, and pyrazinamide, require more than half a year’s treatment. Existing drugs only target a small fraction of essential growth pathways, and it is not difficult for Mycobacterium tuberculosis (Mtb) to overcome their inhibition by activating compensatory pathways. In 2010, 8.8 million people were infected with TB and 1.4 million died from TB.[2] New drugs based on novel pathways or targets that can shorten the therapy and eliminate drug-resistant strains are urgently needed. In order to develop new strategies against TB, a better understanding of the virulence and pathogenesis of Mtb’s infection lifecycle is needed. Macrophages are responsible for defending against invasion by foreign pathogens. However, Mtb can reside in macrophages and replicate within them. Mtb successfully assimilates nutrition from the macrophage and hides from attack by the immune system. As an intracellular pathogen, the first challenge Mtb faces is how to obtain nutrients. Mtb preferentially metabolizes host-derived lipids during infection.[3,4] Among a total of 4000 Mtb genes identified through genome sequencing, at least 250 genes are potentially involved in lipid metabolism,[5] which is 5 times more lipid-metabolism genes than Escherichia coli. Many of the lipid-metabolism genes are induced by intracellular growth and infection.[5] Knowing the functions of genes and their corresponding products involved in lipid-utilization pathways in Mtb is critical for understanding Mtb’s infection biology and may provide new targets to develop anti-TB drugs. In the process of defining genes required for survival of Mtb in macrophages by transposon library screening, several experiments highlighted the importance of the intracellular growth operon (igr operon), comprised of Rv3540c to Rv3545c, for survival in macrophages.[6,7] Deletion of the operon (H37Rv:Δigr) results in attenuated growth in both macrophages and mice.[6] The genes of the igr operon were predicted to encode lipid β-oxidation enzymes.[8] However, the operon is not required for growth on even or odd chain fatty acids.[6] During the course of Mtb infection, cholesterol has been demonstrated to serve as a nutrient and is significant for Mtb persistence. Many steroid-utilizing genes have been shown to be critical both in vitro and in vivo.(9) The location of the igr operon in the 82-gene cholesterol catabolism cluster suggested a potential function in cholesterol metabolism,[10] and the operon was found to be required for growth of Mtb in vitro in the presence of cholesterol.[11] Metabolite profiling of the H37Rv:Δigr strain using isotopically labeled cholesterol resulted in accumulation of a metabolite. The structure of the metabolite led to the hypothesis that the igr operon is required for β-oxidation of C20–C22 in the cholesterol side chain[12] (Figure 1a,b).
Figure 1

Biochemical function of igr operon and gene constructs to express ChsH1-ChsH2, ChsH1-ChsH2N. (a) The igr operon in Mtb; chsH1 (Rv3541c) and chsH2 (Rv3542c) are colored in cyan and pink, respectively. Other genes are shown in gray. (b) The biochemical function of the gene products encoded by the igr operon. (1) 3-oxo-4-pregnene-20-carboxyl-CoA (3-OPC-CoA); (2) 3-oxo-4,17-pregnadiene-20-carboxyl-CoA (3-OPDC-CoA); (3) 17-hydroxy-3-oxo-4-pregnene-20-carboxyl-CoA (17-HOPC-CoA); (4) androstenedione. (c) Constructs for expression of ChsH1-ChsH2 and ChsH1-ChsH2N. An N-terminal 6× His tag is encoded at the 5′ end of the first gene of each construct. The ltp2 gene following ChsH1 is not shown for clarity. The number of residues encoded by each gene is labeled. The factor Xa cleavage sequence is introduced after the Arg187 codon in the chsH2 gene. (d) SDS-PAGE analysis of system I isolated by IMAC and of system II isolated by IMAC and then subjected to factor Xa cleavage. Protein identities were confirmed by tryptic digest and MALDI-TOF MS fingerprinting.

Biochemical function of igr operon and gene constructs to express ChsH1-ChsH2, ChsH1-ChsH2N. (a) The igr operon in Mtb; chsH1 (Rv3541c) and chsH2 (Rv3542c) are colored in cyan and pink, respectively. Other genes are shown in gray. (b) The biochemical function of the gene products encoded by the igr operon. (1) 3-oxo-4-pregnene-20-carboxyl-CoA (3-OPC-CoA); (2) 3-oxo-4,17-pregnadiene-20-carboxyl-CoA (3-OPDC-CoA); (3) 17-hydroxy-3-oxo-4-pregnene-20-carboxyl-CoA (17-HOPC-CoA); (4) androstenedione. (c) Constructs for expression of ChsH1-ChsH2 and ChsH1-ChsH2N. An N-terminal 6× His tag is encoded at the 5′ end of the first gene of each construct. The ltp2 gene following ChsH1 is not shown for clarity. The number of residues encoded by each gene is labeled. The factor Xa cleavage sequence is introduced after the Arg187 codon in the chsH2 gene. (d) SDS-PAGE analysis of system I isolated by IMAC and of system II isolated by IMAC and then subjected to factor Xa cleavage. Protein identities were confirmed by tryptic digest and MALDI-TOF MS fingerprinting. The six genes in the igr operon are annotated as a lipid transfer protein (Ltp2/Rv3540c), MaoC-like hydratases (Rv3541c, Rv3542c), acyl-CoA dehydrogenases (Rv3543c, Rv3544c), and a cytochrome P450 (cyp125/Rv3545c).[11] Enzymatic and structural studies demonstrated that Cyp125 oxidizes cholest-4-en-3-one or cholesterol at C-26 to cholest-4-en-3-one-26-oic acid.[13] However, in the H37Rv strain employed in the igr studies, loss of this activity is compensated by Cyp142.[14] Biophysical characterization of the acyl-CoA dehydrogenases Rv3543c-Rv3544c, named ChsE1-ChsE2, showed that they form an obligate α2β2 heterotetramer with two active sites and two FAD cofactor binding sites.[12,15] ChsE1-ChsE2 catalyzes the C17–C20 dehydrogenation of steroid acyl-CoA thioester 3-oxo-4-pregnene-20-carboxyl-CoA.[15] Earlier pull down experiments demonstrated that Rv3541c and Rv3542c form a protein complex.[12] All of the above information established the importance of the igr operon for cholesterol metabolism by Mtb and highlighted the possibility that a second heteromeric structural motif may be encoded in the operon. Based on a protein sequence BLAST search against the nonredundant protein sequence database (blastp), the igr operon encodes two potential MaoC-like enoyl-CoA hydratases, Rv3541c and Rv3542c. However, their role in cholesterol side chain degradation was unclear since known fatty acid β-oxidation cycles in bacteria, as well as eukaryotes, utilize a crotonase family member to catalyze hydration of fatty enoyl-CoA metabolites. MaoC-like enoyl-CoA hydratases are structurally distinct from crotonases, and in bacteria, they are employed in polyhydroxyalkanoate energy storage pathways instead of for energy metabolism.[16] In eukaryotes, MaoC-like enoyl-CoA hydratases are employed in peroxisomal β-oxidation of di- and trihydroxy-5β-cholestanic acid to chenodeoxycholic acid and cholic acid, respectively. However, these hydratases are part of a multifunctional enzyme (MFE) that includes a fused 3-hydroxy-acyl-CoA dehydrogenase domain.[17,18] Both the bacterial and peroxisomal MaoC-like enoyl-CoA hydratases characterized to date are all homodimers,[16,19] although the peroxisomal enoyl-CoA domains are larger than their bacterial homologues. Sequence similarities indicated that both Rv3541c and Rv3542c have MaoC-like hydratase domains, and in addition Rv3542c includes a DUF35/DUF35_N domain. Here, we demonstrate that Rv3541c and Rv3542c form a heterodimer, two of which further dimerize to form an α2β2 heterotetramer. The unique architecture of this heterotetramer is in stark contrast to previously characterized structural homologues, for example, peroxisomal and bacterial enzymes, which form homodimers. We also demonstrate that Rv3541c-Rv3542c catalyzes the hydration of a steroid substrate, 3-oxo-4,17-pregnadiene-20-carboxyl-CoA (3-OPDC-CoA). We now refer to Rv3541c-Rv3542c as ChsH1-ChsH2 to emphasize its function in cholesterol side chain enoyl-CoA hydration. Our data suggest that Mtb metabolizes sterol side chains through a unique assembly of β-oxidation enzymes. The structures described here highlight opportunities for the design of inhibitors specifically targeting cholesterol catabolism in actinobacteria, particularly Mycobacterium tuberculosis.

Results and Discussion

Enzymes that degrade the core structure of steroids like cholesterol are not present in humans. On the other hand, Mtb has the ability to metabolize completely steroids of mammalian origin and utilize these catabolic intermediates for both energy and other processes related to virulence and pathogenesis. Despite the fact that many of these steroid-utilizing genes have been shown to be critical both in vitro and in vivo, no enzyme crystal structure to date has been elucidated of a steroid utilizing β-oxidation enzyme.[9] Given the unclear structural and functional homologies of ChsH1 and ChsH2, as well as their association, we undertook investigation of the structure and activity of the ChsH1-ChsH2 complex in order to understand its role in Mtb cholesterol metabolism.

ChsH1-ChsH2 Forms an Obligate α2β2 Heterotetramer and Preferentially Catalyzes Hydration of Steroid Enoyl-CoA

Initially, ChsH1 and ChsH2 were expressed individually as N-terminally His6-tagged proteins in E. coli. However, both ChsH1 and ChsH2 precipitated directly after elution from an ion affinity chromatography (IMAC) column indicating that ChsH1 and ChsH2 are not stable under these expression conditions. Construct pigr3(12) (Table 1) was expressed heterologously to provide ChsH1-ChsH2 that was isolated by IMAC and further purified using gel filtration chromatography. SDS-PAGE gel analysis confirmed the coisolation of ChsH1 and ChsH2 (Figure 1c,d). Analytical gel filtration chromatography demonstrated that ChsH1-ChsH2 forms a stable oligomeric complex in solution (Figure 2a) with an approximate 1:1 stoichiometry. Further analysis by analytical ultracentrifugation (AUC) equilibrium experiments established that it forms a complex with a molecular weight of 99 kDa (Figure 2b). Thus, given the relative stoichiometry and the individual molecular weights of ChsH1 (14 kDa) and ChsH2 (36 kDa), the ChsH1-ChsH2 complex is an α2β2 heterotetramer. To confirm the stoichiometry of the complex, the assembly was analyzed by LC/UV/MS (Figure 2d). Integration of the ChsH1 and ChsH2 peaks and calculation of their relative molar concentration provided a relative molar stoichiometry of one ChsH1 to one ChsH2, consistent with formation of an α2β2 heterotetramer.
Table 1

Expression Constructs Used in This Work

construct namegenesrestriction sitespurified enzymesource/ref
pET28b   Novagen
pChsH1Rv3541cNdeI/XhoIN-His6-ChsH1this worka
pChsH2Rv3542cNdeI/NotIN-His6-ChsH2this worka
pigr3Rv3542c, Rv3541c, Rv3540cNdeI/HindIIN-His6-ChsH2-ChsH1(12)
pChsH1ChsH2NRv3542c, Rv3541c, Rv3540cNdeI/HindIIN-His6-ChsH2-ChsH1this worka
pigr3H1D29ARv3542c, Rv3541c, Rv3540cNdeI/HindIIN-His6-ChsH2-ChsH1D29Athis worka
pigr3H1H4ARv3542c, Rv3541c, Rv3540cNdeI/HindIIN-His6-ChsH2-ChsH1H34Athis worka
pCasIRo05822NdeI/XhoIN-His6-CasIthis workb

Genes were cloned from H37Rv genomic DNA and ligated into pET28b with the indicated restriction sites to include an N-terminal His6 fusion tag.

The Rhodococcus jostii RHA1 casI (Ro05822) gene was synthesized and ligated into pET15b with the indicated restriction sites to include an N-terminal His6 fusion tag.

Figure 2

ChsH1-ChsH2 and ChsH1-ChsH2N are heterotetramers in solution and prefer to catalyze hydration of a steroid enoyl-CoA substrate. (a) ChsH1-ChsH2 and ChsH1-ChsH2N were analyzed by analytical size exclusion chromatography on a Superdex 75 column. (b) ChsH1-ChsH2 and (c) ChsH1-ChsH2N were centrifuged at speeds of 10000, 20000, and 30000 rpm at 20 °C in an analytical ultracentrifuge. A representative fit for each sample is shown. The solid line shows the fit of the data to the ideal species model, and the residuals of the fit are graphed below the fit. The best global fit for each protein provided molecular weights of 98.6 and 65.2 kDa for ChsH1-ChsH2 and ChsH1-ChsH2N, respectively. (d, e) Reverse phase LC/UV/MS chromatograms of ChsH1-ChsH2 and ChsH1-ChsH2N. Peaks A, B, and C were identified as ChsH2, ChsH1, and ChsH2N, respectively, by deconvolution of multiple charged states in the corresponding ESI+ MS spectra. The absorbance of peaks were integrated, and relative concentrations were determined from the calculated extinction coefficients of ChsH2, ChsH1, and ChsH2N [ε280(ChsH2) = 59993 M–1 cm–1, ε280(ChsH1) = 16960 M–1 cm–1, and ε280(ChsH2N) = 33523 M–1cm–1]. (f) Specific activities of ChsH1-ChsH2 and ChsH1-ChsH2N. ChsH1-ChsH2 and ChsH1-ChsH2N were assayed with octenoyl-CoA, decenoyl-CoA, and 3-oxo-4,17-pregnadiene-20-carboxyl-CoA (3-OPDC-CoA).

Genes were cloned from H37Rv genomic DNA and ligated into pET28b with the indicated restriction sites to include an N-terminal His6 fusion tag. The Rhodococcus jostii RHA1 casI (Ro05822) gene was synthesized and ligated into pET15b with the indicated restriction sites to include an N-terminal His6 fusion tag. ChsH1-ChsH2 and ChsH1-ChsH2N are heterotetramers in solution and prefer to catalyze hydration of a steroid enoyl-CoA substrate. (a) ChsH1-ChsH2 and ChsH1-ChsH2N were analyzed by analytical size exclusion chromatography on a Superdex 75 column. (b) ChsH1-ChsH2 and (c) ChsH1-ChsH2N were centrifuged at speeds of 10000, 20000, and 30000 rpm at 20 °C in an analytical ultracentrifuge. A representative fit for each sample is shown. The solid line shows the fit of the data to the ideal species model, and the residuals of the fit are graphed below the fit. The best global fit for each protein provided molecular weights of 98.6 and 65.2 kDa for ChsH1-ChsH2 and ChsH1-ChsH2N, respectively. (d, e) Reverse phase LC/UV/MS chromatograms of ChsH1-ChsH2 and ChsH1-ChsH2N. Peaks A, B, and C were identified as ChsH2, ChsH1, and ChsH2N, respectively, by deconvolution of multiple charged states in the corresponding ESI+ MS spectra. The absorbance of peaks were integrated, and relative concentrations were determined from the calculated extinction coefficients of ChsH2, ChsH1, and ChsH2N [ε280(ChsH2) = 59993 M–1 cm–1, ε280(ChsH1) = 16960 M–1 cm–1, and ε280(ChsH2N) = 33523 M–1cm–1]. (f) Specific activities of ChsH1-ChsH2 and ChsH1-ChsH2N. ChsH1-ChsH2 and ChsH1-ChsH2N were assayed with octenoyl-CoA, decenoyl-CoA, and 3-oxo-4,17-pregnadiene-20-carboxyl-CoA (3-OPDC-CoA). We assayed the ChsH1-ChsH2 enzyme complex with three different substrates: octenoyl-CoA, decenoyl-CoA, and 3-oxo-4,17-pregnadiene-20-carboxyl-CoA (3-OPDC-CoA). The hydration process was monitored spectrophotometrically, and the formation of the hydrated products was confirmed by matrix-assisted laser desorption ionization (MALDI)-time of flight (TOF) mass spectrometry. Although ChsH1-ChsH2 catalyzed the hydration of all three substrates, the highest turnover activity was observed with 3-OPDC-CoA (Figure 2f). Therefore, ChsH1-ChsH2 has the ability to bind bulky substrates and preferentially catalyzes the hydration of steroid enoyl-CoA compared with aliphatic enoyl-CoAs, thus establishing a role in cholesterol metabolism for this enzyme.

A Truncated Form of ChsH1-ChsH2N Is Sufficient for Catalytic Activity

In the process of crystallizing full length ChsH1-ChsH2, diffraction quality crystals were obtained. The crystals diffracted to 1.52 Å resolution (Table 2), and the structure was solved using single wavelength anomalous dispersion. The crystals contained four αβ heterodimers (two α2β2 heterotetramers) in the asymmetric unit (Figure 3a, Supplementary Figure 1 and Note, Supporting Information). The electron density was of excellent quality and allowed a model of full length ChsH1 and the N-terminal domain of ChsH2 (180 amino acids out of 311 amino acids) to be built. The absence of further density for the C-terminal domain of ChsH2 prompted us to analyze the protein content of our crystals. Further analysis by SDS-PAGE and MALDI-TOF mass spectrometry confirmed that around a 130 amino acid C-terminal fragment of ChsH2 was lost through proteolysis prior to crystallization. We henceforth refer to this fragment as ChsH2C.
Table 2

Data Collection and Refinement Statistics

 ChsH1-ChsH2NChsH1-ChsH2N/3-OPC-CoA
Data Collection
space groupP21P6122
cell dimensions  
a, b, c (Å)51.8, 134.1, 90.451.9, 51.9, 436.2
α, β, γ (deg)90, 91.6, 9090, 90, 120
resolution (Å)37.6–1.52 (1.50–1.54)a72.70–1.76 (1.768–1.762)a
Rmerge0.033 (0.434)0.068 (0.605)
Rmeas0.038 (0.585)0.073 (0.642)
Rpim0.026 (0.408)0.024 (0.208)
CC1/20.999 (0.886)0.999 (0.852)
II19.2 (2.2)18.8 (8.8)
completeness (%)99.2 (95.7)100 (100)
multiplicity13.4 (11.2)8.0 (8.8)
Wilson B-factor17.0421.19
Refinement
resolution (Å)37.6–1.5272.70–1.76
no. reflns180 56540 369
Rwork/Rfree0.1768/0.20460.1895/0.2322
CC*1.0 (0.976)1.0 (0.963)
no. atoms  
total10,1032,607
protein8,9202,286
ligand 76
water1163263
B-factors  
protein24.034.8
ligand 41.0
water33.740.6
rms deviations  
bond lengths (Å)0.0090.016
bond angles (deg)1.151.57
PDB ID4W784W7B

Values in parentheses are for the highest-resolution shell.

Figure 3

Overall architecture and topology of ChsH1-ChsH2N. (a) Two α2β2 heterotetramers of ChsH1-ChsH2N in one asymmetric unit (ASU). Surface representation of the structure corresponds to chain color. (b, c) ChsH1-ChsH2N adopts a heterotetrameric assembly with two ChsH1 chains and two ChsH2N chains. One ChsH1 chain and one ChsH2N chain fold into a heterodimer and a pair of heterodimers assembles into a tetramer. The identities of the chains are labeled. The structure in panel c is obtained by a 90° rotation of the structure in panel b around the Y-axis. (d) Schematic representation of one heterodimer from the tetramer. Each chain possesses a conserved “hot-dog” fold. α-Helices and β-strands from ChsH1 and ChsH2N are shown as ribbons and arrows, respectively; C-termini and N-termini are labeled. 310-Helices are labeled η. (e) Topology representation of one heterodimer. The color-coding is the same as in panel b. Secondary structure was assigned using DSSP.

Values in parentheses are for the highest-resolution shell. Overall architecture and topology of ChsH1-ChsH2N. (a) Two α2β2 heterotetramers of ChsH1-ChsH2N in one asymmetric unit (ASU). Surface representation of the structure corresponds to chain color. (b, c) ChsH1-ChsH2N adopts a heterotetrameric assembly with two ChsH1 chains and two ChsH2N chains. One ChsH1 chain and one ChsH2N chain fold into a heterodimer and a pair of heterodimers assembles into a tetramer. The identities of the chains are labeled. The structure in panel c is obtained by a 90° rotation of the structure in panel b around the Y-axis. (d) Schematic representation of one heterodimer from the tetramer. Each chain possesses a conserved “hot-dog” fold. α-Helices and β-strands from ChsH1 and ChsH2N are shown as ribbons and arrows, respectively; C-termini and N-termini are labeled. 310-Helices are labeled η. (e) Topology representation of one heterodimer. The color-coding is the same as in panel b. Secondary structure was assigned using DSSP. In order to reliably obtain the protein complex ChsH1-ChsH2N and to reproduce the crystallization, an expression plasmid, pChsH1ChsH2 (Table 1), was constructed that introduced a factor Xa cleavage site followed by four glycine residues after Arg187 of ChsH2 (Figure 1c). The ChsH1-ChsH2 protein complex was obtained upon cistronic expression using the construct pChsH1ChsH2. The full-length protein was purified using IMAC and gel filtration chromatography and then cleaved with factor Xa to provide ChsH1-ChsH2N as confirmed by SDS-PAGE gel and analytical gel filtration (Figure 1d). The molecular weight of ChsH1-ChsH2N was determined to be 65 kDa by AUC (Figure 2c) consistent with the predicted molecular weights of ChsH1 and His6 tagged ChsH2N (14 kDa and 23 kDa, respectively). The relative molar stoichiometry of the subunits remained 1:1 based on LC/UV/MS analysis (Figure 2e). Therefore, ChsH1-ChsH2N still forms an α2β2 heterotetramer. The C-terminal fragment of ChsH2, referred to as ChsH2C, precipitated during factor Xa cleavage. In order to test whether ChsH1-ChsH2N is still catalytically active, we analyzed its ability to catalyze hydration of the substrates described above. Importantly, ChsH1-ChsH2N retained catalytic activity, indicating that ChsH2C is not required for catalysis of hydration (Figure 2f). Furthermore, we were able to obtain crystals of the new protein complex ChsH1-ChsH2N. These crystals diffracted to 1.54 Å and yielded a structure identical to that originally obtained.

The Structure of ChsH1-ChsH2N Reveals a Unique MaoC-like Enoyl-CoA Hydratase Architecture

Consistent with annotations based on BLAST alignments, the crystal structure shown here establishes that ChsH1-ChsH2N belongs to the MaoC-like enoyl-CoA hydratase family. This family contains homodimeric MaoC-like enoyl-CoA hydratases that are comprised of a β-sheet that wraps around a central α-helix, which is known as a hot-dog fold. A second structural motif in MaoC-like hydratases is a three helix segment, referred to as an active site housing segment, and this motif is usually fused with the hot-dog fold domain in the family.[20] Surprisingly, rather than the prototypical homodimeric structure, ChsH1-ChsH2N forms a dimer of a unique αβ heterodimeric assembly comprising one standard and one nonstandard MaoC-like hydratase. In the ChsH1-ChsH2N tetramer, one ChsH1 chain and one ChsH2N chain form an intimate heterodimer. Two of these heterodimers combine to form the heterotetramer (Figure 3b,c,d). As far as we know, ChsH1-ChsH2 is the first example of an α2β2 heteromeric MaoC-like enoyl-CoA hydratase. The interaction between the two heterodimers involves helices α1′ from ChsH1 and α2 from ChsH2N. The helices from one heterodimer form a compact interface with the other heterodimer through salt bridges and hydrogen bonding interactions (Supplementary Figure 2a,b,c, Supporting Information). Arg37 from ChsH1 in one heterodimer forms a double salt bridge with Asp38 from the second ChsH1. Because of the 2-fold rotational symmetry in the dimer–dimer interface, four salt bridges are formed. Six hydrogen bonds are contributed from helices α2 of the ChsH2N chains (Supplementary Figure 2e,f, Supporting Information). Importantly, this heterodimer–heterodimer interface is clearly divergent with respect to other MaoC-like hydratase homologues. A DALI[21] search revealed homologues from several species, ranging from bacteria (Aeromonas caviae; AcRH) and yeast (Candida tropicalis; CtRH) to mammals (Homo sapiens; HuRH). Interestingly, the ChsH1-ChsH2N heterotetramer is comparable to a CtRH homodimer and a HuRH homodimer. The two monomers in CtRH or HuRH dimerize through a similar four-helix bundle (Supplementary Figure 2d, Supporting Information). However, in those cases, the helices from one monomer interact with the helices from the other monomer orthogonally[22] in a very different structural arrangement from ChsH1-ChsH2 (Supplementary Figure 2b,d, Supporting Information).

The ChsH1-ChsH2N Heterodimer Is Structurally Distinct from Its MaoC-like Hydratase Homologues

In the ChsH1-ChsH2N heterodimer, five β-strands (β1′−β3′−β4′−β5′−β2′) from ChsH1 and four β-strands (β1−β4−β3−β2) from ChsH2N form an antiparallel nine-stranded β-sheet. Helix α3′ from ChsH1 and α4 from ChsH2N are above the antiparallel β-sheet in the viewpoint shown in Figure 3d. Helices α2−η1−α3 from ChsH2N and α1′−η1′−α2′ from ChsH1 form a third layer on top of α3′ and α4 and are almost 2-fold symmetric (Figure 3d,e). This architecture and the extensive interface between the monomers imply that ChsH1 and ChsH2N form an obligate heterodimer. The overall structure of the ChsH1-ChsH2N heterodimer from Mtb is disparate from its homologues across different species. The ChsH1-ChsH2N heterodimer has two hot-dog folds in total. Interestingly, a similar architecture is present in other structural homologues, but the two hot-dog folds are present as a single monomer (CtRH or HuRH) or a homodimer (AcRH) (Supplementary Table 1, Supporting Information). The fold of ChsH1 is most similar to the C-terminal domain of the CtRH or HuRH monomer and corresponds to the structure of a single chain of AcRH (Supplementary Table 1, Supporting Information). Surprisingly, ChsH2N, although similar to the N-terminal domains of CtRH and HuRH monomers, deviates from the standard hot-dog fold of AcRH. In addition, both the structure and sequence have low similarity in the active site housing segment (Figure 5c, Supplementary Figure 3, Supporting Information).
Figure 5

Structure comparison and secondary structure sequence alignment between ChsH1 and ChsH2N. (a, b) ChsH1 possesses a standard hot-dog fold, and ChsH2N adopts a nonstandard hot-dog fold. The cartoon representations are colored by secondary structure. The secondary structures are labeled as in Figure 3d,e. N-termini and C-termini are labeled. In ChsH1, the five-stranded β-sheet (β1′-β3′-β4′-β5′-β2′) wraps around α3′, forming a hot-dog fold, and the proposed active site housing segment is comprised of α1′, η1′, and α2′. In ChsH2N, a four-stranded β-sheet (β1-β4-β3-β2) wraps around α4, assembling into a nonstandard hot-dog fold; α2, η1, and α3 fold to form a potential active site housing segment. The potential active site residues are colored blue, and the simulated annealing omit maps of those residues are calculated to reduce the effects of model bias. Electron density is shown as mesh (σ = 3). (c) Cartoon representation of superimposed ChsH1 (yellow) and ChsH2N (magenta). Structurally similar parts are rendered transparent, and the variations are marked. The structure on the right is obtained by rotating the structure on the left 90° about the X-axis. Potential active site residues are shown as sticks. (d) Secondary structural sequence alignment between ChsH1 and ChsH2N. Identical residues are highlighted in black. The secondary structure elements from ChsH1 (yellow) and from ChsH2N (magenta) correspond to panels a and b. Residues that constitute active site housing segments are highlighted in the green box. The central α-helices from ChsH1 and ChsH2N are highlighted in the red box.

The ChsH1-ChsH2N Heterotetramer Forms a Complex with 3-Oxo-4-pregnene-20-carboxyl CoA

In order to investigate the structural basis of preferential hydration of a steroid enoyl-CoA instead of aliphatic enoyl-CoA’s, we cocrystallized ChsH1-ChsH2N with a substrate analog, 3-oxo-4-pregnene-20-carboxyl CoA (3-OPC-CoA) (Figure 4). The crystals diffracted to a resolution of 1.76 Å (Table 2), and molecular replacement was used to solve the ChsH1-ChsH2N/3-OPC-CoA structure using the apoprotein structure as the template.
Figure 4

ChsH1-ChsH2N complexed with 3-oxo-4-pregnene-20-carboxyl-CoA (3-OPC-CoA). (a) Overall atomic picture of ChsH1-ChsH2N/3-OPC-CoA. The holo-heterotetramer is in the same orientation as Figure 3b. A single heterotetramer is in one asymmetric unit (ASU). ChsH1 chains are colored blue, and ChsH2N chains are colored magenta. Ligand 3-OPC-CoA is rendered as sticks and colored by atom. Molecular surfaces are rendered transparent. (b) Superimposed heterodimer apo-ChsH1-ChsH2N structure and ChsH1-ChsH2N/3-OPC-CoA structure. Superimposed ChsH1 chains are colored in blue, and their surface is shown. The ChsH2N chains from Apo-ChsH1-ChsH2N and ChsH1-ChsH2N/3-OPC-CoA are shown in pink and magenta, respectively. The variations occur in the segments of the ChsH2N chains that are labeled. (c) The difference in the opening of the binding pocket before and after 3-OPC-CoA binding. (d) A simulated annealing Fo – Fc omit map of 3-OPC-CoA contoured at 2.5σ was calculated to reduce the effects of model bias. A chemical structure of 3-OPC-CoA is shown and colored by atom. (e) Ligand-binding interactions in the ChsH1-ChsH2N heterodimer. The hydrogen bonds are shown as black dashes. Residues that interact with 3-OPC-CoA are labeled. 3-OPC-CoA fits between β1 from ChsH2N and β2′from ChsH1.

ChsH1-ChsH2N complexed with 3-oxo-4-pregnene-20-carboxyl-CoA (3-OPC-CoA). (a) Overall atomic picture of ChsH1-ChsH2N/3-OPC-CoA. The holo-heterotetramer is in the same orientation as Figure 3b. A single heterotetramer is in one asymmetric unit (ASU). ChsH1 chains are colored blue, and ChsH2N chains are colored magenta. Ligand 3-OPC-CoA is rendered as sticks and colored by atom. Molecular surfaces are rendered transparent. (b) Superimposed heterodimer apo-ChsH1-ChsH2N structure and ChsH1-ChsH2N/3-OPC-CoA structure. Superimposed ChsH1 chains are colored in blue, and their surface is shown. The ChsH2N chains from Apo-ChsH1-ChsH2N and ChsH1-ChsH2N/3-OPC-CoA are shown in pink and magenta, respectively. The variations occur in the segments of the ChsH2N chains that are labeled. (c) The difference in the opening of the binding pocket before and after 3-OPC-CoA binding. (d) A simulated annealing Fo – Fc omit map of 3-OPC-CoA contoured at 2.5σ was calculated to reduce the effects of model bias. A chemical structure of 3-OPC-CoA is shown and colored by atom. (e) Ligand-binding interactions in the ChsH1-ChsH2N heterodimer. The hydrogen bonds are shown as black dashes. Residues that interact with 3-OPC-CoA are labeled. 3-OPC-CoA fits between β1 from ChsH2N and β2′from ChsH1. The atomic picture of the ChsH1-ChsH2N/3-OPC-CoA complex established that one binding site exists per ChsH1-ChsH2N heterodimer. In each heterodimer, the binding pocket is located at the ChsH1-ChsH2N interface and intrudes into ChsH2N (Figure 4a). Binding of 3-OPC-CoA results in a large quaternary structural change compared with the apo-ChsH1-ChsH2N complex (Figure 4b,c). The major changes occur in the ChsH2N chain. Movements of α1, α5, loop I, and loop II (6–7 Å) and the smaller movements of β1−β4−β3−β2 generate enough space to accommodate the 3-OPC-CoA ligand in the tunnel (Figure 4b,c). We posit that the heterodimer interface serves to enable the large structural change required for binding the polycyclic steroid system, which could not occur in a monomeric binding site like that seen in CtRH or HuRH. The ligand is in a boomerang conformation, which is similar to the orientation of 3-hydroxy-octanoyl-CoA in CtRH.[19] The “boomerang” sits between the ChsH1 chain and the ChsH2N chain with its elbow, cysteamine, positioned across β1 and β2′ (Figure 4e). The whole boomerang bends toward ChsH2N with the coenzyme A moiety exposed to solvent and the four-ring steroid system buried inside ChsH2N (Figure 4a,c; Supplementary Figure 4, Supporting Information). Binding interactions with ChsH2N anchor the coenzyme A moiety to the protein. The amine group (N1″) is hydrogen bonded to Asn106 in β1, N2″ is hydrogen bonded to Arg163 in β5, and O3″ is stabilized by Ala137 from loop II. The elbow of the boomerang is stabilized through a hydrogen bond between N4nitrogen and Gly81 from β2′ in ChsH1. The steroid part of the “boomerang” has fewer specific interactions with the enzyme compared with the coenzyme A moiety, consistent with the steroid’s hydrophobicity. The thioester O5″ is stabilized through a hydrogen bonding interaction with Ala104-ChsH2N. No polar interactions were found that stabilize the O3–C3 ketone (Figure 4d, 4e).

ChsH1 possesses a canonical MaoC-like hydratase hot-dog fold providing the active site, but ChsH2N possesses a modified hot-dog fold that confers the ability to bind bulky substrates

ChsH1 includes a complete hot-dog fold and an active site housing segment (Figure 5a). Five antiparallel β strands (β1′-β3′-β4′-β5′-β2′) act as a bun, wrapping around the sausage-like central helix α3′ to form a standard hot-dog fold (Figure 5a), characterized by the four-turns in the central α-helix.[19] The region from α1′, η1′ to α2′ comprises the active site housing segment. The strictly conserved active site residues, Asp29 and His34 (Supplementary Figure 5a), located in the active site housing segment (α1′−η1′−α2′) in ChsH1 (Figure 5a), are almost in identical positions with respect to the active sites in AcRH, CtRH and HuRH (Figure 6b). From the ChsH1-ChsH2N:3-OPCCOA structure, the C17–C20 bond, which is the site of hydration, sits in apposition to the active site residues Asp29/His34 from ChsH1 (Figure 4d, 4e).
Figure 6

Comparisons of MaoC-like enoyl-CoA hydratases across species. (a, b) Three-dimensional structural conservation between AcRH monomer (gray), C-terminal domain of CtRH (C-CtRH) (residue numbers 163–280; purple), C-terminal domain of HuRH (C-HuRH) (residue numbers 174–298; yellow), and ChsH1 chain (cyan). The structural elements are labeled following Figures 3d and 5a. The conserved active sites are enlarged in the box. (c) Three-dimensional structural comparison between AcRH monomer (gray), N-terminal domain of CtRH (N-CtRH) (residue numbers 1–135; purple), N-terminal domain of HuRH (NHuRH) (residue numbers 1–146; yellow), and ChsH2N chain (pink). Active site residues from the other chain of AcRH, C-terminal domains of CtRH and HuRH, or ChsH1 are shown as sticks. The central α-helices are colored red. Molecular surfaces are rendered transparent. The length of the central α-helices correspond to the 3-D structures above them. For simplicity, 310 helices and α-helices are both shown as cylinders. Protein sequences shown underneath are the amino acids comprising the central helices. The sizes of the central helices dictate the substrate preferences. The preferred substrates correspond to the structure above them. AcRH (PDB code 1iq6); CtRH (PDB code 1pn2); HuRH (PDB code 1s9c).

Structure comparison and secondary structure sequence alignment between ChsH1 and ChsH2N. (a, b) ChsH1 possesses a standard hot-dog fold, and ChsH2N adopts a nonstandard hot-dog fold. The cartoon representations are colored by secondary structure. The secondary structures are labeled as in Figure 3d,e. N-termini and C-termini are labeled. In ChsH1, the five-stranded β-sheet (β1′-β3′-β4′-β5′-β2′) wraps around α3′, forming a hot-dog fold, and the proposed active site housing segment is comprised of α1′, η1′, and α2′. In ChsH2N, a four-stranded β-sheet (β1-β4-β3-β2) wraps around α4, assembling into a nonstandard hot-dog fold; α2, η1, and α3 fold to form a potential active site housing segment. The potential active site residues are colored blue, and the simulated annealing omit maps of those residues are calculated to reduce the effects of model bias. Electron density is shown as mesh (σ = 3). (c) Cartoon representation of superimposed ChsH1 (yellow) and ChsH2N (magenta). Structurally similar parts are rendered transparent, and the variations are marked. The structure on the right is obtained by rotating the structure on the left 90° about the X-axis. Potential active site residues are shown as sticks. (d) Secondary structural sequence alignment between ChsH1 and ChsH2N. Identical residues are highlighted in black. The secondary structure elements from ChsH1 (yellow) and from ChsH2N (magenta) correspond to panels a and b. Residues that constitute active site housing segments are highlighted in the green box. The central α-helices from ChsH1 and ChsH2N are highlighted in the red box. Comparisons of MaoC-like enoyl-CoA hydratases across species. (a, b) Three-dimensional structural conservation between AcRH monomer (gray), C-terminal domain of CtRH (C-CtRH) (residue numbers 163–280; purple), C-terminal domain of HuRH (C-HuRH) (residue numbers 174–298; yellow), and ChsH1 chain (cyan). The structural elements are labeled following Figures 3d and 5a. The conserved active sites are enlarged in the box. (c) Three-dimensional structural comparison between AcRH monomer (gray), N-terminal domain of CtRH (N-CtRH) (residue numbers 1–135; purple), N-terminal domain of HuRH (NHuRH) (residue numbers 1–146; yellow), and ChsH2N chain (pink). Active site residues from the other chain of AcRH, C-terminal domains of CtRH and HuRH, or ChsH1 are shown as sticks. The central α-helices are colored red. Molecular surfaces are rendered transparent. The length of the central α-helices correspond to the 3-D structures above them. For simplicity, 310 helices and α-helices are both shown as cylinders. Protein sequences shown underneath are the amino acids comprising the central helices. The sizes of the central helices dictate the substrate preferences. The preferred substrates correspond to the structure above them. AcRH (PDB code 1iq6); CtRH (PDB code 1pn2); HuRH (PDB code 1s9c). To establish that Asp29 and His34 correspond to the active site of the ChsH1-ChsH2 complex, Asp29 and His34 were mutated to alanine in independent constructs (Table 1). The purified protein complexes were analyzed by SDS-PAGE and by analytical size exclusion chromatography. They both had the same elution profile as wild-type ChsH1-ChsH2. Therefore, ChsH1D29A-ChsH2 and ChsH1H34A-ChsH2 still form α2β2 heterotetramers. The activities of these two mutant protein complexes were tested with octenoyl-CoA, decenoyl-CoA, and 3-OPDC-CoA. ChsH1D29A-ChsH2 retained 2%–4% of the wild-type activity. Moreover mutation of His34 to alanine abolished the catalytic activity within detection limits (less than 0.0062% activity) (Supplementary Figure 5b, Supporting Information). In striking contrast to ChsH1, the structure of ChsH2N varies substantially with respect to the standard MaoC-like hydratase structure, for example, ChsH1, AcRH, or the C-terminal domains of CtRH or HuRH (Figures 5b,c,d and 6). Only four antiparallel β-strands wrap around the central helix α4. Furthermore, helix α4 contains two turns, and one turn is a 310 helix (Figure 5b,d). The short central helix (α4) renders the hot-dog fold incomplete (Figure 5c). This short, modified helix results in an open binding pocket formed by the four-stranded β-sheet. Helix α1 and α5 are two new flexible elements (Figures 3d and 5c) that are not present in other known MaoC-like hydratase structures. Even though the active site housing segment (α2– η1−α3) is structurally conserved, the active site histidine is replaced with a tyrosine (Figure 5b,d). The distinctively short central α-helix (α4) is a source of another major difference (Figures 5c and 6c) and has the most important functional implication. The short helix α4 and the following long loop generate a bigger space in the hot-dog fold compared with other MaoC-like hydratases (Figure 6c), and the presence of this space is critical to allow the ChsH1-ChsH2N heterodimer to bind the steroid substrate. Consistently, comparison of this structural feature in the structures of ChsH1-ChsH2N and its homologues (Figure 6c) reveals that as the central α-helix of the hot-dog fold becomes shorter, the binding cavity becomes wider and sufficiently spacious to accommodate increasingly larger and bulkier substrates. It appears that upon shifting from linear alkyl chain and branched alkyl chain substrates to the rigid rings of a steroid system, the protein architecture transitions from small monomer to extended monomer, and finally to a heterodimer (Figure 6c). For example, in the structure of N-CtRH complexed with its physiologic product, (3R)-hydroxy-octanoyl-CoA, the ligand is accommodated in the interface between the nonstandard N-terminal hot-dog fold and the complete C-terminal hot-dog fold.[19] The pantothenate and adenosine triphosphate of the ligand are exposed to solvent, and the ten-carbon acyl chain points toward the shorter central α-helix in the N-terminal nonstandard hot-dog fold.[19] Interestingly, the ChsH1-ChsH2N dimer has a similar, yet distinct, substrate binding mode; ChsH1 provides the active site and ChsH2N provides the binding pocket (Figure 4e). The available space in the suggested binding tunnel of the hot-dog fold is restricted by the rigid central helix, so the length of the helix influences substrate preferences (Figure 6c; Supplementary Table 1, Supporting Information). The existence of a nonstandard hot-dog fold with shorter central helices in the N-terminal domains of CtRH, HuRH, and ChsH2N chain suggests that one active site has evolved into a binding site to accommodate bulky substrates concomitant with the sacrifice of one set of active site catalytic residues (Figure 6c; Supplementary Table 1, Supporting Information). The standard central helix of 16 amino acids in the complete hot-dog fold confers specificity for four to six carbon enoyl-CoAs.[23] In CtRH, the bent central helix of 13 amino acids in the N-terminal hot-dog fold enables accommodation of long-chain enoyl-CoAs (C10–C22) in the binding site.[19] The shorter central α-helix of 11 amino acids and bigger pocket in the N-terminal hot-dog fold of HuRH extends the accepted enoyl-CoA ester chain length up to C26 and methyl-branched enoyl-CoA’s. The central helix α4 of eight amino acids and adjacent flexible loop in ChsH2N generates an even more flexible active site with a larger pocket. The enlarged pocket in ChsH1-ChsH2N makes it possible to accommodate steroid CoA thioesters as substrates, consistent with its function in cholesterol degradation in Mtb(12) (Figure 6c; Supplementary Table 1, Supporting Information).

Phylogenetic Relationships of ChsH1-ChsH2

Our characterization of the unusual ChsH1-ChsH2 architecture prompted us to assess whether the heteromeric assembly might also exist in other organisms, and if so, whether those organisms were known to metabolize cholesterol. The presence of a hot-dog motif is insufficient to identify MaoC-like hydratase candidates because it is present in other enzyme families.[20] However, sequences in the active site housing segment are highly conserved and, in combination with the hot-dog fold, clearly define MaoC-like hydratases (Supplementary Figure 5a, Supporting Information). By comparing the MaoC-like hydratase sequences from human, fungi, and bacteria, we identified a generalized motif [VIL]-[AVI]-[SA]-X-[AY]-[ILRA]-[AL]-[ST]-[RGEN]-D-[FYWR]-[NQEF]-[PDN]-[VLG]-H-[ILH]-[PRK]-[ANDE]-X-A, which is located in the active site housing segments (Figure 6a; Supplementary Figure 5a, Supporting Information). We used this motif to identify additional family members through individual BLAST queries with the protein sequences of ChsH1 and ChsH2 against the nonredundant protein sequence database. Then we examined their genomic environment to assess the prevalence of the heterotetrameric MaoC-like hydratase motif.[24] We found five different genomic contexts for ChsH1 and ChsH2 homologues (Figure 7a). Importantly, inclusion of genomic proximity in the search for homologues allowed us to identify three different motifs for encoding structural homologues of ChsH1 and ChsH2. All of these homologues are found in bacterial families known to catabolize steroids.[25]
Figure 7

Phylogenetic relationships of ChsH1-ChsH2. (a) Homologues of ChsH1-ChsH2 were identified in five different genomic contexts by BLAST. The arrow lengths are scaled to gene lengths. SHD (cyan), single hot-dog fold; DHD (cyan), double hot-dog; SHD+DUF35 (magenta), single hot-dog fold fused with DUF35/DUF35_N domain; ACAD (gray), acyl-CoA dehydrogenase. (b) Phylogenetic tree for five different genomic contexts of chsH1-chsH2 gene homologues. (1) Domain Opisthoknot; (2) Domain Bacteria; (3) Phylum Actinobacteria; (4) Phylum Proteobacteria; (5) Suborder Corynebacterineae; (6) Class Gammaproteobacteria. Representative organisms were selected for each category. The Roman numerals in parentheses after each organism correspond to the gene organization in panel a.

Phylogenetic relationships of ChsH1-ChsH2. (a) Homologues of ChsH1-ChsH2 were identified in five different genomic contexts by BLAST. The arrow lengths are scaled to gene lengths. SHD (cyan), single hot-dog fold; DHD (cyan), double hot-dog; SHD+DUF35 (magenta), single hot-dog fold fused with DUF35/DUF35_N domain; ACAD (gray), acyl-CoA dehydrogenase. (b) Phylogenetic tree for five different genomic contexts of chsH1-chsH2 gene homologues. (1) Domain Opisthoknot; (2) Domain Bacteria; (3) Phylum Actinobacteria; (4) Phylum Proteobacteria; (5) Suborder Corynebacterineae; (6) Class Gammaproteobacteria. Representative organisms were selected for each category. The Roman numerals in parentheses after each organism correspond to the gene organization in panel a. The two most common classes of homologues are single hot-dog fold and double hot-dog fold MaoC-like hydratases encoded without any neighboring MaoC-like enoyl-CoA hydratase genes. Typically, they form homodimers like AcRH or CtRH as described above. These are evolutionarily most distant from chsH1 and chsH2 and represent the previously identified MaoC-like hydratases. The third category contains two homologous genes in the same adjacent context as chsH1, a single hot-dog fold, and chsH2, a single hot-dog fold fused to a DUF35/DUF35_N domain, which is hypothesized to bind and deliver acyl-CoA moieties in acyl-CoA-utilization processes[26,27] (Supplementary Note). This arrangement is predominantly in Corynebacteria, which includes Mycobacteria (Figure 7b). In less closely related Actinobacteria, the chsH1-chsH2 motif exists as a single fused gene. The sequence alignments suggest that these proteins will form a homodimer that is structurally similar to the ChsH1-ChsH2 heterotetramer, that is, a Rosetta fusion protein.[28,29] Whether these homologues catalyze the hydration of a steroid enoyl-CoA or a structurally simpler steroid metabolite remains to be determined. The fifth category is found in Proteobacteria and is comprised of operons that encode a chsH2 homologue separated from a chsH1 homologue by an acyl-CoA dehydrogenase (ACAD/FadE) homologue. Interestingly, in Agrobacterium tumefaciens strain C58, these three proteins are encoded as a single Rosetta fusion protein suggesting that the enoyl-CoA hydratase uses the acyl-CoA dehydrogenase product as a substrate, as is the case for the igr-encoded enzymes (Figure 1a,b). However, the organization of the igr operon is different; the acyl-CoA dehydrogenase genes (chsE1 and chsE2) are adjacent to chsH2 (Figure 1a). In addition, we have found no evidence for a tight association between the ChsH1-ChsH2 enzyme and the ChsE1-ChsE2 enzyme in our work.[15] Within Mtb, 14 proteins are annotated as possessing hot-dogs folds and belong to the MaoC-like-hydratase family.[30−32] Analysis of their sequences suggests that they either assemble as single hot-dog folds or double hot-dog folds and form homodimers like AcRH and CtRH, respectively. We found no other Mtb homologues like ChsH1 and ChsH2, which would be predicted to form a dimer of heterodimers. This unique utilization of a heterotetrameric MaoC-like hydratase is in contrast to the presence of six cholesterol-regulated heterotetrameric acyl-CoA dehydrogenases in Mtb.[25] We conclude that in Mtb, this heterotetrameric enoyl-CoA hydratase is only employed in the final step of cholesterol side chain β-oxidation.

Conclusion

The enzymes encoded by the igr operon diverge structurally from the β-oxidation enzymes typically found in bacteria or eukaryotes. Oxidation of 3-OPC-CoA is catalyzed by a heterotetrameric acyl-CoA dehydrogenase (ChsE1-ChsE2) in place of the typical homotetrameric or homodimeric ACAD assembly. Hydration of the product 3-OPDC-CoA is catalyzed by an α2β2 heterotetrameric MaoC-like enoyl-CoA hydratase instead of the usual mitochondrial or bacterial crotonase family member or homodimeric peroxisomal multifunctional enzyme-2 (MFE-2) family member. The heterotetrameric architectures of these two Mtb enzymes appear to have evolved in order to accommodate the large steroid-CoA substrates. The structures of ChsH1-ChsH2 elucidated in this work provide insight into binding site cavities that differ substantially from its homologues and will allow discrimination between host and pathogen enzymes by inhibitors in the future.

Methods

Materials, Strains, Media, and General Methods

Ferricenium hexafluorophosphate was purchased from Sigma-Aldrich. Coenzyme A was purchased from MP Biomedicals. Isopropyl β-d-1-thiogalactopyranoside was from Denville Scientific. Tryptone, 4-(2-hydroxyethyl)-1-piperazineethanesulfonic acid (HEPES), and tris(hydroxymethyl)aminomethane (TRIS) were purchased from Fisher Scientific. Kanamycin is from IBI Scientific. Yeast extract was purchased from Research Products International Co. iProof DNA polymerase was from Bio-Rad. Restriction endonucleases, T4 DNA ligase, T4 polynucleotide kinase, factor Xa, and protein ladder were from New England Biolabs. HisTrap FF columns and Superdex 200 HiLoad 16/60 and 10/300 GL columns were from GE Healthcare Biosciences Corp. Oligonucleotides were from IDT Inc. Total genomic DNA of M. tuberculosis H37Rv was obtained from the TB Research Materials Facility at Colorado State University (NIAD NO1-AI40091). MALDI mass spectra were acquired on a Bruker Autoflex II TOF/TOF. Big Dye DNA sequencing (Applied Biosystems; performed by the Stony Brook University Sequencing Facility) was used to verify the coding sequence of the expression plasmids. BL21(DE3) E. coli was obtained from BioRad. The 2× YT media is composed of 16 g of tryptone, 10 g of yeast extract, and 5 g of NaCl per liter. Buffer A consists of 20 mM Tris-HCl buffer, pH 8.0, supplemented with 300 mM NaCl and 10 mM imidazole. Buffer B consists of 20 mM Tris-HCl buffer, pH 8.0, supplemented with 300 mM NaCl and 500 mM imidazole. Buffer C consists of 50 mM Tris-HCl buffer, pH 8.0, supplemented with 200 mM NaCl. The casI (Ro05822) gene sequence was synthesized by GenScript USA Inc. Primary crystallization screens were from Hampton research.

Expression Plasmid Construction

ChsH1 (Rv3541c) and ChsH2 (Rv3542c) were amplified from M. tuberculosis H37Rv total genomic DNA by PCR using forward and reverse primers. The PCR product was digested with the appropriate restriction endonuclease and ligated into similarly digested pET28b (Table 1). DNA sequencing of the plasmids confirmed that the sequence was correct and that no mutations were introduced during PCR. Active site mutants were prepared using Quick Change site-directed mutagenesis.[33] The mutations were confirmed by DNA sequencing. To produce the C-terminal truncation of ChsH2, a factor Xa cleavage site, followed by four glycine residues, was introduced at Arg187 of ChsH2. Insertion was conducted by PCR with primers 5′-ATC GAC GGG CGT GGT GGT GGT GGT CGT CCC TCG TCG TCG CGG GAC-3′ and 5′-CAT CAT AGC GTC GGG ATC CAA ATC GTC AGG-3′ to give construct pChsH1ChsH2.

Protein Expression and Purification

To express ChsH1 or ChsH2 individually, construct pChsH1 or pChsH2 was transformed into BL21(DE3) E. coli. Single colonies were selected on LB plates containing 30 μg/mL kanamycin and cultured in 2× YT media at 37 °C. Expression was induced at OD600 ≈ 0.6–0.8 by the addition of 50 μM to 1 mM IPTG, and cells were grown 20 h at 16–25 °C. Purified proteins were analyzed by reducing SDS-PAGE. CasI was expressed as described for ChsH1 or ChsH2 using construct pCasI and 100 μg/mL ampicillin in the media.

ChsH1-ChsH2 Complex Was Obtained Using Construct pigr3(12)

Expression was induced at OD600 ≈ 0.6–0.8 by the addition of 1 mM IPTG, and cells were grown 20 h at 25 °C. Cells were lysed by French press or sonication in buffer A, and cellular debris was removed by centrifugation at 125 000g for 1 h. ChsH1-ChsH2 was purified by IMAC, with HisTrap FF column and buffers A and B. Protein was further purified by size exclusion chromatography on a Superdex 200 HiLoad 16/60 column equilibrated with buffer C. Active site mutants were expressed and purified following the method for pigr3 using constructs pigr3H1 and pigr3H1. ChsH1-ChsH2N was expressed as described for pigr3 and isolated by IMAC. ChsH2 was cleaved by factorXa at 23 °C in buffer C supplemented with 2 mM CaCl2. Factor Xa protease reaction was monitored by SDS-PAGE, and cleaved ChsH1-ChsH2N was purified by size exclusion chromatography on a Superdex 200 HiLoad 16/60 column equilibrated with buffer C.

Solution-State Biophysical Analysis of ChsH1-ChsH2 and ChsH1-ChsH2N

ChsH1-ChsH2 (10 mg mL–1) and ChsH1-ChsH2N (7 mg mL–1) were analyzed by analytical gel-filtration on a Superdex 200 (10/300 GL) column (GE Healthcare). The column was equilibrated in buffer C. Samples were eluted isocratically in buffer C, monitoring at 220 and 280 nm. Several standard proteins were analyzed under the same conditions to generate standard curves to estimate molecular weights of analyzed proteins. Molecular weights were determined using analytical ultracentrifugation sedimentation equilibrium (Beckman Optima XL-A). ChsH1-ChsH2 (5.7, 2.8, and 1.4 μM) and ChsH1-ChsH2N (7.3, 3.7, and 1.8 μM) were centrifuged at speeds of 10000, 20000, and 30000 rpm at 20 °C. Scans were acquired after 18 and 20 h of centrifugation at each speed monitoring at 280 nm. The protein partial-specific volume of 0.7359 for ChsH1-ChsH2 and 0.7379 for ChsH1-ChsH2N and a solvent density 1.0079 for buffer C were calculated using SEDNTERP. Data were fit globally to the ideal, single species model using Heteroanalysis to determine the molecular weight. Protein complex stoichiometries of ChsH1-ChsH2 and ChsH1-ChsH2N were confirmed by LC/UV/MS. Samples were separated on a XBridge BEH 300 C4 3.5 μm column (2.1 mm × 100 mm) at 40 °C with a linear gradient from 95% A to 95% B over 15 min, where A is 5% isopropanol/0.1% trifluoroacetic acid and B is 99.9% isopropanol/0.1% trifluoroacetic acid. MS spectra were collected in ESI positive ion mode with a cone voltage of 40 V, a capillary voltage of 4.5 kV, and source temperature of 150 °C. MS spectra were deconvoluted using ESIprot 1.0,[34] and peaks in the UV 280 nm chromatograms were integrated using R. The integrated peak areas of each protein were divided by the corresponding molar extinction coefficient for the protein to yield the molar concentrations. Protein stoichiometries were determined from the ratio of the molar concentrations.

Synthesis of Enoyl-CoAs

trans-2-Decenoyl-CoA and trans-2-octenoyl-CoA were synthesized using the mixed anhydride method.[35] Briefly, the mixed anhydride was prepared by mixing the corresponding acid (0.2 mmol) with ethyl chloroformate (0.4 mmol) in 4 mL of dry THF in the presence of TEA (56 μL). After approximately 25 min, the acid was completely converted into the corresponding anhydride according to TLC. The mixed anhydride was filtered though glass wool in a disposable Pasteur pipet into newly prepared lithium CoA solution (55 mg) dissolved in 5 mL of H2O and THF (3:2 v/v) (pH, 8). The reaction was stirred at rt for 2 days. The pH of the reaction mixture was adjusted to 3, and the unreacted acid was removed by extraction with ether. The enoyl-CoAs were purified by HPLC using 20 mM ammonium bicarbonate, pH 8, with a linear gradient from 0 to 90% methanol in 20 mM ammonium bicarbonate.

3-Oxo-4-pregnadiene-20-carboxylic acid

3-Oxo-4-pregnadiene-20-carboxylic acid was synthesized following a previously reported method starting from stigmasta-4,22-diene-3-one.[12,36]

3-Oxo-4-pregnene-20-carboxyl CoA, 3-OPC-CoA

3-Oxo-4-pregnene-20-carboxyl CoA was synthesized following the previously reported CoA synthetase assay. 3-Oxo-4-pregnene-20-carboxylic acid (3-OPC) was dissolved to 50 mM in 94% ethanol containing 60 mM NaOH. The thioesterification was performed for 4 h at 22 °C in a 1 mL reaction volume containing 5 mM 3-OPC, 10 μM CasI (Ro05822), 100 mM HEPES (pH 7.4), 5 mM MgCl2, 2.5 mM ATP, and 1 mM CoASH.[37,38] The reaction was quenched by adding 1 mL of MeOH to precipitate the enzyme. The supernatant was isolated by centrifugation and filtration. 3-Oxo-4-pregnadiene-20-carboxyl CoA was purified by HPLC using 100 mM ammonium acetate, pH 4.5, with a linear gradient from 0 to 90% MeOH in 100 mM ammonium acetate to yield 3-OPC-CoA.

3-Oxo-pregna-4,17-diene-20-carboxyl-CoA, 3-OPDC-CoA

3-Oxo-4-pregnene-20-carboxyl CoA was redissolved to 2.8 mM in H2O. The oxidation was performed for 3 h at 20 °C in 1 mL of 0.5 mM 3-OPC-CoA, 3 μM ChsE1-ChsE2 (FadE28-FadE29), 100 mM N-tris[hydroxymethyl]methyl-3-aminopropanesulfonic acid (TAPS) (pH 8.5), and 1 mM ferricenium hexafluorophosphate.[15] The product was purified by HPLC in 100 mM ammonium acetate, pH 4.5, with a linear gradient of 90% MeOH. MeOH was removed under reduced pressure, and H2O was removed by lyophilization to yield 3-OPDC-CoA.

Hydratase Assay

ChsH1-ChsH2, ChsH1-ChsH2N, ChsH1D29A-ChsH2, and ChsH1H34A-ChsH2 were assayed for hydratase activity with substrates octenoyl-CoA, decenoyl-CoA, and 3-oxo-pregna-4,17-diene-20-carboxyl CoA in 100 mM HEPES, pH 7.4, buffer and 100 nM (50 nM for 3-oxo-pregna-4,17-diene-20-carboxyl CoA) enzyme at 25 °C. Reactions were monitored at 263 nm, and initial velocities were determined. Product formation was quantified using ε263nm = 6700 M–1 cm–1, which corresponds to the α,β-unsaturation of the enoyl-CoA substrates. Hydration product formation was confirmed by analysis of the assay mixture by MALDI-TOF mass spectrometry.

Crystallization

The ChsH1-ChsH2N apoenzyme crystals were obtained by hanging drop vapor diffusion at rt. Briefly, 1 μL of 15 mg mL–1 protein was mixed 1:1 with a reservoir solution of 20%–24% PEG 3350, 20 mM CaCl2, 20 mM CdCl2, and 20 mM CoCl2 at pH 7 or pH 6.5 and equilibrated against 500 μL of the reservoir solution. The crystals were then harvested and transferred to a cryoprotectant solution containing the mother liquor and 17% glycerol. For co-crystallization, ChsH1-ChsH2N was mixed with 3-OPC-CoA. Crystals were obtained at 4 °C by hanging drop vapor diffusion, in mother liquor containing 1 mg mL–1 3-OPC-CoA, 20 mM CaCl2, 20 mM CdCl2, 20 mM CoCl2, and 200 mM NaCl at pH 7.0 and 25% PEG 3350. The crystals were harvested and transferred to a cryoprotectant solution containing the mother liquor supplemented with 1 mg mL–1 3-OPC-CoA and 17% ethylene glycol. All crystals were flash cooled in liquid N2 before data collection.

X-ray Data Collection and Structure Determination

Diffraction data for ChsH1-ChsH2N were collected on beamline X25 of the National Synchrotron Light Source at Brookhaven National Laboratory (Upton, NY) at a wavelength of 1.7 Å, and for ChsH1-ChsH2N/3-OPC-CoA, data were collected on beamline 23ID-D of the Advanced Photon Source at Argonne National Laboratory (Argonne, IL) at a wavelength of 1.075 Å. All data sets were processed using XDS[39] and Aimless[40] as implemented in autoPROC.[41] Four cadmium sites in ChsH1-ChsH2N were located with SHELX,[42] phases were calculated to 1.54 Å via single-wavelength anomalous dispersion[43] using SHARP,[44] and automated modeling building with ARP/wARP[45] produced a starting model consisting of 85% of ChsH1 and 90% of ChsH2N. For the ChsH1-ChsH2N/3-OPC-CoA structure, phases were calculated via molecular replacement with Phaser, using ChsH1-ChsH2N as a search model. Six cadmium sites were identified in the ChsH1-ChsH2N/3OPC-CoA structure; four of them are located in the tetramer–tetramer packing interface, and two of them coordinate with Asp29/His34 in ChsH2N, blocking the active site (Supplementary Figure 5b and Note, Supporting Information). For both structures, manual model building was carried out in Coot[46] followed by refinement with Refmac[47] and Phenix.[48] Final model quality was assessed using MolProbity.[49] Data collection and refinement statistics are shown in Table 2.
  45 in total

1.  Detecting protein function and protein-protein interactions from genome sequences.

Authors:  E M Marcotte; M Pellegrini; H L Ng; D W Rice; T O Yeates; D Eisenberg
Journal:  Science       Date:  1999-07-30       Impact factor: 47.728

2.  Pathway profiling in Mycobacterium tuberculosis: elucidation of cholesterol-derived catabolite and enzymes that catalyze its metabolism.

Authors:  Suzanne T Thomas; Brian C VanderVen; David R Sherman; David G Russell; Nicole S Sampson
Journal:  J Biol Chem       Date:  2011-11-01       Impact factor: 5.157

3.  Crystal structure of the (R)-specific enoyl-CoA hydratase from Aeromonas caviae involved in polyhydroxyalkanoate biosynthesis.

Authors:  Tamao Hisano; Takeharu Tsuge; Toshiaki Fukui; Tadahisa Iwata; Kunio Miki; Yoshiharu Doi
Journal:  J Biol Chem       Date:  2002-10-29       Impact factor: 5.157

4.  Structure and function of Rv0130, a conserved hypothetical protein from Mycobacterium tuberculosis.

Authors:  Patrik Johansson; Alina Castell; T Alwyn Jones; Kristina Bäckbro
Journal:  Protein Sci       Date:  2006-09-08       Impact factor: 6.725

5.  Persistence of Mycobacterium tuberculosis in macrophages and mice requires the glyoxylate shunt enzyme isocitrate lyase.

Authors:  J D McKinney; K Höner zu Bentrup; E J Muñoz-Elías; A Miczak; B Chen; W T Chan; D Swenson; J C Sacchettini; W R Jacobs; D G Russell
Journal:  Nature       Date:  2000-08-17       Impact factor: 49.962

6.  Spiropentaneacetic acid as a specific inhibitor of medium-chain acyl-CoA dehydrogenase.

Authors:  K Y Tserng; S J Jin; C L Hoppel
Journal:  Biochemistry       Date:  1991-11-05       Impact factor: 3.162

7.  Mycobacterium tuberculosis utilizes a unique heterotetrameric structure for dehydrogenation of the cholesterol side chain.

Authors:  Suzanne T Thomas; Nicole S Sampson
Journal:  Biochemistry       Date:  2013-04-18       Impact factor: 3.162

8.  Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence.

Authors:  S T Cole; R Brosch; J Parkhill; T Garnier; C Churcher; D Harris; S V Gordon; K Eiglmeier; S Gas; C E Barry; F Tekaia; K Badcock; D Basham; D Brown; T Chillingworth; R Connor; R Davies; K Devlin; T Feltwell; S Gentles; N Hamlin; S Holroyd; T Hornsby; K Jagels; A Krogh; J McLean; S Moule; L Murphy; K Oliver; J Osborne; M A Quail; M A Rajandream; J Rogers; S Rutter; K Seeger; J Skelton; R Squares; S Squares; J E Sulston; K Taylor; S Whitehead; B G Barrell
Journal:  Nature       Date:  1998-06-11       Impact factor: 49.962

9.  How good are my data and what is the resolution?

Authors:  Philip R Evans; Garib N Murshudov
Journal:  Acta Crystallogr D Biol Crystallogr       Date:  2013-06-13

10.  The Hotdog fold: wrapping up a superfamily of thioesterases and dehydratases.

Authors:  Shane C Dillon; Alex Bateman
Journal:  BMC Bioinformatics       Date:  2004-08-12       Impact factor: 3.169

View more
  17 in total

Review 1.  The Minimal Unit of Infection: Mycobacterium tuberculosis in the Macrophage.

Authors:  Brian C VanderVen; Lu Huang; Kyle H Rohde; David G Russell
Journal:  Microbiol Spectr       Date:  2016-12

2.  The steroid side-chain-cleaving aldolase Ltp2-ChsH2DUF35 is a thiolase superfamily member with a radically repurposed active site.

Authors:  Rebecca Aggett; Evan Mallette; Stephanie E Gilbert; Melody A Vachon; Kurt L Schroeter; Matthew S Kimber; Stephen Y K Seah
Journal:  J Biol Chem       Date:  2019-06-16       Impact factor: 5.157

3.  Deletion of a dehydratase important for intracellular growth and cording renders rough Mycobacterium abscessus avirulent.

Authors:  Iman Halloum; Séverine Carrère-Kremer; Mickael Blaise; Albertus Viljoen; Audrey Bernut; Vincent Le Moigne; Catherine Vilchèze; Yann Guérardel; Georges Lutfalla; Jean-Louis Herrmann; William R Jacobs; Laurent Kremer
Journal:  Proc Natl Acad Sci U S A       Date:  2016-07-06       Impact factor: 11.205

4.  Comparative Analysis of Bile-Salt Degradation in Sphingobium sp. Strain Chol11 and Pseudomonas stutzeri Strain Chol1 Reveals Functional Diversity of Proteobacterial Steroid Degradation Enzymes and Suggests a Novel Pathway for Side Chain Degradation.

Authors:  Franziska Maria Feller; Phil Richtsmeier; Maximilian Wege; Bodo Philipp
Journal:  Appl Environ Microbiol       Date:  2021-09-01       Impact factor: 4.792

5.  Mycobacterium tuberculosis Exploits a Heterohexameric Enoyl-CoA Hydratase Retro-Aldolase Complex for Cholesterol Catabolism.

Authors:  Tianao Yuan; Meng Yang; Kalle Gehring; Nicole S Sampson
Journal:  Biochemistry       Date:  2019-10-07       Impact factor: 3.162

6.  Characterization of an Aldolase Involved in Cholesterol Side Chain Degradation in Mycobacterium tuberculosis.

Authors:  Stephanie Gilbert; LaChae Hood; Stephen Y K Seah
Journal:  J Bacteriol       Date:  2017-12-20       Impact factor: 3.490

7.  Genome-wide bioinformatics analysis of steroid metabolism-associated genes in Nocardioides simplex VKM Ac-2033D.

Authors:  Victoria Y Shtratnikova; Mikhail I Schelkunov; Victoria V Fokina; Yury A Pekov; Tanya Ivashina; Marina V Donova
Journal:  Curr Genet       Date:  2016-02-01       Impact factor: 3.886

Review 8.  Cholesterol and fatty acids grease the wheels of Mycobacterium tuberculosis pathogenesis.

Authors:  Kaley M Wilburn; Rachael A Fieweger; Brian C VanderVen
Journal:  Pathog Dis       Date:  2018-03-01       Impact factor: 3.166

9.  Post-translational Succinylation of Mycobacterium tuberculosis Enoyl-CoA Hydratase EchA19 Slows Catalytic Hydration of Cholesterol Catabolite 3-Oxo-chol-4,22-diene-24-oyl-CoA.

Authors:  Amber C Bonds; Tianao Yuan; Joshua M Werman; Jungwon Jang; Rui Lu; Natasha M Nesbitt; Miguel Garcia-Diaz; Nicole S Sampson
Journal:  ACS Infect Dis       Date:  2020-07-27       Impact factor: 5.084

10.  Proteome, Bioinformatic, and Functional Analyses Reveal a Distinct and Conserved Metabolic Pathway for Bile Salt Degradation in the Sphingomonadaceae.

Authors:  Franziska M Feller; Lars Wöhlbrand; Johannes Holert; Vanessa Schnaars; Lea Elsner; William W Mohn; Ralf Rabus; Bodo Philipp
Journal:  Appl Environ Microbiol       Date:  2021-09-10       Impact factor: 4.792

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.