Literature DB >> 35811857

A Structural and Bioinformatics Investigation of a Fungal Squalene Synthase and Comparisons with Other Membrane Proteins.

Satish R Malwal¹, Na Shang², Weidong Liu², Xian Li³, Lilan Zhang³, Chun-Chi Chen³, Rey-Ting Guo³, Eric Oldfield¹.

Abstract

There is interest in the development of drugs to treat fungal infections due to the increasing threat of drug resistance, and here, we report the first crystallographic structure of the catalytic domain of a fungal squalene synthase (SQS), Aspergillus flavus SQS (AfSQS), a potential drug target, together with a bioinformatics study of fungal, human, and protozoal SQSs. Our X-ray results show strong structural similarities between the catalytic domains in these proteins, but, remarkably, using bioinformatics, we find that there is also a large, highly polar helix in the fungal proteins that connects the catalytic and membrane-anchoring transmembrane domains. This polar helix is absent in squalene synthases from all other lifeforms. We show that the transmembrane domain in AfSQS and in other SQSs, stannin, and steryl sulfatase, have very similar properties (% polar residues, hydrophobicity, and hydrophobic moment) to those found in the "penultimate" C-terminal helical domain in squalene epoxidase, while the final C-terminal domain in squalene epoxidase is more polar and may be monotopic. We also propose structural models for full-length AfSQS based on the bioinformatics results as well as a deep learning program that indicate that the C-terminus region may also be membrane surface-associated. Taken together, our results are of general interest given the unique nature of the polar helical domain in fungi that may be involved in protein-protein interactions as well as being a future target for antifungals.

Entities: Chemical

Year: 2022 PMID： 35811857 PMCID： PMC9260892 DOI： 10.1021/acsomega.2c01924

Source DB: PubMed Journal: ACS Omega ISSN： 2470-1343

Introduction

There is a need for novel antifungal drugs with novel mechanisms of action to combat antibiotic resistance. The sterol biosynthesis pathway is of interest in this context since the end-product is ergosterol (1; Figure ), not the cholesterol (2) found in humans, and many current antifungals target ergosterol biosynthesis. The first step in sterol (both ergosterol and cholesterol) biosynthesis is the condensation of two molecules of farnesyl diphosphate (FPP, 3) by squalene synthase (SQS) to form presqualene diphosphate (4), followed by a rearrangement and NADH/NADPH-mediated reductive step to form squalene (5), which is then converted to squalene 2,3-epoxide (6) by squalene epoxidase followed by cyclization to lanosterol (7), catalyzed by oxidosqualene cyclase.

Figure 1

Structures of compounds involved in sterol biosynthesis. Squalene synthase (SQS) catalyzes the conversion of 3 to 5 via 4.

Structures of compounds involved in sterol biosynthesis. Squalene synthase (SQS) catalyzes the conversion of 3 to 5 via 4. After numerous additional steps, cholesterol (or ergosterol) is formed. To be useful as an antifungal drug, it would be desirable that the target be found only in the pathogen of interest—although this is not mandatory since, for example, squalene epoxidase is used in both cholesterol and ergosterol biosynthesis and is the target for antifungals such as terbinafin,[1] and a cytochrome P450 14α-demethylase is the target for the azole antifungals such as clotrimazole and posaconazole, the latter also being of interest as an antiprotozoal drug against Trypanosoma cruzi, the causative agent of Chagas disease. Squalene synthase has been the target of several drug discovery programs aimed at the development of cholesterol-lowering drugs,[2] though these were not commercialized. Since in previous work, it has been shown that there are functional differences between fungal and animal SQSs, there is interest learning more about the structure of fungal SQSs. For example, in a very early work, Kribii et al.[3] investigated SQSs from the plant, Arabidopsis thaliana, finding that there was a sequence of ∼30 amino acids present in the C-terminal region of two yeasts: Saccharomyces cerevisiae and Schizosaccharomyces pombe, which were absent in plant, rat, mouse, and human SQSs and that “squalene-synthesized by Arabidopsis SQS1 in the yeast endoplasmic reticulum could not be used as a substrate for the following enzymes of the pathway”.[3] However, they found that a chimeric derivative of A. thaliana SQS1 containing a yeast C-terminus did produce squalene 2,3-epoxide and lanosterol (from FPP) in microsomal fractions from yeast transformed with the chimeric enzyme, leading to the suggestion that there might be an important interaction between SQS and squalene epoxidase, consistent with earlier work[4] in which it was shown that, in yeast microsomes, exogenous squalene was a poor substrate for a yeast squalene epoxidase (SQLE)—in contrast to endogenous squalene from farnesyl diphosphate—the suggestion being that there is an SQS/SQLE complex formed in microsomes. More recently, Linscott et al.[5] carried out an extensive series of studies of a fungal SQS using a wide variety of chimeric enzymes and proposed that there might be a “hinge domain” in the C-terminus of fungal SQSs that might be of importance in interacting with other proteins involved in ergosterol biosynthesis. Here, we report the first crystallographic structure of the catalytic domain of a fungal SQS, Aspergillus flavus SQS (AfSQS), together with results of bioinformatics investigations of the non-catalytic C-terminal domains in AfSQS as well as other SQSs and some comparisons with the membrane-binding domains in three other proteins: squalene epoxidase, stannin, and a steryl sulfatase.

Results and Discussion

Crystallographic Structure of the A. flavus SQS Catalytic Domain in Apo- and FSPP-Bound Forms

The structure of the catalytic domain of AfSQS with (PDB ID code 7WGH, cyan) or without (PDB ID code 7WGI, green) the FPP analog FSPP (S-thiolo-farnesyl diphosphate) is shown in Figure a. The structures are very similar with a Cα root mean squared deviation (rmsd) over ∼330 residues of ∼0.2 Å since the ligand-bound structure was obtained via soaking. Full data acquisition and structure refinement details are given in Table .

Figure 2

Table 1

Data Collection and Refinement Statistics for AfSQS Crystalsa

	AfSQS	AfSQS + FSPP
data collection
space group	F 2 3	F 2 3
unit-cell
a, b, c [Å]	206.46, 206.46, 206.46	208.34, 208.34, 208.34
α/β/γ (°)	90/90/90	90/90/90
resolution (Å)	25.00–2.50 (2.59–2.50)	25.00–2.36 (2.44–2.36)
unique reflections	25,222 (2508)	30,779 (3031)
redundancy	6.9 (6.9)	7.5 (7.5)
completeness (%)	100 (100)	100 (100)
average I/σ(I)	31.1 (2.07)	38.5 (2.9)
CC 1/2	0.922 (0.626)	0.946 (0.765)
refinement
R_work (95% data)	0.203 (0.307)	0.193 (0.257)
R_free (5% data)	0.223 (0.325)	0.223 (0.295)
rmsd bonds (Å)	0.004	0.008
rmsd angles (°)	0.66	0.98
dihedral angles
most favored (%)	98.56	99.14
allowed (%)	1.44	0.86
disallowed (%)	0.00	0.00
no. of non-H atoms/average B [Å2]
protein	2864/63.76	2871/54.86
water	76/56.41	104/56.15
ligand	14/85.99	52/79.86
PDB ID code	7WGI	7WGH

Values in parentheses are for the outer-most resolution shells.

Structures of AfSQS and comparisons with other proteins. (a) Superposition of the catalytic domain of AfSQS with (PDB ID code 7WGH, cyan) or without (PDB ID code 7WGI, green) the FPP analog, FSPP (the two ligands are shown in stick form). (b) Surface view of the two FSPP ligands in AfSQS showing the open pocket structure. Diphosphate groups in the two FSPP ligands are shown as red spheres. (c, d) Two views of the superposition of the AfSQS/2FSPP structure (green; PDB ID code 7WGH) with that of TcSQS/FSPP/2Mg2+ (cyan; PDB ID code 3WCA; Chain B). Mg2+ is shown as magenta spheres. (e, f) Two views of the superposition of AfSQS/2FSPP (PDB ID code 7WGH, cyan) with S. aureus CrtM (PDB ID code 3W7F, green) structures. The A and B helices are only seen in SQS. The arrows indicate the last residue in the catalytic domain, F384, adjacent to a proline. Values in parentheses are for the outer-most resolution shells. The SQS mechanism of action involves two reactions. The first involves FPP ionizing in the so-called S1 site to form a primary carbocation, which then moves down to react with the C2,3 double bond in the FPP in the S2 site to form (after H+ abstraction) PSPP, which is then reduced by NAD(P)H.[6] Since FPP is expected to be reactive, we used the less reactive S-thiolo analog FSPP for crystallization and the structure of the FSPP ligand-bound protein is shown as a surface view (colored yellow) in Figure b with the ligands shown as red/cyan spheres. The structure is an “open” one in which two FSPP ligands (seen also in Figure a) are bound, one binding to the S1 site and the other to the S2 site. Both structures are very similar to the structures of human SQS (HsSQS) as well as T. cruzi SQS (TcSQS) with a ∼1.2 Å Cα rmsd over ∼330 residues (as determined by using the PDBefold/SSM server, http://www.ebi.ac.uk/msd-srv/ssm). These structures are the first of the catalytic domain of any fungal SQS, and we show in Figure c,d superpositions of the AfSQS/2FSPP structure (PDB ID code 7WGH) with that of TcSQS/FSPP/2Mg2+ (PDB ID code 3WCA)[7] showing the strong similarity. A comparison with human SQS (PDB ID code 1EZF, cyan)[8] is shown in Figure a. In each structure, the polar headgroups bind close to the two highly conserved (Figure S1) DxxED domains involved in Mg2+ binding and catalysis.

Figure 3

AfSQS and HsSQS structures. (a) AfSQS (PDB ID code 7WGH, green) and HsSQS (PDB ID code 1EZF, cyan). (b) HsSQS (PDB ID code 3VJA) displayed using a B-factor putty view. The red helix X is generally not seen in other SQS structures. (c) Superposition of the AfSQS/FSPP structure (green; PDB ID code 7WGH) with that of the HsSQS/2FSPP/3Mg2+ structure (cyan; PDB ID code 3WEG) showing similar ligand and catalytic Asp/Glu residue conformations. The magenta spheres are the three Mg2+ seen in the HsSQS structure. S1 = allylic binding site; S2 = homoallylic binding site. The structures in the ligand-binding regions are similar to those found in other head-to-head prenylsynthases such as dehydrosqualene synthase from Staphylococcus aureus (SaCrtM)[9] and from Enterococcus hirae (PDB ID code 5IYS) and are also similar to other bacterial proteins such as Bacillus subtilis Yisp (PDB ID code 3WE9)[10] and HpnC (15-hydroxysqualene synthase; PDB ID code 4HD1) from Alicyclobacillus acidocaldarius. A superposition of the AfSQS (green) and SaCrtM (PDB ID code 3W7F,[11] cyan) structures is shown in Figure e,f in which it can be seen that there is a large domain into which ligands bind in both the bacterial and fungal structures as well as three additional helices in AfSQS, here called X, A, and B, which are more distant from the ligands. The “X” helical region is generally not observable in most X-ray structures of the human protein, HsSQS, although it is present in at least one structure as a helix with a large B-factor (Figure b) (PDB ID code 3VJA).[12] In our AfSQS/FSPP structure, we did not observe any Mg2+, and in the TcSQS structure, there are four chains but none have the three Mg2+ and two FSPP ligands seen in the human SQS structure (PDB ID code 3WEG(6)), with three Mg2+ and two (FPP) ligands being required for product formation. What is interesting to note here then is that—even in the absence of Mg2+—there is a strong similarity in the ligand conformations as well as the conformations of the active site Asp and Glu residues, as shown in Figure c, making it less likely that selective inhibitors that bind to this site in AfSQS will be found. The A and B helices are present in AfSQS, TcSQS, and HsSQS, but ∼40% of the C-terminal region that is seen in the B-helix in AfSQS and TcSQS is absent in HsSQS, as shown in Figure a, due to the truncated species used for crystallization. In the AfSQS and TcSQS proteins, there are ∼4 additional residues present in the C-terminus region, as shown for example in Figure a. Helices A and B are essential for catalytic activity, at least in TcSQS and HsSQS. More specifically, a 31-370 residue HsSQS (full-length 416 residues) had 200% the activity of a rat 1-416 (full length) clone, while 31-319 and 31-335 truncated HsSQSs (corresponding to loss of X, A, and B helices) had <5% the activity of the full in length rat protein.[13] Likewise, in TcSQS, removal of half of the C-terminal helix (of the catalytic domain) did not result in any active protein, while a truncation at the end of the B-helix did result in active protein.[14] It thus appears that both the A and B helices in AfSQS, TcSQS, and HsSQS are essential for catalytic activity and that these helices may play a role in stabilizing the protein as for example the catalytically inactive β-domain in αβ prenylcyclases is thought to stabilize the structures of these proteins.[15] The 26-residue “hinge” domain investigated earlier by Linscott et al.[5] is now seen to be composed of the loop connecting the A and B helices, together with the B-helix, and is found in both the fungal and protozoal SQSs, so it is not kingdom-specific, though it is essential for catalytic activity. The role of the X-helix in all three proteins is unclear, but given its disorder in most structures, one possibility is that it is involved in closing the open form of the protein (in which Mg2+ and ligands are visible in the X-ray structures, e.g., Figure b) to the catalytically competent, closed form (not yet observed crystallographically in any such prenylsynthase). That is, the X-helix might adopt multiple conformations in some systems—though is clearly seen in AfSQS as well as in TcSQS.

A Comparison between Fungal, Human and Protozoal SQS Structures

The question then arises as to what might be the structure and function of the additional domains in the C-terminus of the fungal, human, and protozoal SQSs since, based on the truncation work, they are not required for catalytic activity. Clearly, based on the earlier work, it appears that there is a transmembrane domain in all SQSs that is thought to anchor SQS to the endoplasmic reticulum, with membrane insertion being a highly complex process involving chaperones (binding to these “TM domains”) as well as the endoplasmic reticulum membrane protein complex (EMC).[16] As noted previously, in early work, Kribii et al. reported the presence of a sequence of ∼30 amino acids in two yeast SQSs that was absent in plant and animal SQSs. This domain is also absent in protozoal SQSs, e.g., in TcSQS, and in fungi it could be of importance in protein–protein interactions. To try and determine what the structure of the C-terminal region in the fungal SQSs might be, we first used the JPRED4 server (http://www.compbio.dundee.ac.uk/jpred4)[17] to determine the helix content and whether residues were buried (i.e., not solvent-exposed) in full-length AfSQS and then compared these results with the crystallographic structure of the catalytic domain. Figure shows helix-prediction results for the C-terminal regions in AfSQS (the top two panels; Figure a,b), as well as for the C-terminal regions in Candida albicans SQS (CaSQS; Figure c), HsSQS (Figure d), and TcSQS (Figure e). Note that, in Figure a, residues that are shown in the bold black font (the top line) correspond to residues seen in the X-ray structures reported here. A Clustal Omega[18] sequence alignment of AfSQS, CaSQS, HsSQS, and TcSQS is shown in the Supporting Information, Figure S1, in which the highly conserved DxxED domains in the first and second aspartate-rich domains in the catalytic site (Figure c) are shown in boxes. The boxed P residues in Figure S1 are either immediately adjacent to or are one residue removed from the last residue found in the crystal structures: Q368 in TcSQS (PDB 3WCA); N370 in HsSQS (PDB 1EZF); F384 in AfSQS (PDB 7WGH) and define the end of the B-helix. For the proteins whose structures are known, we found remarkably good accord between the helix predictions (shown as red “Hs” in Figure ) and those seen in the X-ray structures, the loop positions typically being predicted with a 1–2 residue error, giving confidence in use of the JPRED4 program to detect helical (and non-helical) regions.

Figure 4

JPRED4 results for the C-terminal regions of four SQSs. The codes are H = helix, B = buried, bold font = residues in the catalytic domain, cyan = polar residues, and yellow = non-polar residues. The numbers indicate the reliability of the prediction (9 is best). (a) AfSQS (primarily the C-terminal end of the catalytic domain). (b) AfSQS (polar and TM helical regions). (c) C. albicans SQS (C-terminal end of the catalytic domain and polar and TM helical regions). (d) Human SQS (C-terminal end of the catalytic domain and TM helical regions). (e) T. cruzi SQS (C-terminal end of the catalytic domain and TM helical regions). The abbreviations are Jnet = final secondary structure prediction; jhmm = Jnet hmm profile prediction; jpssm = Jnet PSIBLAST pssm profile prediction; Jnet_25 = Jnet prediction of burial, less than 25% solvent accessibility; Jnet_5 = Jnet prediction of burial, less than 5% exposure; Jnet_0 = Jnet prediction of burial, 0% exposure; and Jnet Rel = Jnet reliability of prediction accuracy, ranges from 0 to 9, bigger is better. Next, we observed that the C-terminus in the AfSQS crystal structure (F384; the end of the B-helix) is connected via a proline-containing ∼4 residue (non-helical) loop to a ∼28 residue, polar, solvent-exposed (i.e., not buried, shown by a “B” in Figure a) primarily helical region, called here the C-helical domain, to a TM domain, called here the D-helix. These P are found in all four proteins (Figure S1) and are the last conserved residues before the highly variable C, D, and E-domains. To give a perhaps better graphical illustration of the polar (hydrophilic) versus the hydrophobic domains, we show in Figure charged residues (D, E, K, R, and H) highlighted in cyan while hydrophobic residues (A, V, L, J, F, and Y) are highlighted in yellow. As can also be seen, e.g., in Figure b, there is a highly hydrophobic helical region (418–445) in AfSQS that is essentially completely buried (B = buried). In addition, there is a small extra-membrane helix E, attached to a solvent-exposed non-helical region. In another fungal SQS, from C. albicans, the overall sequence is shorter than that in AfSQS since the E-helix is absent (Figure c). In HsSQS (Figure d), there is again good accord with the helix (and loop) predictions in the catalytic domain, and interestingly, the last residue in the B-helix (N370) seen in the X-ray structure (the last one in the truncated protein) also corresponds to the end of the helix predicted computationally and is followed by a ∼14 residue non-helical, non-buried segment, then the buried, helical TM region. Thus, as expected, HsSQS lacks the long, polar, solvent-exposed residues and it appears that the catalytic domain is directly connected to the TM domain (helix D) via the linker region (Figure d). Results with TcSQS are generally similar, with a linker connecting the catalytic domain to a TM helix (Figure e). To help confirm the conclusion drawn above—that there is a large, polar, primarily helical domain connecting the catalytic and TM regions in AfSQS—we next used the HeliQuest program (http://heliquest.ipmc.cnrs.fr)[21] that computes hydrophobicity ⟨H⟩ and hydrophobic moment ⟨μH⟩ values[22] for helical sequences, in addition to providing useful graphical representations of polar/non-polar residues, on helical wheels. A large positive ⟨H⟩ (e.g., >1) means a very hydrophobic helix, and a negative value means a hydrophilic helix. A large ⟨μH⟩ means that the helix is amphiphilic perpendicular to its axis. The sequences that are of most interest here (since their structures have not been determined) are the polar helices seen in the fungi and the TM domains. We show some typical HeliQuest results in Figure .

Figure 5

HeliQuest results for % polar residues, hydrophobicity ⟨H⟩, hydrophobic moment ⟨μH⟩, and charge in SQSs. (a) Polar helix C in AfSQS. (b) TM helix D in AfSQS. (c) TM helix D in HsSQS. (d) TM helix D in TcSQS. For the proposed exposed, hydrophilic C-helical domain in AfSQS, as can be seen in Figure a, ∼78% of the polar residues (and Gly) map to an 18-residue α-helical wheel with a hydrophobicity ⟨H⟩ of −0.361 and a hydrophobic moment ⟨μH⟩ of 0.085. A very polar or hydrophilic helix. In the TM domain, the region (of 18 amino acids) with the least (11%) polar residues, the hydrophobicity ⟨H⟩ is 1.301 and the hydrophobic moment ⟨μH⟩ is 0.226 (Figure b), consistent with a TM domain. Similar results are obtained with CaSQS (not shown), with 61% polar residues in the polar helix C, a ⟨H⟩ of 0.213 and a hydrophobic moment ⟨μH⟩ of 0.226, and in the TM domain D, 17% polar residues, a ⟨H⟩ of 1.228 and a hydrophobic moment ⟨μH⟩ of 0.241 (for the last 18 residues in the protein; there is no E-domain). In the TM domain C in HsSQS (Figure c), there are 17% polar residues, a ⟨H⟩ of 1.097 and a hydrophobic moment ⟨μH⟩ of 0.111, while in the TM domain in TcSQS (Figure d), there are 33% polar residues, a ⟨H⟩ of 0.656 and a hydrophobic moment ⟨μH⟩ of 0.229. While there is a little variability in the values above, depending on where the sequences of the 18 residues that are scanned begin and there are typically 23 residues in a TM helix, the TM domain in the trypanosome SQS is clearly more polar than found in the fungal and human SQSs and may bind in a somewhat different membrane-associated manner. The same may be said of the N-terminal residues in SQS whose removal facilitates expression of soluble, active (and crystallizable) AfSQS, HsSQS, and TcSQS, but we found no evidence for N-terminal canonical TM helical domains in these proteins. Thus, our results indicate that SQSs are so-called “single-pass” membrane proteins. Based on the results discussed above, it thus appears that the fungal protein AfSQS and the protozoal protein TcSQS have very similar catalytic domain structures with well-defined X, A, and B helices that (at least in TcSQS) are essential for activity. Notably, the A and B helices are not present in the homologous bacterial protein, CrtM, which is not a membrane protein. In AfSQS, there is, therefore, a large, hydrophilic helical domain, the C-domain, which is not found in any non-fungal SQSs. Based on the X-ray and bioinformatics results, we thus propose that the fungal SQSs are organized as illustrated in Figure a. What is attractive about the model is that the presence of the polar, non-buried (solvent-exposed) C-domain would appear to offer strong possibilities for SQS interacting with not only the membrane surface (acting as an in-plane membrane anchor) but also with other proteins involved in sterol biosynthesis, via electrostatic interactions.

Figure 6

Proposed structural arrangement of the polar and non-polar (trans-membrane) helices in AfSQS based on JPRED4, HeliQuest, and RoseTTAFold results. (a) AfSQS model based on JPRED4 and HeliQuest results. (b) AfSQS model based on RoseTTAFold prediction, illustrated as an electrostatic charge surface computed using the PyMOL program. Blue = positively charged region, red = negatively charged region, and white = neutral (hydrophobic residues) region. Four other models are shown in Figure S2. (c) Error estimates in Ångström units for the model in panel (b) computed in the RoseTTAFold program. We next attempted to see to what extent it might be possible to predict the 3D structure of AfSQS. We first used the Phyre2 program.[23] There was an excellent prediction of the catalytic domain but no prediction of the other regions. We thus next used the deep learning-based modeling method, RoseTTAFold.[24] There were five models predicted, and these are shown in Figure S2 together with their error estimates. We show the predicted structures as electrostatic potential surfaces (calculated in PyMOL[25]) in which red represents a negatively charged region, blue is a positively charged region, and white is a neutral (hydrophobic) region. The large catalytic domain was well predicted (1.08 Å/314 Cα atoms) and is connected via the polar helical domain to the (vertical, white) transmembrane region. The E-domain is helical and is connected to the C-terminus. All five predicted models have a very similar “L-shaped” TM/chain terminus arrangement, as proposed (Figure a) using the other computational methods, and one model is shown in Figure b (error estimates shown in Figure c) for comparison with the model shown in Figure a. There are large error estimates for the C-domain, the TM domain, and the C-terminus, presumably because the catalytic domain can rotate, as seen in Figure S2. It should also be noted that there is no common structure (such as a pocket) in the polar C-domain. That notwithstanding, there is general accord with the model shown in Figure a, with the L-shaped TM/E-helix region perhaps being an intrinsic feature that may facilitate binding to a membrane surface.

Comparisons with Other Membrane Proteins

The question then arises as to how similar are the non-catalytic domains found in AfSQSs (as well as the other SQSs) to those found in other membrane proteins. There are only a small number of membrane protein structures with what appear to be primarily structural (i.e., non-catalytic) TM or helical domains. One example is that of estrone sulfatase (ES; 562 amino acids), responsible for maintaining high levels of estrogens in breast tumor cells, which as with SQS is associated with the ER.[19] Another is stannin, shown to mediate neuronal cell apoptosis mediated by Me3SnCl[20] and more recently to inhibit the entry of human papillomavirus into the trans-Golgi network.[26] The structures of both proteins are shown in Figure a,b (estrone sulfatase PDB ID code 1P49;[19] stannin PDB ID code 1ZZA(20)), and the HeliQuest results are shown in Figure c–e. Clearly, the ⟨H⟩ and ⟨μH⟩ values are very similar to those found with AfSQS with ⟨H⟩ ∼1.1–1.3 and ⟨μH⟩ ∼0.1–0.2. In stannin, there is also a membrane surface-associated domain that has ⟨H⟩ ∼0.3 and ⟨μH⟩ ∼0.2.

Figure 7

Structures and HeliQuest results for two other membrane proteins with TM helices. (a) Estrone sulfatase (ES) structure (PDB ID code 1P49). (b) Stannin structure (PDB ID code 1ZZA). (c) HeliQuest result for the cyan helix in ES. (d) HeliQuest result for the yellow helix in ES. (e) HeliQuest result for the TM helix in stannin. As we noted above, one of the potential partners for interacting with SQS would be squalene epoxidase, SQLE, as suggested in a very early work.[3,4] In this context, it is of interest to note that the X-ray structure of a catalytically active N-terminally truncated human SQLE (118–574) was recently reported[27] and reviewed.[28] The protein contains ligand-binding domains (as shown in green in Figure a (PDB ID code 6C6P);[27] the FAD and NB-598 ligands are omitted for clarity) together with two “membrane-associated” helical domains, suggested to both be buried in the endoplasmic reticulum.[28]

Figure 8

Polarity, hydrophobicity, and hydrophobic moments for the two C-terminal helices in human squalene epoxidase and a structural model for binding to a cell membrane. (a) HsSQLE (PDB ID 6C6P). (b) HeliQuest results for the penultimate (yellow) helix. (c) HeliQuest results for the terminal (cyan) helix. (d) Charged residues present in the two C-terminal helices. (e) Proposed model for membrane-binding based on rotations of the C-terminal helices. Interestingly, the penultimate C-terminal helix is very hydrophobic (11% polar residues) and has a ⟨H⟩ of 1.147 and a ⟨μH⟩ of 0.154 (Figure b), very similar to what we find for the D-helix in AfSQS and the other TM proteins (∼10% polar residues, hydrophobicity ⟨H⟩ ∼1.3, and hydrophobic moment ⟨μH⟩ ∼0.2). The actual C-terminal helix in HsSQLE has 33% polar residues, a net charge of +2, a ⟨H⟩ of 0.691, and a ⟨μH⟩ of 0.394 (Figure c), very similar to what we find with the D-helix in TcSQS (33% polar residues, a net charge of +2, a ⟨H⟩ of 0.656, and a ⟨μH⟩ of 0.229). There are a total of six charged residues in the HsSQLE terminal helix and turn region (Figure d), suggesting that this region might be membrane surface-associated (Figure e), in which the putative TM domain (in yellow) has been rotated to interact with the membrane bilayer. While this is a speculative proposal, the % polar residues and ⟨H⟩ and ⟨μH⟩ values in the HsSQLE penultimate helix are very similar to those seen in the single AfSQS TM domain, helix D (and in the other TM proteins) while the terminal helix is far less hydrophobic and indeed has multiple charged residues (Figure d). However, it is also possible that the C-terminal helix shown in Figure e might be rotated “up” and be a TM helix since five of the six charged residues are at either end of the helix and could be at the solvent-exposed membrane surfaces. This arrangement, though unusual, might then help explain the nature of the more polar TM helix in TcSQS. Finally, it should be noted that it is not yet clear whether SQS is a good target for antifungal development, at least as a monotherapy. Although inhibition of ergosterol biosynthesis and cell growth in Candida glabrata by depletion of its SQS gene (ERG9) is observed,[29] cell growth is rescued by cholesterol, both in vitro and in mice. Related partial rescue effects in vitro are also seen in A. fumigatus with azole antifungals,[30] and cholesterol-dependent C. glabrata has been found in clinical specimens.[31] On the other hand, compounds such as the bisphosphonate that inhibits both ASfSQS as well as human SQS[32] could result in lowering of both host cholesterol as well as pathogen ergosterol. In addition, it has been found that inhibiting SQS in neutrophils leads to formation of neutrophil extracellular traps (NETS),[33] and NETS have antifungal activity.[34]

Conclusions

The results we have presented above are of interest for several reasons. First, we have obtained the structure of a fungal squalene synthase catalytic domain, both as the apo- and as a ligand (FSPP) bound form. Second, the structure obtained is very similar to that of T. cruzi SQS and contains highly superimposable X, A, and B helices. Third, the structure is of interest when compared with human SQS, HsSQS, which appears (based on computational predictions) to have a shorter B-helix, something that might lead to differences in ligand (inhibitor or protein) binding between human and pathogen proteins. Fourth, we identified a very polar, solvent-exposed (i.e., non-buried) long (∼35 residue) highly helical domain, the C-helical domain, that is unique to the fungal proteins. This is of interest since the fungal SQSs are thought to interact with other proteins (such as squalene epoxidase) involved in ergosterol biosynthesis, and this domain may be a target for inhibitor development. Fifth, using a deep learning program, we find a similar L-shaped TM/E-helix structure to that proposed using simpler modeling methods, suggesting that the E-helix/chain terminus may be involved in binding to the membrane surface. Sixth, we find that the transmembrane domains in the fungal and human SQSs have very similar properties (% polar residues and ⟨H⟩ and ⟨μH⟩ values) to the TM domains in estrone sulfatase and stannin as well as to the penultimate helix in HsSQLE, leading to proposed models for SQLE in which the two C-terminal helices seen in the crystal structure may form one or two transmembrane helices. Overall, the crystallographic and computational results provide new perspectives on the structures of the fungal squalene synthases and should encourage renewed efforts in experimental structure determinations.

Materials and Methods

Cloning, Expression, and Purification

The gene encoding the AfSQS gene (GenBank accession number: KOC07597.1) from A. flavus was chemically synthesized by GENE ray Biotech Co. (Shanghai, China), ligated into the pET46 vector, and transformed into Escherichia coli strain BL21(DE3). Cells were grown in the LB medium at 37 °C to an OD600 of ∼0.8 and then induced by 0.6 mM isopropyl β-d-thiogalactopyranoside (IPTG) at 16 °C for 20 h. Cells were then harvested by centrifugation at 5000g for 15 min and then resuspended in lysis buffer containing 25 mM Tris–HCl, pH 8.0, 150 mM NaCl, and 20 mM imidazole followed by disruption with a French Press. Cell debris was removed by centrifugation at 17,000g for 1 h. The supernatant was then applied to a Ni-NTA column with an FPLC system (GE Healthcare). AfSQS was eluted at ∼150 mM imidazole when using a 20–250 mM imidazole gradient and was then dialyzed against a buffer containing 25 mM Tris–HCl, pH 8.0, passed through a DEAE column pre-equilibrated with the same buffer, and then eluted with 25 mM Tris–HCl, pH 8.0, 250 mM NaCl. Purity was checked by SDS-PAGE analysis and was >95%. The purified protein was then concentrated to 10 mg/mL for crystallization screening.

Crystallization, Data Collection, Structure Determination, and Refinement

All crystallization experiments were conducted at 25 °C using the sitting-drop vapor-diffusion method. In general, 1 μL of AfSQS containing solution (25 mM Tris, 150 mM NaCl; 40 mg/mL) was mixed with 1 μL of reservoir solution in 48-well Cryschem Plates and then equilibrated against 100 μL of the reservoir solution. The optimized crystallization conditions were 1.1 M NaH2PO4 and 0.5 M K2HPO4. Within 2–3 days, the crystals reached a size suitable for X-ray diffraction. The AfSQS crystals in complex with FSPP were obtained by soaking with mother liquor containing 10 mM FSPP for 72 h. All of the X-ray diffraction data sets were collected at beam line BL15A1 at the National Synchrotron Radiation Research Center (NSRRC). The crystals were mounted in a cryo-loop and soaked with cryoprotectant solution (1.1 M NaH2PO4, 0.5 M K2HPO4, 10% glycerol) prior to data collection at 100 K. The diffraction images were processed by using HKL2000.[35] The crystal structure of AfSQS was solved by using the molecular replacement method with the Phaser program[36] from the Phenix[37] suite, using the structure of HsSQS (PDB code 1EZF) as a search model. Further refinement was carried out by using the programs phenix.refine[38] and Coot.[39] Prior to structure refinement, 5% randomly selected reflections were set aside for calculating Rfree as a monitor. All figures were prepared by using the PyMOL program.[25]

Computational Aspects

We used the JPRED program[17] for predicting helical, sheet, disordered, and solvent-exposed residues; the Clustal Omega program[18] for sequence alignments; the HeliQuest program[21] for ⟨H⟩ and ⟨μH⟩ predictions; and the RoseTTAFold program[24] for 3D structure predictions.

39 in total

1. Mechanism of action and inhibition of dehydrosqualene synthase.

Authors: Fu-Yang Lin; Chia-I Liu; Yi-Liang Liu; Yonghui Zhang; Ke Wang; Wen-Yih Jeng; Tzu-Ping Ko; Rong Cao; Andrew H-J Wang; Eric Oldfield
Journal: Proc Natl Acad Sci U S A Date: 2010-11-23 Impact factor: 11.205

2. Use of helical wheels to represent the structures of proteins and to identify segments with helical potential.

Authors: M Schiffer; A B Edmundson
Journal: Biophys J Date: 1967-03 Impact factor: 4.033

3. Processing of X-ray diffraction data collected in oscillation mode.

Authors: Z Otwinowski; W Minor
Journal: Methods Enzymol Date: 1997 Impact factor: 1.600

4. Structure, dynamics, and membrane topology of stannin: a mediator of neuronal cell apoptosis induced by trimethyltin chloride.

Authors: Bethany A Buck-Koehntop; Alessandro Mascioni; Jarrod J Buffy; Gianluigi Veglia
Journal: J Mol Biol Date: 2005-09-30 Impact factor: 5.469

5. Truncation of human squalene synthase yields active, crystallizable protein.

Authors: J F Thompson; D E Danley; S Mazzalupo; P M Milos; M E Lira; H J Harwood
Journal: Arch Biochem Biophys Date: 1998-02-15 Impact factor: 4.013

6. Allylamine derivatives: new class of synthetic antifungal agents inhibiting fungal squalene epoxidase.

Authors: G Petranyi; N S Ryder; A Stütz
Journal: Science Date: 1984-06-15 Impact factor: 47.728

7. Structural and functional analysis of Bacillus subtilis YisP reveals a role of its product in biofilm production.

Authors: Xinxin Feng; Yumei Hu; Yingying Zheng; Wei Zhu; Kai Li; Chun-Hsiang Huang; Tzu-Ping Ko; Feifei Ren; Hsiu-Chien Chan; Mulugeta Nega; Shannon Bogue; Daniel López; Roberto Kolter; Friedrich Götz; Rey-Ting Guo; Eric Oldfield
Journal: Chem Biol Date: 2014-10-09

8. Structure of human estrone sulfatase suggests functional roles of membrane association.

Authors: Francisco G Hernandez-Guzman; Tadayoshi Higashiyama; Walter Pangborn; Yoshio Osawa; Debashis Ghosh
Journal: J Biol Chem Date: 2003-03-25 Impact factor: 5.157

9. JPred4: a protein secondary structure prediction server.

Authors: Alexey Drozdetskiy; Christian Cole; James Procter; Geoffrey J Barton
Journal: Nucleic Acids Res Date: 2015-04-16 Impact factor: 16.971

10. Accurate prediction of protein structures and interactions using a three-track neural network.

Authors: Minkyung Baek; Frank DiMaio; Ivan Anishchenko; Justas Dauparas; Sergey Ovchinnikov; Gyu Rie Lee; Jue Wang; Qian Cong; Lisa N Kinch; R Dustin Schaeffer; Claudia Millán; Hahnbeom Park; Carson Adams; Caleb R Glassman; Andy DeGiovanni; Jose H Pereira; Andria V Rodrigues; Alberdina A van Dijk; Ana C Ebrecht; Diederik J Opperman; Theo Sagmeister; Christoph Buhlheller; Tea Pavkov-Keller; Manoj K Rathinaswamy; Udit Dalwadi; Calvin K Yip; John E Burke; K Christopher Garcia; Nick V Grishin; Paul D Adams; Randy J Read; David Baker
Journal: Science Date: 2021-07-15 Impact factor: 47.728