Md Munan Shaik1,2, Hanqin Peng1, Jianming Lu3,4, Sophia Rits-Volloch1, Chen Xu5, Maofu Liao6, Bing Chen7,8. 1. Division of Molecular Medicine, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA. 2. Department of Pediatrics, Harvard Medical School, Boston, MA, USA. 3. Codex BioSolutions, Gaithersburg, MD, USA. 4. Xiamen Amoytop Biotech, Xiamen, China. 5. Cryo-EM Core Facility, University of Massachusetts Medical School, Worcester, MA, USA. 6. Department of Cell Biology, Harvard Medical School, Boston, MA, USA. 7. Division of Molecular Medicine, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA. bchen@crystal.harvard.edu. 8. Department of Pediatrics, Harvard Medical School, Boston, MA, USA. bchen@crystal.harvard.edu.
Abstract
HIV-1 envelope glycoprotein (Env), which consists of trimeric (gp160)3 cleaved to (gp120 and gp41)3, interacts with the primary receptor CD4 and a coreceptor (such as chemokine receptor CCR5) to fuse viral and target-cell membranes. The gp120-coreceptor interaction has previously been proposed as the most crucial trigger for unleashing the fusogenic potential of gp41. Here we report a cryo-electron microscopy structure of a full-length gp120 in complex with soluble CD4 and unmodified human CCR5, at 3.9 Å resolution. The V3 loop of gp120 inserts into the chemokine-binding pocket formed by seven transmembrane helices of CCR5, and the N terminus of CCR5 contacts the CD4-induced bridging sheet of gp120. CCR5 induces no obvious allosteric changes in gp120 that can propagate to gp41; it does bring the Env trimer close to the target membrane. The N terminus of gp120, which is gripped by gp41 in the pre-fusion or CD4-bound Env, flips back in the CCR5-bound conformation and may irreversibly destabilize gp41 to initiate fusion. The coreceptor probably functions by stabilizing and anchoring the CD4-induced conformation of Env near the cell membrane. These results advance our understanding of HIV-1 entry into host cells and may guide the development of vaccines and therapeutic agents.
HIV-1 envelope glycoprotein (Env), which consists of trimeric (gp160)3 cleaved to (gp120 and gp41)3, interacts with the primary receptor CD4 and a coreceptor (such as chemokine receptor CCR5) to fuse viral and target-cell membranes. The gp120-coreceptor interaction has previously been proposed as the most crucial trigger for unleashing the fusogenic potential of gp41. Here we report a cryo-electron microscopy structure of a full-length gp120 in complex with soluble CD4 and unmodified human CCR5, at 3.9 Å resolution. The V3 loop of gp120 inserts into the chemokine-binding pocket formed by seven transmembrane helices of CCR5, and the N terminus of CCR5 contacts the CD4-induced bridging sheet of gp120. CCR5 induces no obvious allosteric changes in gp120 that can propagate to gp41; it does bring the Env trimer close to the target membrane. The N terminus of gp120, which is gripped by gp41 in the pre-fusion or CD4-bound Env, flips back in the CCR5-bound conformation and may irreversibly destabilize gp41 to initiate fusion. The coreceptor probably functions by stabilizing and anchoring the CD4-induced conformation of Env near the cell membrane. These results advance our understanding of HIV-1 entry into host cells and may guide the development of vaccines and therapeutic agents.
HIV-1 envelope glycoprotein (Env) fuses viral and cell membranes, allowing entry of the virus into host cells. Its precursor, gp160, trimerizes to (gp160)3, which is cleaved into two noncovalently associated fragments: gp120 (receptor-binding) and gp41 (fusion)[1]. Three copies of each fragment constitute the mature viral spike (gp120/gp41)3. Sequential binding of gp120 to primary receptor CD4 and a coreceptor (e.g. CCR5 or CXCR4) are believed to induce conformational changes that trigger likely dissociation of gp120 and refolding of gp41[1]. Structural rearrangements in gp41 bring the two membranes together, promoting membrane fusion. We have detailed structural information for the interaction of HIV-1 Env with CD4[2-6], but no molecular picture for its interaction with a coreceptor, which continues to be the subject of speculative molecular modeling[7-10]The chemokine receptors, CCR5 and CXCR4, were identified as the HIV-1 coreceptors in 1996[11]. They are G protein-coupled receptors (GPCRs) with seven transmembrane-spanning segments (7TMs). Choice of coreceptor is the major determinant for viral tropism[11]. Viruses using CCR5 (R5 viruses) are generally responsible for viral transmission; those using CXCR4 (X4 viruses) or both (dual-tropic; R5X4 viruses) emerge later during disease progression[12,13]. Both CCR5 and CXCR4 have an extracellular N-terminal segment, three extracellular loops (ECL), three intracellular loops (ICL) and a cytoplasmic C-terminal tail. Crystal structures have been reported for a C-terminally truncated CXCR4 containing stabilizing mutations and a T4 lysozyme fusion, in complex with different ligands, and for a similarly modified CCR5 containing a rubredoxin fusion in complex with either an anti-HIV drug, maraviroc or a modified chemokine, [5P7]CCL5[7,10,14,15]. These structures show a typical 7TM helical bundle topology seen for other GPCRs (Extended Data Fig. 1). Consistent with the so-called two-site model[16], the N-terminal segment of CXCR4 or CCR5 forms “chemokine recognition site 1” (CRS1) to interact with the globular core domain of chemokine, while their 7TM bundle forms a binding pocket (“chemokine recognition site 2”; CRS2) that accommodates the N-terminus of the chemokine. While they elucidate chemokine receptor function, these structures do not explain how CCR5 and CXCR4 function as HIV-1 coreceptors.
Extended Data Figure 1.
Known structures of CCR5 and CXCR4.
CCR5 and CXCR4 were identified as the coreceptors for HIV-1 entry in 1996[71-77]. (a)-(b) Crystal structures of a modified CCR5 (ΔC224-N226→rubredoxin; ΔF320-L352; and point mutations C58Y, G163N, A233D, K303E) in complex with HIV entry inhibitor maraviroc (pdb ID: 4MBS; ref[7]) and a modified chemokine [5P7]CCL5 (an antagonist; pdb ID: 5UIW; ref[10]). CCR5 is shown in ribbon diagram in blue, the internally fused rubredoxin in magenta and the ligands in yellow. N-terminus (N), C-terminus (C) and the second extracellular loop (ECL2) are indicated. (c)-(e) Crystal structures of an engineered CXCR4 in complex with a viral chemokine antagonist vMIP-II (pdb ID: 4RWS; ref[15]), a small molecule antagonist IT1t (pdb ID: 3ODU; ref[14]) and a cyclic peptide antagonist CVX15 (pdb ID: 3OE0; ref[14]). CXCR4 is shown in green, the fused T4 lysozyme in magenta and the ligands in yellow. N, N-terminus; C, C-terminus; and ECL2, the second extracellular loop.
Mutagenesis studies have mapped the gp120 binding site to the N-terminal segment and ECL2 for CCR5; and to the N-terminal segment, ECL2, and ECL3 for CXCR4[17,18]. The footprint of the coreceptor on gp120 probably includes the V3 loop and the bridging sheet, a structure accessible only after CD4 binding[19,20]. The N-terminus of the coreceptor may contact the gp120 bridging sheet, while the tip of the V3 loop may insert into the coreceptor CRS2[20]. Interactions of V3 with CCR5 or CXCR4 have also been modeled by molecular dynamics simulations and free energy calculations[7-10]. Tyrosine sulfation near the N-terminus only enhances HIV-1 entry for CCR5, but not for CXCR4[21,22]. The C-terminal tail, containing palmitoylation and phosphorylation sites, is required for efficient cell signaling, but not for HIV-1 coreceptor function[23]. Chemokines such as, MIP-1α, MIP-1β, CCL5/RANTES and CXCL12/SDF-1 block gp120 binding and prevent viral infection[11].
Results
To understand how a coreceptor functions, we have determined the structure of an unmodified CCR5 in complex with a full-length HIV-1 gp120 and a 4-domain soluble CD4 by single-particle cryo-electron microscopy (cryo-EM).
Purification of the CCR5 complex and structure determination
To produce functional CCR5, we generated HEK293T or Expi293F cell lines stably expressing wildtype human CCR5 by our published protocol[24]. CCR5 on these cells was fully active as a chemokine receptor (Extended Data Fig. 2a), and it also formed a tight complex with HIV-1 gp120 in the presence of soluble CD4 (Extended Data Fig. 2b). Moreover, these cells fused efficiently with HIV-1 Env-expressing cells only when activated by soluble CD4 (Extended Data Fig. 2c), suggesting that the expressed CCR5 is a fully functional HIV-1 coreceptor. Finally, neither gp120 alone nor the gp120-CD4 complex could activate the G-protein mediated signaling pathways in these cells (Extended Data Fig. 2d and 2e).
Extended Data Figure 2.
Characterization of stable 293 cell lines expressing wildtype human CCR5.
(a) Chemokine receptor assay. 293T and 293T-CCR5 (stable) cells were treated with different concentrations of CCL5/RANTES. Ft/F0 is a fluorescence-signal ratio proportional to that of intracellular cAMP concentration at 40 min post CCL5/RANTES-activation and time 0. The dose response curves were plotted for both 293T (black) and 293T-CCR5 (red) cells. The experiment was carried out in quadruplicate and repeated at least three times with similar results. Error bars indicate the standard deviation calculated by the Excel STDEV function. (b) Flow cytometry histograms of HIV-1 gp120 binding to CCR5 expressed on the cell surfaces in the absence (orange) or presence (red) of soluble CD4. 293T cells (black), CCR5-expressing cells only (gray) and CCR5-expressing cells with soluble CD4 only (blue) were negative controls. The experiment was repeated independently at least twice with similar results. (c) HIV-1 Env mediated cell-cell fusion. 293T cells stably transfected with CCR5 were mixed with HIV-1 Env (gp160) expressing cells in the absence or presence of soluble CD4. The CCR5 cells fuse with CD4-triggered Env cells very efficiently and form large syncytia that almost cover the entire well. The experiment was repeated independently twice with similar results. (d) Chemokine receptor assay by various ligands. As in (a), Expi293F and Expi293F-CCR5 (stable) cells were treated with CCL5/RANTES, gp120, CD4 or the complex of gp120 and CD4. The dose response curves were plotted for both Expi293F (control, left) and Expi293F-CCR5 (right) cells with different ligands as indicated. The experiment was carried out in quadruplicate and repeated at least three times with similar results. Error bars indicate the standard deviation calculated by the Excel STDEV function. (e) Left, kinetic curves of 5 representative wells of 293T-CCR5 cells treated with 5 different ligands as indicated. ATP activates the endogenous Gq coupled GPCR - P2Y receptor, as a positive control. Ratio, fluorescence intensity/baseline intensity. Right, dose response curve of each ligand. The y-axis is a background-subtracted ratio (peak fluorescent intensity ratio - 1). We conclude that our gp120 and gp120-CD4 do not activate G-protein mediated calcium flux at the concentrations tested here. The experiment was carried out in quadruplicate and repeated twice with similar results. Error bars indicate the standard deviation calculated by the Excel STDEV function.
To purify the wildtype CCR5 in its Env-bound conformation, we isolated the CD4-gp120-CCR5 complex from the CCR5-expressing cells (Extended Data Fig. 3a and 3b). The purified CD4-gp120-CCR5 complex eluted from a size-exclusion column as a single sharp peak at the expected volume, confirming its stability and conformational homogeneity (Extended Data Fig. 3c). Coomassie-stained SDS-PAGE and negative stain EM of the peak fractions showed all three components in a stoichiometry of 1:1:1 (Extended Data Figs. 3c and 4a-c). The absence of any G proteins in the purified complex is consistent with the notion that gp120, unlike chemokines, does not require G protein coupling for high affinity binding to CCR5[25].
Extended Data Figure 3.
Purification of the CD4-gp120-CCR5 complex.
(a) Schematic representation of expression constructs for HIV-1 gp120, human CCR5 and CD4. Segments of gp120 are designated as follows: C1-C5, conserved regions 1-5; V1-V5, variable regions 1-5; and His-tag, a six-histidine tag. Tree-like symbols represent glycans. Those for CCR5 include: N, N-terminus; TM1-7, transmembrane helices 1-7; ECL1-3, extracellular loop 1-3; ICL3, intracellular loop 1-3; and CT, cytoplasmic tail. For CD4, they are: D1-4, immunoglobulin (Ig) domain 1-4; and strep tag, a purification tag. TM (transmembrane segment) and CT (cytoplasmic tail) in gray are truncated in the expression construct. (b) Unmodified human CCR5 in complex with HIV-1 gp120 and 4D CD4 was purified by the following steps: 1) complex formation, HIV-1 gp120 (green) and strep-tagged, 4 domain CD4 (light blue) were incubated with CCR5 (magenta) expressed cells to allow formation of the CD4-gp120-CCR5 complex on cell surfaces; 2) strep-tag purification, the CCR5 complex, as well as some CD4-gp120 complex, were captured to strep-tactin resin via the strep-tagged CD4 (strep tag in yellow). They were eluted by D-desthiobiotin under mild conditions; 3) negative selection by an anti-V3 antibody to remove the CD4-gp120 complex. The CCR5 complex was further purified by size-exclusion chromatography. (c) The purified CD4-gp120-CCR5 complex was resolved by gel-filtration chromatography on a Superose 6 column in the presence of detergent LMNG. The molecular weight standards include thyoglobulin (670 kDa), ferritin (440 kDa), γ-globulin (158 kDa) and ovalbumin (44 kDa). The expected size of the CCR5 complex is ~310 kDa (120 kDa for gp120, 50 kDa for 4D CD4, 40 kDa for CCR5 and ~100 kDa for LMNG micelle). Peak fractions were analyzed by Coomassie stained SDS-PAGE (lanes 1-3). Labeled bands were confirmed by western blot and protein sequencing. The experiment was repeated independently at least 15 times with similar results.
Extended Data Figure 4.
Characterization of the CD4-gp120-CCR5 complex by EM.
(a) Representative image of the CD4-gp120-CCR5 complex in negative stain. The experiment was repeated independently at least 4 times with similar results. (b) 2D averages of the negatively stained CD4-gp120-CCR5 complex. The box size of 2D averages is ~330Å. (c) 3D reconstruction of the negatively stained CD4-gp120-CCR5 complex, fitted with a gp120 structure containing an extended V3 loop (pdb ID: 2QAD; ref[20]), 4D CD4 (pdb ID: 1WIO) and CCR5 (pdb ID: 4MBS). (d) A representative cryo-EM image of the 4D CD4-gp120-CCR5 complex. The scale bar represents 25 nm. Five independent large data sets were collected with similar results. (e) 2D averages of the cryo-EM particle images show secondary structural features for both gp120 and CCR5.
We acquired cryo-EM images on a Titan Krios electron microscope with a K2 Summit direct detector (Extended Data Fig. 4d) and used RELION[26] for image processing. Two-dimensional (2D) class averages of the particle images showed secondary structural features for both gp120 and CCR5 (Extended Data Fig. 4e). Three-dimensional (3D) classification of the particles was performed (Extended Data Fig. 5a), using the low-pass filtered 3D reconstruction of the complex in negative stain as an initial model (Extended Data Fig. 4c). The particles from the classes with defined structural features were combined and refined to generate a map at 4.6Å resolution, showing all three components (Extended Data Fig. 5a). The structure was determined by rounds of 3D classification, 3D classification with signal subtraction and masked refinement, as described in Methods and Extended Data Fig. 5a-d. The final resolution was 3.9Å, when the last two domains of CD4 were excluded
Extended Data Figure 5.
Single-particle cryo-EM analysis of the CD4-gp120-CCR5 complex.
(a) Data processing workflow for the CD4-gp120-CCR5 complex. (b) 3D reconstructions of the CD4-gp120-CCR5 complex refined with no mask at an overall resolution of 4.5Å (left) and with a mask to exclude the last two domains of CD4 at a resolution of 3.9Å (right) are colored according to local resolution estimated by RELION. (c) Angular distribution of the cryo-EM particles used in the reconstruction was also shown in respect to both the side and top views of the EM map. (d) Gold standard FSC curves of the unmasked and masked EM reconstructions shown in (b).
Overall structure of the CD4-gp120-CCR5 complex
The atomic structures of 4D CD4, the gp120 core complexed with CD4 and the modified CCR5, none of which was used in image processing, all fit as rigid bodies remarkably well to the EM density of the CD4-gp120-CCR5 at 4.5Å resolution (ref[10,27,28]; Fig. 1a and 1b and Extended Data Fig. 5b). The gp120 core, containing inner and outer domains and the bridging sheet[2], occupied excellent density, indicating that it has the same rigid structure in this ternary complex as it does in the CD4-induced conformation[2,28]. Several N-linked glycans were marked by protruding densities from the protein surfaces, as expected for a fully-glycosylated gp120. All four domains of soluble CD4 were visible, with density for first two (D1-D2) slightly stronger than for last two (D3-D4) domains. There was also density for a N-linked glycan at N271 in D3 (Fig. 1b), suggesting that the hinge region between D2 and D3 may not be very flexible[27]. All 7TM helices of CCR5 were well resolved in detergent micelle. Its palmitoylated cytoplasmic tail, which presumably interacts with membrane, was disordered in this detergent-solubilized complex. Extra densities between CCR5 and gp120 that cannot be explained by the existing structures define the details of CCR5-gp120 interaction.
Figure 1.
Cryo-EM structure of the CD4-gp120-CCR5 complex.
(a) Cryo-EM map of the complex containing HIV-1 gp120 (cyan), CCR5 (red), 4D CD4 (green; D1-4, domain 1-4), and detergent micelle (gray). (b) Fit of structures of gp120 (pdb ID: 5VN328), CCR5 (pdb ID: 5UIW[10]) and 4D CD4 (pdb ID: 1WIO[27]) into the EM map in (a). N271 of CD4 (green), N234, N262 and N362 of gp120 in cyan are N-linked glycosylation sites. (c) The structure of the CD4-gp120-CCR5 complex was modeled based on a 3.9Å density map. (d) Overall structure of the 4D CD4-gp120-CCR5 complex shown in ribbon diagram. N, N-terminus; C, C-terminus; ECL2, extracellular loop 2; I, II, III, IV, V, VI, VII, transmembrane helices (TM) 1-7.
Interfaces between gp120 and CCR5
The map from the masked refinement showed good density for the gp120 and CCR5 interfaces (Fig. 1c and Extended Data Fig. 6). As postulated previously[7,14], there are indeed two major contacting interfaces between gp120 and CCR5 (Fig. 1d). The V3 loop of gp120 inserts into the CRS2 of CCR5 and contacts with all the 7TM helices. The CCR5 N-terminal segment adopts an extended conformation with several sharp turns and contacts the surface of the bridging sheet of gp120, which forms only after CD4 binding[2,28].
Extended Data Figure 6.
Gallery of representative density for the CD4-gp120-CCR5 complex.
Representative density in gray mesh from the 3.9Å resolution EM map is shown for TM1-7, the N-terminus of CCR5, ELC3 near TM6, Tys10, Tys14 and Tyr15 (red model); two V3 regions; and for helix α1, N-terminus, V3 loop, the bridging sheet and N-linked glycan at Asn262 of gp120 (cyan model).
gp120 V3 loop and CCR5 CRS2.
The conserved 310GPGR(Q)313 motif at the tip of the V3 loop penetrates by approximately 1/3 of the thickness of lipid bilayer into the CRS2 pocket with the residue P311 reaching most deeply (Fig. 2a). Residues 309-316 of the V3 loop adopt a structure, including a one-turn helix, similar to that of the N-terminus (residues 1-8) of [5P7]CCL5 (ref[10]; Extended Data Fig. 7a). Both structures have a proline residue (P311 of gp120 and P3 of [5P7]CCL5) that reaches the bottom of the CRS2, packing against the side chains of W86 and Y108 in CCR5 (Fig. 2a and Extended Data Fig. 7a). R313 in V3 appears to be sandwiched between Y251 and E283 of CCR5. The CRS2 of chemokine receptors has minor and major subpockets, formed by TM helices I-III, VII; and III-VII, respectively[10]. The V3 loop of this HIV-1 strain occupies mainly the minor subpocket and just part of the major subpocket (Fig. 3a), leaving room in the CRS2 to accommodate different V3 sequences from other viral isolates. Like [5P7]CCL5, different V3 loops may also have water-mediated interactions with CCR5[10]. The ECL2 of CCR5 forms a nearly semi-circular grip as it wraps around V3, making contacts with residues in both the V3 stem and crown[29]. Particularly, E172 in ECL2 and R304 in V3 probably form a salt bridge (Fig. 2a). Overall, the gp120 V3 makes extensive contacts with the CRS2 of CCR5, contributing to the high affinity of HIV-1 Env for its coreceptor.
Figure 2.
Interfaces between gp120 and CCR5.
(a) Interactions between the V3 loop of gp120 (cyan) and the CRS2 of CCR5 (red). Left, ribbon diagram of V3 inserting into the CRS2. The GPGR motif of V3 is in stick model. Right, major contacts between residues Pro311, Arg313 and Arg304 of gp120 in cyan and those from CCR5. (b) Interactions between the CCR5 N-terminus (red) and the bridging sheet of gp120 (cyan). Left, overall view of the CCR5 N-terminus attaching to the four-stranded bridging sheet formed by the V1V2 stem and β21-β22 of gp120. Residues Ser7, Pro8, sulfated Tyr10 and Tyr14, Tyr15, Pro19, the O-linked glycan at Ser7, the disulfide between Cys20 and Cys269 of CCR5 are in stick model. Right, major contacts between sulfated Tyr10 and Tyr14, as well as Tyr15 of CCR5 and residues from gp120.
Extended Data Figure 7.
Comparison of conformations of V3 loop and [5P7]CCL5 in complex with CCR5, as well as of gp120-bound CCR5 and G protein-bound β2 adrenergic receptor (β2AR).
(a) The structures of the CD4-gp120-CCR5 and [5P7]CCL5-CCR5 complexes are superposed on CCR5 (red). Gp120 V3 loop with its Pro311 in stick model is in cyan and [5P7]CCL5 with its Pro3 in stick model in yellow. Residues 309-316 of the V3 loop and residues 1-8 of [5P7]CCL5 adopt a very similar structure are highlighted in a rectangular box. (b) Superposition of the structures of the N-terminus of the gp120-bound CCR5 (red) and the CDR H3 loop of antibody 412d in complex with gp120 core (green). The EM density of the CD4-gp120-CCR5 is shown in gray. The positions of the sulfated tyrosine (Tys) residues, including Tys10 and Tys14 from CCR5; Tys100 and Tys100c from 412d, are indicated. (c) A model for interactions of three CD4 and three CCR5 with the SOSIP Env trimer. The side and bottom views of a composite structure of the CD4-CCR5-SOSIP Env trimer complex are shown. The model was generated using the CD4-bound SOSIP trimer (pdb ID:5VN3) and the structure of the CD4-gp120-CCR5 complex from this study. All the structures were aligned based on the gp120 core region. CCR5 is shown in red, CD4 in green, gp120 in blue and gp120 of SOSIP dark blue; gp41 of SOSIP gray. The crystallographic dimer of CCR5 was also shown on left only in a rectangular box using pdb ID: 4MBS. The observed crystallographic dimer of CCR5 or the TM5-mediated dimer by modeling does not seem to be relevant to binding to either monomeric or trimeric gp120[7,78]. (d) Superposition of the structures of the gp120-bound CCR5 (red) and the Gs protein-bound β2 adrenergic receptor (blue). The position of TM6, critical for GPCR activation, is indicated.
Figure 3.
Conformational differences between gp120-bound and other liganded CCR5s.
(a) The section of the CCR5 CRS2 (divided into a major and a minor subpocket) is shown in surface representation for the [5P7]CCL5, gp120 and maraviroc complexes, respectively. Interacting residues, including Pro3 from [5P7]CCL5, Pro311 from the V3 loop, and compound maraviroc, are in stick model. (b) Superposition of the structures of the gp120-CCR5 complex (red) and the maraviroc-CCR5 complex (blue). N-terminus, ELC2 and seven TM helices (I, II, III, IV, V, VI, VII) are indicated. (c) Superposition of the structures of the gp120-CCR5 complex (red) and the [5P7]CCL5-CCR5 complex (blue).
gp120 bridging sheet and CCR5 N-terminus.
The second major interface between Env and its coreceptor is formed by the N-terminus of CCR5 and the bridging sheet of gp120. Helices I and VII of CCR5 are locked by a disulfide (Cys20-Cys269), and the N-terminal segment (residues 1-19), connected to the helix I, is largely disordered in the crystal structures of CCR5 complexed with other ligands[7,10]. In the CD4-gp120-CCR5 complex, the N-terminal segment adopts an extended conformation with several sharp turns, attaching to the surface of the bridging sheet, formed by the V1V2 stem and the β20-β21 hairpin upon CD4 binding (Fig. 2b). Three tyrosine residues, Tyr10, Tyr14, and Tyr15, of CCR5, all of which can be sulfated[21], make the most intimate contacts with gp120. There appears to be density and room for sulfate groups on Tyr10 and Tyr14, but not on Tyr15 (Extended Data Fig. 6). The aromatic ring of sulfated Tyr10 packs against the side chain of Arg326 in gp120 probably by a cation-π interaction[30], placing the putative sulfate group near Lys416 and Arg414 of gp120. Sulfated Tyr14 wedges between the bases of both the V3 stem and bridging sheet, positioning its sulfate group near the side chains of Arg 298 and Lys 435 of gp120. The side chain of Tyr15 is close to Lys207 at the base of the V1V2 stem, potentially forming another cation-π interaction. Sulfated Tyr10 and Tyr14 mimic the interactions with gp120 of two sulfated tyrosines Tyr100 and Tyr100c in antibody 412d (ref[20]; Extended Data Fig. 7b). There also appears to be an O-linked glycan at Ser7, a glycosylation site identified previously[31], and the carbohydrate may help maintain the configuration of the N-terminal region of CCR5.One unexpected feature of the complex structure is the orientation of CD4 relative to the 7TM helices of CCR5. The long axis of CD4 lies almost perpendicular to the axis of the CCR5 TM helices and roughly parallel to the plane of lipid bilayer, raising the possibility that formation of the ternary complex could induce a local bend in the membrane. Although binding of an Env trimer to three copies each of CD4 and CCR5 simultaneously is stereochemically possible (Extended Data Fig. 7c), it may be an inefficient process if such a stoichiometry is needed to activate gp41.
Conformational differences between gp120-bound and other liganded CCR5s
There is no published structure for unliganded CCR5, which may be conformationally flexible, sampling a range of conformations even in its native membrane environment[32]. In a complex with CCR5 stabilized by specific mutations, maraviroc binds in a hydrophobic pocket near the bottom of CRS2; it has been thought to inhibit HIV-1 infection by stabilizing a conformation that Env cannot recognize[33]. We find, however, that the overall dimension and shape of the gp120-bound CRS2 pocket from the wildtype CCR5 are not very different from those of other liganded CCR5s with stabilizing modifications (Fig. 3a). Comparison of the structures of the CCR5-maraviroc and CCR5-gp120 complexes indicates that the parts of all TM helices near the intracellular side, including TM6, which is critical for GPCR activation[34], show few differences, indicating that the gp120-bound CCR5 also adopts an inactive conformation (Extended Data Fig. 7d). Indeed, binding of the gp120-CD4 complex did not activate the G protein-signaling pathways (Extended Data Fig. 2d and 2e). The parts near the extracellular surface move outwards by 1-3Å to accommodate the inserted V3 loop (Fig. 3b). The largest changes are in the long β-hairpin of ECL2, where part of its body moves outwards by 2-3Å while its tip moves inwards by >5Å tightening the grip on the V3 loop. These conformational differences are unlikely to block V3 access to the CRS2 pocket, however. In addition, the binding site of maraviroc partially overlaps with that of the V3 tip, suggesting that the drug blocks gp120 binding by direct competition, not by restricting conformational availability. Moreover, the antagonist ([5P7]CCL5) bound CCR5 with stabilizing mutations also adopts a conformation very similar to the gp120-bound wildtype CCR5, suggesting that the conformational freedom of CCR5 TM helices is limited, at least in the liganded, inactive forms (Fig. 3c).
Conformational differences between CD4- and CCR5-bound gp120s
The most unexpected aspect of the CD4-gp120-CCR5 structure is lack of obvious allosteric changes that can propagate from the CCR5 binding site to gp41, as previously hypothesized. While CD4 and/or CD4i antibody-induced conformational changes in the SOSIP-based Env trimer have been described in great detail recently[28], comparison of the CD4- and CCR5-bound gp120 with the CD4-bound gp120 shows no major differences in the gp120 core region (Fig. 4a and Extended Data Fig. 8a). In particular, a ~50Å zone between the CCR5 binding site and the gp120-gp41 interface, including inner and outer domains and bridging sheet, remains almost invariant, suggesting that CCR5 does not induce any major structural changes affecting gp41. There are, however, some differences between the two structures. First, when CCR5 binds, the V3 loop reconfigures in order to fit into the CRS2 pocket (Fig. 3a). This conformation, partially mimicking that of the N-terminus of [5P7]CCL5, has not been seen for either unliganded or antibody-bound V3 loops (ref[20,35-38]; Extended Data Fig. 8b). Second, a more significant difference is in the region including the N- and C-termini of gp120 near its interface with gp41. In the prefusion SOSIP trimer structure, the gp120 termini are surrounded by a so-called “4-helix collar” of gp41, which closes up by insertion of the side chain of Met530 on one of the helices (α6) into a hydrophobic clasp formed by three tryptophan residues on two other helices (α8 and α9)[5]. CD4 binding induces a shift of α6, allowing the fusion peptide to pack directly against the gp120 termini (ref[28]; Fig. 4b). In the CD4-gp120-CCR5 complex, the N- and C-termini bend back at pivot regions containing the highly conserved [40]GVP[42] and 489PLG491 sequences, respectively. In particular, the N-terminus rotates almost by ~180° to pack against the surface of gp120 and occupies the space of the fusion peptide in the Env trimer (Fig. 4b). There is no obvious density for the rest of the C-terminus, which is probably disordered, although the histidine tag could have influenced its conformation.
Figure 4.
Conformational differences between CD4-bound and CCR5-bound gp120s.
(a) Comparison of structures of gp120 in the unliganded SOSIP Env trimer (pdb ID: 4ZMJ35; purple), in the CD4-bound SOSIP trimer (pdb ID:5VN3[28]; blue) and in complex with CD4 and CCR5 (cyan). A 50Å-circle marks the gp120 core region. (b) Superposition of structures of the CD4-gp120-CCR5 complex and the CD4-bound SOSIP trimer. Left, the two structures are superposed by the gp120 core region and the first two domains of CD4. CCR5-bound gp120 is in cyan, CCR5 in red, CD4 in green; one of the gp120s from the trimer in blue, the corresponding gp41 in yellow, except for its fusion peptide in magenta, the rest of the SOSIP trimer in gray. The distances between the fusion peptide and the TM domains of CD4 and CCR5 are 160Å and 70Å, respectively. Right, close-up views of the gp120 N- and C-terminal region. Four helices (α6, α7, α8 and α9) of gp41 forming the 4-helix collar are indicated. The N-terminus of the CCR5-bound gp120 overlaps with the fusion peptide in the CD4-bound trimer.
Extended Data Figure 8.
Comparison of conformations of different monomeric gp120s and various V3 loops.
(a) Comparison of structures of an unliganded gp120 core (pdb ID:4OLV; purple), a CD4-bound monomeric gp120 core with the V3 loop (pdb ID:2QAD; blue) and gp120 in complex with CD4 and CCR5 from this study (cyan). The gp120 core region is marked by a circle with a diameter of 50Å. N- and C-termini, V1V2 stem, V3 stem or loop and bridging sheet are indicated. (b) Representative conformations that an HIV-1 V3 loop can adopt. From left to right: V3 loop in the unliganded SOSIP BG505 Env trimer (pdb ID: 4ZMJ); the first V3-containing gp120 core in complex with CD4 and antibody X5 (pdb ID: 2B4C; ref[29]); CD4- and 412d-bound monomeric gp120 core with V3 (pdb ID: 2QAD); CCR5-bound intact gp120 (this study); and V3 peptide in complex with antibody 447-52D (pdb ID: 3GHB; ref[36]); antibody 268-D (pdb ID: 3GO1; ref[37]); antibody 2557 (pdb ID: 3MLV; ref[37]); antibody 10A37 (pdb ID: 5V6L; ref[38]). The root-mean-square deviation (RMSD) of each structure, except for 5V6L, relative to the CCR5-bound gp120 monomer is shown at the bottom in parenthesis.
A model for how CCR5 functions as an HIV-1 coreceptor
Since CCR5 binding does not appear to induce any allosteric changes that can unleash gp41 to fuse membranes, how does it function as an essential coreceptor? Based on the known structures of the HIV-1 Env trimer[3-5,28,39], the prefusion gp41, which wraps around the N- and C-termini of gp120, is no longer stable and likely to enter an irreversible refolding process once gp120 dissociates. Thus, gp120 dissociation may be the crucial “trigger” that initiates gp41 refolding events, including insertion of its fusion peptide into the target membrane and formation of the six-helix bundle postfusion conformation. In the prefusion conformation, the gp120 N- and C-termini are gripped by the 4-helix collar of gp41[5]. CD4 binding leads to a large shift of the C-terminus of helix α6 away from the gp120 termini, creating a pocket, filled by the fusion peptide[28], which packs against the pivot region ([40]GVP[42] ) of the gp120 N-terminus (Fig. 4b). When intrinsic conformational dynamics cause the fusion peptide to dissociate from the pocket, it will open up one side of the gp41 grip, and the gp120 N-terminus can then bend back to adopt the conformation observed in the CCR5-bound structure (Extended Data Fig. 9). The rearrangements of the gp120 termini, probably independent of CCR5 binding, can prevent the fusion peptide from reoccupying the pocket and effectively weaken gp120-gp41 interactions, possibly leading to complete dissociation. Indeed, spontaneous or CD4-induced gp120 shedding from Env trimers are well documented for many HIV-1 isolates[40,41], indicating that gp120 is prone to dissociation from gp41 even in the absence of a coreceptor. We note that the impact of the membranes and Env trimer organization remains unknown and will require further investigation.
Extended Data Figure 9.
Model for HIV-1 Env activation to induce membrane fusion.
A hypothesis of how cellular receptors CD4 and CCR5 trigger HIV-1 Env trimer to induce membrane fusion and viral entry. Left, virus attaches to the target cell by gp120 (cyan) binding to CD4 (green). Helix collar of gp41, the 4-helix collar gripping the N- and C-termini of gp120. Right, immediate binding by CCR5 (red) prevents rapid dissociation between gp120 and CD4, stabilizes the CD4-induced conformational changes within the Env trimer and also brings the trimer close to the cell membrane. Simultaneous binding of gp120 to both CD4 and CCR5 may require bending in the cell membrane. The fusion peptide (magenta) of gp41 (gray) flips out due to intrinsic conformational dynamics, allowing bending back of the N- and C-termini of gp120, which blocks the fusion peptide from resuming its original position in the trimer. The movements of the fusion peptide and gp120 termini effectively weakens the non-covalent association between the two subunits and may lead to partial or complete dissociation of gp120 and a series of refolding events in gp41 to adopt the prehairpin intermediate conformation with the fusion peptides inserting into the target cell membrane. Extended helix in gp41, three helices in the fusion intermediate conformation of gp41.
If the rearrangement of gp120 termini to activate gp41 does not depend on CCR5 binding, then why would a coreceptor still be needed? First, premature gp120 dissociation, in absence of a coreceptor, would be non-productive; for a virion attached to the target cell surface only through an Env trimer-CD4 contact, the distance between the fusion peptide and membrane surface can be ~160Å (Fig. 4b). If gp120 dissociates, the fusion peptide would be too far from reaching the target membrane. Binding of gp120 to CCR5 can bring the fusion peptide of gp41 within 70Å (Fig. 4b), consistent with the distance needed for the fusion peptide to translocate and reach the target membrane[5]. Second, gp120-CD4 association, measured by single-molecule force spectroscopy with infectious virions and live host cells, is unstable and rapidly reversible unless CCR5 binding immediately follows[42,43]. CCR5 is therefore needed to stabilize the CD4-induced conformational changes, which are already competent for promoting fusion. In particular, tucking away V3 by CCR5 will prevent the Env trimer from moving back to the prefusion conformation and help shift the equilibrium towards the irreversible step - gp41 refolding. Third, membrane fusion may require more than one Env trimer to induce fusion pore formation[44], as shown for other viral fusion proteins[45]. Since the number of Env trimers on the virion is low (~14/virion; ref[46]), a long lifetime of Env-receptor complex would be important for recruiting additional, CD4- and coreceptor-primed trimers. Thus our structure shows how coreceptor can be essential for membrane fusion, even though it does not actively induce gp41 refolding.
Coreceptor switch
The switch from CCR5 to CXCR4 is often associated with an accelerated increase in viral load and decrease in CD4+ T cells, as well as with faster disease progression[12,13]. Our structure supports a model for how simple mutations in Env can achieve this seemingly complicated transition. First, the coreceptor is required only to stabilize the CD4-induced conformation, not to trigger additional allosteric changes in Env through specific interactions. The switch can thus be accomplished if the V3 loop gains sufficient affinity for CXCR4 and does not release the coreceptor, as no specific mutations in Env are needed to make it “triggerable” by CXCR4. Second, the overall dimensions of the CRS2 pocket in the gp120-bound CCR5 and the liganded CXCR4 are very similar (ref[14,15]; Extended Data Fig. 1), suggesting that changes of the surface-exposed residues in V3 to make it compatible with CXCR4 binding would be sufficient. Use of both coreceptors by R5X4 isolates further underscores the similarities between the two[11]. Third, the main contacts between gp120 and the CCR5 N-terminus are electrostatic. CXCR4 has seven acidic residues in its N-terminus (before the first disulfide-forming Cys residue) while CCR5 has four, in addition to two or three sulfotyrosines. No additional mutations in the bridging sheet region would be needed if the extra acidic residues in CXCR4 can replace the two critical sulfated tyrosines in CCR5. Finally, X4 V3 loops generally have more positive charges than those of R5 viruses, consistent with a more negatively charged CRS2 in CXCR4 than CCR5. Evolution from CCR5 to CXCR4 usage indeed can be achieved by multiple mutational pathways, but often with gaining net positive charges in the V3 region[47-49].
CCR5 antagonist-based therapeutics
The V3 binding site only partially overlaps with the minor subpocket of the maraviroc binding site (Fig. 3a). Maraviroc-resistant viruses can emerge either in infected individuals under the treatment or by in vitro selection. Major changes in the escaped viruses map to the V3 region, some with a three-residue deletion, but they show no consistent patterns[50-52]. Some resistant viruses can infect by recognizing the drug-bound CCR5[50,53]. In addition, replication-competent HIV-2 viruses have been selected that lack the entire V3 loop[54]. They can use both coreceptors for entry by gaining additional positively charged residues near the bridging sheet to enhance interactions with the coreceptor N-terminus. These data indicate that no specific structural determinants in V3 are required for the coreceptor to function, fully consistent with our conclusion that CCR5 does not actively trigger gp41 through specific interactions with gp120Our structure also suggests a general strategy for how to improve maraviroc-like therapeutics. Since the V3 loop mainly overlaps with maraviroc in the minor subpocket, which is primarily occupied by the triazole group of the compound, additional groups may be added to the triazole ring to enhance its competing power with the V3 loop and increase the barrier to drug resistance.
Constructs and stable cell lines
The gene of the intact human C-C chemokine receptor type 5 (CCR5; NCBI Reference Sequence: NP_000570.1) was cloned into pCMV-IRES-puro vector (Codex BioSolutions, Inc, Gaithersburg, MD). Genes of HIV-1 gp120 (residues 1-507) from the isolate 92BR020 with a C-terminal 6xhistidine tag, and of 4 domain CD4 (residues 1-388) with a C-terminal twin strep tag [(GGGGS)2WSHPQFEK(GGGGS)2WSHPQFEK)] were synthesized by GenScript (Piscataway, NJ) and cloned into pCMV-IRES-puro vector. 293T cell lines (Thermo Fisher Scientific, Waltham, MA) stably transfected with these constructs were generated either in-house or at Codex Biosolutions. Briefly, 8×105 HEK-293T cells in 2 ml of DMEM containing 10% FBS and no antibiotics were seeded on a 6 well-plate and incubated for overnight. The cells were then transfected with the expression constructs using DNA-In® 293 Transfection Reagent (MTI-GlobalStem, Gaithersburg, MD), following a protocol recommended by manufacturer. 24 hours post-transfection, the cells were transferred to a medium containing DMEM, 10% FBS and 1 μg/ml puromycin for selection. Single colonies were picked in 2-3 weeks, and transferred to 24-well plates in the same selective medium. Protein expression was confirmed by both western blot and a fluorescence-activated cell sorting (FACS) assay (see below). Positive clones were expanded, frozen and stored in liquid nitrogen. To grow cells in large-scale in suspension, we also generated stable cell lines expressing CCR5 with Expi293F cells (Thermo Fisher Scientific). Hybridoma cells for production of an anti-V3 antibody 447-52D[55] was kindly provided by Dr. Susan Zolla-Pazner, New York University.
HIV-1 gp120 of the isolate 92BR020.
Cells expressing C-terminal his-tagged 92BR020 gp120 were grown in 250 ml roller bottles with DMEM containing 10% FBS and 1 μg/ml puromycin. The protein was purified by affinity chromatography using Ni-NTA agarose (Qiagen, Hilden, Germany), followed by gel filtration chromatography, as described previously[56,57]. The peak fractions were pooled and concentrated to 10 mg/ml using a 10 kDa MWCO Millipore filter (MilliporeSigma, Burlington, MA).
Soluble CD4.
Cells expressing strep-tagged CD4 were grown in 250 ml roller bottles with DMEM containing 10% FBS and 1 μg/ml puromycin. Once the cells reached ~70% confluence, the medium was replaced with 293T serum free expression medium. After 5 days, cell supernatants were harvested and loaded onto a Strep-Tactin Sepharose (IBA Lifesciences, Goettingen, Germany) column. The column was then washed with 100 mM Tris-HCl, pH 8.0 and 150 mM NaCl. The protein was eluted with 100 mM Tris-HCl, pH 8.0, 150 mM NaCl and 2.5 mM desthiobiotin (IBA). Eluted fractions were analyzed by SDS-PAGE, and peak fractions containing CD4 were pooled and dialyzed against 100 mM Tris-HCl, pH 8.0 and 150 mM NaCl using a dialysis tubing with 6-8 kDa MWCO (Spectrum Laboratories, Inc., Rancho Dominguez, CA). The protein was further purified by gel filtration chromatography using a Superdex 200 column (GE healthcare, Chicago, IL) in a buffer containing 30 mM Tris-HCl, pH 8.0, and 150 mM NaCl. The peak fractions were pooled and concentrated to 10 mg/ml using 10 kDa MWCO Millipore filters.
Anti-V3 antibody 447-52D.
The hybridoma cells expressing anti-V3 antibody 447-52D were gown in roller bottles with RPMI medium supplemented with ultra low IgG FBS (Thermo Fisher Scientific). The antibody was purified by affinity chromatography using GammaBind Plus Sepharose (GE Healthcare) as described previously[57,58]. The eluted fractions were analyzed by SDS-PAGE and those containing the antibody were pooled, concentrated, frozen in liquid nitrogen and stored at -80 °C.
Western blot and flow cytometry
Western blot was performed using an anti-CCR5 antibody following a protocol described previously[24]. To test CCR5 cell-surface expression and its binding to gp120, flow cytometry was carried out as described previously[24,59]. Briefly, CCR5 was detected by PE Mouse antihuman CD195 (BD Biosciences, San Jose, CA) and bound-gp120 was stained with Anti-His tagged PE conjugated Mouse IgG (R&D Systems, Minneapolis, MN). Extensive washing of the gp120-bound cells with PBS or leaving them in PBS up to 4 hours did not change the fluorescence signals, suggesting that gp120 dissociates very slowly from CCR5 on the cell surfaces. Control experiments were carried out to ensure the binding specificity and all experiments were repeated at least three times with almost identical results.
CCR5 functional assays
When CCR5 is activated by its chemokine ligands, its intracellular regions can interact mainly with the inhibitory heterotrimeric G protein Gαi and coupling of CCR5 to Gαi reduces the intracellular concentration of the secondary messenger cyclic AMP (cAMP)[60]. We measured changes in the intracellular cAMP concentration in these cells when CCR5 was activated by chemokine RANTES using ACTOne technology (ref[61]; Codex Biosolutions). Briefly, 293 cells or CCR5 expressing cells were transfected with pcDNA3.2-cyclic nucleotide-gated (CNG) channel gene using Lipofectamine 3000 reagent (Thermo Fisher Scientifics). ~24 hours posttransfection, the cells were transferred to a 384-well black clear plate at the density of 1.2×104 cells/well in 20 μl culture medium. On day 3, the cells were treated with different concentrations of CCL5/RANTES, CD4, gp120 or the gp120-CD4 complex in the presence of 25 μM Ro20-1724 (4-(3-Butoxy-4-methoxybenzyl)imidazolidin-2-one; a phosphodiesterase 4 inhibitor) and 500 nM 5’-N-Ethylcarboxamidoadenosine (NECA; an endogenous adenosine A2b receptor agonist that increases cellular cAMP levels). Changes in the cAMP concentration in live cells directly correlate with the ion flux through CNG channel, which was measured by a fluorescent plate reader Hamamatsu FDSS 7000 after staining with the ACTOne membrane potential dye (Codex Biosolutions). Similar results were obtained for both the 293T-CCR5 and Expi293F-CCR5 stable cell lines.It has been reported that HIV-1 gp120 or viral particles can induce CCR5 or CXCR4 cell signaling measured as calcium flux in primary unstimulated CD4+ T cells[62]. To test calcium flux in our 293T-CCR5 stable cells in response to gp120, we seeded the cells in a 384-well black clear plate at the density of 1.5×104 cells/well in 20 μl culture medium. On the second day, 20 μl of 1x Non-Wash Calcium Dye solution (Codex BioSolutions) was added to each well. The plate was then incubated in a CO2 incubator at 37°C for one hour. The different concentrations of various ligands were prepared in 1x Hank’s balanced salt solution (HBSS) with 20 mM HEPES (pH 7.4) (at the 5x of the final concentration). The fluorescent intensity of each well was recorded on the FDSS 7000 reader at a rate of 1 image/sec with an excitation wavelength of 480 nM and an emission wavelength of 540 nM. The base line of each well was recorded for 10 seconds before addition of 10 μl of a prepared ligand at 5x of the final concentration. Fluorescent intensity was recorded for additional 390 seconds.
Purification of the CD4-gp120-CCR5 complex
We first screened many different HIV-1 gp120s for binding to the cell-surface-expressed CCR5 in the presence of soluble CD4 by flow cytometry and small-scale purification in detergent. The 92BR020 gp120 was chosen based on not only the stability of its CCR5 complex on the cell surfaces, but also the stability of the purified complex in detergent. For large-scale protein purification, we grew Expi293F stable cells stably transfected with the CCR5 expression construct in suspension. The cells were harvested at the density of ~5-7×106/ml by centrifugation (4,000 rpm at 4°C for 30 min). The cell pellets were washed with PBS, re-suspended in PBS supplemented with 1% BSA, and incubated with the preformed CD4-gp120 complex for one and half hour at 4°C. The cells were then washed three times with PBS, resuspended in a lysis buffer containing100 mM Tris-HCl, pH 8.0, 150 mM NaCl, 1 mM EDTA, 1% lauryl maltose neopentyl glycol (w/v) (LMNG, Anatrace, Maumee, OH), 0.2 % cholesteryl hemisuccinate (w/v) (CHS, Anatrace), EDTA-free complete protease inhibitor cocktail (Roche, Basel, Switzerland), and incubated at 4°C for one hour. The supernatants were collected after centrifugation (18,000 rpm at 4°C for 60 min), and then loaded on a strep-tactin column equilibrated with the lysis buffer. The column was washed with 5 column volumes of a washing buffer (100 mM Tris-HCl, pH 8.0, 150 mM NaCl, 1 mM EDTA, 0.5% LMNG (w/v), 0.04 % CHS (w/v)), followed by 50 column volumes of another washing buffer (100 mM Tris-HCl, pH 8.0, 150 mM NaCl, 1 mM EDTA, 0.001% LMNG (w/v), 0.1% n-Dodecyl-β-D-Maltopyranoside (DDM, Anatrace) (w/v), 0.04 % CHS (w/v)). The CCR5 complex was eluted by an elution buffer containing 100 mM Tris-HCl, pH 8.0, 150 mM NaCl, 1 mM EDTA, 0.001% LMNG (w/v), 0.1% DDM (w/v), 0.04 % CHS (w/v), and 5 mM desthiobiotin. Elution fractions were analyzed by SDS-PAGE and those containing the CCR5 complex were pooled.The anti-V3 antibody 447-52D only binds the CD4-gp120 complex with the V3 loop exposed, but not the CD4-gp120-CCR5 complex with the V3 protected by possible interactions with the coreceptor. To remove extra CD4-gp120 complex, 447-52D was loaded onto the GammaBind Plus column, which was subsequently washed with 50 column volumes of PBS to remove unbound antibody. The column was then equilibrated with the elution buffer for the strep-tactin column. The pooled fractions containing the CCR5 complex from the strep-tag purification were loaded onto the 447-52D column. The flowthrough was collected and then concentrated using an Amicon Ultra Centrifugal Filter (MWCO 100 kDa). The complex was further purified by size-exclusion chromatography on a Sepharose 6 10/300 column (GE Healthcare) in a buffer containing 100 mM Tris-HCl, pH 8.0, 150 mM NaCl, 1 mM EDTA, 0.001% LMNG (w/v), 0.025% DDM (w/v), and 0.04 % CHS (w/v). Eluted fractions containing the CCR5 complex were pooled and concentrated. The complex could be concentrated at least 3 mg/ml without any aggregation on a size-exclusion column.
Negative stain EM
To prepare grids, 2.5 μl of the freshly purified CCR5 complex was adsorbed to a glow-discharged carbon-coated copper grid, washed with deionized water, and stained with freshly prepared 0.75% uranyl formate. Images were recorded at room temperature with a magnification of 52,000x and a defocus value of 1.5 μm following low-dose procedures, using a Philips Tecnai F20 electron microscope (Thermo Fisher Scientific) equipped with a Gatan US4000 CCD camera and operated at voltage of 200 kV. Particles were picked manually and 2D class averages were generated by using EMAN2 software package[63]. These 2D averages were used to generate a 3D initial model starting from a density of isotropic Gaussian distribution[64].
Cryo-EM sample preparation and data collection
To prepare cryo grids, 3 μl of the freshly purified CD4-gp120-CCR5 complex at 1.0 mg/ml was applied to a 1.2/1.3 Quantifoil grid (Quantifoil Micro Tools GmbH, Großlöbichau, Germany), which had been glow-discharged for 90 sec at 20 mA. Grids were immediately plunge-frozen in liquid ethane using a Vitrobot (Thermo Fisher Scientific) with a blotting time of 4 sec. The grids were first screened for ice thickness and particle distribution using a Talos Arctica transmission electron microscope (Thermo Fisher Scientific) operated at 200 kV and equipped with a K2 Summit direct detector (Gatan). For data collection, images were acquired with selected grids using a Titan Krios transmission electron microscope (Thermo Fisher Scientific) operated at 300 kV with a K2 detector. Automated data collection was carried out using SerialEM[65] at a magnification of 130,000× and the K2 detector in super-resolution counting mode (pixel size: 0.529Å) at a dose rate of ~6 electrons/physical pixels/s. Each movie had a total accumulated exposure of ~46 e/Å[2] fractionated in 35 frames of 200 ms. Total of four datasets were acquired in different sessions using a defocus range of 1.0-2.8 μm.
Image processing and 3D reconstructions
For cryo-EM data, drift correction was performed using MotionCor2[66] and images were binned 2×2 by Fourier cropping to a pixel size of 1.059Å. The contrast transfer function (CTF) was estimated by CTFFIND4[67] using motion-corrected sums without dose-weighting. Motion-corrected sums with dose-weighting were used for all other image processing. RELION 2.1[26] was used for particle picking, 2D classification, 3D classification and refinement procedures. Around 2,000 particles were manually picked and classified by 2D classification to generate the templates for automatic particle picking. After the manual inspection of auto-picked particles, a total of 1,707,675 particles were extracted from 9,776 selected images (out of 10,530 movie stacks in total collected in four sessions). These particles were subjected to 2D classification in three groups, giving a total of 1,546,032 particles. The low-resolution negative stain reconstruction of the complex was low-pass filtered to 60Å resolution, and used as the initial model for 3D classification. A total of 691,508 particles from 3D classes that show clear structural features were combined and subjected to 3D refinement which led to a reconstruction at 4.6Å resolution. Local refinement with a mask to exclude the last two domains (D3-D4) of CD4 improved the overall resolution to 4.0Å. Cryo-EM particles were then subjected to 3D classification with signal subtraction, focusing on gp120, the first domain of CD4 and half of the CCR5. The best class with the highest resolution contained 307,346 particles and 3D refinement produced a map at 4.5Å resolution. Further local refinement using a mask to exclude the last two domains of CD4 led to a final map at 3.9Å resolution.Reported resolutions are based on the gold-standard Fourier shell correlation (FSC) using the 0.143 criterion, calculated with “SAM (Simplified Application Managing Utilities for EM Labs) scripts of the Liao lab (https://liao.hms.harvard.edu/samuel). All density maps were corrected for the modulation transfer function (MTF) of the K2 summit direct detector and then sharpened by applying temperature factor that was estimated using post-processing in RELION. Local resolution was determined using LocalRes in RELION with half-reconstructions as input maps.
Model building
The initial model of gp120 was a homology model calculated by I-TASSER[68], using the cryo-EM structure of gp120 from the CD4-bound SOSIP trimer (pdb ID: 5VN3) as a template. The crystal structures of CD4 (pdb ID: 1WIO) and CCR5 (pdb ID: 5WIU) were also used as initial templates for model building. Several rounds of manual building were performed in coot[69]. The model was finalized by refinement in Phenix[70] against the 3.9Å cryoEM map. The refinement statistics are summarized in Extended Data Table 1.
Extended Data Table 1.
Cryo-EM data collection, refinement and validation statistics.
CCR5 and CXCR4 were identified as the coreceptors for HIV-1 entry in 1996[71-77]. (a)-(b) Crystal structures of a modified CCR5 (ΔC224-N226→rubredoxin; ΔF320-L352; and point mutations C58Y, G163N, A233D, K303E) in complex with HIV entry inhibitor maraviroc (pdb ID: 4MBS; ref[7]) and a modified chemokine [5P7]CCL5 (an antagonist; pdb ID: 5UIW; ref[10]). CCR5 is shown in ribbon diagram in blue, the internally fused rubredoxin in magenta and the ligands in yellow. N-terminus (N), C-terminus (C) and the second extracellular loop (ECL2) are indicated. (c)-(e) Crystal structures of an engineered CXCR4 in complex with a viral chemokine antagonist vMIP-II (pdb ID: 4RWS; ref[15]), a small molecule antagonist IT1t (pdb ID: 3ODU; ref[14]) and a cyclic peptide antagonist CVX15 (pdb ID: 3OE0; ref[14]). CXCR4 is shown in green, the fused T4 lysozyme in magenta and the ligands in yellow. N, N-terminus; C, C-terminus; and ECL2, the second extracellular loop.
Characterization of stable 293 cell lines expressing wildtype human CCR5.
(a) Chemokine receptor assay. 293T and 293T-CCR5 (stable) cells were treated with different concentrations of CCL5/RANTES. Ft/F0 is a fluorescence-signal ratio proportional to that of intracellular cAMP concentration at 40 min post CCL5/RANTES-activation and time 0. The dose response curves were plotted for both 293T (black) and 293T-CCR5 (red) cells. The experiment was carried out in quadruplicate and repeated at least three times with similar results. Error bars indicate the standard deviation calculated by the Excel STDEV function. (b) Flow cytometry histograms of HIV-1 gp120 binding to CCR5 expressed on the cell surfaces in the absence (orange) or presence (red) of soluble CD4. 293T cells (black), CCR5-expressing cells only (gray) and CCR5-expressing cells with soluble CD4 only (blue) were negative controls. The experiment was repeated independently at least twice with similar results. (c) HIV-1 Env mediated cell-cell fusion. 293T cells stably transfected with CCR5 were mixed with HIV-1 Env (gp160) expressing cells in the absence or presence of soluble CD4. The CCR5 cells fuse with CD4-triggered Env cells very efficiently and form large syncytia that almost cover the entire well. The experiment was repeated independently twice with similar results. (d) Chemokine receptor assay by various ligands. As in (a), Expi293F and Expi293F-CCR5 (stable) cells were treated with CCL5/RANTES, gp120, CD4 or the complex of gp120 and CD4. The dose response curves were plotted for both Expi293F (control, left) and Expi293F-CCR5 (right) cells with different ligands as indicated. The experiment was carried out in quadruplicate and repeated at least three times with similar results. Error bars indicate the standard deviation calculated by the Excel STDEV function. (e) Left, kinetic curves of 5 representative wells of 293T-CCR5 cells treated with 5 different ligands as indicated. ATP activates the endogenous Gq coupled GPCR - P2Y receptor, as a positive control. Ratio, fluorescence intensity/baseline intensity. Right, dose response curve of each ligand. The y-axis is a background-subtracted ratio (peak fluorescent intensity ratio - 1). We conclude that our gp120 and gp120-CD4 do not activate G-protein mediated calcium flux at the concentrations tested here. The experiment was carried out in quadruplicate and repeated twice with similar results. Error bars indicate the standard deviation calculated by the Excel STDEV function.
Purification of the CD4-gp120-CCR5 complex.
(a) Schematic representation of expression constructs for HIV-1 gp120, human CCR5 and CD4. Segments of gp120 are designated as follows: C1-C5, conserved regions 1-5; V1-V5, variable regions 1-5; and His-tag, a six-histidine tag. Tree-like symbols represent glycans. Those for CCR5 include: N, N-terminus; TM1-7, transmembrane helices 1-7; ECL1-3, extracellular loop 1-3; ICL3, intracellular loop 1-3; and CT, cytoplasmic tail. For CD4, they are: D1-4, immunoglobulin (Ig) domain 1-4; and strep tag, a purification tag. TM (transmembrane segment) and CT (cytoplasmic tail) in gray are truncated in the expression construct. (b) Unmodified human CCR5 in complex with HIV-1 gp120 and 4D CD4 was purified by the following steps: 1) complex formation, HIV-1 gp120 (green) and strep-tagged, 4 domain CD4 (light blue) were incubated with CCR5 (magenta) expressed cells to allow formation of the CD4-gp120-CCR5 complex on cell surfaces; 2) strep-tag purification, the CCR5 complex, as well as some CD4-gp120 complex, were captured to strep-tactin resin via the strep-tagged CD4 (strep tag in yellow). They were eluted by D-desthiobiotin under mild conditions; 3) negative selection by an anti-V3 antibody to remove the CD4-gp120 complex. The CCR5 complex was further purified by size-exclusion chromatography. (c) The purified CD4-gp120-CCR5 complex was resolved by gel-filtration chromatography on a Superose 6 column in the presence of detergent LMNG. The molecular weight standards include thyoglobulin (670 kDa), ferritin (440 kDa), γ-globulin (158 kDa) and ovalbumin (44 kDa). The expected size of the CCR5 complex is ~310 kDa (120 kDa for gp120, 50 kDa for 4D CD4, 40 kDa for CCR5 and ~100 kDa for LMNG micelle). Peak fractions were analyzed by Coomassie stained SDS-PAGE (lanes 1-3). Labeled bands were confirmed by western blot and protein sequencing. The experiment was repeated independently at least 15 times with similar results.
Characterization of the CD4-gp120-CCR5 complex by EM.
(a) Representative image of the CD4-gp120-CCR5 complex in negative stain. The experiment was repeated independently at least 4 times with similar results. (b) 2D averages of the negatively stained CD4-gp120-CCR5 complex. The box size of 2D averages is ~330Å. (c) 3D reconstruction of the negatively stained CD4-gp120-CCR5 complex, fitted with a gp120 structure containing an extended V3 loop (pdb ID: 2QAD; ref[20]), 4D CD4 (pdb ID: 1WIO) and CCR5 (pdb ID: 4MBS). (d) A representative cryo-EM image of the 4D CD4-gp120-CCR5 complex. The scale bar represents 25 nm. Five independent large data sets were collected with similar results. (e) 2D averages of the cryo-EM particle images show secondary structural features for both gp120 and CCR5.
Single-particle cryo-EM analysis of the CD4-gp120-CCR5 complex.
(a) Data processing workflow for the CD4-gp120-CCR5 complex. (b) 3D reconstructions of the CD4-gp120-CCR5 complex refined with no mask at an overall resolution of 4.5Å (left) and with a mask to exclude the last two domains of CD4 at a resolution of 3.9Å (right) are colored according to local resolution estimated by RELION. (c) Angular distribution of the cryo-EM particles used in the reconstruction was also shown in respect to both the side and top views of the EM map. (d) Gold standard FSC curves of the unmasked and masked EM reconstructions shown in (b).
Gallery of representative density for the CD4-gp120-CCR5 complex.
Representative density in gray mesh from the 3.9Å resolution EM map is shown for TM1-7, the N-terminus of CCR5, ELC3 near TM6, Tys10, Tys14 and Tyr15 (red model); two V3 regions; and for helix α1, N-terminus, V3 loop, the bridging sheet and N-linked glycan at Asn262 of gp120 (cyan model).
Comparison of conformations of V3 loop and [5P7]CCL5 in complex with CCR5, as well as of gp120-bound CCR5 and G protein-bound β2 adrenergic receptor (β2AR).
(a) The structures of the CD4-gp120-CCR5 and [5P7]CCL5-CCR5 complexes are superposed on CCR5 (red). Gp120 V3 loop with its Pro311 in stick model is in cyan and [5P7]CCL5 with its Pro3 in stick model in yellow. Residues 309-316 of the V3 loop and residues 1-8 of [5P7]CCL5 adopt a very similar structure are highlighted in a rectangular box. (b) Superposition of the structures of the N-terminus of the gp120-bound CCR5 (red) and the CDR H3 loop of antibody 412d in complex with gp120 core (green). The EM density of the CD4-gp120-CCR5 is shown in gray. The positions of the sulfated tyrosine (Tys) residues, including Tys10 and Tys14 from CCR5; Tys100 and Tys100c from 412d, are indicated. (c) A model for interactions of three CD4 and three CCR5 with the SOSIP Env trimer. The side and bottom views of a composite structure of the CD4-CCR5-SOSIP Env trimer complex are shown. The model was generated using the CD4-bound SOSIP trimer (pdb ID:5VN3) and the structure of the CD4-gp120-CCR5 complex from this study. All the structures were aligned based on the gp120 core region. CCR5 is shown in red, CD4 in green, gp120 in blue and gp120 of SOSIP dark blue; gp41 of SOSIP gray. The crystallographic dimer of CCR5 was also shown on left only in a rectangular box using pdb ID: 4MBS. The observed crystallographic dimer of CCR5 or the TM5-mediated dimer by modeling does not seem to be relevant to binding to either monomeric or trimeric gp120[7,78]. (d) Superposition of the structures of the gp120-bound CCR5 (red) and the Gs protein-bound β2 adrenergic receptor (blue). The position of TM6, critical for GPCR activation, is indicated.
Comparison of conformations of different monomeric gp120s and various V3 loops.
(a) Comparison of structures of an unliganded gp120 core (pdb ID:4OLV; purple), a CD4-bound monomeric gp120 core with the V3 loop (pdb ID:2QAD; blue) and gp120 in complex with CD4 and CCR5 from this study (cyan). The gp120 core region is marked by a circle with a diameter of 50Å. N- and C-termini, V1V2 stem, V3 stem or loop and bridging sheet are indicated. (b) Representative conformations that an HIV-1 V3 loop can adopt. From left to right: V3 loop in the unliganded SOSIP BG505 Env trimer (pdb ID: 4ZMJ); the first V3-containing gp120 core in complex with CD4 and antibody X5 (pdb ID: 2B4C; ref[29]); CD4- and 412d-bound monomeric gp120 core with V3 (pdb ID: 2QAD); CCR5-bound intact gp120 (this study); and V3 peptide in complex with antibody 447-52D (pdb ID: 3GHB; ref[36]); antibody 268-D (pdb ID: 3GO1; ref[37]); antibody 2557 (pdb ID: 3MLV; ref[37]); antibody 10A37 (pdb ID: 5V6L; ref[38]). The root-mean-square deviation (RMSD) of each structure, except for 5V6L, relative to the CCR5-bound gp120 monomer is shown at the bottom in parenthesis.
Model for HIV-1 Env activation to induce membrane fusion.
A hypothesis of how cellular receptors CD4 and CCR5 trigger HIV-1 Env trimer to induce membrane fusion and viral entry. Left, virus attaches to the target cell by gp120 (cyan) binding to CD4 (green). Helix collar of gp41, the 4-helix collar gripping the N- and C-termini of gp120. Right, immediate binding by CCR5 (red) prevents rapid dissociation between gp120 and CD4, stabilizes the CD4-induced conformational changes within the Env trimer and also brings the trimer close to the cell membrane. Simultaneous binding of gp120 to both CD4 and CCR5 may require bending in the cell membrane. The fusion peptide (magenta) of gp41 (gray) flips out due to intrinsic conformational dynamics, allowing bending back of the N- and C-termini of gp120, which blocks the fusion peptide from resuming its original position in the trimer. The movements of the fusion peptide and gp120 termini effectively weakens the non-covalent association between the two subunits and may lead to partial or complete dissociation of gp120 and a series of refolding events in gp41 to adopt the prehairpin intermediate conformation with the fusion peptides inserting into the target cell membrane. Extended helix in gp41, three helices in the fusion intermediate conformation of gp41.Cryo-EM data collection, refinement and validation statistics.
Authors: B J Doranz; Z H Lu; J Rucker; T Y Zhang; M Sharron; Y H Cen; Z X Wang; H H Guo; J G Du; M A Accavitti; R W Doms; S C Peiper Journal: J Virol Date: 1997-09 Impact factor: 5.103
Authors: Maolin Lu; Xiaochu Ma; Nick Reichard; Daniel S Terry; James Arthos; Amos B Smith; Joseph G Sodroski; Scott C Blanchard; Walther Mothes Journal: J Virol Date: 2020-08-17 Impact factor: 5.103
Authors: Carlos A Rico; Yamina A Berchiche; Mizuho Horioka; Jennifer C Peeler; Emily Lorenzen; He Tian; Manija A Kazmi; Alexandre Fürstenberg; Hubert Gaertner; Oliver Hartley; Thomas P Sakmar; Thomas Huber Journal: Biophys J Date: 2019-08-02 Impact factor: 4.033
Authors: Yi Wang; Pavanjeet Kaur; Zhen-Yu J Sun; Mostafa A Elbahnasawy; Zahra Hayati; Zhi-Song Qiao; Nhat N Bui; Camila Chile; Mahmoud L Nasr; Gerhard Wagner; Jia-Huai Wang; Likai Song; Ellis L Reinherz; Mikyung Kim Journal: Proc Natl Acad Sci U S A Date: 2019-10-17 Impact factor: 11.205
Authors: Chandan Upadhyaya; Xuanmao Jiao; Anthony Ashton; Kishan Patel; Andrew V Kossenkov; Richard G Pestell Journal: Adv Cancer Res Date: 2020-01-10 Impact factor: 6.242
Authors: Ze Li; Wenwei Li; Maolin Lu; Julian Bess; Cara W Chao; Jason Gorman; Daniel S Terry; Baoshan Zhang; Tongqing Zhou; Scott C Blanchard; Peter D Kwong; Jeffrey D Lifson; Walther Mothes; Jun Liu Journal: Nat Struct Mol Biol Date: 2020-06-29 Impact factor: 15.369