Takahide Kouno1, Elizabeth M Luengas2, Megumi Shigematsu3, Shivender M D Shandilya4, JingYing Zhang3, Luan Chen3, Mayuko Hara3, Celia A Schiffer4, Reuben S Harris2, Hiroshi Matsuo3. 1. 1] Biochemistry, Molecular Biology and Biophysics Department, Institute for Molecular Virology, University of Minnesota, Minneapolis, Minnesota, USA. [2] Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, Massachusetts, USA. 2. Biochemistry, Molecular Biology and Biophysics Department, Masonic Cancer Center, Center for Genome Engineering, Institute for Molecular Virology, University of Minnesota, Minneapolis, Minnesota, USA. 3. Biochemistry, Molecular Biology and Biophysics Department, Institute for Molecular Virology, University of Minnesota, Minneapolis, Minnesota, USA. 4. Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, Massachusetts, USA.
Abstract
The human APOBEC3G (A3G) DNA cytosine deaminase restricts and hypermutates DNA-based parasites including HIV-1. The viral infectivity factor (Vif) prevents restriction by triggering A3G degradation. Although the structure of the A3G catalytic domain is known, the structure of the N-terminal Vif-binding domain has proven more elusive. Here, we used evolution- and structure-guided mutagenesis to solubilize the Vif-binding domain of A3G, thus permitting structural determination by NMR spectroscopy. A smaller zinc-coordinating pocket and altered helical packing distinguish the structure from previous catalytic-domain structures and help to explain the reported inactivity of this domain. This soluble A3G N-terminal domain is bound by Vif; this enabled mutagenesis and biochemical experiments, which identified a unique Vif-interacting surface formed by the α1-β1, β2-α2 and β4-α4 loops. This structure sheds new light on the Vif-A3G interaction and provides critical information for future drug development.
The humanAPOBEC3G (A3G) DNA cytosine deaminase restricts and hypermutates DNA-based parasites including HIV-1. The viral infectivity factor (Vif) prevents restriction by triggering A3G degradation. Although the structure of the A3G catalytic domain is known, the structure of the N-terminal Vif-binding domain has proven more elusive. Here, we used evolution- and structure-guided mutagenesis to solubilize the Vif-binding domain of A3G, thus permitting structural determination by NMR spectroscopy. A smaller zinc-coordinating pocket and altered helical packing distinguish the structure from previous catalytic-domain structures and help to explain the reported inactivity of this domain. This soluble A3G N-terminal domain is bound by Vif; this enabled mutagenesis and biochemical experiments, which identified a unique Vif-interacting surface formed by the α1-β1, β2-α2 and β4-α4 loops. This structure sheds new light on the Vif-A3G interaction and provides critical information for future drug development.
APOBEC3 proteins are single-stranded DNA cytosine deaminases that are part of a larger network of innate immune effector proteins that serve to limit the replication of many diverse parasitic elements including the AIDS virusHIV-1[1-3]. Up to seven family members, APOBEC3A, B, C, D, F, G, and H, can be expressed in a single human cell[4]. A combination of overexpression, knockdown, knockout, and separation-of-function studies have shown that A3D, A3F, A3G, and A3H contribute to HIV-1 restriction and hypermutation in CD4+ T lymphocytes (recent publications[5-7] and references therein). A3D, A3F, and A3H catalyze C-to-U deamination in 5′-TC dinucleotide motifs, whereas A3G catalyzes deamination in 5′-CC motifs. These viral cDNA deamination events lead to G-to-A mutations in hallmark GA and GG dinucleotide motifs respectively. A3G has two structural domains, an N-terminal pseudo-catalytic (NTD) and C-terminal catalytic (CTD) domain[8,9]. The CTD alone dictates the observed local dinucleotide preference[10,11,12]. The NTD is catalytically inert but still strongly binds RNA and single-stranded DNA and forms a ribonucleoprotein complex with the viral Gag protein, enabling encapsidation [e.g.
[13-17]].HIV-1 encodes Vif as a counter-defense mechanism against A3 mediated restriction[18,19]. Vif interacts with core binding factor β (CBF-β), CUL5, ELOB, ELOC, and RBX2 proteins to form an E3 ubiquitin ligase complex[20,21]. As a consequence, A3G and other potentially restrictive A3 enzymes are polyubiquitinated and degraded by proteasome before being encapsidated [e.g.
[18,19,22-25]]. The interaction between Vif and A3G has been studied extensively, and key regions for the interaction have been identified, which include 126-FWDPDYQ-132 of A3G and 22-KSLVK-26, 40-YRHHY-44, 69-YWGL-72 of Vif[26-29]. However, insolubility has limited structural information of the A3G-Vif complex.A crystal structure of the Vif-CBF-β-ELOBC-CUL5 complex was reported recently[30]. Vif has two structural domains called alpha/beta and alpha. A3G-binding residues, including K22, K26, 40-YRHHY-44 and W70, are located in the alpha/beta domain[30]. Although the structure of catalytically active A3 proteins A3A and A3C, as well as the catalytic domains A3GCTD and A3F CTD, were solved by NMR and X-ray crystallography[12,31-38], a structure of the N-terminal pseudo-catalytic, Vif-binding domain of A3G has thus far proven elusive. To address this major gap in knowledge, we have generated a soluble A3G NTD variant (sNTD) that binds the Vif-CBF-β-ElOBC complex. The sNTD has 80% amino acid sequence identity to wildtype A3G NTD. In this paper, we report the three-dimensional structure of sNTD and mutationally interrogate the Vif-binding regions.
RESULTS
Engineering a soluble A3G Vif-binding domain
The wildtype A3G NTD was insoluble in E. coli. To overcome this technical hurdle, we compared the amino acid sequence of A3G NTD with A3A, A3B CTD, and A3GCTD, which were soluble and amenable to structural studies[12,31-34,37]. A3A, A3B CTD, and A3GCTD were catalytically active, but did not interact with Vif, and belong to the same phylogenetic subgroup, i.e., the Z1 group[39] (Fig. 1a). An alignment of A3A, A3B CTD, and A3GCTD allowed us to generate a consensus Z1 amino acid sequence (Supplementary Fig. 1a). We expressed and purified the consensus Z1 protein from E. coli, and found that it was soluble and folded. Wildtype A3G NTD amino acid residues were used to replace consensus Z1 residues one at a time, and all of the resulting constructs were tested for protein expression, solubility, and stability. We used NMR to evaluate soluble derivatives for signal dispersion and homogeneity. Ultimately, 80% amino acid sequence identity to wildtype NTD was achieved without substantially compromising solubility or quality of NMR spectra (Fig. 1b, c). Additional wildtype A3G amino acids were not tolerated (Supplementary Fig. 1b, c). We henceforth refer to this soluble A3G NTD construct as sNTD.
Figure 1
Generation of soluble N-terminal Vif-binding domain of A3G. (a) Amino acid sequence similarities of APOBEC3 proteins. Amino acid sequences of APOBEC3 proteins are aligned by ClustalW[60], and the branch length represents similarity of amino acid sequences. A3 proteins are grouped, as described previously[39]. A3C, A3F CTD and A3G NTD bind HIV-1 Vif, and all of them belong to the Z2 group. (b) Elution pattern of sNTD by size-exclusion chromatography. Elution peaks of Aldolase (158 kDa), Ovalbumine (44kDa) and RNase (14kDa) are shown as markers of retention times. (c) sNTD was purified to a single band in an SDS-PAGE gel. Supplementary Data Set 1 provides uncropped form of the SDS-PAGE gel.
Solution structure of sNTD
We generated a series of 15N, 13C, and/or 2H isotope-labeled protein samples to determine the solution structure of sNTD using state-of-the-art NMR techniques combined with amino acid-specific labeling techniques (representative NMR spectra are shown in Supplementary Fig. 2)[31,40]. Each sample was subjected to size-exclusion chromatography for purification, and the monomeric status was evaluated by the elution profile. sNTD eluted with a major peak corresponding to a molecular weight of 17 kDa and a minor peak corresponding to 47 kDa (Fig. 1b); the theoretical molecular weight of sNTD is 21 kDa. sNTD exhibited fair quality of NMR spectra with most amide proton signals resolved in a transverse relaxation-optimized spectroscopy (TROSY)[41] spectrum, although signal intensities were inhomogeneous, suggesting diverse dynamics throughout sNTD (Supplementary Fig. 3). Using conventional triple resonance NMR spectra[40], we assigned backbone signals (1HN, 15N, 1Hα, 13Cα and 13C′) for 146 residues (81.1%) of sNTD. 24 residues were missing from TROSY spectra due to severe line broadening, suggesting conformational exchange. Nuclear Overhauser effect (NOE) signals and a 1H/2H-exchange experiment depicted sNTD to contain a β-sheet of five β-strands arranged in the order of β2_(54–56)-β1_(36–42)-β3_(85–92)-β4_(113–120)-β5_(148–152) with β2-β1-β3 in an anti-parallel orientation, and β3-β4-β5 in a parallel orientation (Supplementary Fig. 4). 1H, 1H-NOE patterns indicated that β2-β1 form an antiparallel β-sheet, however, only Y37 NH from β1 showed slow 2H exchange with solvent, whereas F54 NH and G56 NH from β2 exchanged completely with solvent within one hour (Supplementary Fig. 4 and data not shown). Thus, hydrogen bonds between F54 NH and V39 CO and G56 NH and Y37 CO were weaker than that between Y37 NH and G56 CO, indicating that the structure and motions of β 2 is likely to be less restrained compared to the other β strands.The sNTD solution structure was determined with 1609 distance and 106 dihedral angle restraints (structural statistics summarized in Table 1). The structure of sNTD has six alpha-helices, including α1_(16–19), α2_(68–75), α3_(98–108), α4_(131–140), α5_(154–164), and α6_(178–194), organized around the β-strands described above (Fig. 2). Secondary structures were well defined, although α1, β2 and α2 were less converged compared to other secondary structures (Fig. 2a). The poorer convergence for β2 was indicative of weaker hydrogen bonding with β1. The loop between β1 and β2 (β1-β2 loop) was disordered because of insufficientNOE restraints. Residues located in the β1-β2 loop (E38, V39, K40, K42, G43, S45, R46) and the α2-β3 loop (Q84, E85, Y86) each showed two weak signals in TROSY spectra (Supplementary Fig. 3). Since the β1-β2 and α2-β3 loops face each other in the sNTD structure, these two loops may exchange between two conformations (Fig. 2b).
Table 1
NMR and refinement statistics for protein structures
Protein
NMR distance and dihedral constraints
Distance constraints
Total NOE
1543
Intra-residue
676
Inter-residue
Sequential (|i – j| = 1)
338
Medium-range (|i – j| < 4)
183
Long-range (|i – j| > 5)
346
Intermolecular
0
Hydrogen bonds
33 (66)
Total dihedral angle restraints
ϕ
106
ψ
0
Structure statistics
Violations (mean ± s.d.)
Distance constraints (Å)
0.07 ± 0.06
Dihedral angle constraints (°)
0.90 ± 0.77
Max. dihedral angle violation (°)
3.70
Max. distance constraint violation (Å)
0.30
Deviations from idealized geometry
Bond lengths (Å)
0.004546 ± 0.000087
Bond angles (°)
0.6737 ± 0.0076
Impropers (°)
0.4877 ± 0.0153
Average pairwise r.m.s. deviation** (Å)
Heavy
1.57 ± 0.17
Backbone
0.82 ± 0.17
Pairwise r.m.s. deviation was calculated among ## refined structures.
Total RDCs
Qfree (%)
Q and Qfree is recommended in the case where there are many RDCs, often though there are not enough to make this statistically meaningful
Figure 2
Solution structure of the Vif-binding domain of A3G. (a) Stereoview of the ten lower-energy calculated sNTD structures superimposed using the backbone atoms (N, Cα, and C) of all residues except those absent in TROSY spectra, including 19–24, 48–49, 55–56, 67–69, 81–86, 112–118. These residues are not shown in this figure. (b) Stereoview of a cartoon representation of the energy-minimized average structure from ten lowest energy calculated sNTD structures. A zinc ion (purple), side chain atoms of zinc coordinating residues, including H65, C97, and C100, and of a conserved glutamic acid, E67, are shown with ball-and-stick representation. (c) Schematic of sNTD secondary structure with helices and strands colored orange and blue, respectively.
The overall structure of sNTD was similar to other A3 proteins, including A3A, A3C, A3F CTD, and A3GCTD[12,31-38] (Fig. 3a). However, the sNTD structure also had several unique features. First, the start of α2 is residue 68 of sNTD, one residue after the conserved glutamic acid residue (E67), whereas in α2’s of catalytically active A3 domains the helix starts one residue prior to the conserved glutamic acid. This structural difference substantially affected the spatial positioning of E67, locating the residue too far from the Zn2+ ion to participate in catalysis (Fig. 3b). Second, the α2 N-terminus was tilted toward the C-terminus of α3, an orientation not seen in any reported A3 catalytic domain structures (Fig. 3c). These two features provided a likely explanation for prior studies that assigned catalytic activity exclusively to the C-terminal domain[8,9,42]. Third, sNTD lacked a bulge structure present in the β2 region of A3A and A3GCTD (Fig. 3d). The β2 strand of sNTD was shorter than in other A3s, and located in the position of β2′ of the β2-bulge-β2′ structure of A3A and A3GCTD (Fig. 3d)[12,31,32,34,37].
Figure 3
Structure comparison of sNTD with other APOBEC3 proteins. (a) Cartoon representations of structures of the energy-minimized average sNTD (this study), A3A (PDB code, 2M65)[37], A3C (3VOW)[38], A3F CTD (4IOU)[35] and A3G CTD (3IR2)[34] with zinc displayed in purple. (b) Enlarged view of the APOBEC3 zinc-binding sites from (a). The side-chains of zinc coordinating residues are rendered with ball-and-stick representation. For each structure, the top faces to the protein interior, and bottom is accessible to solvent. Zinc is shown in gray sphere. (c) Comparison of the α2 and α3 helices. α2 and α3 of sNTD (green) are superimposed to that of another APOBEC3 protein, including A3A (2M65)[37], A3C (3VOW)[38], A3F CTD (4IOU)[35] and A3G CTD (3IR2)[34]. Zinc is displayed in green (sNTD) or gray (other A3s). (d) The β1-bulge-β2 region of sNTD is compared with other APOBEC3 proteins. Each image shows sNTD (green) superimposed on another A3 domain protein (gray).
We substituted 20% of wildtype residues to solubilize NTD. Two thirds of the substituted residues are located in loops, including Δ(M1-R11), Y13D, R14P, Y22N, L62D, Δ(R78), A109Q, D110P, P111T, K112H, F126A, K141A, R142G, Δ(D143-R146), R169G, E170A, L171P, E173Q, N176D and N177G, whereas the remaining substitution are in helices, including F71L, H72S, W73L and F74V in α2, T101A in α3, C139A in α4, P179D, K180E, Y181H, Y182S, I183Q, L184A, H186S, I187G and M189R in α6, with the exception of M149I in β2.
sNTD is catalytically inactive in vitro
Because our solublization strategy started with the consensus sequence of several active deaminases, we next asked if sNTD possessed catalytic activity using one-dimensional 1H NMR spectra[11,12]. As a positive control, catalytically active A3GCTD variant (CTD-2K3A)[31] was incubated with substrate DNA (5′-ATTCCCAATT-3′)[12], and a signal corresponding to the uridine H5 proton emerged within 30 minutes at 5.72 ppm (Supplementary Fig. 5d). This result indicated that CTD-2K3A had catalyzed the first deamination reaction (5′-CCC to 5′-CCU). Within 24 hours, CTD-2K3A completed the second deamination reaction, 5′-CCU to 5′-CUU, resulting two H5 proton signals, one at 5.74ppm and the other at 5.81 ppm (Supplementary Fig. 5e). By contrast, sNTD did not cause changes to the NMR spectrum of the substrate DNA even after 24 hours (Supplementary Fig. 5f). Thus, sNTD lacks detectable deaminase activity, consistent with prior studies demonstrating the catalytic inactivity of this domain[8,9,42].
sNTD binds Vif
We used GST-pull down experiments to monitor interaction between sNTD and Vif-CBF-β-ELOBC (VCBC) co-expressed in E. coli. sNTD bound to the VCBC complex (Fig. 4a, lane 10). H43A and Y44A substitutions in Vif reduced binding to undetectable levels (Fig. 4a, lane 11). Since F126 of wildtype NTD had been substituted to an alanine in sNTD in order to increase solubility, we reversed this mutation and tested binding to the VCBC complex. sNTD(F126) bound more strongly to the VCBC complex than the original sNTD construct (Fig. 4a, lane 8), and this interaction became undetectable when Vif(H43A Y44A) was used instead of wildtype Vif (Fig. 4a, lane 9). The location of A126 appeared to be in the middle of the loop between β4 and α4, where it was exposed to solvent (Fig. 4b). This surface location was important because the A126F substitution was unlikely to alter either secondary or tertiary structural elements.
Figure 4
Functional assays of sNTD. (a) E. coli co-expression and GST-pull down assay of sNTD and the Vif, CBFβ, elongin B, elongin C complex (VCBC complex). Samples are GST-fused A3G CTD (lanes 6 and 7), GST-fused sNTD(F126) (lanes 8 and 9) or GST-fused sNTD (lanes 10 and 11) expressed together with untagged CBF-β, elonginBC and wild type HIV-1 Vif or Vif(H43A Y44A). Lanes 1–5 show reference bands for the expressed protein. (b) Locations of D128 and A126 (displayed in green) in the sNTD structure. (c) Vif-mediated degradation assay using wildtype A3G (A3G) (lanes 1–6), A3G with D128K substitution [A3G(D128K)] (lanes 7–12), sNTD fused with wild type CTD (sNTD-CTD) (lanes 13–18), sNTD with A126F substitution fused with wild type CTD [sNTD(F126)-CTD] (lanes 19–24) and sNTD with A126F and D128K double-substitutions fused with wild type CTD [sNTD(F126 D128K)-CTD] (lanes 25–30). Alpha-isoform of tubulin (α-tubulin) was used as a loading control. Vif (wt) is wildtype Vif, and Vif (H43A Y44A) contains alanine substitutions of H43 and Y44. (d) Single cycle infectivity experiment comparing wildtype A3G and full-length variants with sNTD(F126)-CTD and sNTD-CTD. The histogram reports the infectivity of Vif-deficient (Vifxx) or Vif-proficient (WT) HIV-1 IIIB produced in 293T cells expressing the indicated A3G-myc construct (0, 25, 50, 100 ng) (graph shows mean values ± s. d. n = 3 independent cell cultures). The corresponding immunoblots show expression levels of relevant proteins in cell lysates (anti-myc for A3G-myc, anti-Vif, and anti-Tubulin) and virus-like particles (VLPs; anti-myc for A3G-myc and anti-p24). alpha-isoform of tubulin (α-tubulin) was used as a loading control.
F126 is important for Vif-mediated degradation
We created a construct in which sNTD was fused to wildtype CTD (A3G sNTD-CTD) and tested whether this construct behaves similarly to wildtype full-length A3G in a Vif-mediated degradation assay (Fig. 4c). Wildtype A3G was used as a positive control and, as expected, demonstrated reduced protein levels in 293T cells when co-expressed with Vif (Fig. 4c, lane 3) compared to expression alone (Fig. 4c, lane 1). The A3G protein level was restored by addition of protease inhibitor MG132 (Fig. 4c lane 4). We used a double mutation of H43A and Y44A in Vif (Fig. 4c lane 5) and D128K in A3G (Fig. 4c lane 9) as negative controls because these mutations have been shown to impair Vif-mediated degradation of A3G[18,26,43-47]. Interestingly, A3G sNTD-CTD protein levels were not reduced when co-expressed with Vif (Fig. 4c lane 15). Because sNTD(F126) bound to the VCBC complex more strongly than sNTD in vitro (Fig. 4a lane 8), we prepared a construct with sNTD(F126) fused to wildtype CTD [sNTD(F126)-CTD] and tested its protein stability in a Vif-mediated degradation assay. sNTD(F126)-CTD exhibited reduced protein levels in the presence of Vif (Fig. 4c, lane 21), which were restored by co-expression with Vif(H43A Y44A) (Fig. 4c, lane 23) or when cells were treated with MG132 (Fig. 4c, lane 22). Furthermore, adding the Vif-antagonizing single amino acid substitution, D128K, protected the sNTD(F126 D128K)-CTD construct from being degraded by Vif (Fig. 4c, lane 27). Prior studies have implicated the importance of F126 in Vif-binding and Vif-mediated degradation of A3G[26,28,48]. Thus, these data indicate that a single amino acid reversion (F126) can restore Vif-susceptibility to the A3G sNTD-CTD construct, and that the overall mechanism of Vif-mediated degradation is preserved in this construct.
sNTD lacks antiviral activity
We compared wildtype A3G, sNTD-CTD, and sNTD(F126)-CTD constructs in HIV-1 restriction experiments (Fig. 4d). As reported in previously, wildtype A3G packaged into vif-deficient HIV-1 particles and restricts viral infectivity in a dose dependent manner (Fig. 4d, lanes 1–4). Antiviral activity was strongly counteracted by Vif as evidenced by less A3G in cells and viral particles, and mostly restored viral infectivity (Fig. 4d, lanes 5–8). In contrast, sNTD-CTD resisted Vif-mediated degradation and failed to package substantially into viral particles, despite near wildtype A3G levels of cellular expression (Fig. 4d, compare lanes 17–20 and 21–24). However, as described above in co-expression studies, a single amino acid substitution A126F [(sNTD(F126)-CTD], restored the susceptibility of this construct to Vif-mediated degradation but did not restore packaging and restriction activities (Fig. 4d, compare lanes 9–12 and 13–16). These data confirmed that the sNTD(F126)-CTD construct is functional for Vif degradation but compromised for packaging and restriction. Because packaging requires the formation of a ribonucleoprotein complex with Gag [e.g.
[13-17]] and similar regions of A3G have been implicated in both the Vif interaction and RNA binding, the sNTD(F126) variant appeared to separate these two activities by preserving the capacity to be bound and degraded by Vif and losing the capacity to bind RNA (which also manifests as packaging and restriction deficiencies).
Vif binding to sNTD differs from A3C and A3F
We used sNTD variants, E. coli co-expression and GST co-purification as described above to identify amino acid residues required for interaction of sNTD with the VCBC complex. Substitution Y19 at the C-terminus of α1, I26, L27 and W34 in the α1-β1 loop, V58 and Y59 of the β2-α2 loop, and Y124, W127 and D128 from the β4-α4 loop decreased binding to the VCBC complex by greater than 50% (Fig. 5a, representative SDS PAGE gel images are provided in Supplemental Fig. 5g). These residues substantially expanded the size of the A3G NTD surface area required for Vif interaction (Fig. 5b). Interestingly, using a different experimental approach, Lavens et al. implicated a similar surface area (Fig. 5c)[49]. Thus, the A3G NTD Vif-binding surfaces appeared to be in distinct areas from that of A3C and A3F CTD, which are formed by residues located in the α2 and α3 helices (Fig. 5d and 5e)[38].
Figure 5
Identification of Vif-binding regions of sNTD. (a) Relative Vif-binding capabilities of sNTD variants tested using E. coli co-expression and GST-pull down assay. The ratio between protein amounts of [GST-sNTD variant] and [Vif, CBF-β, elonginBC] is plotted (graph shows average values ± s. d. n = 3 independent cell cultures). Representative images of SDS-PAGE gels are provided in supplemental figure 5g (experiments were repeated three times using independent cell cultures). (b) Residues whose substitution reduced sNTD binding to the VCBC complex by greater than 50% are shown in red on the sNTD structure. The position of F126 is shown in green. (c) Residues whose substitution resulted in greater than 50% reduction of A3G Vif-binding as reported[49] are shown in red. (d) and (e) highlight A3C and A3F CTD residues that upon substitution cause greater than 50% reduction in Vif-binding, respectively[38].
DISCUSSION
We used sequence conservation and restorative mutagenesis to engineer a soluble variant of A3G NTD, which is 80% identical to the wildtype enzyme (Fig 1a). This variant, like wildtype A3G NTD, is not catalytically active but still binds Vif and succumbs to Vif-mediated degradation (Fig. 4
and
Supplementary Fig. 5f). This engineering approach, while labor intensive, is applicable to other systems that have evaded structural characterization due to protein solubility issues.We used NMR spectroscopy to solve the structure of sNTD, representing the first structure of an APOBEC3 non-catalytic domain and the first structure of the Vif-binding domain of A3G. Our results suggest a mechanistic basis for the lack of catalytic activity. Zn2+ is coordinated by residues H65, C97, and C100 at the protein interior (Fig. 3b) and contacts the hydrophobic side chains of L35, W90, I92, and M104 (Supplementary Fig. 6a). These interactions reduce accessibility for a putative target cytosine, as the pocket volume of the Zn2+ coordinating region is reduced compared to A3GCTD (Supplementary Fig. 6b). The active site pocket volume of A3G-CTD is about 400 Å3, while that of sNTD is approximately 100 Å3, which is smaller than the volume of a cytosine nucleobase. This result supports the general trend observed for catalytically active and inactive domains[50]. In addition, conserved glutamic acid E67 of sNTD is not in a position where Zn2+ coordination is possible, preventing E67 from performing a catalytic role (e. g.[51], Fig. 3b
and
Supplementary Fig. 6a). This altered position can be attributed to changes at the N-terminus of α2 helix, where A3G NTD has a proline at residue 66 just prior to E67 that shortens this helix to begin at M68. In all catalytically active A3 domains the helix begins at A66.To make soluble NTD we had to make four amino acid substitutions in α2, including F71L, H72S, W73L and F74V, although they are conserved mutations among APOBEC3 proteins (Supplementary Fig. 6c). Residues 71 and 74 interact with F70 (α2), W90 (β3) and F107 (α3), and F70, W90, and F107 interact with Y37 (β1), V39 (β1), and V88 (β3). Together, these residues form a hydrophobic core that stabilizes the β1-α2-β3-α3 structure (Supplementary Fig. 6d). All known A3 protein structures have the similar hydrophobic core involving corresponding residues[12,31-38]. Residues 72 and 73 are not involved in the hydrophobic core of known A3 structures, and we have shown that these residues can be substituted with alanine without compromising catalytic activity of A3GCTD[52]. Thus, substitutions at residues 71–74 are unlikely to be responsible for the unique structural features of α2 and α3 of sNTD.The known variant of Vif(H43A Y44A)[26] that disrupts wildtype A3G binding also does not permit binding to sNTD, suggesting that contacts made through H43 and Y44 are maintained between sNTD and Vif. However, sNTD fused to wildtype CTD was largely resistant to the Vif-mediated degradation. We found that A126 was responsible for this reduction, as replacement of this alanine with endogenous phenylalanine increased interaction with the VCBC complex, and restored susceptibility to Vif-mediated degradation. These results confirmed the importance of A3G F126 in Vif-binding, and, together with additional mutant data demonstrated that the sNTD construct is a good structural mimic of wildtype A3G NTD.A caveat of sNTD function is that neither sNTD-CTD nor sNTD(F126)-CTD were able to incorporate into viral particles or restricted HIV-1 infection. Encapsidation requires interaction with RNA and Gag protein[e.g.
[13,14,53-55]]. The failure of sNTD variants to encapsidate is most likely due to an abrogated interaction with RNA, because no interaction between sNTD and RNA could be detected in a phosphate buffer containing 100 mM NaCl (data not shown). This loss may be due to deletion and/or substitution of basic residues, including K2, R6, R11, R14, R78, K113, K141, R142, R146, R169 and K180 in sNTD (Figure S1). Future studies may be able to use sNTD as a starting point to determine residues essential for encapsidation and RNA-binding.The Vif-binding regions of sNTD identified by this study, including α1-β1 β2-α2 and β4-α4 loops provide a structural epitope for Vif binding and validate previous studies with wildtype A3G[26,49,56,57] (Fig. 5b, 5c). The β4-α4 loop contains residues 124-YYFWD-128 that are known to be important for Vif-mediated A3G degradation[26,58]. Structures of the A3C and A3F Vif-binding domains have been solved, and their Vif-binding surfaces were mapped by mutagenesis, co-immunoprecipitation, and degradation assays to a different surface[35,36,38,59]. In these cases, Vif is proposed to bind a continuous surface located on α2 and α3 helices of A3F CTD and A3C away from the catalytic zinc (Fig. 5d and 5e). Residues F74, L80, Y86, and F107 were found to be critical for the A3C binding to Vif but, consistent with our results, not important for Vif-mediated degradation of A3G[38]. Previous studies showed that a negatively charged region around D128 appears important for binding to Vif (Supplementary Fig. 6b), as substitution of this amino acid abrogates binding to Vif[26-29]. A3G-CTD has a more positively charged (blue) profile in the corresponding area that likely interferes with binding to Vif[30] (Supplementary Fig. 6b). In summary, we report the first structure of the Vif-binding domain of A3G (sNTD), which further elaborates a novel mode of Vif-binding and provides a substrate for future structural studies of the entire A3G-Vif ligase complex, as well as drug discovery.
ONLINE METHODS
Plasmids, antibodies, and reagents
DNA fragments coding the A3 consensus protein (Supplementary Fig. 1a) or wildtype A3G NTD (residues 1–196) were synthesized with codons optimized for expression in E. coli (Genscript). All DNA fragments coding A3G NTD variants were prepared with a PCR-based method using the synthesized DNAs coding the consensus protein or wildtype A3G NTD as a template. The prepared DNA fragments were inserted into NdeI/XhoI site of pCold-GST vector[61]. DNA fragment coding soluble A3GCTD variant (A3GCTD-2K3A that contains residues 191–384 with mutations L234KC243A F310K C321AC356A) was obtained with a PCR-based method using the previously prepared expression vector[31], and was inserted into Nde I/Xho I site of pCold-GST vector[61]. For expression of Vif (Vif-176, residue 1–176) and CBF-β (residue 1–141) in E. Coli, codon-optimized DNA fragments coding those proteins (Genscript) were inserted into BamHI/HindIII and NdeI/XhoI sites of pRSFDuet vector (Novagen), respectively. Vif-176(H43A Y44A) expression vector was prepared in the same manner with a PCR-based method using Vif-176. For expression of elongin B (residue 1–118) and elongin C (residue 17–112) in E. coli, DNA fragments coding those proteins were inserted into BamHI/HindIII and NdeI/XhoI sites of pACYCDuet vector (Novagen), respectively. DNA fragments coding Wildtype A3G NTD (residue 1–196), CTD (residue 197–384), or sNTD (residue 1–180) were synthesized with codons optimized for expression in human cells (Genscript). D128K substitution was introduced to the A3G NTD construct with a PCR-based method. Similarly, sNTD variants, including sNTD(D128K), sNTD(F126), and sNTD(F126 D128K), were prepared with a PCR-based method using the DNA fragment encoding sNTD as a template. Each DNA fragment encoding NTD and its variants was combined with the wildtype A3GCTD using a PCR-based method to yield a full-length construct including sNTD-CTD, sNTD(F126)-CTD and sNTD(F126 D128K)-CTD. All full-length constructs were inserted into NheI/HindIII site of pcDNA3.1 myc-His(-) A vector (Invitrogen). pcDNA-HVif expression plasmid was provided by Dr. Klaus Strebel at NIH. A double mutation H43A and Y44A were introduced to pcDNA-HVif with a PCR-based method to generate pcDNA-HVif(H43A Y44A). DNA sequences of all constructs were verified by DNA sequencing. DNA oligomers for PCR and deamination assay were purchased from Integrated DNA Technologies. Mouse monoclonal anti-c-myc antibody (9E10) was purchased from Sigma-Aldrich. Mouse monoclonal anti-Vif antibody (319) and mouse monoclonal anti-α-tubulin antibody (DM1A) were purchased from Abcam. Anti-mouse IgG antibody conjugated with a fluorescent dye was purchased from Rockland Immunochemicals (Code# 610-132-121). All stable-isotope labeled reagents, including [15N]-ammonium chloride, [13C]-D-glucose, [2H, 13C]-D-glucose, deuterium oxide, [15N]-L-phenylalanine (Phe), [15N]-L-isoleucine (Ile), [15N]-L-lysine (Lys), [15N]-L-leucine (Leu), [15N]-L-arginine (Arg), [15N]-L-tyrosine (Tyr), [13C]-L-alanine (Ala), [13C]-Phe, [13C]-Ile, [13C]-Lys, [13C]-Leu, [13C]-L-methyonine (Met), [13C]-Arg, [13C]-L-threonine (Thr), [13C]-Tyr, [13C]-L-valine (Val), [2H, 13C, 15N]-algal amino acid mixture, [2H]-choline o-sulfate, were purchased from Cambridge Isotope Laboratories.
Protein expression and purification
The consensus protein (Supplementary Fig. 1a), sNTD, and their variants were prepared in the similar manner described previously[31]. BL21 (DE3) cells (Invirogen) were transformed with a prepared pCold-GST expression vector[61] based plasmid, and grown at 37°C in M9 medium containing 100 μg ml−1 of ampicillin until reaching an optical density at 600 nm of 0.6–0.8. After the addition of 0.2 mM Isopropyl β-D-1-thiogalactopyranoside (IPTG) and 10 μM ZnCl2, the cells were further incubated at 16°C for overnight. The bacterial cells were harvested by centrifugation and resuspended with a lysis buffer [50 mM sodium phosphate, pH7.3, 150 mM NaCl, 0.005% Tween-20, and 0.5 mM tris(2-Carboxyethyl)phosphine (TCEP)]. The lysed cells were disrupted by sonication and then centrifuged. The supernatant was applied to glutathione-immobilized resin (GE Healthcare life sciences) and the resin was washed with the lysis buffer. The proteins bound to the resin was digested by HRV 3C protease (GE Healthcare life sciences) for overnight. The digested proteins were applied to Superdex-75 column (GE Heathcare life sciences) equilibrated with a buffer (10 mM sodium phosphate, pH7.3, 100 mM NaCl, 0.005% Tween-20, 0.5 mM TCEP, and 0.01% NaN3). The fractions including the protein of interest were collected and the protein was concentrated to 100–200 μM by using Amicon Ultra (Millipore). For NMR experiments, 0.5 M choline o-sulfate, 5% deuterium oxide, and 20 μM 2,2-dimethyl-2-silapentane-5-sulfonate sodium salt (DSS) were added to the protein solution. To prepare isotope-labeled proteins, M9 medium was prepared with 60% or 85% deuterium oxide with [15N]-ammonium chloride and [13C]- or [2H, 13C]-D-glucose. When the cells were grown in 85% deuterium oxide, 1g L−1 [2H, 13C, 15N]-algal amino acid mixture was added prior to the IPTG induction to enhance the protein harvest. To prepare amino acid selectively [13C]- or [15N]-labeled proteins, an amino acid uniformly labeled with 13C or 15N was added to the medium containing 85% deuterium oxide prior to the IPTG induction. Similarly, to prepare amino acid selectively protonated proteins, an amino acid without any isotope labeling was added to the medium containing 85% deuterium oxide prior to the IPTG induction.
NMR spectroscopy
NMR spectra were measured using Bruker AV700, AV850, and AV900 spectrometers equipped with cryogenic probes at 298K (MNMR center at University of MN). Sequential assignments of the backbone 1H, 13C, and 15N resonances of sNTD were achieved by triple-resonance NMR techniques combined with amino acid selective labeling[41,62-65]. Assignments of side chain 1H and 13C resonances were achieved by using a combination of 1H,13C-HSQC spectra of amino acid selectively [13C]-labeled sNTD (Supplementary Fig. 2) and 1H,1H-NOESY[66] spectra of amino acid selectively protonated sNTD. 1H chemical shifts were directly referenced to DSS, while 13C and 15N chemical shifts were indirectly referenced by the absolute frequency ratios (13C/1H) = 0.251449530 and (15N/1H) = 0.101329118 with internal DSS[67]. All NMR spectra were processed and analyzed by using NMRPipe[68] and Sparky[69], respectively.
Structure calculations
The solution structures of sNTD were calculated with a distance geometry-simulated annealing protocol using X-PLOR-NIH[70]. The structure calculations and NOE peak assignments were performed in an iterative and manual manner. For distance restraints, the lower limitation was set to 1.8 Å and the upper limitation was set to 2.5, 3.0, 3.5, 4.0, 4.5, or 5.0 Å according to the NOE cross-peak intensity. Additionally, pseudo-atom corrections were applied[66]. In total, 1543 distance restraints were collected from 13C- and 15N-edited NOESY spectra and a series of 1H, 1H-NOESY spectra using amino acid selectively protonated sNTD. A total of 106 backbone dihedral angles (ϕ) were estimated using TALOS software based on 13Cα, 13Cβ, and 13C′ chemical shifts[71]. A total of 33 hydrogen bonds were identified from an H/D-exchanging experiment and the structures calculated in the early stage. The hydrogen bond restraints, 2.3–3.3 Å for N-O and 1.3–2.3 Å for HN-O, were introduced into the late stage of calculations. Additionally, Zn2+ ion and empirical restraints between Zn2+ ion and sNTD residues H65, C97, and C100 were introduced into the late stage of the calculations[12]. A total of 100 structures were calculated and the 10 lower-energy structures were selected. These 10 structures were used for the further calculation of an energy-minimized average structure. The quality of obtained structures was analyzed using MOLMOL[72] and PROCHECK programs[73]. Structural figures were generated using MOLMOL and MacPyMOL softwares (Schrödinger).
Protein harvest assay
To monitor expression level and relative solubility of A3G NTD variants, the aforementioned protein expression and purification procedures were applied. The volumes of media (400 ml), lysis buffer (25 ml), and glutathion-immobilized resin (100 μl) were strictly fixed. The resin bound to the protein was resuspended with lysis buffer to adjust the total volume to 200 μl. 5 μl of the resuspension was applied to a sodium dodecyl sulfate (SDS) polyacrylamide gel electrophoresis (PAGE) that was stained with Coomassie brilliant blue, and quantified for measured protein band density by Image J provided by US National Institute of Health. Each variant was independently cultured three times to calculate mean and s. d.
DNA deamination assay
Synthesized DNA fragment d(5′-ATTCCCAATT-3′)[12] was dissolved in a buffer (10 mM sodium phosphate, pH7.3, 100 mM NaCl, 0.005% Tween-20, and 0.5 mM TCEP) at a concentration of 100 μM. Purified A3GCTD-2K3A or sNTD was added to the DNA solution at a final concentration of 20 μM. The mixture was incubated at 25°C. One-dimensional 1H spectra were measured 0.5, 3, and 24 hours after the addition of protein.
E. coli co-expression and GST-pull down assay
BL21 (DE3) cells were transformed with a pCold-GST-NTD variant expression plasmid, pRSFDuet-Vif-176-CBF-β, and pACYCDuet-EloBC and grown at 37°C in 400 ml of Luria-Bertani medium containing 100 μg ml−1 of ampicillin, 25 μg ml−1 of kanamycin, and 30 μg ml−1 of chloramphenicol until reaching an optical density at 600 nm of 0.6–0.8. After the addition of 1 mM IPTG, the cells were further incubated at 16°C for overnight. The harvested cells were applied to the protein purification procedure described above. The proteins were collected using 100 μl volume of glutathione-immobilized resin. The resin bound to the protein was resuspended with lysis buffer to adjust the total volume to 200 μl. 5 μl of the resuspension separated by SDS-PAGE, which was stained with Coomassie brilliant blue, and quantified for protein band density. For each variant, the assay was repeated in three independent cell culture.
Degradation assay using human cells
Humanembryonic kidney (HEK293T) cells in 12-well plates were co-transfected with pcDNA3.1-based A3G or sNTD expression vector (0.66 μg) and pcDNA-HVif, pcDNA-HVif(H43A Y44A), or empty pcDNA3.1 vector (1.32 μg) by using FuGENE HD (Promega). After the incubation for 24 hours, 2 μM MG132 or dimethylsulfoxide was added to the media and the cells were further incubated for 24 hours. Resulting cell lysates were separated by SDS-PAGE and immunoprobed by using mouse monoclonal anti-c-myc antibody (9E10) (Sigma-Aldrich). The protein bands were visualized and scanned with an Odyssey imaging system (LI-COR).
HIV single-cycle assays
Single-cycle infectivity experiments were conducted in triplicate as described[35,74]. 293T cells were maintained in Dulbecco’s modified Eagle medium supplemented with 10% fetal bovine serum and 0.5% pen/strep. Cells were plated 250,000 cells per well of a 6-well plate and 24 hrs later transfected (TransIT-LT1; Mirus Bio) with 1μg of previously reported Vif-proficient or Vif-deficient HIV-1 IIIB proviral expression constructs[75] and 0ng, 25ng, 50ng, or 100ng of A3G-myc expression constructs. DNA quantities were equalized with empty pcDNA3.1 (Life Technologies). 48hrs post-transfection, the producer cells and the virus containing-supernatants were harvested. A portion of the virus containing-supernatants was used to determine infectivity by incubating with CEM-GFP reporter cells plated at 25,000 cells per well in a 96 well plate following an additional 48 hrs by GFP flow cytometry. The remainder of the virus containing-supernatants was centrifuged through a 20% sucrose cushion to partially purify viral particles for analysis by immunoblotting. Cell and viral pellets were lysed in 2.5X Laemmeli reducing sample buffer for immunoblotting. Expression of A3G-myc in cell and viral lysates were detected by use of an anti-c-Myc antibody (Sigma-Aldrich) and expression of Vif in cell lysates was detected by an anti-Vif antibody (NIH AIDS Reagent Program). Tubulin and p24 served as loading controls for cell and viral lysates, respectively, and detected with an anti-tubulin antibody (Covance) and an anti-p24 antibody (in-house).
Surface charge and pocket volume analysis
The sNTD NMR structure (Model 1) was used as the template to generate the A3G-NTD homology model, using PRIME (Schrödinger, LLC), including energy minimization to resolve atomic clashes. Surface electrostatics were calculated in Maestro (Schrödinger, LLC). Pocket volume calculations were performed using SiteMap (Schrödinger, LLC).
Authors: Eric W Refsland; Mark D Stenglein; Keisuke Shindo; John S Albin; William L Brown; Reuben S Harris Journal: Nucleic Acids Res Date: 2010-03-22 Impact factor: 16.971
Authors: Markus-Frederik Bohn; Shivender M D Shandilya; John S Albin; Takahide Kouno; Brett D Anderson; Rebecca M McDougle; Michael A Carpenter; Anurag Rathore; Leah Evans; Ahkillah N Davis; Jingying Zhang; Yongjian Lu; Mohan Somasundaran; Hiroshi Matsuo; Reuben S Harris; Celia A Schiffer Journal: Structure Date: 2013-05-16 Impact factor: 5.006
Authors: Hongzhan Xu; Evguenia S Svarovskaia; Rebekah Barr; Yijun Zhang; Mohammad A Khan; Klaus Strebel; Vinay K Pathak Journal: Proc Natl Acad Sci U S A Date: 2004-03-30 Impact factor: 11.205
Authors: D S Wishart; C G Bigam; J Yao; F Abildgaard; H J Dyson; E Oldfield; J L Markley; B D Sykes Journal: J Biomol NMR Date: 1995-09 Impact factor: 2.835
Authors: Marcel Ooms; Bonnie Brayton; Michael Letko; Susan M Maio; Christopher D Pilcher; Frederick M Hecht; Jason D Barbour; Viviana Simon Journal: Cell Host Microbe Date: 2013-10-16 Impact factor: 21.023
Authors: William C Solomon; Wazo Myint; Shurong Hou; Tapan Kanai; Rashmi Tripathi; Nese Kurt Yilmaz; Celia A Schiffer; Hiroshi Matsuo Journal: Nucleic Acids Res Date: 2019-08-22 Impact factor: 16.971
Authors: Bogdan Polevoda; William M McDougall; Bradley N Tun; Michael Cheung; Jason D Salter; Alan E Friedman; Harold C Smith Journal: Nucleic Acids Res Date: 2015-09-30 Impact factor: 16.971
Authors: Christopher Richards; John S Albin; Özlem Demir; Nadine M Shaban; Elizabeth M Luengas; Allison M Land; Brett D Anderson; John R Holten; John S Anderson; Daniel A Harki; Rommie E Amaro; Reuben S Harris Journal: Cell Rep Date: 2015-11-25 Impact factor: 9.423
Authors: Shurong Hou; Tania V Silvas; Florian Leidner; Ellen A Nalivaika; Hiroshi Matsuo; Nese Kurt Yilmaz; Celia A Schiffer Journal: J Chem Theory Comput Date: 2018-12-11 Impact factor: 6.006
Authors: Atanu Maiti; Wazo Myint; Krista A Delviks-Frankenberry; Shurong Hou; Tapan Kanai; Vanivilasini Balachandran; Christina Sierra Rodriguez; Rashmi Tripathi; Nese Kurt Yilmaz; Vinay K Pathak; Celia A Schiffer; Hiroshi Matsuo Journal: J Mol Biol Date: 2020-10-22 Impact factor: 5.469