Literature DB >> 26887497

Crystal structure of the Rous sarcoma virus intasome.

Zhiqi Yin1,2,3, Ke Shi1,2,3, Surajit Banerjee4, Krishan K Pandey5, Sibes Bera5, Duane P Grandgenett5, Hideki Aihara1,2,3.   

Abstract

Integration of the reverse-transcribed viral DNA into the host genome is an essential step in the life cycle of retroviruses. Retrovirus integrase catalyses insertions of both ends of the linear viral DNA into a host chromosome. Integrase from HIV-1 and closely related retroviruses share the three-domain organization, consisting of a catalytic core domain flanked by amino- and carboxy-terminal domains essential for the concerted integration reaction. Although structures of the tetrameric integrase-DNA complexes have been reported for integrase from prototype foamy virus featuring an additional DNA-binding domain and longer interdomain linkers, the architecture of a canonical three-domain integrase bound to DNA remained elusive. Here we report a crystal structure of the three-domain integrase from Rous sarcoma virus in complex with viral and target DNAs. The structure shows an octameric assembly of integrase, in which a pair of integrase dimers engage viral DNA ends for catalysis while another pair of non-catalytic integrase dimers bridge between the two viral DNA molecules and help capture target DNA. The individual domains of the eight integrase molecules play varying roles to hold the complex together, making an extensive network of protein-DNA and protein-protein contacts that show both conserved and distinct features compared with those observed for prototype foamy virus integrase. Our work highlights the diversity of retrovirus intasome assembly and provides insights into the mechanisms of integration by HIV-1 and related retroviruses.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 26887497      PMCID: PMC4881392          DOI: 10.1038/nature16950

Source DB:  PubMed          Journal:  Nature        ISSN: 0028-0836            Impact factor:   49.962


INs from lentiviruses including HIV-1 and the phylogenetically closely related alpharetroviruses including avian Rous sarcoma virus (RSV) share the conserved three-domain organization consisting of the Zn++-coordinating N-terminal domain (NTD), the catalytic core domain (CCD), and the β-strand-rich C-terminal domain (CTD) (Fig. 1a, Extended Data Fig. 1). We used the three-domain RSV IN construct biochemically fully active in concerted integration [6] and a branched DNA substrate mimicking the product of the concerted integration reaction (Fig. 1b) [7] to assemble and crystallize the stable RSV intasome complex (Extended Data Figs. 2, 3). The crystallized RSV intasome showed in solution an apparent molecular mass of 255 kDa by size-exclusion chromatography (SEC) and 240 (+/−10) kDa by SEC-multi-angle light scattering analysis, larger than the expected mass of 168 kDa for a tetramer of RSV IN(1–270) bound to the branched DNA (Extended Data Fig. 4a–c). The structure of the RSV intasome was determined by molecular replacement phasing and refined to 3.8 Å resolution (Extended Data Table 1).
Figure 1

Overall structure of the RSV intasome

a, Comparison of the domain organization between RSV, HIV-1, and PFV INs. b, Branched DNA structure mimicking the product of the concerted integration reaction used in assembling the RSV intasome. c, Structure of the RSV intasome, viewed along its pseudo two-fold axis from the viral DNA side. The three structural domains of IN are color-coded as in (a). The grey spheres represent zinc ions in NTD. Two subunits within each IN dimer are colored slightly different from each other. d–f, Structure of the RSV intasome with the eight IN molecules colored individually, shown in three orthogonal orientations. The protein surface is shown in (d) and (e). The DNA strands are color-coded as in (b).

Extended Data Figure 1

Amino acid sequence alignment of RSV, HIV-1, and PFV INs

The secondary structure elements for RSV IN are color-coded based on the three IN domains similarly to Fig. 1a. The residue numbering at the top is for RSV IN. For each IN, the black dots mark every ten residues. This figure was made using ESPript [42].

Extended Data Figure 2

DNA substrate used for assembling the RSV intasome

a, The half-site (gaped duplex) substrate prepared by annealing three oligonucleotides dimerizes via the self-complementary 6-base spacer sequence (underlined) to form a branched structure mimicking the product of the concerted integration reaction (Fig. 1b). b, DNA structure in the RSV intasome. Viral DNA nucleotides are numbered, and some of the structural elements of RSV IN involved in the viral DNA interactions are shown.

Extended Data Figure 3

RSV intasome crystal

a, A crystal of the RSV intasome. b, X-ray diffraction pattern from a crystal not treated with metatungstate. c, X-ray diffraction pattern after the metatungstate-soaking (see method for details). d,e, Lattice contacts within the RSV intasome crystal. DNA strands are colored as in Fig. 1b, while IN subunits from only one intasome are colored. The unit cell is shown in green. The small blue spheres represent tungsten atoms. The view in (e) is perpendicular to the two-fold screw (b) axis of the monoclinic lattice, which lies horizontally. f, Paired-refinement analysis [41] to assess the resolution limit of the RSV intasome diffraction data. For each pairwise comparison, model refinements were run at two different resolution limits and the R-factors calculated for a common (lower) resolution cutoff were compared. Inclusion of data beyond 3.7 Å in the refinement compromised model quality.

Extended Data Figure 4

Biochemical characterization of RSV intasome

a, A representative size-exclusion chromatography (SEC) profile for RSV intasome, overlaid with that for a mixture of molecular weight markers. The buffer condition was as mentioned in the methods. b, SEC profiles for RSV intasomes formed with IN of varying C-termini. c, SEC-MALS (multi-angle light scattering) analysis of RSV intasome. The intasome formed with RSV IN (1–269 aa) was separated by SEC in a modified condition containing 20 mM HEPES pH 7.5, 1.0 M NaCl, 5 % glycerol, and 1.0 mM TCEP. The absolute molecular mass was determined by light scattering using in-line detectors described previously [43]. The mass profile for the intasome is shown in red across the peak. The molecular mass of RSV intasome was 240 ±10 kDa (n=4). A similar SEC-MALS analysis of the intasome formed with wt full-length RSV IN (1–286 aa) yielded a molecular mass of 268 kDa (n=2, data not shown). The calculated mass of an intasome containing eight RSV IN (1–269 or 270) molecules is ~288 kDa. d, Chemical cross-linking analysis of RSV intasome. The RSV intasome and free IN (1–269 aa) were purified by size-exclusion chromatography in the running buffer: 20 mM HEPES (pH 7.5), 1.0 M NaCl, 5 % glycerol, and 1.0 mM TCEP. The peak fractions of the intasome and IN were cross-linked with the indicated amount of ethylene glycol bis-succinimidylsuccinate (EGS) as described previously [43] and analyzed by SDS-PAGE. The majority of cross-linked species within the intasome were larger than a tetramer. The highest oligomeric species observed is consistent with an octamer migrating at ~220 kDa. The molecular weight markers are in the far right lane. The NuPAGE 4–12% gradient gel with a MES-based SDS-PAGE running buffer was used.

Extended Data Table 1

Data collection and refinement statistics

RSV intasome
Data collection
 Space groupP21
 Cell dimensions
  a, b, c (Å)124.9, 157.8, 126.6
  α, β, γ (°)90.0, 110.9, 90.0
 Resolution (Å)49.3-3.80(3.90–3.80)
Rmerge (%)14.7 (247.6)
I/σI6.0 (0.6)
 CC1/20.994 (0.287)
 Completeness (%)99.2 (99.3)
 Redundancy5.5(5.5)
Refinement
 Resolution (Å)49.3 - 3.80
 No. reflections44627
 Reflections for Rfree2183
Rwork/Rfree25.4/29.4
 No. atoms
  Protein19309
  Ligand/ion44
  Water0
 B-factors
  Protein206.9
  Ligand/ion205.0
  WaterN/A
 R.m.s deviations
  Bond lengths (Å)0.004
  Bond angles (°)0.72

Statistics for the highest resolution shell are shown in parentheses.

Reference:

CC1/2: P.A. Karplus and K. Diederichs, Linking crystallographic model and data quality. (2012) Science 336, 1030–1033 (Ref. 41).

In contrast to the general assumption that retrovirus intasomes consist of an IN tetramer, the RSV intasome structure shows that it contains eight IN molecules (Fig. 1c–f and Supplementary Video), clearly supported by the selenium anomalous difference Fourier peaks for a crystal grown with selenomethionine-labeled IN (Extended Data Fig. 5). The observed octameric assembly (288 kDa) is consistent with the larger apparent molecular mass of RSV intasome in vitro and chemical cross-linking analysis (Extended Data Fig. 4d). The RSV IN octamer contains a core tetramer, which consists of two sets of ‘proximal’ IN dimers, and two additional sets of ‘distal’ IN dimers. The CCDs of the four IN dimers are positioned in an approximately two-fold symmetrical arrangement that resembles a parallelogram (Fig. 1c,d). The proximal IN dimer consists of ‘inner’ and ‘outer’ IN subunits, where the active-site of the inner subunit accommodates the viral/target DNA junction (Fig. 2a–c). The NTD of each inner IN subunit interacts in trans with the viral DNA engaged by the opposing proximal IN dimer, with the extended linker between NTD and CCD traversing the grooves of both viral DNA molecules to make additional contacts (Figs. 2a, 3a–c). The domain-swapped arrangement of NTD and CCD of the proximal IN dimers is analogous to that observed in the tetrameric PFV intasome [2].
Extended Data Figure 5

Selenium anomalous difference Fourier peaks confirming the model

Anomalous difference Fourier maps calculated using the data collected on selemomethionine-labeled RSV intasome, contoured at 3.5σ (blue mesh) or 5.0 σ (orange mesh). Methionine side chains are shown in sticks. a, A view covering the octameric RSV intasome. b, c, Close-up view of a proximal (b) and distal (c) IN dimer, respectively.

Figure 2

Proximal and distal IN dimers

a, A view highlighting interactions between a proximal (green/cyan) and a distal (slate/orange) IN dimers and their interactions with DNA. The IN and DNA strands are colored as in Fig. 1d–f. b, A close-up view around the viral DNA terminus. The CCD loop (residues 144–153) centered around Ser150 of the catalytic IN subunit is colored in violet. The catalytic triad residues (D64, D121, E157) are shown in red sticks in (a–c). c,d, Superposition of the proximal IN dimer (c; green/cyan) or the distal IN dimer (d; slate/orange) on the RSV IN CCD-CTD dimer in its native conformation (PDB 4FW1, light grey) [6]. The grey spheres represent zinc ions in NTD.

Figure 3

Comparison between the RSV and PFV intasomes

a, b, The octameric RSV intasome. The eight IN molecules and the DNA strands are colored as in Fig. 1d–f. The catalytic triad (DDE) of the inner subunit of the proximal IN dimers are shown in red in (b). c, Conformation of the inner catalytic IN subunit in the RSV intasome. The protein chain is colored in a gradient of blue to red from N to C-terminus, respectively. The NTDs and CCDs of the distal IN dimer are omitted. d,e, The tetrameric PFV intasome [3]. The color scheme follows that used for the proximal IN dimers of RSV intasome in (a, b). The direction of DNA helical axis between the two integration sites on opposing strands (6 bp for RSV and 4 bp for PFV) differs significantly between the RSV (b) and PFV (e) intasomes. f, The inner catalytic IN subunit in the PFV intasome, colored in a gradient of blue to red as in (c). Note the presence of an extra domain on the N-terminus (NED) and different positioning of CTD compared to RSV IN in (c).

However, despite the shared structural features that support the basic chemistry of integration, the RSV and PFV intasomes have very different architectures (Fig. 3, Extended Data Fig. 6). The PFV intasome consists of four IN molecules, corresponding to only the core tetramer (two proximal IN dimers) of the RSV intasome. In the PFV intasome the inner IN subunits make all DNA interactions, while only the CCD is ordered for the outer subunits (Fig. 3d–f). In the RSV intasome, CTDs of the inner and outer subunits of the proximal IN dimer take unique conformations and tightly associate with each other (Fig. 2c), and this CTD dimer makes viral DNA contacts both in cis and trans (Fig. 2a). Moreover, the octameric RSV intasome contains two additional IN dimers; the CTDs from the distal IN dimers bridge between the proximal IN dimers, making additional contacts with both viral DNAs analogously to the CTD of the catalytic IN subunit in the PFV intasome (Fig. 3b,e). The CCDs of the distal IN dimers, anchored to the core of the RSV intasome through these CTD interactions (Fig. 2a), are positioned at the outer corners of the parallelogram, loosely associated with the distal regions of target DNA through non-specific interactions (Figs. 1d–f, 2a). The six remaining NTDs from the outer subunits of the proximal IN dimers and both subunits of the distal IN dimers are bound intra-molecularly to CCD (Extended Data Fig. 7c,d). In total, over 10,000 Å2 of molecular surface is buried in IN-IN interfaces within the RSV intasome, approximately half of which is accounted for by the conserved CCD dimerization interface, and ~6,000 Å2 in IN-DNA interfaces.
Extended Data Figure 6

Comparison between RSV and PFV intasomes

a, b, Protein arrangement in the octameric RSV intasome. The inner and outer subunit of one proximal IN dimer is colored in green and cyan, respectively, with the catalytic triad (DDE) of the inner subunit shown in red. The other proximal IN dimer is colored similarly but in more pale colors. Arrows indicate the two proximal IN dimers. A distal IN dimer is colored in slate and orange. DNA is omitted in (a). (b) shows the same view as Fig. 3c. c, A close-up view around the active site of the inner IN subunit in the RSV intasome. The DNA strands are colored as in Fig. 1, and the catalytic triad residues (DDE) are shown in red sticks. d,e, Protein arrangement in the tetrameric PFV intasome [2,3]. The color scheme follows that used for the proximal IN dimers of RSV IN in (a, b). Arrows indicate the two IN dimers. DNA is omitted in (d). (e) shows the same view as Fig. 3f. f, A close-up view around the active site of the inner IN subunit in the PFV intasome (PDB ID: 3OS0 [3]). The color scheme follows that in (c).

Extended Data Figure 7

Composite omit maps

Simulated annealing composite omit 2mFo−DFc density contoured at 1.0 σ, shown for area within 3.5 Å from any protein or DNA atom in the final model. In a and b, electron densities around protein and DNA are colored differently (blue and green, respectively).

Although the CCDs and CTDs from all four IN dimers within the RSV intasome individually self-dimerize in the same fashions, the relative positioning of the CTDs with respect to the CCDs differs between the proximal and distal IN dimers, corresponding to their different roles. The CCD-CTD configuration for the proximal IN dimers is very similar to that previously observed for DNA-free RSV INs [6,8] (Fig. 2c), suggesting that RSV IN dimer in its native conformation is poised for viral DNA binding and catalysis, as noted earlier [6]. The CTD of the inner catalytic subunit of the proximal dimer binds to viral DNA near the viral/target junction but on the opposite face of the double-stranded DNA, while it also makes contacts in trans with the other viral DNA (Fig. 2a–b). The CTD of the outer subunit of the proximal dimer binds to the distal region of the viral DNA in cis. Unlike in the proximal IN dimer, the CCD-CTD configuration for the distal IN dimers shows deviation from the canonical conformation, which can be described by a swing of the CTDs relative to the CCDs and disruption of the parallel β–sheet-like conformation of the CCD-CTD linkers [8] (Fig. 2d). This alternative CCD-CTD orientation allows the distal IN dimers to fit in the intasome without making steric clashes with the proximal INs or the 5’ overhang of the viral DNA strand (Fig. 2a). The CCDs of the distal IN dimers may be positioned similarly in the absence of target DNA, serving as a platform for target DNA capturing. The asymmetrically associated CTDs of the proximal and distal IN dimers further interact with each other and with the NTD of the catalytic IN subunit to crosslink between the two viral DNAs (Figs. 2a, 4a,b). Each CTD dimer interacts with both of the viral DNA molecules at different positions, resulting in four distinct DNA-binding modes of individual CTDs. The basic amino acids Arg227, Arg244, Arg263, and Lys266 from various CTD monomers make contacts with the viral DNA molecules, and each viral DNA is sandwiched between separate CTD dimers (Fig. 4c). Arg244 of the inner catalytic subunit of the proximal IN dimer is positioned in the major groove of viral DNA, closest to G7 of the non-transferred strand. The GC pair at this position is critical for concerted integration by RSV IN [9]. The corresponding residue Glu246 of HIV-1 IN was shown by disulfide cross-linking studies to interact with A7 of the non-transferred strand [10], suggesting similar modes of viral DNA sequence recognition by CTD between RSV and HIV-1 INs. Both RSV R244A/C and HIV-1 E246A IN mutants show reduced 3’ processing and strand-transfer activities [6,11,12], possibly reflecting the importance of these residues from various IN subunits. Mutation of conserved Trp233, which is stacked between the Arg227 and Lys266 side chains, to Glu or Ala but not Phe, abolishes binding to the viral DNA LTR sequence and concerted integration by RSV IN [13]. The corresponding HIV-1 IN mutations W235E/A/F have parallel effects on concerted integration activity and virus replication, suggesting the importance of an aromatic residue at this position in orienting the basic side chains [14,15]. Similarly, mutation of Trp259 buried in the CTD dimer interface (Fig. 4b) [6,8] as well as involved in multiple interactions near the viral DNA 5′ end (Fig. 2b) abolishes all enzymatic activities of RSV IN [6,16], reflecting the important roles of this residue in engaging viral DNA.
Figure 4

Viral DNA contacts

a, Surface representation of the RSV intasome similar to Fig. 3a. b, A crown-like structure formed by the CTD of all eight INs and NTD and CCD of the catalytic INs, which encircles the two viral DNA molecules in black (boxed region in a). Trp259 side chains buried in the interface of each CTD dimer are shown. c, Viral DNA contacts by the CTDs. The side chains of Arg227, Arg244, Arg263, Lys266 are shown. NTD and CCD are omitted. d, A close-up view around the viral DNA end, showing interactions by CCD of a catalytic subunit, NTD of the opposing catalytic subunit contributed in trans, and the CTDs of a distal IN dimer. The catalytic triad residues are shown in red sticks. The grey sphere represents zinc ion in NTD.

The CCD of the catalytic RSV IN molecules engages viral and target DNAs primarily through interactions in their minor grooves, as observed for PFV IN [2,3]. The long α-helix (α7; residues 154–174) that harbors one of the catalytic triad of metal-coordinating residues, Glu157, inserts into a significantly widened minor groove near the viral DNA terminus, where Arg158, Arg161, and Lys164 side chains make DNA base or backbone contacts (Fig. 4d, Extended Data Fig. 7h,i). The preceding loop centered around Ser150, which is highly flexible in the DNA-free IN [6,8,17], is positioned between the viral DNA 5′-overhang and the 3′-end of the cleaved target DNA, displacing T3 base opposite the terminal adenine (A20) of the transferred strand (Fig. 2b). DNA contacts in this minor groove also include hydrogen-bonding between Thr66 and the backbone phosphate group of the terminal nucleotide (A20) of the transferred viral DNA strand (Fig. 4d). Mutations of the corresponding HIV-1 IN residue T66A/I/K confer resistance against the IN strand-transfer inhibitors (INSTIs) [18], likely as a result of subtle changes of the IN-viral DNA interface. The NTD of the opposing catalytic subunit, contributed in trans, binds in the adjacent major groove and places Arg17 and Arg31 side chains for potential base-specific contacts (Fig. 4d). The hydrophobic NTD-CCD interface is centered on Phe199 from CCD, which explains why the F199K mutation selectively abolishes concerted integration by RSV IN [6,16]. The CTDs of the distal IN dimer further extend the viral DNA interactions in this region (Fig. 4d). The three flipped-out terminal nucleotides at the 5′ end of the non-transferred viral DNA strand, including the two overhanged bases and the displaced T3, bridge between CCD of the catalytic IN subunit and a CTD from the distal IN dimer, rather than between CCD and CTD of the same molecule as observed in the PFV intasome (Fig. 2b, 4b, 5a). Arg263 from this CTD points toward the non-transferred strand G4 opposite the CA dinucleotide at the viral DNA terminus (Fig. 2b), which could be relevant to the effect of mutation of the corresponding HIV-1 residue Arg263 on catalysis and drug resistance [19]. The two viral DNAs in the RSV intasome branch out with their helical axes skew and at an angle of ~60°, which is smaller than the viral DNA split of ~80° in the PFV intasome (Extended Data Fig. 8). Accordingly, the viral DNA molecules in the RSV intasome are positioned closer to each other than those in the PFV intasome, with the backbone phosphate oxygen atoms at the closest point ~5 Å apart. The viral DNA molecules in the RSV intasome are surrounded by a highly positively charged surface formed by a network of CTDs and NTDs, which may alleviate potential electrostatic repulsions between DNA strands and help hold the complex together (Fig. 4a–c).
Figure 5

Target DNA

a, The sharply bent target DNA spanning the CCDs of the catalytic IN subunits. Ser124 and Glu229 side chains potentially involved in target DNA contacts are shown. b, The central 6 bp region viewed from the viral DNA side, showing distorted DNA conformation with severely compromised base-stacking. c, The target DNA viewed from the viral DNA side, highlighting a large shift in the helical axis.

Extended Data Figure 8

DNA conformations in the RSV and PFV intasomes

a,b, DNA structure in the RSV (a) or PFV (b) intasome, alternatively referred to as the strand-transfer complex (STC). The PFV intasome model is PDB ID: 3OS0 [3]. c, A comparison of DNA structures between the RSV and PFV intasomes (STCs). The integration product DNAs (RSV in cyan, PFV in red) superimposed at a viral DNA terminus are shown in three different view angles. Note the significant deviation in overall trajectory of the target DNA, and difference in the orientation of the second viral DNA molecule. The region spanning the two integration sites on opposing target DNA strands is 6 bp for RSV and 4 bp for PFV.

The target DNA in the RSV intasome shows a strong overall bending of ~90° away from the core of the complex (Fig. 5a, Extended Data Fig. 7b), which may help prevent the reversal of integration as noted for the PFV intasome and related transpososome structures [3,20]. The bent conformation is stabilized by the minor groove contacts made by CCD of the catalytic subunits near the viral/target DNA junction, which include insertion of a short helix (α5) harboring Ser124, a residue important in target DNA capturing [21]. Localized kinks at the viral/target DNA junctions cause the DNA trajectory to also zigzag in the plane perpendicular to the direction of the primary bending, creating a ~20 Å shift in the helical axis with an overall positive writhe (Fig. 5c). As a result, the target DNA conformation in the RSV intasome deviates significantly from that in the PFV intasome [3] (Extended Data Fig. 8c) or the DNA structure in nucleosomes[22]. This suggests that RSV integration into nucleosomes would require a large conformational change in the target DNA, likely more extensive than that observed in the PFV intasome-nucleosome complex [4]. The active-sites of the catalytic IN subunits in the RSV intasome are separated by ~27 Å, larger than the distance of ~21 Å across the major groove between the backbone phosphates 6 bp apart in a B-form DNA. Accordingly, the 6 bp spacer region of target DNA is underwound and shows severely compromised base-stacking in addition to widening of the major groove (Fig. 5b, Extended Data Fig. 9). In particular, the central dinucleotide step shows a distorted conformation where the unstacked nucleobases stack on the deoxyribose moiety of the adjacent nucleotide; the unique conformation may explain the modest target sequence preference in RSV integration [23,24]. A CTD from each distal IN dimer has a loop (between β6 and β7) positioned over the major groove of the target DNA in this region, with Glu229 side chain positioned for potential base contacts (Fig. 5a). This loop from various CTDs are involved in viral DNA contacts, interactions between NTD and CTDs, and those linking the CTDs from the proximal and distal IN dimers in the RSV intasome (Fig. 4b, Extended Data Fig. 7e–g).
Extended Data Figure 9

Electron density for the central 6 bp of the target DNA

The sigma A-weighted 2mFo−DFc map contoured at 1.5 σ (a) or 2.5 σ (b), overlaid with the final model for the central 6 bp region between the two integration sites.

The RSV intasome structure shows a novel architecture of IN-DNA complex and highlights a remarkable diversity among retrovirus intasome assemblies. In particular, the structurally conserved CTD plays critical roles in the octameric RSV intasome distinct from those played by CTD in the tetrameric PFV intasome [2,3,5]. PFV IN features a unique ~30 amino-acid insertion between CCD and CTD (Fig. 1a, Extended Data Fig. 1), and the resulting long interdomain linker not only makes direct viral DNA contacts but also allows CTD of the catalytic IN subunit to fit between the NTD and CCD from the same molecule to interact with both viral DNAs (Fig. 3f). In case of RSV IN with a much shorter CCD–CTD linker, the NTD and CCD of the catalytic IN molecule is instead bridged by the CTDs contributed in trans from the distal IN dimer (Fig. 3c). Analogously, the extra ~50 amino acids on the N-terminus of PFV IN constitute an independent DNA-binding domain (NED), and its interaction with distal region of the viral DNA substitutes for viral DNA interactions made by the multiple copies of CTDs as observed in the RSV intasome (Fig. 3a, d). The larger (408 amino acids) IN from a gammaretrovirus murine leukemia virus (MLV) also has both the NED and a long CCD-CTD linker [25], consistent with the idea that these two structural features have complementary functions. The different modes of viral DNA engagement by either tetrameric or octameric IN, using similar sets of structurally conserved domains, suggest divergent evolution of the integration machineries. Of note, an octameric intasome assembly similar to that reported here has been recently observed by cryo-EM for another three-domain IN from mouse mammary tumor virus [26]. The integration of retroviral DNA into the host genome was first postulated based on the observation of RSV-infected cells [27], and identification and characterization of IN from retroviruses including RSV have provided a basic understanding of retrovirus integration and foundation for studying HIV-1 IN [28-31]. Our findings on the RSV intasome structure shed light on the previously unappreciated diversity in retroviral DNA integration and define a new framework for future studies of retrovirus IN. Availability of the intasome structures from multiple retrovirus systems will also allow more accurate modeling of the HIV-1 IN-DNA interactions to help development of novel antiretroviral drugs, including IN inhibitors that target outside the most conserved active-site features.

METHODS

RSV intasome preparation

RSV IN and its mutant forms were overexpressed in Escherichia coli BL21 (DE3) and purified as previously described [6]. The proteins were stored in aliquots at −80°C in 20 mM HEPES-NaOH (pH 7.5), 1.0 M NaCl, 20μM ZnCl2, 5 mM 2-mercaptoethanol and 10% (w/v) glycerol. The branched DNA substrate mimicking the product of the concerted integration reaction was obtained by annealing three synthetic oligonucleotides (Integrated DNA Technologies). A similar strategy had been used previously to prepare a PFV IN-DNA complex, which demonstrated that the IN-DNA complex assembled on the designed integration product has essentially an identical structure to the equivalent complex formed via the forward integration reaction [7]. The viral DNA branches carrying the high-affinity gain-of-function mutant RSV U3 long terminal repeat (LTR) sequence (GU3) [32] were attached to the target DNA duplex with a palindromic 6 bp spacer to generate a fully symmetrized structure. To prepare the RSV IN-DNA complexes for crystallization, 30 μM half-site DNA substrate was mixed with 120 μM RSV IN in 20 mM HEPES-NaOH (pH 7.5), 500 mM NaCl, 20% (w/v) glycerol, and 1 mM tris-(2-carboxyethyl)phosphine (TCEP). The mixture was dialyzed against low salt buffer (20 mM HEPES-NaOH pH 7.5, 125 mM NaCl, 20% (w/v) glycerol, 1 mM TCEP) at 25°C for overnight. At the end of this first dialysis, essentially all IN and IN-DNA complex precipitated. The mixture was subsequently dialyzed against high salt buffer (20 mM MES-NaOH pH 6.0, 1.2 M NaCl, 20% (v/v) dimethyl sulfoxide (DMSO), 5% (w/v) glycerol, 1 mM TCEP) at 25°C for 2–4 hours. At the end of this second dialysis, the reaction mixture became clear. The solubilized RSV IN-DNA complex (intasome) was purified through size-exclusion chromatography (Superdex 200 10/300 GL, GE Healthcare) running with 20 mM MES-NaOH pH 6.0, 1.2 M NaCl, 20% (v/v) DMSO, 5% (w/v) glycerol, and 1 mM TCEP. The isolated intasome remains stable in the high-salt condition containing 1.2 M NaCl that precludes complex formation, suggesting that once the intasome forms, it is kinetically trapped. The solubility-enhancing RSV IN mutation F199K completely abolished intasome formation. For the SEC-MALS analysis, a modified condition was used for the intasome re-solubilization and isolation for better baseline stability (Extended Data Fig. 4c).

Crystallization

For crystallization, the RSV intasome assembled through dialysis and purified by size-exclusion chromatography was concentrated to 4–6 mg/ml using centrifugal concentrator (Amicon). Various combinations of the lengths of the viral DNA (ranging from 16 bp to 25 bp) and flanking target DNA (ranging from 14 bp to 22 bp, corresponding to the full target DNA lengths of 34 bp to 50 bp) were screened in crystallization trials. The extensive screening yielded only one crystal form with a specific combination of 22-base (length of the non-transferred strand) viral DNA branches and 16 base-pair (bp) target DNA flanks on either side of the central six bp spacer (Fig. 1b, Extended Data Fig. 2). DNA substrates with two slightly different target sequences were used (5′-aatgttgtcttatgcaatactc-3′/5′-gagtattgcataagacaacagtgcacgaatcttgaagacact-3′/5′-agtgtcttcaagattc-3′, or 5′-aatgttgtcttatgcaatactc-3′/5′-gagtattgcataagacaacagtcgaccaaccttcaacttagc-3′/5′-gctaagttgaaggttg-3′), and they produced essentially the same crystals. The RSV intasome crystals were grown through reverse vapor diffusion in hanging drops at 22°C, by mixing 1.5 μL IN-DNA complex solution with 1.5 μL reservoir solution (3.2 M sodium formate). Crystals appeared within 3–5 days and reached a size of ~150–300 μm in 3~5 days. Even though the RSV intasome crystals initially diffracted X-ray poorly (~ 10 Å), soaking the crystals with a metatungstate cluster compound dramatically improved the resolution (Extended Data Fig. 3b, c). The tungsten cluster was later found to bind between IN dimers from separate intasome complexes to mitigate crystal lattice disorder. The crystals were soaked overnight with 0.15 mM metatungstate cluster Na6[H2W12O40] in a stabilization buffer consisting of 3.2 M sodium formate, 16 mM MES-NaOH pH 6.0, 0.8 M NaCl, 16% (v/v) DMSO, 4% (w/v) glycerol, and 1 mM TCEP. After soaking, the crystals were cryo-protected in 3.2 M sodium formate, 16 mM MES pH 6.0, 0.8 M NaCl, 16% (v/v) DMSO, 12% (w/v) glycerol, and 1 mM TCEP, and were frozen by rapid immersion in liquid nitrogen. The full-length wild-type RSV IN(1–286), the C-terminally truncated RSV IN(1–270), and its various mutant forms tested produced essentially the same crystals with indistinguishable X-ray diffraction properties.

Structure determination

X-ray diffraction data were collected at the Advanced Photon Source Northeastern Collaborative Access Team beamlines (24-ID-C/E) and the Advanced Light Source Molecular Biology Consortium (4.2.2) beamline and processed using HKL2000 [33] or XDS [34]. The RSV intasome crystals showed varied degrees of pseudo-merohedral twinning with twin operator [l, −k, h], owing to the very similar a and c unit cell dimensions of the primitive monoclinic lattice. Thus, we screened a large number of crystals to identify ones that diffract to higher resolution and have smaller twin fractions. The structure of RSV intasome was determined by molecular replacement with PHASER [35], using the RSV IN CCD, CTD (PDB ID: 4FW1) [6], and a 16 bp B-form DNA as search models. 8 copies of CCD, one copy of CTD and three copies of DNA molecules were located. Refinement of the partial model revealed electron density for two copies of the metatungstate clusters. The metatungstate clusters were placed into the electron density by molecular replacement using MOLREP [36]. Subsequent iterative model building using COOT [37] and refinement with PHENIX suite [38] allowed placement of the remaining 7 copies of CTD, 8 copies of NTD generated using MODELLER [39] based on HIV-1 NTD (PDB ID: 1K6Y) [40], and building of the inter-domain linkers as well as the remaining parts of the DNA molecule guided by the difference electron density maps. A third metatungstate cluster, which is more weakly bound compared to the first two, was positioned manually into residual density. The DNA base-pairs and base-stacking restraints were used throughout the refinement. The geometry restraints for protein included the reference-model restraints for CCD and CTD based on the higher resolution RSV IN structure (4FW1) [6], and the secondary structure and zinc-coordination restraints for NTD. Atomic displacement parameters were refined with grouped B-factors per residue for protein and DNA, and a total of 53 TLS groups assigned by PHENIX [38]. Twelve tungsten atoms representing each cluster were refined as a rigid body. The asymmetric unit of the crystal contains one complete RSV intasome, which includes eight IN molecules and two viral DNA branches emanating from a strongly bent 38 bp target DNA. The dataset used for the final refinement was from an RSV intasome crystal grown using selenomethionine-labeled RSV IN (1–270) with the following amino acid substitutions; C23S, L112M, L135M, L162M, L163M, L188M, and L189M, which we confirmed to be active in concerted integration and inhibited by INSTI similarly to the wild-type RSV IN, and a DNA substrate carrying a nick at the middle of each target DNA branch (the 16-base DNA strand shown in olive in Fig. 1b has a nick 8 bases from either end). The nick occasionally facilitated crystal growth, but it was not necessary for crystallization and did not change the space group or the unit cell parameters compared to crystals grown without the nick. Because this nick in the target DNA is not biologically relevant, it is not shown in Figs. 1b, 1d–f, 3b, 5a,c to avoid confusion. Twin refinement protocol was not used as the dataset used for the final refinement had a low (less than 10%) twin fraction. The summary of data collection and refinement statistics is shown in Extended Data Table 1. The paired-refinement procedure [41] was performed in steps of 0.1 Å to determine the high-resolution limit (Extended Data Fig. 3f). The register of amino acids in the final model was verified by the selenium anomalous difference Fourier peaks (Extended Data Fig. 5). Ramachandran analysis shows that 96.0, 3.9, and 0.1 % of the protein residues are in the most favored, allowed, and disallowed region, respectively. The NTD-CCD linker in some of the non-catalytic IN molecules and the last four bp at the distal end of one of the viral DNA molecules were not built due to poor electron density. The molecular graphics images were produced using PYMOL (www.pymol.org).

Amino acid sequence alignment of RSV, HIV-1, and PFV INs

The secondary structure elements for RSV IN are color-coded based on the three IN domains similarly to Fig. 1a. The residue numbering at the top is for RSV IN. For each IN, the black dots mark every ten residues. This figure was made using ESPript [42].

DNA substrate used for assembling the RSV intasome

a, The half-site (gaped duplex) substrate prepared by annealing three oligonucleotides dimerizes via the self-complementary 6-base spacer sequence (underlined) to form a branched structure mimicking the product of the concerted integration reaction (Fig. 1b). b, DNA structure in the RSV intasome. Viral DNA nucleotides are numbered, and some of the structural elements of RSV IN involved in the viral DNA interactions are shown.

RSV intasome crystal

a, A crystal of the RSV intasome. b, X-ray diffraction pattern from a crystal not treated with metatungstate. c, X-ray diffraction pattern after the metatungstate-soaking (see method for details). d,e, Lattice contacts within the RSV intasome crystal. DNA strands are colored as in Fig. 1b, while IN subunits from only one intasome are colored. The unit cell is shown in green. The small blue spheres represent tungsten atoms. The view in (e) is perpendicular to the two-fold screw (b) axis of the monoclinic lattice, which lies horizontally. f, Paired-refinement analysis [41] to assess the resolution limit of the RSV intasome diffraction data. For each pairwise comparison, model refinements were run at two different resolution limits and the R-factors calculated for a common (lower) resolution cutoff were compared. Inclusion of data beyond 3.7 Å in the refinement compromised model quality.

Biochemical characterization of RSV intasome

a, A representative size-exclusion chromatography (SEC) profile for RSV intasome, overlaid with that for a mixture of molecular weight markers. The buffer condition was as mentioned in the methods. b, SEC profiles for RSV intasomes formed with IN of varying C-termini. c, SEC-MALS (multi-angle light scattering) analysis of RSV intasome. The intasome formed with RSV IN (1–269 aa) was separated by SEC in a modified condition containing 20 mM HEPES pH 7.5, 1.0 M NaCl, 5 % glycerol, and 1.0 mM TCEP. The absolute molecular mass was determined by light scattering using in-line detectors described previously [43]. The mass profile for the intasome is shown in red across the peak. The molecular mass of RSV intasome was 240 ±10 kDa (n=4). A similar SEC-MALS analysis of the intasome formed with wt full-length RSV IN (1–286 aa) yielded a molecular mass of 268 kDa (n=2, data not shown). The calculated mass of an intasome containing eight RSV IN (1–269 or 270) molecules is ~288 kDa. d, Chemical cross-linking analysis of RSV intasome. The RSV intasome and free IN (1–269 aa) were purified by size-exclusion chromatography in the running buffer: 20 mM HEPES (pH 7.5), 1.0 M NaCl, 5 % glycerol, and 1.0 mM TCEP. The peak fractions of the intasome and IN were cross-linked with the indicated amount of ethylene glycol bis-succinimidylsuccinate (EGS) as described previously [43] and analyzed by SDS-PAGE. The majority of cross-linked species within the intasome were larger than a tetramer. The highest oligomeric species observed is consistent with an octamer migrating at ~220 kDa. The molecular weight markers are in the far right lane. The NuPAGE 4–12% gradient gel with a MES-based SDS-PAGE running buffer was used.

Selenium anomalous difference Fourier peaks confirming the model

Anomalous difference Fourier maps calculated using the data collected on selemomethionine-labeled RSV intasome, contoured at 3.5σ (blue mesh) or 5.0 σ (orange mesh). Methionine side chains are shown in sticks. a, A view covering the octameric RSV intasome. b, c, Close-up view of a proximal (b) and distal (c) IN dimer, respectively.

Comparison between RSV and PFV intasomes

a, b, Protein arrangement in the octameric RSV intasome. The inner and outer subunit of one proximal IN dimer is colored in green and cyan, respectively, with the catalytic triad (DDE) of the inner subunit shown in red. The other proximal IN dimer is colored similarly but in more pale colors. Arrows indicate the two proximal IN dimers. A distal IN dimer is colored in slate and orange. DNA is omitted in (a). (b) shows the same view as Fig. 3c. c, A close-up view around the active site of the inner IN subunit in the RSV intasome. The DNA strands are colored as in Fig. 1, and the catalytic triad residues (DDE) are shown in red sticks. d,e, Protein arrangement in the tetrameric PFV intasome [2,3]. The color scheme follows that used for the proximal IN dimers of RSV IN in (a, b). Arrows indicate the two IN dimers. DNA is omitted in (d). (e) shows the same view as Fig. 3f. f, A close-up view around the active site of the inner IN subunit in the PFV intasome (PDB ID: 3OS0 [3]). The color scheme follows that in (c).

Composite omit maps

Simulated annealing composite omit 2mFo−DFc density contoured at 1.0 σ, shown for area within 3.5 Å from any protein or DNA atom in the final model. In a and b, electron densities around protein and DNA are colored differently (blue and green, respectively).

DNA conformations in the RSV and PFV intasomes

a,b, DNA structure in the RSV (a) or PFV (b) intasome, alternatively referred to as the strand-transfer complex (STC). The PFV intasome model is PDB ID: 3OS0 [3]. c, A comparison of DNA structures between the RSV and PFV intasomes (STCs). The integration product DNAs (RSV in cyan, PFV in red) superimposed at a viral DNA terminus are shown in three different view angles. Note the significant deviation in overall trajectory of the target DNA, and difference in the orientation of the second viral DNA molecule. The region spanning the two integration sites on opposing target DNA strands is 6 bp for RSV and 4 bp for PFV.

Electron density for the central 6 bp of the target DNA

The sigma A-weighted 2mFo−DFc map contoured at 1.5 σ (a) or 2.5 σ (b), overlaid with the final model for the central 6 bp region between the two integration sites. Data collection and refinement statistics Statistics for the highest resolution shell are shown in parentheses. Reference: CC1/2: P.A. Karplus and K. Diederichs, Linking crystallographic model and data quality. (2012) Science 336, 1030–1033 (Ref. 41).

SI video. The RSV intasome crystal structure

A video showing the overall structure of the RSV intasome and positioning of the three structural domains of IN within the intasome.
  40 in total

1.  Multiple integrase functions are required to form the native structure of the human immunodeficiency virus type I intasome.

Authors:  H Chen; S Q Wei; A Engelman
Journal:  J Biol Chem       Date:  1999-06-11       Impact factor: 5.157

2.  An amino acid in the central catalytic domain of three retroviral integrases that affects target site selection in nonviral DNA.

Authors:  Amy L Harper; Malgorzata Sudol; Michael Katzman
Journal:  J Virol       Date:  2003-03       Impact factor: 5.103

3.  THE PARTICIPATION OF DNA IN ROUS SARCOMA VIRUS PRODUCTION.

Authors:  H M TEMIN
Journal:  Virology       Date:  1964-08       Impact factor: 3.616

4.  Crystal structure of the nucleosome core particle at 2.8 A resolution.

Authors:  K Luger; A W Mäder; R K Richmond; D F Sargent; T J Richmond
Journal:  Nature       Date:  1997-09-18       Impact factor: 49.962

5.  Architecture of a full-length retroviral integrase monomer and dimer, revealed by small angle X-ray scattering and chemical cross-linking.

Authors:  Ravi S Bojja; Mark D Andrake; Steven Weigand; George Merkel; Olya Yarychkivska; Adam Henderson; Marissa Kummerling; Anna Marie Skalka
Journal:  J Biol Chem       Date:  2011-03-15       Impact factor: 5.157

6.  Features and development of Coot.

Authors:  P Emsley; B Lohkamp; W G Scott; K Cowtan
Journal:  Acta Crystallogr D Biol Crystallogr       Date:  2010-03-24

7.  Molecular and genetic determinants of rous sarcoma virus integrase for concerted DNA integration.

Authors:  Roger Chiu; Duane P Grandgenett
Journal:  J Virol       Date:  2003-06       Impact factor: 5.103

8.  A mutant murine leukemia virus with a single missense codon in pol is defective in a function affecting integration.

Authors:  L A Donehower; H E Varmus
Journal:  Proc Natl Acad Sci U S A       Date:  1984-10       Impact factor: 11.205

9.  Structural basis for retroviral integration into nucleosomes.

Authors:  Daniel P Maskell; Ludovic Renault; Erik Serrao; Paul Lesbats; Rishi Matadeen; Stephen Hare; Dirk Lindemann; Alan N Engelman; Alessandro Costa; Peter Cherepanov
Journal:  Nature       Date:  2015-06-10       Impact factor: 49.962

10.  Phaser crystallographic software.

Authors:  Airlie J McCoy; Ralf W Grosse-Kunstleve; Paul D Adams; Martyn D Winn; Laurent C Storoni; Randy J Read
Journal:  J Appl Crystallogr       Date:  2007-07-13       Impact factor: 3.304

View more
  55 in total

1.  The Preserved HTH-Docking Cleft of HIV-1 Integrase Is Functionally Critical.

Authors:  Meytal Galilee; Elena Britan-Rosich; Sarah L Griner; Serdar Uysal; Viola Baumgärtel; Don C Lamb; Anthony A Kossiakoff; Moshe Kotler; Robert M Stroud; Ailie Marx; Akram Alian
Journal:  Structure       Date:  2016-09-29       Impact factor: 5.006

Review 2.  Integration site selection by retroviruses and transposable elements in eukaryotes.

Authors:  Tania Sultana; Alessia Zamborlini; Gael Cristofari; Pascale Lesage
Journal:  Nat Rev Genet       Date:  2017-03-13       Impact factor: 53.242

3.  Differential assembly of Rous sarcoma virus tetrameric and octameric intasomes is regulated by the C-terminal domain and tail region of integrase.

Authors:  Sibes Bera; Krishan K Pandey; Hideki Aihara; Duane P Grandgenett
Journal:  J Biol Chem       Date:  2018-09-05       Impact factor: 5.157

4.  A C-terminal "Tail" Region in the Rous Sarcoma Virus Integrase Provides High Plasticity of Functional Integrase Oligomerization during Intasome Assembly.

Authors:  Krishan K Pandey; Sibes Bera; Ke Shi; Hideki Aihara; Duane P Grandgenett
Journal:  J Biol Chem       Date:  2017-02-08       Impact factor: 5.157

Review 5.  Multifaceted HIV integrase functionalities and therapeutic strategies for their inhibition.

Authors:  Alan N Engelman
Journal:  J Biol Chem       Date:  2019-08-29       Impact factor: 5.157

6.  Allosteric HIV Integrase Inhibitors Promote Formation of Inactive Branched Polymers via Homomeric Carboxy-Terminal Domain Interactions.

Authors:  Kushol Gupta; Audrey Allen; Carolina Giraldo; Grant Eilers; Robert Sharp; Young Hwang; Hemma Murali; Katrina Cruz; Paul Janmey; Frederic Bushman; Gregory D Van Duyne
Journal:  Structure       Date:  2020-12-23       Impact factor: 5.006

7.  X-ray crystal structure of the N-terminal region of Moloney murine leukemia virus integrase and its implications for viral DNA recognition.

Authors:  Rongjin Guan; Sriram Aiyer; Marie L Cote; Rong Xiao; Mei Jiang; Thomas B Acton; Monica J Roth; Gaetano T Montelione
Journal:  Proteins       Date:  2017-02-03

8.  Structural basis for strand-transfer inhibitor binding to HIV intasomes.

Authors:  Dario Oliveira Passos; Min Li; Ilona K Jóźwik; Xue Zhi Zhao; Diogo Santos-Martins; Renbin Yang; Steven J Smith; Youngmin Jeon; Stefano Forli; Stephen H Hughes; Terrence R Burke; Robert Craigie; Dmitry Lyumkis
Journal:  Science       Date:  2020-01-30       Impact factor: 47.728

Review 9.  Structural Insights on Retroviral DNA Integration: Learning from Foamy Viruses.

Authors:  Ga-Eun Lee; Eric Mauro; Vincent Parissi; Cha-Gyun Shin; Paul Lesbats
Journal:  Viruses       Date:  2019-08-22       Impact factor: 5.048

10.  A Peptide Derived from Lens Epithelium-Derived Growth Factor Stimulates HIV-1 DNA Integration and Facilitates Intasome Structural Studies.

Authors:  Min Li; Xuemin Chen; Huaibin Wang; Kellie A Jurado; Alan N Engelman; Robert Craigie
Journal:  J Mol Biol       Date:  2020-02-13       Impact factor: 5.469

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.