Literature DB >> 26887496

Cryo-EM reveals a novel octameric integrase structure for betaretroviral intasome function.

Allison Ballandras-Colas1, Monica Brown2, Nicola J Cook3, Tamaria G Dewdney1, Borries Demeler4, Peter Cherepanov3,5, Dmitry Lyumkis2, Alan N Engelman1.   

Abstract

Retroviral integrase catalyses the integration of viral DNA into host target DNA, which is an essential step in the life cycle of all retroviruses. Previous structural characterization of integrase-viral DNA complexes, or intasomes, from the spumavirus prototype foamy virus revealed a functional integrase tetramer, and it is generally believed that intasomes derived from other retroviral genera use tetrameric integrase. However, the intasomes of orthoretroviruses, which include all known pathogenic species, have not been characterized structurally. Here, using single-particle cryo-electron microscopy and X-ray crystallography, we determine an unexpected octameric integrase architecture for the intasome of the betaretrovirus mouse mammary tumour virus. The structure is composed of two core integrase dimers, which interact with the viral DNA ends and structurally mimic the integrase tetramer of prototype foamy virus, and two flanking integrase dimers that engage the core structure via their integrase carboxy-terminal domains. Contrary to the belief that tetrameric integrase components are sufficient to catalyse integration, the flanking integrase dimers were necessary for mouse mammary tumour virus integrase activity. The integrase octamer solves a conundrum for betaretroviruses as well as alpharetroviruses by providing critical carboxy-terminal domains to the intasome core that cannot be provided in cis because of evolutionarily restrictive catalytic core domain-carboxy-terminal domain linker regions. The octameric architecture of the intasome of mouse mammary tumour virus provides new insight into the structural basis of retroviral DNA integration.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 26887496      PMCID: PMC4908968          DOI: 10.1038/nature16955

Source DB:  PubMed          Journal:  Nature        ISSN: 0028-0836            Impact factor:   49.962


MMTV intasomes were assembled from IN and vDNA components by differential salt dialysis, akin to the strategy used for PFV intasomes[2]. Fractionation of assembly reactions by size exclusion chromatography (SEC) revealed a higher-order species with a distinct elution profile from those of IN and vDNA (Fig. 1a). To address biological relevance, reactions were conducted with supercoiled plasmid tDNA to monitor the concerted integration of two vDNA ends[10] (Fig. 1b). The SEC-purified complexes catalyzed efficient concerted integration activity, which was inhibited by the IN strand transfer inhibitor raltegravir (Fig. 1c). The sequencing of concerted integration products excised from agarose gels revealed that the majority contained 6 bp target site duplications flanking the integrated vDNA ends, which are known to occur during MMTV infection[11] (Fig. 1d). To address the specificity of complex formation, the invariant CA dinucleotide, which is essential for IN catalysis[12,13], was mutated to GT in the vDNA substrate. As the mutant vDNA failed to support complex formation (data not shown), we conclude that the higher-order species identified by SEC are bona fide MMTV intasomes. We note that divalent metal ion was a critical cofactor for MMTV intasome formation. Based on prior reports that Ca2+ promoted the assembly of active HIV-1 IN-vDNA complexes but was unable to support IN catalysis[14], it was utilized here for intasome assembly.
Figure 1

MMTV intasome (Int) characterization

a, Purification by SEC. Elution positions of mass standards in kDa are indicated. b, Integration assay schematic. Int or IN plus vDNA was reacted with supercoiled tDNA, which can yield half site (h.s.) or concerted integration (c.i.) products. c, Ethidium bromide-stained agarose gel. Lane 1–3 reactions were initiated with IN; vDNA was omitted from lane 1. Raltegravir (RAL) was included as indicated. Lanes 4 and 5, Int reactions. Migrations positions of h.s. products that co-migrate with open circular (o.c.) tDNA, c.i. products, supercoiled (s.c.) tDNA and mass standards in kb are indicated. For gel source data, see Supplementary Figure 1. d, Sequenced Int-mediated concerted integration products (n=35).

To determine the MMTV intasome structure, single-particle cryo-EM data was collected on a Titan Krios microscope equipped with a Gatan K2 direct detector. Computational processing of the data revealed the most stable structural conformation of the complex, which was refined to ~5–6 Å for different regions of the map (Fig. 2a and Extended Data Figs. 1 and 2). The MMTV intasome is composed of central core density, as well as flanking density regions that are conformationally mobile in comparison to the intasome core (Extended Data Fig. 3). Restricting data refinement to the core density region accordingly increased the resolution for the central portion of the structure to ~4 Å for the best-resolved regions (Extended Data Fig. 2d).
Figure 2

Cryo-EM structure of the MMTV intasome

a, top view (upper) of the cryo-EM map; the lower view is rotated 90°. Core density and flanking density regions are indicated. b, Individual domain crystal structures (NTD, green; CCD, orange; CTD, purple) and vDNA (blue) model fitted by rigid body docking. Rulers demarcate 20 Å.

Extended Data Figure 1

Cryo-EM data and refinement

a, Representative cryo-electron micrograph of MMTV intasomes, taken at 2.7 μm underfocus. b, Same as in panel a, marked to show selected particles. c, 2D class averages calculated using Relion[56]. d, Initial model from the class averages calculated using OptiMod[57]. e, Refined reconstruction from the full dataset, with an Euler angle distribution plot showing the relative orientations of the particles.

Extended Data Figure 2

Cryo-EM resolution analysis of reconstructed intasome maps

a, Fourier shell correlation (FSC) curve corresponding to the refined map generated from the full intasome dataset. b, FSC curve corresponding to the refined map generated from the core intasome dataset with the NTDs, CCDs and interdomain linker regions of the flanking IN dimers computationally subtracted. Average global resolutions in panels a and b are indicated. c, Refined map generated from the full dataset (left) displayed side-by-side with the same map colored for local resolution (right). d, Refined map generated from the core intasome dataset (left) displayed side-by-side with the same map colored for local resolution (right) using the coloring scheme in panel c. e, Rotational snapshots of segmented density of CCD1 with the fit of the refined model (see Extended Data Fig. 6) highlighting structural features evident at ~4–5 Å resolution. Partial separation of β-strands, which is typically evident at or beyond 4.5 Å resolution, is apparent.

Extended Data Figure 3

Structural heterogeneity of the MMTV intasome

a, Stable structural conformation of the MMTV intasome after 3D classification of the data. Slices from the density map are displayed below. b, One of several conformations of MMTV intasome refinement after 3D classification of the data. Slices from the density map are displayed below. Multiple fuzzy regions in the flanking INs are apparent in b, which are indicative of remaining heterogeneity within the data and/or continuous structural mobility of the region. c, Overlay of the two reconstructed maps, highlighting the extent of mobility within the flanking regions (brackets).

Each IN monomer is composed of an N-terminal domain (NTD), CCD, and CTD (Extended Data Fig. 4a), and the map was sufficiently detailed to readily assign these domains to their corresponding cryo-EM densities. Given a lack of MMTV IN structures, the different protein domains were crystallized as INCCD, INCTD and INNTD-CCD fragments, and these structures were refined to 1.7 Å, 1.5 Å and 2.7 Å resolution, respectively (Extended Data Fig. 5 and Extended Data Table 1). MMTV DNA was modeled using PFV intasome DNA coordinates and by extending the modeled fragment by 3 bp in the region distal from the IN active sites to account for the different vDNA lengths. Using rigid body docking, the two vDNAs and eight NTDs, CCDs and CTDs were unambiguously positioned into the cryo-EM map (Fig. 2b). Rosetta[15-17] was used to refine the X-ray structures and vDNA, and to build a subset of interdomain linker regions to best fit the density within the intasome core region (Extended Data Fig. 6 and Supplementary Videos 1–5). The resulting model revealed two molecules of vDNA and eight MMTV INs arranged as four IN dimers (Fig. 3a). Two catalytic IN dimers A and B are positioned in the core region in close contact to the vDNAs, whereas supportive IN dimers C and D locate to the flanking density regions, donating their CTDs to the core. Flexible linkers connect the IN domains, and the NTD-CCD linker, which is contracted in the majority of IN protomers, extends in IN1 and IN3 to donate these NTDs in trans to opposing CCDs (Fig. 3a and Extended Data Fig. 6e). Sedimentation velocity centrifugation indicated the molecular mass of active MMTV intasomes as 302.1 kDa, which is fully consistent with the octameric IN structure (calculated IN8-vDNA2 = 313.6 kDa; Extended Data Fig. 4b).
Extended Data Figure 4

MMTV IN domains and intasome sedimentation coefficient distribution

a, Primary IN sequence alignment with boxes denoting canonical IN structural domains. The N-terminal extension domain (NED) occurs in spuma-, γ- and ɛ-retroviral IN proteins. Identical residues between MMTV, RSV, HIV-1 and PFV INs are highlighted by red background; residues that are minimally conserved in three of the sequences are in red. PFV IN secondary structure elements are from PDB code 3L2Q; MMTV elements are from the INNTD-CCD and INCTD crystal structures described here (PDB codes 5CZ2 and 5D7U, respectively). α, β, η, TT and TTT represent α helix, β strand, 310 helix, α-turn and β-turn, respectively. Figure generated with ESPript 3.0 (ref. 61). b, Monte Carlo analysis of sedimentation velocity data for the higher loading concentrations of vDNA (green), MMTV IN (blue) and intasome (red). A clear shift to a discrete species at 10.5 s is observed for the intasome, with minor IN and vDNA populations evident. Different centrifugation parameters for IN and vDNA versus intasomes (see Methods) likely attributed to the minor variations in sedimentation coefficient between major and minor IN and vDNA species. Measured sedimentation coefficients and calculated molar masses compared to theoretical molar masses are shown beneath the graph.

Extended Data Figure 5

MMTV IN domain crystal structures

a, Stereo view of the final 2Fo-Fc density map of the INCCD crystal structure with blue mesh contoured at 1σ. Amino acid side chains are readily evident at the 1.7 Å resolution. b, Stereo view of the final 2Fo-Fc density map of the 2.7 Å resolution INNTD-CCD crystal structure with blue mesh contoured at 1σ. The map is centered on the DDE catalytic triad (red sticks); green spheres, Mg2+ ions. c, Cartoon representation of the INCCD monomer (one of 4 in the crystallographic asymmetric unit) colored in gold. Active site residues are shown as red sticks. d, Cartoon representation of the INNTD-CCD dimer structure (one of 3 in the asymmetric unit). The NTD and CCD are colored green and gold, respectively. Red sticks, active site residues; grey and green spheres, Zn2+ and Mg2+ ions, respectively. e, Stereo view of the final 2Fo-Fc density map of the 1.5 Å resolution INCTD crystal structure, shown as a green mesh contoured at 1σ. f, Cartoon representation of one of the two CTD monomers present in the asymmetric unit.

Extended Data Table 1

X-ray crystallography data collection and refinement statistics.

ConstructCCDNTD-CCDCTD
Data collection
Space groupP1P1211C2221
Cell dimensions
a, b, c (Å)51.89, 53.71,69.6554.37, 83.15, 141.1435.99,42.28, 139.09
a, b, g (°)69.69, 82.08, 63.9790,90.19,9090, 90, 90
Resolution (Å)*46.6 – 1.70(1.73–1.70)70.6 – 2.72 (2.79 – 2.72)40.4 – 1.50(1.53–1.50)
Rmerqe0.060 (0.57)0.08 (0.534)0.043 (0.585)
llsl21.0(2.0)9.5(2.0)29.2 (3.8)
Completeness (%)99.1 (95.6)99.3 (99.0)99.8 (99.9)
Redundancy5.2 (2.8)3.2(3.1)12.2 (8.9)
Refinement
Resolution (Å)32.8 – 1.7070.6 – 2.7240.4 – 1.50
No. reflections used69,07532,11517,448
Rwork/Rfree0.189/0.2220.245/0.2660.165/0.202
No. atoms
 Protein4,9839,110890
 Ligand/ion0128
 Water437069
B-factors
 Protein26.070.928.5
 Ligand/ion45.646.4
 Water33.546.9
R.m.s deviations
 Bond lengths (Å)0.0070.0100.005
 Bond angles (°)0.9541.2810.911

Data for the highest resolution shells are given in parenthesis.

Extended Data Figure 6

Molecular modeling of cryo-EM density

a, Stereo views showing comparisons between the starting X-ray domain models and refined cryo-EM domain models for IN1 highlight relatively minor structural perturbations that are evident only in the most flexible regions of the intasome. b, Linker region snapshots. Atomic models were built de novo from the cryo-EM density for the indicated linkers in the top two panels (residues 45–54 connecting NTD1 and CCD1 and CCD-CTD residues 211–213). Linkers NTD2-CCD2, CCD5-CTD5 and CCD6-CTD6 were not modeled, but are shown as cryo-EM density (red) in the lower panels. c, Stereo view of the cryo-EM model for the MMTV intasome core region (Extended Data Fig. 2d), generated using Rosetta[15–17]. All domains were refined starting with the X-ray crystal structures (Extended Data Fig. 5). Specific linker regions were built de novo (continuous red lines) from the cryo-EM density, whereas lower resolution linker regions (red dotted lines) were omitted from the model. d, FSC curve between the refined cryo-EM core intasome model and map, showing an average resolution of 4.8 Å. e, Comparison of two NTD-CCD conformations in the intasome highlights the NTD-CCD linker, which assumes a retracted state in the outer IN2 and IN4 monomers of core intasome dimers A and B, respectively, as well as in flanking IN dimers C and D (left). The linker extends in core IN molecules IN1 and IN3, which interact with the vDNA (right).

Figure 3

Comparison of MMTV and PFV intasome structures

a, MMTV (left) and PFV (right) intasomes color coded to highlight IN dimers and constituent protomers. Core dimers A and B are red-orange and green-chartreuse, respectively, while MMTV flanking IN dimers C and D are blue-sky blue and purple-light pink, respectively. Colored circles highlight similarly positioned CTDs between structures. b, Close-up views of Arg240-mediated protein (left) and vDNA (right; G6 of plus-strand) interactions. For simplicity, only one set of asymmetric interactions is shown. The interaction of IN5 with residues 258–261 of IN6 varied during model refinement, with the indicated interaction (as well as other atomic distances) observed in the final refined model. Colors are conserved between panels a and b.

The structures of the MMTV and PFV intasomes were compared to ascertain aspects of the new structure important for DNA recombination (Fig. 3a). The PFV intasome is composed of two IN dimers A and B, with the inner protomers of each dimer (IN1 and IN3; red and green in Fig. 3a) adopting extended conformations[2]. The NTDs and CTDs of the outer IN protomers (chartreuse and orange in Fig. 3a) are unseen in PFV intasome density maps. The architecture in the core density region of the MMTV intasome is strikingly similar to the PFV structure. For example, the positions of the CCDs and NTDs that contact vDNA (red IN1 and green IN3 in Fig. 3a) are analogous. The two remaining NTDs in the core region stabilize the CCD dimer interface in an arrangement identical to that seen in the INNTD-CCD crystal structure (Extended Data Figs. 5d and 6e). Both flanking density regions contain a CCD dimer that is also stabilized on each side by NTDs, mimicking the CCD dimer arrangements found in the core density region. The arrangements of the CTDs differ dramatically between the MMTV and PFV structures, with MMTV IN residue Arg240 mediating several key contacts. For example, core protomer IN1 Arg240 interacts with vDNA while IN2 Arg240 interacts with IN1 Asp233 (Fig. 3b). Flanking protomer IN5 Arg240 engages its IN6 neighbor whereas IN6 Arg240 mediates an inter-dimeric interaction with core protomer IN1 Asp223, docking the flanking IN dimer to the core structure (Fig. 3b). To test the functionality of the flanking IN dimers, complementation assays were performed by varying ratios of wild-type (INWT) and mutant IN proteins in strand transfer reactions. Similar strategies were used previously to dissect the division of labor within multimeric complexes of retroviral IN[18-21] as well as the related bacteriophage Mu transpososome[22]. INR240E, which like INWT purified as a dimer (Extended Data Fig. 7), was defective for strand transfer activity (Fig. 4a). To assess the functionality of Arg-240-mediated IN-IN interactions, we compared INR240E to INK165E, which carries a change that uniquely disrupts IN-vDNA binding[2,23]. In concordance with its inability to assume the roles of inner IN1 and IN3 subunits of the core tetramer, INK165E mildly stimulated the activity of limited INWT protein (Fig. 4b), presumably providing a source for other IN subunits within the functional complex. INR240E by contrast potently inhibited INWT function, confirming the importance of Arg240-mediated protein-protein interactions for MMTV IN activity. Two deletion mutant constructs, INCCD-CTD and INCTD, which purified as dimers and monomers, respectively (Extended Data Fig. 7), were additionally analyzed. The reaction composed of 75% INCCD-CTD supported near INWT activity, indicating that this mutant could function when present in up to 6 of 8 octamer positions. This finding strongly supports flanking IN dimer functionality, as the absence of the NTD would likewise preclude INCCD-CTD from assuming intasome core positions 1 and 3. As the INCTD response curve overlaid that predicted for non-functional protein, we moreover conclude that CCD-mediated dimerisation is critical for flanking IN CTD function (Fig. 4).
Extended Data Figure 7

Gel filtration profiles of INWT and IN mutant proteins

Migration positions of mass standards in kDa as well as theoretical protein monomer (M) and dimer (D) positions are indicated.

Figure 4

MMTV intasome functionality

a, Representative agarose gels. The reactions in lanes 1–4 contained 1,0, 0.75, 0.5, 0.25 μM INWT, respectively; IN was omitted from the reaction in lane 5. Subsequent 5-reaction sets contained the same INWT concentrations with 0, 0.25, 0.5, 0.75, 1.0 μM of the indicated mutant protein, for a total concentration of 1 μM IN in lanes 6–25. Lanes 1–5 versus lanes 6–15 and 16–25 were from separate agarose gels (see Supplementary Fig. 1 for gel source data); other labeling as in Fig. 1. b, Dashed lines indicate theoretical activities (graphed as percent INWT activity) for mixtures that contain a mutant protein that supports full INWT function when present in 6 of 8 octamer positions (blue dashed line), 4 of 8 positions (green dashes), 2 positions (purple dashes) or is unable to complement INWT function (pink dashes). Actual activities are from 4 technical replicates (average ± s.e.m.; see Supplementary Table 1 for source data). The non-linear response of INWT (grey line with red diamonds) likely reflects concentration-dependent cooperative multimerization of IN with vDNA[30]. The INWT alone and INWT + INCTD values were not significantly different (P >0.1; two-tailed t-test). *, P <0.05; **, P <0.01.

Analysis of IN primary sequences suggests an explanation for the octameric arrangement of IN within the MMTV intasome when an IN tetramer suffices for PFV integration. Whereas 50-residue CCD-CTD linkers afford the positioning of inner PFV IN CTDs for vDNA and tDNA engagement[2,3], the analogous 8-mer MMTV linker is simply too short to accomplish the task (Extended Data Fig. 8a). MMTV has accordingly evolved to employ flanking IN dimers to nestle CTDs into the core intasome structure to provide essential CTD function in trans for integration. As flanking IN dimer CTDs 6 and 8 structurally mimic the PFV domains (Fig. 3a and Extended Data Fig. 8a), we presume these CTDs will engage tDNA during MMTV integration. Extending our analysis to other retroviruses indicates that in addition to the spumaviruses, IN tetramers could suffice for γ- and ɛ-retroviral intasome activity, while an IN octamer will be required to catalyze α-retrovirus in addition to β-retrovirus integration (Extended Data Fig. 8b). We note that Aihara and colleagues have independently determined an octameric IN architecture for the α-retrovirus Rous sarcoma virus (RSV) intasome[24]. Whereas the majority of studies have highlighted a tetramer as the IN species that catalyzes concerted HIV-1 integration[9,25,26], others have implicated a role for octameric IN[27,28]. Given the intermediary length of lentiviral IN CCD-CTD linker regions (Extended Data Fig. 8b), the higher-order nature of IN in active HIV-1 intasomes may need to be reevaluated.
Extended Data Figure 8

Comparisons of PFV and MMTV intasome structures

a, Cartoon representations of the inner IN3 green subunits of the MMTV and PFV intasomes (Fig. 3a; vDNA strands are in grey). CCD-CTD linker regions are highlighted in orange, and dashed lines circle analogously positioned CTDs. Of note, this CTD in the MMTV structure is colored differently because it originates from a separate IN molecule (IN8 from flanking dimer D). b, Lengths of NTD-CCD and CCD-CTD interdomain linker regions across retroviral IN proteins; ‘+’ indicates the presence of an NED. The multimeric state of IN in known intasome structures is indicated by bold type. c, The PFV intasome with bound tDNA (PDB code 3OS2; orange) was superimposed with the MMTV intasome (blue). The distance between overlaid active sites is in each case ~26 Å. d, 90° rotation of superimposed structures, with proteins omitted for clarity. Canonical B-form tDNA (magenta) was superimposed with PFV intasome tDNA. The positions of phosphodiester bonds staggered by 4 bp in the PFV crystal structure or by 6 bp in the modeled tDNA are indicated by spheres.

PFV IN, which cleaves tDNA phosphodiester bonds separated by 4 bp, preferentially integrates into flexible sequences, whereas MMTV and RSV IN, which cleave tDNA with 6 bp staggers, are relatively unconstrained by tDNA flexibility[3,29]. Superposition of the MMTV and PFV intasome structures revealed that the two sets of catalytic IN active sites almost perfectly aligned (Extended Data Fig. 8c). The same practical spacing of IN active sites therefore catalyzes PFV and MMTV integration into sharply bent versus relatively non-deformed tDNA, respectively (Extended Data Fig. 8d). Due to their positions in the structure, we note that the flanking IN dimers dramatically expand the potential contact area with tDNA, which is likely to have consequences for the docking of α- and β-retroviral intasomes to host chromatin.

METHODS

Statistical methods were not used to predetermine sample sizes. Experiments were not randomized and the investigators were not blinded to allocation during experiments and outcome assessment.

DNA constructs

Full-length (FL) MMTV IN[31] and INCTD (IN212–266 and IN212–319) expression constructs provided N-terminal His6 tags followed by human rhinovirus (HRV) 3C protease cleavage sites. The INNTD-CCD expression construct was made by introducing a stop codon after the TCA that encodes for IN residue Ser212. INK165E and INR240E expression constructs were made by PCR-directed mutagenesis. DNA fragments corresponding to IN51–212 (INCCD) and IN51–319 (INCCD-CTD) were amplified by PCR and subcloned into expression vector pET-20b (Novagen); these proteins harbored cleavable C-terminal His6 tags. The sequences of all PCR amplified regions of plasmid DNAs were verified by sequencing.

Protein expression and purification for intasome and IN activity assays

FL INs, INCCD-CTD and INCTD/212–319 were expressed in Escherichia coli strain PC2 (ref. 32) in LB broth (supplemented with 50 μM ZnCl2 for FL INs) by induction with 0.4 mM isopropyl β-D-1-thiogalactopyranoside (IPTG) (1 mM IPTG for INCCD-CTD) at 30 °C (37 °C for INCCD-CTD and INCTD) for 4 h. Bacteria pellets were resuspended in 20 mM HEPES, pH 7.6, 1 M NaCl, 5 mM 3-[(3-cholamidopropyl)dimethylammonio]-1-propanesulfonate (CHAPS), complete EDTA-free protease inhibitor (Roche). Following sonication for 5 min at 50 mA, cell lysates were centrifuged at 45,000 × g for 1 h. The supernatant, supplemented with 5 mM imidazole, was filtered through a 0.45 μm filter and purified using a Ni2+-charged HisTrap 5 mL column (GE Healthcare) equilibrated with 20 mM HEPES, pH 7.6, 1 M NaCl, 5 mM CHAPS, 15 mM imidazole. Proteins were eluted by a linear gradient of imidazole (15–500 mM) containing a step wash at 65 mM imidazole using the ÄKTA purifier system (GE Healthcare; for INCCD-CTD, a second step wash was done at 115 mM imidazole). IN-containing fractions were diluted 1:5 with 20 mM HEPES, pH 7.6, 5 mM CHAPS, 2 mM dithiothreitol (DTT) and immediately loaded on a Heparin HiTrap 5 mL column equilibrated with 20 mM HEPES, pH 7.6, 200 mM NaCl, 5 mM CHAPS, 2 mM DTT. Proteins were eluted by a linear NaCl gradient from 200 mM to 2 M (INCTD was isolated in the column flow through). IN-containing fractions were pooled and cleaved with HRV 3C protease (GE Healthcare) overnight at 4 °C to remove the His6 tag. In lieu of purification by Heparin HiTrap, INCCD-CTD was dialysed against 20 mM HEPES, pH 7.6, 1 M NaCl, 5 mM CHAPS, 2 mM DTT, 2 mM EDTA at 4 °C for 2 h, cleaved with HRV 3C protease overnight at 4 °C, followed by dialysis against 20 mM HEPES, pH 7.6, 1 M NaCl, 5 mM CHAPS, 2 mM DTT, 0.5 mM EDTA (SEC1 buffer). Cleaved proteins were purified by SEC using a Superdex 200 10/300 column (GE Healthcare) equilibrated with SEC1 buffer. Purified INs were concentrated by ultracentrifugation using 10-kDa molecular weight cutoff Millipore concentrators and dialysed overnight against SEC1 buffer supplemented to contain 10% glycerol. Protein concentration was determined by spectrophotometry, and aliquots flash-frozen in liquid N2 were stored at −80 °C. Purified INs were analyzed by SEC using a Superdex 3.2/300 column equilibrated with SEC1 buffer; protein standards were from Bio-Rad.

MMTV intasome assembly

Intasomes were assembled by mixing 128 μM MMTV IN with 38 μM 22 bp preprocessed vDNA (5′-CAGGTCGGCCGACTGCGGCA/5′-AATGCCGCAGTCGGCCGACCTG) in 20 mM HEPES, pH 7.6, 600 mM NaCl, 2 mM DTT, prior to dialysis for 16 h at 4 °C against 25 mM Tris-HCl, pH 7.4, 80 mM NaCl, 2 mM DTT, 25 μM ZnCl2, 10 mM CaCl2. The resulting milky white precipitate was dissolved by adding NaCl to the final concentration of 250 mM, followed by incubation on ice for 1 h. Following 10 min centrifugation at 20,000 g at 4 °C, soluble intasomes were purified by SEC using Superdex 200 10/300 equilibrated with 25 mM Tris-HCl, pH 7.4, 200 mM NaCl, 2 mM DTT, 25 μM ZnCl2, 10 mM CaCl2 (SEC2 buffer). Intasome-containing fractions, which eluted around 10.5 mL, were concentrated by ultracentrifugation using 10-kDa cut off concentrators.

In vitro integration assays

Strand transfer assays were performed as described previously[31]. Briefly, 1 μM intasome or 1 μM MMTV IN plus 0.5 μM vDNA were mixed with 300 ng pGEM-3 tDNA in 40 μL of 20 mM HEPES, pH 7.4, 60 mM NaCl, 5 mM MgCl2, 4 μM ZnSO4, 10 mM DTT. Reactions incubated for 1 h at 37 °C were terminated by adding 25 mM EDTA–0.5 % SDS. DNA products deproteinized by digestion with proteinase K and precipitated with ethanol were analyzed by electrophoresis through 1.5% agarose gels and visualized by staining with ethidium bromide. Raltegravir, which was used at the final concentration of 100 μM, was obtained from Selleck Chemicals. Proteins were premixed on ice prior to addition to reactions for biochemical complementation assays. Concerted integration products were measured by band intensity quantification relative to INWT product formation, which was set to 100% using Molecular Imager® Gel Doc TM XR+ System with Image Lab software (BioRad); the background across 8 gel images corresponded to 1.26% ± 0.47% of INWT function. Concerted integration reaction products were cloned and sequenced essentially as previously described[32]. Briefly, DNA excised from agarose gels was repaired using Phi29 DNA polymerase (New England Biolabs) and ligated to a PCR-amplified kanamycin resistance cassette. Plasmids recovered following transformation of ligation mixtures into E. coli were sequenced using primers that annealed to the ends of the cassette DNA.

Analytical ultracentrifugation

Sedimentation velocity analysis was performed at 20 °C in a Beckman Optima XL-I analytical ultracentrifuge using an An60Ti rotor and standard 2-channel epon centerpieces (Beckman-Coulter). Samples were prepared in 20 mM phosphate buffer, pH 7.5, 150 mM NaCl at two loading concentrations, A280 values of 0.3 and 0.9 for MMTV IN and the intasome, and A280 values of 0.18 and 0.53 for vDNA, to exclude potential mass action oligomerization. IN and vDNA were spun simultaneously at 35,000 rpm for 22 h while the intasome was spun at 27,000 rpm for 12 h; the different rotor speeds were based on the predicted masses of the different macromolecules. Data were analyzed using UltraScan-III version 2.2, release 2000 (ref. 33). Hydrodynamic corrections for buffer density and viscosity were estimated with UltraScan to be 1.041 g/mL and 1.101 cP, respectively. The partial specific volume of IN (0.728 mL/g) was estimated by UltraScan from its protein sequence using a method analogous to the methods outlined in Laue et al.[34]. Sedimentation velocity data were analyzed as described[35]. Optimization was performed by two-dimensional spectral analysis (2DSA)[36] with simultaneous removal of time-invariant and radially-invariant noise contributions[37]. 2DSA solutions, which are subjected to parsimonious regularization by genetic algorithm analysis[38], were further refined using Monte Carlo analysis to determine confidence limits for the determined parameters[39]. Calculations were performed on the Lonestar cluster at the Texas Advanced Computing Center at the University of Texas at Austin.

Protein expression and X-ray crystallography

MMTV INCCD, INNTD-CCD and INCTD fragments spanning MMTV IN residues 51–212, 1–212 and 212–266, respectively, were expressed in BL21(DE3)-CodonPlus cells (Stratagene) in LB medium (supplemented with 50 μM ZnCl2 for INNTD-CCD) by induction with 0.01% (w/v) IPTG. Bacteria were lysed by sonication in 0.5 M NaCl, 50 mM Tris-HCl, pH 7.4 and the proteins were isolated by absorption to Ni-nitrilotriacetic acid agarose (Qiagen). Following digestion with HRV 3C protease to release His6 tags, the proteins were further purified by ion exchange and size exclusion chromatography. Crystals were grown by vapor diffusion in hanging drops by mixing 1 μL protein (6–10 mg/mL in 200 mM NaCl, 2 mM DTT, 25 mM Tris-HCl, pH 7.5) and 1 μL reservoir solution, which contained 12.5% PEG-3350, 0.15 M ammonium citrate, pH 6.5 (INCCD), 19% PEG-3350, 0.2 M MgCl2, 5% (v/w) 1-butyl-3-methylimidazolium dicyanamide (INNTD-CCD), or 19% isopropanol, 50 mM ammonium acetate, 0.1 M HEPES-NaOH, pH 7.5 (INCTD). Crystals, cryoprotected with 25% glycerol (INCCD, INNTD-CCD) or 30% PEG-400 (INCTD), were frozen by immersion in liquid N2. Diffraction data for the INCCD was collected using a charged coupled device detector at beamline BM14 (European Synchrotron Radiation Facility [ESRF]) whereas INCTD and INNTD-CCD crystals were analyzed at beamline I03 (Diamond Light Source) equipped with a PILATUS direct detector. The data integrated with XDS[40] were scaled with Aimless[41]. The structures, which were each derived from a single crystal, were solved by molecular replacement in Phaser[42] using search models generated from PDB entries 1ASV (CCD)[43], 3F9K (NTD)[10] and 1EX4 (CTD)[44]. The models were rebuilt using ARP/wARP[45] and/or manually in Coot[46] and refined in Phenix[47] and/or Refmac[48]. Pseudo-merohedral twin law (-h,-k,l) was accounted for during refinement of the INNTD-CCD structure. Final models, validated with MolProbity[49], had at least 96.9% of residues in the favored regions and none in the disallowed regions of the Ramachandran plot. Detailed X-ray data collection and refinement statistics are given in Extended Data Table 1.

Cryo-EM data acquisition

Sample containing MMTV intasomes in SEC2 buffer supplemented to contain 0.05% NP-40 was applied onto freshly plasma treated (6s, Gatan Solarus plasma cleaner) holey carbon C-flat grids (CF-1.2/1.3–4C, Protochips, Inc.), adsorbed for 30 sec and then plunged into liquid ethane using a manual cryo-plunger in an ambient environment of 4 °C. Data were acquired over three separate sessions using Leginon software[50] installed on an FEI Titan Krios electron microscope operating at 300 kV, with a dose of 40 e-/Å2 at the rate of ~6.9 e-/pix/sec and an estimated underfocus ranging from 1–4 μm (centered at 2.6 ± 0.6 μm). The dose was fractionated over 50 raw frames collected over 10-sec exposure time (200 ms per frame) on the Gatan K2 Summit direct detection device, with each frame receiving a dose of ~6.5 e-/pixel/sec. 2714 movies were collected and recorded at a nominal magnification of 22,500, corresponding to a pixel size of 1.31 Å at the specimen level. The individual frames were gain corrected, aligned and summed using a GPU-enabled whole frame alignment program as previously described[51,52], and exposure filtered[53] according to the 6.9 e-/pix/sec dose rate. See Extended Data Table 2 for additional details on cryo-EM data collection.
Extended Data Table 2

Cryo-EM data statistics.

Constructcore MMTV intasomefull MMTV intasome
EM data collection/processing
MicroscopeTitan KriosTitan Krios
Voltage300300
CameraGatan K2 SummitGatan K2 Summit
Defocus range (μm)1.0–4.01.0–4.0
Defocus mean ± std (μm)2.6 ± 0.62.6 ± 0.6
Exposure time (s)1010
Dose rate (e-/pixel/s)6.96.9
Total dose (e-/Å2)4040
Pixel size (Å)1.311.31
Number of micrographs2,7142,714
Number of particles (processed)147,850147,850
Number of particles (refined)77,36577,365
Number of particles (in final map)30,30741,475
SymmetryC2C2
Resolution (global) (Å)*4.86.0
Resolution range (local) (Å)4–55–6
Map sharpening B-factor (Å2)−300−460
Model refinement
Space groupP1
Cell dimensions
 a = b = c (Å)151.2
a=b = g (°)90
Number of atoms (modeled)11,462
Validation
MolProbity score1.46 (96th percentile)
Clashscore, all atoms2.27 (99th percentile)
Protein
 Ramachandran favored (%)1,115(92.76)
  allowed (%)87 (7.24)
  Disallowed (%)0(0)
 Good rotamers (%)1,035(99.71)
 Cβ deviations >0.25Å (%)0(0)
 Cis Prolines (%)8/88(9.09)
 Bad bonds (%)2/10,140(0.02)
 Bad angles (%)3/13,810(0.02)
DNA
 Bad bonds (%)0/1,834(0)
 Bad angles (%)1/2,822 (0.04)
r.m.s. deviations
 Bond lengths (Å)0.012
 Bond angles (°)1.334

Resolution assessment based on frequency-limited refinement using the 0.143-threshold for resolution analysis.

Cryo-EM image analysis

Pre-processing operations prior to the refinement of the final models were performed using the Appion package[54] and are conceptually identical to those previously described[52]. Briefly, single intasome particles (244,315) were selected from the aligned and summed micrographs, from which 147,850 were used to create an initial raw particle stack after removing regions of the micrographs containing carbon and large areas of aggregation. 2D alignments and classifications were performed using the CL2D[55] and Relion[56] algorithms (Extended Data Fig. 1c), and an initial model was generated directly from the class averages using OptiMod[57] (Extended Data Fig. 1d). Following iterative rounds of 2D alignment and classification, 77,365 particles remained for 3D refinement and classification. 3D refinements and classifications were initially performed within Relion[56], after which the parameters were converted for use in Frealign[58]. The final map was refined in Frealign. Several conformational states of the intasome were observed following 3D classification in both Relion and Frealign[59]. Whereas one of the resulting maps yielded the stable intasome structure from 41,475 particles (Fig. 2a, Extended Data Fig. 2c and Extended Data Table 2), all other maps (one of which is displayed in Extended Data Fig 3b) displayed mobility in the flanking regions, which did not resolve by further classifying the data. To improve the resolution of the core region, we ran Relion and recovered four models in the classification. For each of the resulting maps, the flanking regions were segmented and treated with a soft-edged mask that adopted the shape of the remaining density. Subsequently, for each raw particle, the flanking region from the respective conformational state to which that particle belonged was computationally subtracted from the raw particle image. The contrast transfer function was included in the computational subtraction process. In this manner, datasets lacking the majority of the flanking INs were created. Refinement of the core intasome dataset was then conducted using the likelihood-based approach in Frealign[59], effectively a focused classification of the core region. The best class was resolved to ~4 Å resolution in the most homogeneous regions using 30,307 particles (Extended Data Fig. 2d and Extended Data Table 2). Although slight ghost images remained for the flanking regions within certain particles, they did not dramatically affect the refinement; the use of a tighter mask facilitated the recovery of higher resolution information.

Assembly of the atomic model

Models of the core intasome and the full octamer structures were built and refined in a stepwise manner using Rosetta[15] starting with rigid-body fitted X-ray structures of individual domains as input. Rosetta protocols were used for all parts of the modeling[60]. To optimally fit X-ray models into the EM density, we first independently refined each individual domain (NTD, CCD and CTD) using multiple input starting seeds. CCD1 and CCD2 were each seeded with 6 starting X-ray models: independent CCD monomers from chains A-D of the INCCD structure and monomers A-B of the CCD portions of the INNTD-CCD structures. CTDs 1, 2, 5 and 6 were seeded with subunits A and B of the INCTD X-ray model. Likewise, for NTD1 and NTD3, the two different NTDs of the INNTD-CCD X-ray structure were used as input seeds. All models were refined against the core intasome structure resolved to ~4–5 Å resolution (Extended Data Fig. 2d). At least 2000 models were generated from each and the lowest energy model was selected for moving forward. Modeling quality was assessed by energy scores, structural similarity of the top scoring models, and visual inspection (Extended Data Fig. 6a). We next proceeded to independently model IN1, IN2, IN5 and IN6, thereby filling in the linker regions between individual domains using 7-mers from the PDB[15]. This enabled de novo modeling for linker residues 45–54 between NTD1-CCD1 and residues 211–213 between CCD1-CTD1 and CCD2-CTD2 (some residues, as well as outlier linker regions, were not modeled due to disorder; Extended Data Fig. 6b and 6c); modeling was facilitated by the presence of ‘bumps’ within the density that corresponded to bulky amino acid side chains, in particular within NTD1-CCD1, which is located in the best resolved region of the structure (Extended Data Fig. 2d). IN1 and IN2 were each seeded with combinations of the best models arising from refinement of individual domains and were subsequently refined against the core intasome density map. 2000 models were again generated for each, and the best were selected to move forward. This set of procedures produced FL models for IN1 and IN2 and models for CTD5 and CTD6 fitted to the EM protein density. MMTV DNA was modeled based on the X-ray structure of the PFV intasome (PDB 3L2Q). This model was rigid-body docked into the EM density and then relaxed with Rosetta. The complete intasome model was iteratively relaxed with Rosetta and then adjusted manually using Coot[46]. Several iterative rounds of refinement and inspection were performed using MolProbity[49] at the end of each round until a consensus model was obtained (Extended Data Fig. 6c,d and Extended Data Table 2).

IN linker regions

Linker lengths for Extended Data Figure 8b were assessed by aligning published[30] or in-house generated IN sequence alignments against alignments based on known domain structures[2] (Extended Data Fig. 4a). The following sequences were included: γ-retroviruses, Moloney murine leukemia virus (Genbank accession number J02255.1), reticuloendotheliosis virus strain A (DQ237900.1), feline leukemia virus (NC_001940.1); ɛ-retroviruses, walleye dermal sarcoma virus (NC_001867.1), walleye epidermal hyperplasia virus types 1 and 2 (AF133051.1 and AF133051.2, respectively); spumaviruses, PFV (U21247.1), macaque simian foamy virus (NC_010819.1), spider monkey foamy virus (EU010385.1); lentiviruses, HIV-1 strain NL4–3 (U26942.1), HIV-2 strain ROD (X05291.1), simian immunodeficiency virus strain agm.tan-1 (U58991.1), equine infectious anemia virus (M16575.1), feline immunodeficiency virus (M25381.1), caprine arthritis encephalitis virus (M33677.1), bovine immunodeficiency virus (NC_001413.1); δ-retroviruses, bovine leukemia virus (K02120.1), human T-cell lymphotropic virus types 1 and 2 (NC_001436.1 and NC_001488.1, respectively); β-retroviruses, MMTV (NC_001503.1), Mason Pfizer monkey virus (NC_001550.1), Jaagsiekte sheep retrovirus (NC_001494.1); α-retroviruses, RSV (J02342.1), lymphoproliferative disease virus (KC802224.1).

Cryo-EM data and refinement

a, Representative cryo-electron micrograph of MMTV intasomes, taken at 2.7 μm underfocus. b, Same as in panel a, marked to show selected particles. c, 2D class averages calculated using Relion[56]. d, Initial model from the class averages calculated using OptiMod[57]. e, Refined reconstruction from the full dataset, with an Euler angle distribution plot showing the relative orientations of the particles.

Cryo-EM resolution analysis of reconstructed intasome maps

a, Fourier shell correlation (FSC) curve corresponding to the refined map generated from the full intasome dataset. b, FSC curve corresponding to the refined map generated from the core intasome dataset with the NTDs, CCDs and interdomain linker regions of the flanking IN dimers computationally subtracted. Average global resolutions in panels a and b are indicated. c, Refined map generated from the full dataset (left) displayed side-by-side with the same map colored for local resolution (right). d, Refined map generated from the core intasome dataset (left) displayed side-by-side with the same map colored for local resolution (right) using the coloring scheme in panel c. e, Rotational snapshots of segmented density of CCD1 with the fit of the refined model (see Extended Data Fig. 6) highlighting structural features evident at ~4–5 Å resolution. Partial separation of β-strands, which is typically evident at or beyond 4.5 Å resolution, is apparent.

Structural heterogeneity of the MMTV intasome

a, Stable structural conformation of the MMTV intasome after 3D classification of the data. Slices from the density map are displayed below. b, One of several conformations of MMTV intasome refinement after 3D classification of the data. Slices from the density map are displayed below. Multiple fuzzy regions in the flanking INs are apparent in b, which are indicative of remaining heterogeneity within the data and/or continuous structural mobility of the region. c, Overlay of the two reconstructed maps, highlighting the extent of mobility within the flanking regions (brackets).

MMTV IN domains and intasome sedimentation coefficient distribution

a, Primary IN sequence alignment with boxes denoting canonical IN structural domains. The N-terminal extension domain (NED) occurs in spuma-, γ- and ɛ-retroviral IN proteins. Identical residues between MMTV, RSV, HIV-1 and PFV INs are highlighted by red background; residues that are minimally conserved in three of the sequences are in red. PFV IN secondary structure elements are from PDB code 3L2Q; MMTV elements are from the INNTD-CCD and INCTD crystal structures described here (PDB codes 5CZ2 and 5D7U, respectively). α, β, η, TT and TTT represent α helix, β strand, 310 helix, α-turn and β-turn, respectively. Figure generated with ESPript 3.0 (ref. 61). b, Monte Carlo analysis of sedimentation velocity data for the higher loading concentrations of vDNA (green), MMTV IN (blue) and intasome (red). A clear shift to a discrete species at 10.5 s is observed for the intasome, with minor IN and vDNA populations evident. Different centrifugation parameters for IN and vDNA versus intasomes (see Methods) likely attributed to the minor variations in sedimentation coefficient between major and minor IN and vDNA species. Measured sedimentation coefficients and calculated molar masses compared to theoretical molar masses are shown beneath the graph.

MMTV IN domain crystal structures

a, Stereo view of the final 2Fo-Fc density map of the INCCD crystal structure with blue mesh contoured at 1σ. Amino acid side chains are readily evident at the 1.7 Å resolution. b, Stereo view of the final 2Fo-Fc density map of the 2.7 Å resolution INNTD-CCD crystal structure with blue mesh contoured at 1σ. The map is centered on the DDE catalytic triad (red sticks); green spheres, Mg2+ ions. c, Cartoon representation of the INCCD monomer (one of 4 in the crystallographic asymmetric unit) colored in gold. Active site residues are shown as red sticks. d, Cartoon representation of the INNTD-CCD dimer structure (one of 3 in the asymmetric unit). The NTD and CCD are colored green and gold, respectively. Red sticks, active site residues; grey and green spheres, Zn2+ and Mg2+ ions, respectively. e, Stereo view of the final 2Fo-Fc density map of the 1.5 Å resolution INCTD crystal structure, shown as a green mesh contoured at 1σ. f, Cartoon representation of one of the two CTD monomers present in the asymmetric unit.

Molecular modeling of cryo-EM density

a, Stereo views showing comparisons between the starting X-ray domain models and refined cryo-EM domain models for IN1 highlight relatively minor structural perturbations that are evident only in the most flexible regions of the intasome. b, Linker region snapshots. Atomic models were built de novo from the cryo-EM density for the indicated linkers in the top two panels (residues 45–54 connecting NTD1 and CCD1 and CCD-CTD residues 211–213). Linkers NTD2-CCD2, CCD5-CTD5 and CCD6-CTD6 were not modeled, but are shown as cryo-EM density (red) in the lower panels. c, Stereo view of the cryo-EM model for the MMTV intasome core region (Extended Data Fig. 2d), generated using Rosetta[15-17]. All domains were refined starting with the X-ray crystal structures (Extended Data Fig. 5). Specific linker regions were built de novo (continuous red lines) from the cryo-EM density, whereas lower resolution linker regions (red dotted lines) were omitted from the model. d, FSC curve between the refined cryo-EM core intasome model and map, showing an average resolution of 4.8 Å. e, Comparison of two NTD-CCD conformations in the intasome highlights the NTD-CCD linker, which assumes a retracted state in the outer IN2 and IN4 monomers of core intasome dimers A and B, respectively, as well as in flanking IN dimers C and D (left). The linker extends in core IN molecules IN1 and IN3, which interact with the vDNA (right).

Gel filtration profiles of INWT and IN mutant proteins

Migration positions of mass standards in kDa as well as theoretical protein monomer (M) and dimer (D) positions are indicated.

Comparisons of PFV and MMTV intasome structures

a, Cartoon representations of the inner IN3 green subunits of the MMTV and PFV intasomes (Fig. 3a; vDNA strands are in grey). CCD-CTD linker regions are highlighted in orange, and dashed lines circle analogously positioned CTDs. Of note, this CTD in the MMTV structure is colored differently because it originates from a separate IN molecule (IN8 from flanking dimer D). b, Lengths of NTD-CCD and CCD-CTD interdomain linker regions across retroviral IN proteins; ‘+’ indicates the presence of an NED. The multimeric state of IN in known intasome structures is indicated by bold type. c, The PFV intasome with bound tDNA (PDB code 3OS2; orange) was superimposed with the MMTV intasome (blue). The distance between overlaid active sites is in each case ~26 Å. d, 90° rotation of superimposed structures, with proteins omitted for clarity. Canonical B-form tDNA (magenta) was superimposed with PFV intasome tDNA. The positions of phosphodiester bonds staggered by 4 bp in the PFV crystal structure or by 6 bp in the modeled tDNA are indicated by spheres. X-ray crystallography data collection and refinement statistics. Data for the highest resolution shells are given in parenthesis. Cryo-EM data statistics. Resolution assessment based on frequency-limited refinement using the 0.143-threshold for resolution analysis.
  57 in total

1.  FREALIGN: high-resolution refinement of single particle structures.

Authors:  Nikolaus Grigorieff
Journal:  J Struct Biol       Date:  2006-06-02       Impact factor: 2.867

2.  Retroviral DNA integration: reaction pathway and critical intermediates.

Authors:  Min Li; Michiyo Mizuuchi; Terrence R Burke; Robert Craigie
Journal:  EMBO J       Date:  2006-02-16       Impact factor: 11.598

3.  Zn2+ promotes the self-association of human immunodeficiency virus type-1 integrase in vitro.

Authors:  S P Lee; J Xiao; J R Knutson; M S Lewis; M K Han
Journal:  Biochemistry       Date:  1997-01-07       Impact factor: 3.162

4.  A two-dimensional spectrum analysis for sedimentation velocity experiments of mixtures with heterogeneity in molecular weight and shape.

Authors:  Emre Brookes; Weiming Cao; Borries Demeler
Journal:  Eur Biophys J       Date:  2009-02-27       Impact factor: 1.733

5.  How good are my data and what is the resolution?

Authors:  Philip R Evans; Garib N Murshudov
Journal:  Acta Crystallogr D Biol Crystallogr       Date:  2013-06-13

6.  Complementation between HIV integrase proteins mutated in different domains.

Authors:  D C van Gent; C Vink; A A Groeneger; R H Plasterk
Journal:  EMBO J       Date:  1993-08       Impact factor: 11.598

7.  Structural basis for retroviral integration into nucleosomes.

Authors:  Daniel P Maskell; Ludovic Renault; Erik Serrao; Paul Lesbats; Rishi Matadeen; Stephen Hare; Dirk Lindemann; Alan N Engelman; Alessandro Costa; Peter Cherepanov
Journal:  Nature       Date:  2015-06-10       Impact factor: 49.962

8.  MolProbity: all-atom structure validation for macromolecular crystallography.

Authors:  Vincent B Chen; W Bryan Arendall; Jeffrey J Headd; Daniel A Keedy; Robert M Immormino; Gary J Kapral; Laura W Murray; Jane S Richardson; David C Richardson
Journal:  Acta Crystallogr D Biol Crystallogr       Date:  2009-12-21

9.  RELION: implementation of a Bayesian approach to cryo-EM structure determination.

Authors:  Sjors H W Scheres
Journal:  J Struct Biol       Date:  2012-09-19       Impact factor: 2.867

10.  Phaser crystallographic software.

Authors:  Airlie J McCoy; Ralf W Grosse-Kunstleve; Paul D Adams; Martyn D Winn; Laurent C Storoni; Randy J Read
Journal:  J Appl Crystallogr       Date:  2007-07-13       Impact factor: 3.304

View more
  56 in total

1.  NKNK: a New Essential Motif in the C-Terminal Domain of HIV-1 Group M Integrases.

Authors:  Marine Kanja; Pierre Cappy; Nicolas Levy; Oyndamola Oladosu; Sylvie Schmidt; Paola Rossolillo; Flore Winter; Romain Gasser; Christiane Moog; Marc Ruff; Matteo Negroni; Daniela Lener
Journal:  J Virol       Date:  2020-09-29       Impact factor: 5.103

2.  The Preserved HTH-Docking Cleft of HIV-1 Integrase Is Functionally Critical.

Authors:  Meytal Galilee; Elena Britan-Rosich; Sarah L Griner; Serdar Uysal; Viola Baumgärtel; Don C Lamb; Anthony A Kossiakoff; Moshe Kotler; Robert M Stroud; Ailie Marx; Akram Alian
Journal:  Structure       Date:  2016-09-29       Impact factor: 5.006

Review 3.  Integration site selection by retroviruses and transposable elements in eukaryotes.

Authors:  Tania Sultana; Alessia Zamborlini; Gael Cristofari; Pascale Lesage
Journal:  Nat Rev Genet       Date:  2017-03-13       Impact factor: 53.242

Review 4.  Cryo-EM in drug discovery: achievements, limitations and prospects.

Authors:  Jean-Paul Renaud; Ashwin Chari; Claudio Ciferri; Wen-Ti Liu; Hervé-William Rémigy; Holger Stark; Christian Wiesmann
Journal:  Nat Rev Drug Discov       Date:  2018-06-08       Impact factor: 84.694

5.  Differential assembly of Rous sarcoma virus tetrameric and octameric intasomes is regulated by the C-terminal domain and tail region of integrase.

Authors:  Sibes Bera; Krishan K Pandey; Hideki Aihara; Duane P Grandgenett
Journal:  J Biol Chem       Date:  2018-09-05       Impact factor: 5.157

6.  A C-terminal "Tail" Region in the Rous Sarcoma Virus Integrase Provides High Plasticity of Functional Integrase Oligomerization during Intasome Assembly.

Authors:  Krishan K Pandey; Sibes Bera; Ke Shi; Hideki Aihara; Duane P Grandgenett
Journal:  J Biol Chem       Date:  2017-02-08       Impact factor: 5.157

Review 7.  Multifaceted HIV integrase functionalities and therapeutic strategies for their inhibition.

Authors:  Alan N Engelman
Journal:  J Biol Chem       Date:  2019-08-29       Impact factor: 5.157

8.  X-ray crystal structure of the N-terminal region of Moloney murine leukemia virus integrase and its implications for viral DNA recognition.

Authors:  Rongjin Guan; Sriram Aiyer; Marie L Cote; Rong Xiao; Mei Jiang; Thomas B Acton; Monica J Roth; Gaetano T Montelione
Journal:  Proteins       Date:  2017-02-03

9.  Structural basis for strand-transfer inhibitor binding to HIV intasomes.

Authors:  Dario Oliveira Passos; Min Li; Ilona K Jóźwik; Xue Zhi Zhao; Diogo Santos-Martins; Renbin Yang; Steven J Smith; Youngmin Jeon; Stefano Forli; Stephen H Hughes; Terrence R Burke; Robert Craigie; Dmitry Lyumkis
Journal:  Science       Date:  2020-01-30       Impact factor: 47.728

10.  A Peptide Derived from Lens Epithelium-Derived Growth Factor Stimulates HIV-1 DNA Integration and Facilitates Intasome Structural Studies.

Authors:  Min Li; Xuemin Chen; Huaibin Wang; Kellie A Jurado; Alan N Engelman; Robert Craigie
Journal:  J Mol Biol       Date:  2020-02-13       Impact factor: 5.469

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.