Literature DB >> 31711961

X-ray Structures of the Post-fusion 6-Helix Bundle of the Human Syncytins and their Functional Implications.

Katinka Ruigrok¹, Marie-Christine Vaney², Julian Buchrieser³, Eduard Baquero², Jan Hellert², Bruno Baron⁴, Patrick England⁴, Olivier Schwartz⁵, Felix A Rey⁶, Marija Backovic⁷.

Abstract

The retroviral envelope-derived proteins syncytin-1 and syncytin-2 (syn1 and syn2) drive placentation in humans by forming a syncytiotophoblast, a structure allowing for an exchange interface between maternal and fetal blood during pregnancy. Despite their essential role, little is known about the molecular mechanism underlying the syncytins' function. We report here the X-ray structures of the syn1 and syn2 transmembrane subunit ectodomains, featuring a 6-helix bundle (6HB) typical of the post-fusion state of gamma-retrovirus and filovirus fusion proteins. Contrary to the filoviruses, for which the fusion glycoprotein was crystallized both in the post-fusion and in the spring-loaded pre-fusion form, the highly unstable nature of the syncytins' prefusion form has precluded structural studies. We undertook a proline-scanning approach searching for regions in the syn1 6HB central helix that tolerate the introduction of helix-breaker residues and still fold correctly in the pre-fusion form. We found that there is indeed such a region, located two α-helical turns downstream a stutter in the central coiled-coil helix - precisely where the breaks of the spring-loaded helix of the filoviruses map. These mutants were fusion-inactive as they cannot form the 6HB, similar to the "SOSIP" mutant of HIV Env that allowed the high-resolution structural characterization of its labile pre-fusion form. These results now open a new window of opportunity to engineer more stable variants of the elusive pre-fusion trimer of the syncytins and other gamma-retroviruses envelope proteins for structural characterization.

Entities: CellLine Chemical Disease Gene Mutation Species

Keywords: Fusion protein; Membrane fusion; Structural biology; Syncytin; X-ray crystallography

Mesh：

Substances：

Year: 2019 PMID： 31711961 PMCID： PMC7094397 DOI： 10.1016/j.jmb.2019.10.020

Source DB: PubMed Journal: J Mol Biol ISSN： 0022-2836 Impact factor: 5.469

Introduction

Syncytins are cell–cell fusion proteins that drive placenta formation by inducing the merger of peripheral blastocyst cells, called cytotrophoblasts, into a giant syncytium called syncytiotrophoblast. This structure provides an exchange interface between maternal and fetal blood, thereby establishing a nutrient supply to the fetus [1]. Despite the syncytins' key role in placental morphogenesis and likely in a number of pregnancy diseases [2], as well as their involvement in osteoclast fusion during bone formation [3], they remain poorly characterized molecularly. The syncytin genes are among the numerous endogenous retroviral elements [4] that account for 8% of the human genome [5]. Although the majority of such elements have accumulated mutations and sequence rearrangements that make them non-functional, some have preserved their open reading frames due to selective advantages for the host. The syncytins are prime examples of such appropriations, involving domestication of the retroviral genes coding for the viral envelope protein (Env), which mediates fusion of the viral lipid envelope with the plasma membrane of target cells during entry. One or more syncytins have been found in all so far investigated placental mammals, including marsupials [2], [6], and also in more distant vertebrates such as viviparous lizards [7]. They have been the product of independent endogenization events, with the earliest syncytin capture envisioned as the driving force behind the evolution of egg-laying vertebrates into placental mammals [1]. In humans, there are two syncytins, syncytin-1 and syncytin-2 (syn1 and syn2), that derive from the env genes of the human endogenous retroviruses W and FRD (HERV-W and HERV-FRD), respectively [8]. Both are gamma-retroviruses [9] that entered the human genome around 25 and 45 million years ago, respectively [10], [11], [12]. Human syncytins possess a gamma-type Env protein, which is present not only in gamma-retroviruses but also in delta-, alpha-, and in a subset of the beta-retroviruses [13]. The Env protein of all retroviruses infecting vertebrates is a class I viral fusion protein, a class that also includes the fusion glycoproteins (GPs) of the otherwise unrelated influenza viruses, coronaviruses, paramyxoviruses, pneumoviruses, filoviruses, and arenaviruses [14]. Class I proteins are type 1 single-pass transmembrane proteins that fold in the endoplasmic reticulum of the infected cell as a precursor that in most cases trimerizes directly upon folding. An activating proteolytic step cleaves the precursor in two - an N-terminal portion, termed “surface” subunit (SU) in retroviruses - which in general has a receptor binding function, and a C-terminal subunit, which carries the membrane fusion function and is termed “trans-membrane” (TM) subunit, as it includes the trans-membrane anchor (TMA) near its C-terminal end (Fig. 1 A). The SU subunit remains peripherally associated and in some cases is covalently linked to TM by a disulfide bond [13]. Cleavage of the precursor into SU and TM traps the complex in a metastable state (the “pre-fusion” form), with the TM subunit spring-loaded underneath a crown made by SU. Interactions with target cells induce the release of the SU crown, allowing TM to spring out and undergo a major conformational change that drives membrane fusion [15]. A specific feature of the gamma-type Env proteins is the presence of two cysteine-rich motifs in the primary sequence, CXXC in SU and CX6CC in TM (where X is any aa), which control formation of an inter-subunit disulfide bond in the pre-fusion form that is reduced upon interaction with a receptor to release SU and induce the fusogenic conformational change in TM required for entry [16].

Fig. 1

(A) Organization of the full-length syncytin proteins. The syncytin main features are displayed as boxes on the linear diagram. Black horizontal lines at the top show the extent of the SU and TM subunits, together with the TM ectodomain (TME). The boundaries of the crystallized constructs and the residues resolved in the structures are marked above and below the rectangular diagram of the full-length syncytin-1 and syncytin-2, respectively. The residues limiting different regions within SU and TM are indicated with numbers on the scheme of the full-length protein. The intersubunit disulfide bond is represented with dashed lines, and the intra-TM disulfide bond within CX6CC motif in T-loop as an arc symbol. (B) Sequence alignment of syncytin-1 and syncytin-2 TM subunits. The residues are colored according to their conservation: identical residues in white font on a red background, nonconserved residues in black font, and the residues with similar physicochemical properties in red font on a white background. The boundaries of the expression constructs are marked with black vertical lines, and the regions that are not resolved in the structures with dashed lines. The helical register (abcdefg) and the turns in N- and C-helices are plotted above and below the sequence alignment. The stutter residues are underlined. The N-helix asparagines that coordinate chloride ions are labeled with “Cl−,” and the T-loop residues conferring the ISU + fusion properties to syncytin-2 are highlighted in yellow. Orange arrowheads indicate the boundaries of the previously crystallized syn2-TME. An important functional element of the TM subunit is the “fusion peptide”, a hydrophobic segment located at its N-terminal end and normally buried under the SU crown. The fusogenic conformational change entails first projection of the fusion peptide to a distance about 10 nm or more away from its original location, such that it can reach and insert into the target cell membrane. In this transient conformation, the TM protein has the N- and C-termini at opposite ends of a long trimeric rod, thereby bridging viral and cellular membranes at a distance of about 15 nm [17]. The trimer protomers bend in half, adopting a “hairpin” conformation that brings the N- and C-terminal ends of the TM ectodomain (TME) into contact, forcing apposition of the two membranes at a distance within 1 nm. The conformational change of TM thus lowers a kinetic barrier arising from short-range repulsive forces between the two membranes - resulting from the necessary dehydration of both outer leaflets to allow inter-membrane lipid contacts [18] - to catalyze lipid merger. The X-ray structure of TME in the post-fusion hairpin conformation is known for Env from several retroviruses [19], [20], [21], [22], [23], [24], [25]. These structures showed a long α-helix (termed N-helix) immediately downstream of the fusion peptide, which forms a characteristic central, parallel trimeric α-helical coiled-coil. The amino acid sequence in this region has characteristic heptad repeats (HRs), represented as positions (Fig. 1B), where and are typically occupied by apolar amino acids that form the hydrophobic core of the coiled-coil. Downstream the N-helix, the polypeptide chain makes an overall 180° turn, such that the C-terminal segment runs backward, antiparallel to the coiled-coil and inserting an α-helix (the “C-helix”) along the grooves in-between the long N-helices in the trimer to complete the hairpin. The resulting 6-helix bundle (6HB) thus has the C-terminal segment connecting to the trans-membrane anchors in proximity of the fusion peptides projecting from the N-terminal end. The structure of the pre-fusion form of the retroviral Env ectodomain, in which the N-helix is broken into several short helices to maintain a more compact form underneath the SU crown, remains a challenge. HIV-1, which has a “beta-type” Env, is the only retrovirus for which pre-fusion Env structure is known (reviewed in Refs. [26], [27]). In this case, as with other class I fusion proteins, the introduction of mutations interfering with formation of the post-fusion 6HB were necessary to stabilize the pre-fusion trimer and make it accessible to structural studies [28]. Structural studies on syncytins have been limited to the X-ray structure of a syn2-TME fragment missing around half of the TME [29]. We present here the structures of the complete syn2- and syn1-TME in post-fusion conformation, which allowed us to perform structural comparisons of syncytins with other class I fusion proteins, confirming the anticipated structural homology to the retroviral gamma-type Envs and also to GP2 fusion subunit of filoviruses and reptarenaviruses. Our results identified a vertical string of 9 salt bridges crisscrossing the N-terminal part of the N-helix and the C-terminal part of the C-helix, found in syn1 and not in syn2, in line with the higher thermal stability measured for syn1-TME. The new structures, together with their close structural homology with the filovirus counterparts, guided further functional studies with full-length syn1 variants carrying helix-breaker proline substitutions along the N-helix region. We thus identified locations in which the proline appeared to block formation of the post-fusion form - as monitored by their loss of function in a syncytium formation assay - but remained compatible with folding in the spring-loaded pre-fusion form as they were trafficked to the cell surface as efficiently as wild-type protein.

Results

Construct design and structure determination

Although syn1 and syn2 have the same number of residues in total (538 aa) and their TME are also about the same length (~140 aa), syn1 has a shorter SU and a longer cytosolic tail (Fig. 1A). We used structural information available for a number of postfusion retroviral TME conformations [22], [24], [25] together with aa sequence alignments and secondary structure/hydrophobicity plots [30], [31] to define the boundaries of the constructs we used for the structural studies. These analyses (Fig. S1) suggested that the fusion peptide spans the TM N-terminal 23–24 aa and that the HRs begin 6-7 residues downstream, encompassing around 48 aa in total for both syncytins. The N-terminal end in the previously reported X-ray structure of a syn2-TME fragment [29] was about 10 residues downstream these predictions, suggesting that the N-helix was truncated. In the C-terminal segment, our predictions indicated a roughly 25-residue long α-helix downstream the characteristic gamma-type Env CX6CC motif, which would correspond to a C-helix that was absent in the reported syn2-TME structure (Figs. 1A and 2 ; PDB accession code 1Y4M [29]). A relatively hydrophobic element is present between the predicted C-helix and the TMA, most likely corresponding to the amphipathic membrane-proximal external region (MPER) identified in a number of other retroviral Env proteins. From these analyses, we designed expression vectors for a recombinant TME core, termed here syn1-TME (residues 342–435) and syn2-TME (residues 375–468) starting immediately downstream of the fusion peptide and ending before the putative MPER (Fig. 1B).

Fig. 2

Postfusion structures of syn1-TME and syn2-TME. Our X-ray structures of syn1- and syn2-TME are compared with the previously reported syn2 structure (top row) and the structures of closely related MPMV and XMRV TME (bottom row). The structural elements forming the TME—N-helix (pale gray), T-loop (blue), C-helix (dark gray)—are indicated, as well as the turns in the helices. The intrachain disulfide bonds are shown as yellow sticks. A pale green tube inside the N-helices follows the axis of each helix, with the stutter location marked black. A black vertical line marks the 3-fold axis of the coiled-coil. A chloride ion coordinated by asparagine residues is shown as a purple sphere. The residues conferring ISU properties to syn2-TME and the equivalent residues in other TME have their side chains displayed as orange sticks, and their names displayed on yellow background. The conserved N-glycosylation site at the beginning of C-helix is labeled as N409, N443, and N487 in syn1-, syn2-, and MPMV TME, respectively. The PDB codes are displayed below the structure names. We cloned these constructs into a pET28 vector for production in E scherichia coli, including a histidine tag fused at the N-terminus. Upon purification by nickel affinity followed by size exclusion chromatography (SEC), we obtained yields of about 30 mg of each protein per liter of cell culture. Both proteins formed trimers with a molecular mass of ~36 kDa as judged from their SEC elution profiles (data not shown). Syn1-TME crystallized in a rhombohedral space group (H32), and the crystals, which contained one protomer in the asymmetric unit, diffracted to 2.1 Å resolution. Syn2-TME formed trigonal crystals (space group P3221) that diffracted to 2.2 Å and had a trimer in the asymmetric unit. The corresponding structures were determined by molecular replacement using the truncated syn2-TME structure as search model (PDB accession number 1Y4M). The final map displayed clear electron density for residues 345–433 in the crystals of syn1-TME, with the first three and last two residues of the construct being disordered. For syn2-TME, the trimer is local, and residues 380–465 are resolved in all three chains of the trimer, with individual protomers displaying additional N- or C-terminal extensions ordered because of stabilization by crystal packing contacts. The atomic models were refined to the maximum resolution of the diffraction data to Rwork/Rfree of 0.22/0.24 and 0.21/0.26 for syn1- and syn2-TME, respectively. The statistics concerning the structure determination and refinement of the atomic model are listed in Table S1. The crystallographic details are described in the Materials and Methods section.

The syncytins are typical gamma-type Env proteins

The structures showed that the syn1 and syn2 6HBs are formed by an N-helix of 12 turns (about 44 aa organized in 6 HRs) followed by a “turn” (T) region (21 aa), where the chain reverses orientation, and a C-helix of 6 turns (23 aa) (Fig. 2). Aligning the 6HBs of syn1 and syn2 resulted in 0.9 Å root mean square deviation (rmsd) for 258 aligned Cα atoms (Table S2). A DALI search [32] using the syn1- and syn2-TME structures as queries showed that they are most similar to the 6HB of the beta-retrovirus Mason–Pfizer monkey virus (MPMV), which has a gamma-type Env [33], and also to the prototypical gamma-retrovirus xenotropic murine leukemia virus–related virus (XMRV) (Fig. 2, lower panel). The observed structural similarity is in line with the 51% and 45% aa sequence identity of the MPMV TME core with syn1 and syn2, respectively, which is higher than between syn1 and syn2 (44% aa sequence identity). In three dimensions (3D), the 6HB of syn1 and syn2 align with 0.9 Å and 1.2 Å rmsd, respectively, onto the MPMV counterpart, with 258 aligned Cα atoms (Table S2). The XMRV 6HB is slightly more distant, but still highly similar, as are the 6HBs of the delta-retroviruses human T-lymphotropic virus (HTLV) and bovine leukemia virus (BLV) Env proteins, which are also gamma-type Envs. The most obvious difference is that the BLV and HTLV proteins have two or a single short helix, respectively, replacing the C-helix in the 6HB [24], [25] (Fig. S2A). The corresponding rmsd values obtained upon 3D alignment on the syn1 and syn2 6HBs are given in Table S2 and Fig. S2B.

Polar interactions stabilizing the 6HB

As also observed in the previously reported gamma-type TME structures [22], [24], [25], the 6HB of the syncytins features two “belts” or networks of hydrogen bonds: belt 1 ties the N-helix at turn 7 to its adjacent protomers in the trimer and belt 2 at turns 9 and 10, close to the bottom of the bundle (Fig. 3 A). Belt 1 involves the side chains of several glutamine residues in syn1 and syn2. In the 6HB of MPMV and XMRV TME, numerous electrostatic interactions were described between charged residues mapping to the same region (Fig. 3D) [22]. In addition, the syn1 6HB contains a string of 8 inter-chain and 1 intra-chain salt bridges that tie the first 5 turns of the N-helix to the C-helix of the adjacent protomer in a vertical fashion, i.e. by establishing numerous successive inter-chain and intra-chain contacts (Fig. 3B). The corresponding residues in the C-helix of the syn2 6HB are uncharged (Fig. S2, Fig. 3D), and therefore no salt bridge is formed. Instead, the syn2 6HB features two different salt bridges, one inter- and one intra-chain, in the upper half of the bundle (Fig. 3C, Table S3). Akin to what we observe here in syn2, electrostatic interactions also clip the termini of the N- and C-helices of MPMV and XMRV at the open end of the hairpin, although they are more dispersed and less numerous (1 and 2, respectively) (Fig. 3D).

Fig. 3

Interactions within syncytin 6HBs. (A) Interchain polar interactions. Side views on syn1- and syn2-TME are shown on left and right, respectively, zooming into the belt 1 and belt 2 regions in the center. Residues belonging to the neighboring protomer are labeled with an asterisk. (B) Syn1-TME “vertical” string of salt bridges. Interchain hydrogen bonds formed between charged residues forming 8 salt bridges are shown on the left, and the single intrachain salt bridge is indicated on the right. (C) Syn2-TME inter- and intrachain salt bridges. Hydrogen bonds formed between the charged residues forming salt bridges at the N- and C-termini of the bundle are shown. (D) Comparison of interactions within 6HBs of syncytins and gamma-type Envs. The residues forming belt 1 and 2 are displayed on orange and green background, respectively. The charged amino acids forming a vertical string in syn1-TME and the charged residues forming salt bridges mapping to the same region in syn2-, MPMV, and XMRV TME are indicated as red, bold letters. The numbers below the sequence alignment mark the turns in the N- and C-helices of syncytins' 6HB. Residues at positions a and d within the HRs are indicated (the a position in the stutter is in red). The ionic character of the interactions stabilizing the syn1 6HB is reflected in the negatively charged electrostatic surface potential of the upper half of the coiled-coil in syn1, which is uncharged in syn2 (Fig. 4 A). To understand the contribution of these differences to the stability of the 6HB, we measured the melting temperature (Tm) of the syn1 and syn2 6HBs using differential scanning fluorimetry. We found that the syn1 6HB had a Tm of 85 °C, about 20 °C higher than that of the syn2 6HB (Fig. 4B), highlighting the higher stability of the former.

Fig. 4

(A) Electrostatic potentials of syncytin 6HBs. The electrostatic potentials of syn1- and syn2-TME coiled-coils are plotted on the molecular surface, with the C-helix presented as cartoon in pale gray and with numbers 1–5 designating the turns (left side of each panel). The C-helix rotated around the Y-axis for 180° is shown on the right to illustrate the complementarity of the positively charged C-helical surfaces and the highly negatively charged groove formed by the N-helices in syn1-TME. (B) 6HB bundle of syncytin-1 is more stable than of syncytin-2. The graph shows unfolding of syn1- and syn2-TME displayed as increase in the intrinsic fluorescence as a function of temperature. The syn1-TME unfolds at a temperature ~20 °C higher than syn2-TME. At the core of belt 2, there is a conserved QNRR motif (aa 381–384 in syn1 and 415–418 in syn2), with an asparagine in place of the hydrophobic residue expected at position or in the HR pattern (Fig. 3D). The asparagine side chain coordinates a central chloride ion on the 3-fold molecular axis, as seen previously in the HTLV-1, BLV, MPMV, and XMRV TME structures. We refer to the interacting residues in the QNRR motif according to their location relative to the conserved asparagine (N0), as Q− 1, R+1, and R+2. In this scheme, the conserved salt bridge connects R+2 with E+20. Both residues were shown to be essential for the fusion activity of XMRV, MPMV, and HTLV Envs [22], [34]. The TME of the delta-retroviruses contains a second, upstream hydrophilic layer, which is in the case of HTLV-1 gp21 organized around Asn364 that traps a second chloride ion, and in BLV gp21 around Thr353 that chelates a water molecule on the 3-fold axis. These additional hydrophilic layers of inter-chain polar contacts established around a second trapped molecule/ion map to the same region as belt 1 in XMRV, MPMV, and the syncytins 6HB (Fig. S2A). The polar interactions in belt 2, with the asparagine side chain pointing into the coiled-coil cavity, are likely to be important for establishing the correct register and orientation of the helices, as has been demonstrated for other coiled-coils [35], [36]. The interactions between the residues of the N- and C-helices of the same or the adjacent protomer in the trimer (Fig. 3A [belt 1], B, and C) might play a role in stabilization of the 6HB, which is in turn important to drive fusion. A direct test of the role of the residues involved in these interactions by site-directed mutagenesis is complicated by the fact that the 6HB forms only after the fusogenic conformational change, and the lack of functionality of any given point mutant could also be because of an alternative interaction in the pre-fusion form. A detailed analysis of these residues therefore awaits the knowledge of the structure of the TME moiety in the context of a pre-fusion syncytin trimer.

The immunosuppressive (ISU) motif in syn2

The T-loop at the chain reversal region starts with a 3/10 helical turn, termed ηT helix, followed by a short α-helix (T helix, ~2 turns; Fig. 2). The single residue responsible for the ISU properties of syn2 (Gln427) resides in the ηT helix (Fig. 1, Fig. 2). Syn1, which does not have ISU properties, has R393 at the corresponding position. Another residue located in the syn2-TME T-loop, A433, is important for its fusion activity, and only the simultaneous introduction of these two mutations into syn1, R393Q and F399A (Fig. 1B), converts syn1 into a functional protein with both ISU and fusogenic activities [37]. These pairs of residues, Q427 and A433 in syn2-TME, are conserved in MPMV (Q471, A477), while a glutamic acid replaces the glutamine in XMRV (E526) (Fig. 2). Although syn1 is not immunosuppressive, the main chain conformation in the region spanning the ISU is the same as in syn2, MPMV, and XMRV, which all have functional ISU properties.

Similarities to non-retroviral 6HBs

In addition to the expected structural similarity to the gamma-retrovirus type 6HB, syn1- and syn2-TME are also structurally very close to the fusion proteins of filoviruses such as Ebola virus (EBOV) and Marburg virus (MARV), as well as the filovirus-like fusion protein of reptarenaviruses such as the California Academy of Sciences virus (CASV), as already pointed out [38], [39]. This is reflected in the low rmsd values obtained upon superposition of their trimers (≤1.9 Å for ≥83% residues aligned; Table S2), as well as conservation of structural motifs such as CX6CC, ISU [40], and belts 1 and 2 (Fig. S2A). Such a close conservation of features is not observed when comparing syncytins to the 6HB of retroviruses bearing a beta-type Env protein, like the lentiviruses, which lack the conserved CXXC/CX6CC motifs and do not form the inter-subunit TM–SU disulfide bond. In the filoviral-like proteins, the length of the structural elements of the 6HB is similar to the gamma-retroviral counterparts, with the C-helices showing some variability (Table S2, Fig. S2B). The QNRR-E+20 motif at the core of belt 2 is LNRK-D+21 and INRH-D+21 in EBOV and MARV GP2, respectively (Fig. S2A). The chloride ion coordinated by the central asparagine is in addition coordinated by a serine side chain located three residues upstream—at the position of the same HR [41], [42] (Fig. S2). CASV GP2 is slightly different, as the central asparagine is replaced by a histidine residue that coordinates a central water molecule on the trimer axis [39]. The high structural similarity between the syncytins 6HB and the filovirus counterpart is of particular interest, as the fusion proteins of EBOV and MARV have also been structurally characterized in their pre-fusion form [43], [44], [45].

Potential break points of the N-helix in the prefusion form

The X-ray structures of a number of class I fusion proteins from different viral families have been determined in both pre- and post-fusion forms (reviewed in Ref. [14]), and the breaks in the N-helix in the spring-loaded form have been mapped. In several of these structures, including filoviral GP, the break points were found downstream of a special turn in the N-helix that makes a “stutter” in the central coiled-coil. Stutters are defined as HR disruptions in the form of 4-residue insertions in between two HRs [46] and are found in most alpha-helical coiled-coils. The stutter-induced deviation from the ideal geometry results in local unwinding of the coiled-coil superhelix, and while the helical axes are almost straight for the first three turns of the N-helix, their twist becomes evident as the coil progresses towards the C-terminus (Fig. 2). Stutters have been proposed to impart structural plasticity to regions that function as conformational switches [39], in agreement with their potential to suggest possible sites for spring-loading the N-helix. Analysis of the coiled-coil by program TWISTER [47] revealed the presence of a stutter at the 3rd turn of the N-helix in both syncytin structures, residues 354LNGD357 in syn1 and 388IANN391 in syn2-TME (Figs. 1B and 2). A stutter is present at the same location in the 6HB of MPMV, XMRV, and BLV, as well as in CASV, two helical turns upstream in HTLV and two turns downstream in the filovirus GP2 6HB (Fig. S2).

Identification of helix-breaker residues along the syncytin N-helix that do not affect transport of the full-length protein to the cell surface

Our attempts to produce a biochemically tractable syncytin ectodomain (SU/TME)3 for structural studies have not been successful, and a likely reason is its instability in the spring-loaded form. We therefore explored the effect of introducing helix-breaking residues (prolines) along the whole length of the N-helix, with the goal of identifying positions that may not interfere with folding of the syncytins in their pre-fusion conformation. Such an approach was applied successfully to stabilize the pre-fusion state of HIV-1 Env [28] and later of other class I proteins, such as the spike protein of the Middle East respiratory syndrome–related coronavirus [48] or the fusion (F) protein of the human respiratory syncytial virus as well as the human metapneumovirus [49], [50]. We generated a set of single proline variants of syn1, containing a helix-breaking residue at and positions in the N-helix: Y347P, L350P, L354P, M358P, L365P, L368P, L372P, L375P, and V379P. We monitored by flow cytometry the effect of these mutations on the surface expression of the syn1 variants upon transient transfection (described in Section ‘Cell-surface expression of STsyn1 variants analyzed by FACS’), as indicative of a proper pre-fusion conformation. With this system, we found that the proline substitutions introduced between turns 6 and 7 of the N-helix (L365P, L368P, and L372P) led to a protein that was transported to the cell surface as efficiently as wild type (Fig. 5 ). In contrast, proline substitutions further upstream (turns 1, 2, and 3, Y347P, L350P, L354P, M358P) or downstream (turns 8 and 9, L375P and V379P) resulted in impaired transport to the cell surface compared with wild-type (Figs. S3A and S3B, respectively), while the transfection efficiency and total syn1 expression were similar in all the samples.

Fig. 5

L365P, L368P, and L372P syncytin-1 variants are expressed at the surface of ExpiCHO cells at a level similar to WT. Surface or total expression of syncytin-1 variants, marked on the left side of each row, was measured by FACS. The pink numbers in each dot plot represent the percentage of cells that tested positive for syncytin surface or total expression. A) The proline variants L365P, L368P, and L372P were expressed at the cells surface at a level similar to the WT protein (left panel). The side chains of L365P, L368P, and L372P are shown as dark green sticks, and their names are displayed in bold letters, while the other residues mutated in this study are shown as pale green sticks (right panel). B) Lower surface expression was measured for the C186S variant, which contains cysteine to serine mutation in the C186XXC motif, and the variant with mutated furin site, suggesting impaired folding and/or trafficking caused by the mutation. The experimental details are given in Section ‘Cell-surface expression of STsyn1 variants analyzed by FACS’. As controls, we tested in parallel other sites in the 6HB, for instance, the effect of altering the polar layer at the QNRR motif, a mutation that had been tested for BLV Env [24]. We found that several substitutions at this location - syn1 N382D, N382A, and N382L - partially affected its trafficking to the cell surface (Fig. S3C), in contrast to the previously tested BLV Env variants (N367D and N367L), which were expressed at the cell surface at a level similar to that of the WT protein [25]. Mutation of the syn1 disulfide isomerization motif CXXC to SXXC in SU (C186S) also resulted in partially impaired trafficking to the cell surface. Finally, mutating the furin cleavage site (R314S, R317S) resulted in a protein that trafficked to the cell surface without significant differences to the wild-type protein (Fig. 5B). We then used 293T cells stably transfected with either one of the two complementary fragments of the green fluorescence protein (GFP) to analyze syncytia formation upon transient transfection with full-length syn1 and selected variants. Syncytia formation leads to reconstitution of a functional GFP, allowing easy monitoring of cell–cell fusion by bimolecular fluorescence complementation [51] (described in detail in Ref. [52] and in the Material and Methods Section ‘Cell–cell fusion assay (293T cells)’). These cells constitutively express the syn1 receptor, the alanine–serine–cysteine transporter 2 molecule, which is required for syn1-driven cell–cell fusion [53]. As expected, the GFP signal appeared in the cells transfected with wild-type syn1, indicative of its fusogenicity (Fig. 6 , Fig. 7 A). In contrast, no GFP signal was detected when transfected with the proline variants, including those that underwent efficient trafficking to the cell surface, indicating that the mutants were unable to induce cell–cell fusion because they could not form the required 6HB. The variants carrying N382 mutations did induce GFP signal, although at ≤20% of what was observed for the wild-type syn1. N382D was more active than N382A and N382L, which showed essentially no fusion activity (Figs. 6 and 7A, Fig. S4), similar to what was observed when asparagine to aspartic acid or leucine mutations were introduced into BLV TME [25]. Finally, the controls with the altered disulfide motif or the furin site knockout did not result in measurable cell fusion either, consistent with the already published data on the human syncytins [54], [55], HTLV [16], and MLV Envs [56].

Fig. 6

Fig. 7

Prolines in fusion-inactive, correctly surface expressed syncytin-1 variants map to the same location relative to the stutter as the helix breaks in the pre-fusion conformation of filoviral GP2. (A) The bar chart displays surface expression measured by FACS (white bars) and fusion index (dark gray bars), both set to 100% for the WT syncytin-1 and expressed relative to the WT values. The error bars indicate standard error of the mean values (SEM). Statistical analyses were done as described in Section ‘Statistical analyses’. (B) The structures are colored to highlight the stutters (black) and the regions that form helix breaks in the pre-fusion forms of EBOV and MARV GP2 (blue). The blue region in syncytin-1 indicates the locations of the proline mutations that rendered the protein inactive but did not impact its trafficking to the cell surface, indicating that the same region may form a helix break in the syncytin-1 metastable state. The open brackets illustrate the conserved distance between the stutters and the predicted or already known helix breaks in syncytin-1 and filoviral GP2, respectively. The HR2 regions are omitted for clarity.

Syncytin-1–induced fusion of 293T cells. The fusion assay results are displayed in the first two columns (raw data and the processed data, respectively, as explained in Section ‘Cell–cell fusion assay (293T cells)’). Results for only one of the proline variants are shown (L368P), as it is representative of all the proline variants, none of which were active in fusion (for complete data, see Fig. S4). The images in the third column correspond to the syncytin-1 surface expression (red), observed by immunofluorescence of 293T cells transfected with STsyncytin-1 variants (described in Section ‘Surface expression of STsyncytin-1 variants analyzed by immunofluorescence’). The images for the fusion assay and immunofluorescence were taken by different microscopes and under different magnifications, indicated by the scale bars of 200 μm. Prolines in fusion-inactive, correctly surface expressed syncytin-1 variants map to the same location relative to the stutter as the helix breaks in the pre-fusion conformation of filoviral GP2. (A) The bar chart displays surface expression measured by FACS (white bars) and fusion index (dark gray bars), both set to 100% for the WT syncytin-1 and expressed relative to the WT values. The error bars indicate standard error of the mean values (SEM). Statistical analyses were done as described in Section ‘Statistical analyses’. (B) The structures are colored to highlight the stutters (black) and the regions that form helix breaks in the pre-fusion forms of EBOV and MARV GP2 (blue). The blue region in syncytin-1 indicates the locations of the proline mutations that rendered the protein inactive but did not impact its trafficking to the cell surface, indicating that the same region may form a helix break in the syncytin-1 metastable state. The open brackets illustrate the conserved distance between the stutters and the predicted or already known helix breaks in syncytin-1 and filoviral GP2, respectively. The HR2 regions are omitted for clarity.

Discussion

The structures of the syn1- and syn2-TME core revealed a 6HB very similar to that of the closely related gamma-type Env proteins of the gamma-retrovirus XMRV, the beta-retrovirus MPMV, and the delta-retroviruses HTLV and BLV. They also revealed close similarity to the more distant fusion GPs of the filoviruses MARV and EBOV, and of the filovirus-like CASV. The two polar layers described in the structures of the XMRV and MPMV TME [22] are also present in the syncytins and have their counterpart in the 6HB of all the proteins mentioned above. A number of these polar interactions on the MPMV and XMRV 6HB were tested by site-directed mutagenesis and shown to interfere with the fusion activity of the corresponding full-length protein [22]. The same residues, however, can potentially be involved in a different set of interactions in the pre-fusion form, and it is therefore difficult to draw strong conclusions about their specific role in stabilization of the post-fusion 6HB and its relation to the membrane fusogenic activity of the protein. Our studies also confirmed that the core TME fragment of syn2 reported back in 2005 [29] was truncated roughly at half the length of the protein core (Fig. 2), similar to the first described structure of a gamma-retroviral protein in the post fusion form of the murine Moloney leukemia virus reported 23 years ago [19] on which the construct had been based. In those structures, only the second polar belt was present, and as the C-helix was absent, there was no 6HB in the trimer. To fully understand the role of the polar residues observed stabilizing the 6HB, it is important to also know the structure of the spring-loaded, (SU/TME)3 prefusion form. No such structure is available for any gamma-type retroviral Env protein, and the available structure of the beta-type HIV-1 Env protein is too distant to be a useful guide. In contrast, the structure of the fusion GP of the EBOV and MARV filoviruses in the pre-fusion (GP1/GP2E)3 spring-loaded trimer is available; in the filovirus GP, GP1 and GP2 are equivalent to the retroviral SU and TM subunits, respectively. The high structural similarity of the post-fusion GP2E 6HB suggests that the region spanning the N-helix in the gamma-type retroviral Env proteins can potentially undergo a similar change in going from the pre-fusion to the post-fusion conformation. Our helix-breaking proline-scanning mutagenesis studies, targeting the and positions of the HR downstream of the stutter, identified that proline residues can be introduced at turns 6 and 7 of the N-helix (mutants L365P, L368P, L372P; Fig. 5A) without interfering with full-length syn1 being transported to the surface of the transfected cells. This observation strongly indicates that the mutants were well-folded and consequently had passed the folding quality control system of the cell, which was not the case for the proline mutants engineered further upstream (Y347P, L350P, L354P, M358P at turns 1–3 i.e. between the N-terminus and the stutter) or downstream (L375P and L379P at turns 8 and 9, closer to belt 2), which led to decreased transport to the cell surface (Fig. S3). The similarities between the syn1-TME and filoviral ectodomain GP2E post-fusion 6HBs are displayed in Fig. 7B, where the region of the N-helix that tolerates proline mutations is highlighted, as well as the region that maps in between shorter helices (i.e., to a helix break) in the pre-fusion form of the filovirus GP2. Remarkably, this region is also located two turns downstream of the stutter. The GP1 moiety of the filovirus GP is, however, most likely unrelated to the gamma-retrovirus SU, as the two fusion proteins have very different properties. For instance, filoviruses enter cells by fusion in late endosomes, and the EBOV GP requires further proteolytic cleavage of GP1 by cellular cathepsins after uptake into a target cell, partially removing the GP1 crown [57]. The remainder of GP1 then interacts with a second internal receptor upon this cleavage, the Niemann–Pick C1 protein [58], and fusion ensues only at this late-stage membrane. As a result, the pre-fusion form of the filovirus (GP1/GP2E)3 spike is particularly robust, and no additional stabilization was necessary for the structural studies. The removal of the mucin-like domain, which is heavily glycosylated, and complexation with neutralizing antibody fragments served to improve crystal packing but were not required for stabilization of the pre-fusion conformation per se [43], [44]. In contrast, the syncytins act directly at the plasma membrane, and the fusion trigger is the interaction with their cell-surface receptor (reviewed in Ref. [59]). In this sense, they are more like the beta-type HIV-1 Env protein, which similarly drives fusion of the viral envelope with the plasma membrane for entry. The syncytins are also reminiscent of the paramyxoviruses and pneumoviruses, which also induce fusion of the viral envelope with the plasma membranes. These viral fusion proteins all required mutagenesis to maintain them in the pre-fusion form to maintain conformational epitopes to use as immunogens and to carry out structural studies [49], [50]. An initial structure of the pre-fusion form, however, guided those trials. With the syncytins, we have worked backward, devising trials by explicit homology of the postfusion form without any information of the pre-fusion form. Our results nonetheless now point to new ways of engineering the pre-fusion form of gamma-type retroviral Env proteins, using mutants with helix-breaking residues in the identified region of the N-helix to destabilize the post-fusion form. Such a strategy may also require further mutations directly stabilizing the pre-fusion state, in addition to those in the region identified here. Possible combinations include the SU/TM CXXC/CX6CC motif to lock the inter-subunit disulfide bond and avoid shedding of the SU subunit, and also the addition of a trimerization motif to keep the pre-fusion trimer together for a high-resolution structural characterization.

Materials and Methods

Expression construct design, protein expression, and purification

Synthetic genes encoding the ectodomain portion of the TM subunit (TME) of syn1 (residues 342 to 435, GenBank accession number Q9UQF0) and of syn2 (residues 375 to 468, GenBank accession number P60508) were codon optimized for protein expression in bacteria and purchased from GeneArt. Cloning into the expression vector pET28-a vector (Novagen) was done using Gibson Assembly Master Mix (New England Biolabs). The expression constructs contained the N-terminal six-histidine tag followed by a tobacco etch virus cleavage site ENLYFQS (Fig. 1A). The expression plasmids, syn1-TME and syn2-TME, were transformed into E . coli BL21 DE3 cells (New England Biolabs), which were used as the expression host. Cell cultures were grown in Miller lysogeny broth (LB) containing 50 mg/L kanamycin (Sigma) at 37 °C until the optical density at 600 nm reached 0.5. Protein expression was induced by addition of 0.5 mM of isopropyl-β-d-thiogalactopyranoside, and 3 h later, the cells from 1 L of culture were collected by centrifugation at 4000×g for 20 min (Beckman Coulter centrifuge, JLA 8.1 rotor). Cells were lysed by sonication (Bioblock Scientific Vibra Cell sonicator 75041) in 50 ml of buffer containing 50 mM Tris pH 8, 150 mM NaCl, 1 mM ethylenediaminetetraacetic acid, and 0.2 mM phenylmethylsulfonyl fluoride. The insoluble fraction was removed by two rounds of centrifugation (3000×g for 20 min [Eppendorf 5810R centrifuge] and 50,000×g for 30 min [Beckman Coulter centrifuge, JA25.50 rotor]). The clarified lysate was filtered through a 0.22 μm filter (Millipore) before being loaded to 1 ml HisTrap Excel column (GE Healthcare). Protein purification was performed according to the manufacturer's protocol. Eluted material was concentrated and further purified by SEC on a Superdex 75 16/60 column (GE Healthcare) using 10 mM Tris pH 8, 100 mM NaCl as running buffer. Protein concentration was calculated using the extinction coefficient (calculated on ExPASy server [60]) of 0.6 ml/mg and 0.7 ml/mg for syn1- and syn2-TME, respectively.

Protein crystallization

Purified syn1-TME protein was concentrated to 10 mg/ml in Vivaspin PES cutoff 10 kDa and crystallized at 18 °C by the sitting-drop method with a reservoir solution containing 0.1 M HEPES pH 7.5, 30% v/v 2-propanol, and 0.2 M MgCl2. Syn2-TME was concentrated to 6 mg/ml and crystallized under the same conditions in 0.1 M Tris pH 8.5, 25% v/v tertiary-butanol. The crystals of both proteins were harvested in their mother liquor solutions containing 25% glycerol as a cryoprotectant and flash-frozen in liquid nitrogen.

X-ray diffraction data collection and structure determination

X-ray diffraction data were collected at the Proxima 2 beamline at the French national synchrotron facility SOLEIL (Gif-sur-Yvette, France). The data were processed using XDS package [61], and the initial phases were obtained by molecular replacement with Phaser [62] in CCP4 suite [63], using the coordinates of the previously crystallized fragment of the syn2-TM [29] as a search model (PDB accession number 1Y4M). Model refinement was done in Buster [64] and structure building in Coot [65].

Structural analyses

Coiled-coils were analyzed by program TWISTER to determine the presence and position of stutters (https://pharm.kuleuven.be/apps/biocryst/twister.php). The sequence alignments were generated in Clustal Omega [66], and the one shown in Fig. S2A was further manually modified, in the T-loop and C-helix regions, to correspond to the structural superpositions of the proteins. ESPript [67] was used to calculate residue conservation and generate all the images showing sequence alignments. PyMol was used to create all the structure images and to perform the structural superpositions (pair_fit command) [68]. The electrostatic potentials shown on Fig. 4A were calculated with the APBS tool in PyMol [69], on a scale −5 to +5, with red depicting negative and blue positive surface potentials. The abbreviations and the accession codes for proteins used throughout the text and in the figures are syncytin-1 (syn1, Q9UQF0), syncytin-2 (syn2, P60508), XMRV (xenotropic murine leukemia virus–like virus, D0UFA8), MoMLV (Moloney murine leukemia virus, P03385), MPMV (Mason–Pfizer monkey virus, P07575), BLV (bovine leukemia virus, AAF97918), HTLV-1 (human T-cell leukemia virus 1, BBD74589.1), EBOV (Ebola virus, O11457), MARV (Marburg virus, Q6UY66), and CASV (California Academy of Science virus, YP_006590086.1). These sequences in some cases do not entirely match the sequences deposited with the PDB files because mutagenesis was performed in several cases to facilitate protein expression and purification. Abbreviations and the accession numbers of protein sequences used in the alignment in Fig. S1 are MPMV (Mason–Pfizer monkey virus, P07575), SRV-1 (simian retrovirus-1, P04027.1), SRV-2 (simian retrovirus-2, AAA47563.1), SMRV (squirrel monkey retrovirus, NP_041262.1), REV (reticuloendotheliosis virus, ACT75575.1), FeLV (feline leukemia virus, AAC31802.1), GA-FeSV (Gardner-Arnstein feline sarcoma virus, P03391.1), MLV (murine leukemia virus, AAA46526.1), Mo-MLV (Moloney murine leukemia virus, P03385), XMRV (xenotropic murine leukemia virus, AEI59730.1), GALV (gibbon ape leukemia virus, ALV83310.1), WMSV (woolly monkey sarcoma virus, ALV83313.1), KoRV (koala retrovirus, YP_008169844.1), BLV (bovine leukemia virus, AAF97918.1), HTLV-1 (human T-lymphotropic virus 1, BBA30582.1), STLV-1 (simian T-lymphotropic virus 1, Q03817.1), HTLV-2 (human T-lymphotropic virus 2, AAD34843.1), STLV-2 (simian T-lymphotropic virus 2, NP_056912.1), HTLV-3 (human T-lymphotropic virus 3, ACF40914.1), STLV-3 (simian T-lymphotropic virus 3 CAA61321.1), ALV (avian leukosis virus, ANF29601.1), AMV (avian myeloblastosis virus, AAA46303.1), RSV (Rous sarcoma virus, AAC08989.1), JSRV (Jaagsiekte sheep retrovirus, AAK38688.1), MMTV (mouse mammary tumor virus, AAF31475.1), BIV (bovine immunodeficiency virus, NP_040566.1), JDV (Jembrana disease virus, AAA64390.1), CAEV (caprine arthritis encephalitis virus, AAX81904.1), VMV (visna/maedi virus, AAB25463.1), FIV (feline immunodeficiency virus, CAA40321.1), HIV-1 (human immunodeficiency virus 1, AAL01570.1), HIV-2 (human immunodeficiency virus 2, AAA76847.1), SIV (simian immunodeficiency virus, CDZ86805.1), and PLV-14 (puma lentivirus 14, AAA67170.1).

Temperature melts of syn1- and syn2-TME

Syn1- and syn2-TME were diluted to a concentration of 20 μM in 100 mM sodium phosphate buffer pH 7 containing 2 M guanidinium chloride. The intrinsic fluorescence signal was measured as a function of increasing temperature in Prometheus NT.48 fluorimeter (NanoTemper), with 80% excitation light intensity and 2C°/minute temperature ramp. Intrinsic fluorescence signal expressed by the 350 nm/330 nm emission ratio, which increases as the proteins unfold, is plotted as a function of temperature on Fig. 4B. The plots are one of the three independent data collections that were performed for each protein.

Functional studies: syn1-mediated cell–cell fusion

Plasmids and cells

Plasmids pQCXIP-GFP1-10 (Addgene plasmid #68715) and pQCXIP-BSR-GFP11 (Addgene plasmid #68716), encoding a GFP variant residues 1–155 (strands β1–10) and residues 156 to 172 (strand β11), respectively, were gifts from Yutaka Hata and are described in Ref. [51]. The synthetic gene encoding the full-length syn1 (residues 1–538, GenBank accession number Q9UQF0) was codon optimized for protein expression in mammalian cells, cloned into pcDNA3.1 expression vector, and purchased as such from GenScript. The expression construct containing the affinity tag, residues D WSHPQFEK GGSA WSHPQFEK, at the syn1 N-terminus was created using standard molecular biology protocols and is referred to as the STsyn1 (the two strep tags [ST] are highlighted in bold, while the short linker separating them is in italics). The single point mutations were introduced into this construct by the “QuikChange II Site-directed Mutagenesis Kit” from Agilent Technologies. HEK293T platinum-E retroviral packaging cell line (“Plat-E” from Cell Biolabs), HEK293T cells (293T, ATCC), and derivatives were cultured in DMEM medium supplemented with Glutamax (Gibco), 10% fetal bovine serum (FBS), and 1% Penicillin-Streptomycin. Puromycin resistant cells were maintained in the presence of 1 μg/ml puromycin. The ExpiCHO cells (Thermo Fisher®) were grown in ExpiCHO media at 37 °C with 8% CO2, in 50 ml tubes or flasks, on the orbital shaker InforsHT Celltron at 130 rotations per minute (rpm).

Cell-surface expression of STsyn1 variants analyzed by FACS

The cell-surface expression of the syn1 variants was measured by FACS analysis of nonpermeabilized ExpiCHO cells transfected with the same expression constructs as used for the fusion assays. These vectors were engineered to have the affinity ST at the N-terminus of the protein, thus being accessible for binding to the antibodies used in FACS and immunofluorescence experiments. 293T cells could not be used for FACS analysis because they fused forming syncytia, complicating cell sorting and counting. ExpiCHO cells were transiently transfected with the STsyn1 plasmids, according to the manufacturer's protocol and using ExpifectamineCHO transfection reagent and 2 μg of plasmid DNA per 2 ml of cells diluted to density of 3 × 106 cells/ml. Cells were grown in 50 ml conical tubes (Corning) and shaken in a rack at 37 °C with 8% CO2 at 130 rpm. Six hours posttransfection, the cells were centrifuged for 3 min at 200×g, washed three times with ExpiCHO media, and left to grow overnight in 2 ml of fresh ExpiCHO media. Eighteen hours later, the cells were counted, pelleted by centrifugation at 200×g for 3 min, and treated with 1 ml of 4% paraformaldehyde diluted in PBS for 15 min at 4 °C. The rest of the staining procedure was performed in round bottom 96-well plates; 100,000 cells were plated per well and in duplicate for each sample. After washing the cells with PBS containing 1% BSA (the washing buffer [WB]), the primary, StrepMAB-Immo antibody (IBA Biosciences, #2-1517-001) that recognizes the SAWSHPQFEK sequence in the ST at the N-terminus of syn1, was incubated for 30 min at room temperature (RT). The antibody was diluted 1:5000 in WB and 0.05% saponin for the samples that required permeabilization (Fig. S3). After 2 washes with WB, the samples were incubated with a goat anti-Mouse IgG (H + L) highly cross-adsorbed secondary antibody conjugated to Alexa Fluor 647 (Invitrogen) that was diluted 1:500 in WB, for 30 min at RT, followed by two washes in WB. FACS acquisition on 10,000 cells per sample was performed with the Attune NxT Flow Cytometer (Thermo) for Y347P, L350P, L354P, and M358P variants (Fig. S3A) and on BD FACSCanto™ cytometer (BD Biosciences) for the remaining samples (Fig. 5, Figs. S3B and C). Two different cytometers had to be used due to technical problems and the unavailability of the latter instrument for experiments involving Y347P, L350P, L354P syn1 variants. All the data were analyzed via FlowJo software.

Cell–cell fusion assay (293T cells)

For murine retroviral vector production, the Plat-E retroviral packaging cell line was transfected with pQCXIP-BSR-GFP11 or pQCXIP-GFP1-10 using calcium chloride at 70% confluency. Media were changed 12h after transfection. Retroviral vectors were harvested at three time points (36h, 48h, and 72h after transfection) and ultracentrifuged at 22,000×g for 1h at 4 °C. For lentiviral transduction, 2 × 104 HEK293T cells were resuspended in 150 μl of DMEM-HEPES, and 15 μl of retroviral vector was added. Cells were agitated 30 s every 5 min for 2 h at 37 °C in a thermomixer and then transferred to 6-well plates with complete DMEM and incubated for 2 days at 37 °C before addition of puromycin. The two cell lines stably expressing the GFP parts are referred to as 293T-GFP1 − 10 and 293T-GFP11. Transient transfection of the 293T-GFP1 − 10 and 293T-GFP11 cell lines with the STsyn1 constructs was performed using Lipofectamine® 2000 (Invitrogen) following the manufacturer's protocol. Briefly, after a PBS wash, the 293T-GFP1 − 10 and 293T-GFP11 cells were detached with PBS-EDTA (Sigma) and resuspended in DMEM medium supplemented with Glutamax, 10% FBS, and 1 μg/ml puromycin. The two cell types were mixed in 1:1 ratio at a final density of 6 × 105 cells/ml, and 100 μl of such cell mixture was seeded in a 96-well cell culture flat-bottom plate (6 × 104 cells/well). The same amount of cells was seeded in Lab-Tek chamber slides (#177402, ThermoFisher) for immunofluorescence analysis of syncytin surface expression (Section ‘Surface expression of STsyncytin-1 variants analyzed by immunofluorescence’). A transfection mixture containing 50 μl of OptiMEM (Gibco), 100 ng of STsyn1 plasmid DNA, and 0.3 μl of the lipofectamine reagent was prepared for each sample, per well. 50 μl of the mixture was added, and the cells were allowed to fuse for next 24 h when the images were collected on live cells using the “EVOS FL Cell Imaging System” setup to record GFP fluorescence. An ImageJ script created by J.B. for the ImageJ software [70] was applied to (1) convert the images collected on the GFP channel from the *tif format to black and white images, green signal being replaced by black color (Fig. 6), and (2) to integrate the surface occupied by the black fields and represent it as the percentage of total area, thereby giving what is referred to as “the fusion index.” Two fields (1142.84 × 857.13 μm) per experiment were taken for quantifications. The experiment was repeated twice, and the bar graph shown in Fig. 7A is an average of 4 measurements. The fusion was calculated as follows: the raw data obtained by recording GFP fluorescence (Fig. 6, left column) were processed so that the green signal (fusion events) was replaced with black color, and the background signal was colored in white, as shown in the middle column. The “fusion index” was calculated as the percent of black area in the total visualized surface. The right column on Fig. 6 represents the same cell population seeded on a separate slide for immunofluorescence staining of the surface-expressed syncytin variants (red staining).

Surface expression of STsyncytin-1 variants analyzed by immunofluorescence

The 293T-GFP1 − 10 and 293T-GFP11 cells were seeded in Lab-Tek chamber slides and transfected with STsyn1 constructs as described in Section ‘Cell–cell fusion assay (293T cells)’. Twenty-four hours later the cells were fixed in 4% paraformaldehyde in PBS for 15 min. Free aldehydes were blocked by incubation with 40 mM NH4Cl in PBS for 5 min. Nonspecific binding was blocked with 1% FBS in PBS containing 0.02% sodium azide for 10 min, followed by 1 h, RT incubation with the StrepMAB-Immo antibody (IBA Lifesciences, Cat. No. 2-1517-001) diluted 3000 times in blocking buffer. After 3 washes in PBS, the secondary antibody, goat antibody against the mouse IgG conjugated with Alexa Fluor 555 (Invitrogen, #A28180), diluted 1:2000 was added and incubated for 1 h. Nuclei were stained with Hoechst stain (Invitrogen) at 1 μg/mL in PBS for 5 min before mounting the coverslips onto 76 × 26 mm slides (Thermofisher) using Fluoromount-G mounting media (Southern Biotech). Images were taken with a Leica DM LB microscope and analyzed with ImageJ software.

Statistical analyses

In Fig. 7A, surface expression results of STsyn1 variants are represented as the mean of three independent experiments, fusion indices as the mean of four measurements from two independent experiments, in all cases normalized to the value of the positive control (syn1-WT) □ SEM. One-way ANOVA and Dunnett's multiple comparisons tests were used to determine the statistical significance of each result in comparison with syn1-WT surface expression or fusion index (*p < 0.05; **p < 0.01; ****p < 0.0001). A p value of <0.05 was accepted as indicative of a statistically significant difference. The fusion index bars are missing from most of the samples because syncytia were not observed. The ****p < 0.0001 values were calculated for the fusion indexes for all the samples and are not plotted on the graph for clarity reasons. Accession numbers: Coordinates and structure factors have been deposited in the Protein Data Bank with the accession numbers 6RX1 and 6RX3, for syn1- and syn2-TME, respectively.

CRediT authorship contribution statement

Katinka Ruigrok: Investigation, Visualization, Methodology, Formal analysis. Marie-Christine Vaney: Formal analysis, Visualization. Julian Buchrieser: Methodology, Investigation. Eduard Baquero: Formal analysis, Visualization. Jan Hellert: Investigation. Bruno Baron: Resources. Patrick England: Resources. Olivier Schwartz: Resources. Felix A. Rey: Conceptualization, Writing - review & editing, Supervision, Funding acquisition. Marija Backovic: Conceptualization, Writing - original draft, Writing - review & editing, Supervision, Project administration.

68 in total

1. An envelope glycoprotein of the human endogenous retrovirus HERV-W is expressed in the human placenta and fuses cells expressing the type D mammalian retrovirus receptor.

Authors: J L Blond; D Lavillette; V Cheynet; O Bouton; G Oriol; S Chapel-Fernandes; B Mandrand; F Mallet; F L Cosset
Journal: J Virol Date: 2000-04 Impact factor: 5.103

2. Immunogenicity and structures of a rationally designed prefusion MERS-CoV spike antigen.

Authors: Jesper Pallesen; Nianshuang Wang; Kizzmekia S Corbett; Daniel Wrapp; Robert N Kirchdoerfer; Hannah L Turner; Christopher A Cottrell; Michelle M Becker; Lingshu Wang; Wei Shi; Wing-Pui Kong; Erica L Andres; Arminja N Kettenbach; Mark R Denison; James D Chappell; Barney S Graham; Andrew B Ward; Jason S McLellan
Journal: Proc Natl Acad Sci U S A Date: 2017-08-14 Impact factor: 11.205

Review 3. Transmission, Evolution, and Endogenization: Lessons Learned from Recent Retroviral Invasions.

Authors: Alex D Greenwood; Yasuko Ishida; Sean P O'Brien; Alfred L Roca; Maribeth V Eiden
Journal: Microbiol Mol Biol Rev Date: 2017-12-13 Impact factor: 11.056

4. The crystal structure of the SIV gp41 ectodomain at 1.47 A resolution.

Authors: Z N Yang; T C Mueser; J Kaufman; S J Stahl; P T Wingfield; C C Hyde
Journal: J Struct Biol Date: 1999-06-15 Impact factor: 2.867

5. Stabilization of the soluble, cleaved, trimeric form of the envelope glycoprotein complex of human immunodeficiency virus type 1.

Authors: Rogier W Sanders; Mika Vesanen; Norbert Schuelke; Aditi Master; Linnea Schiffner; Roopa Kalyanaraman; Maciej Paluch; Ben Berkhout; Paul J Maddon; William C Olson; Min Lu; John P Moore
Journal: J Virol Date: 2002-09 Impact factor: 5.103

6. Ebola virus entry requires the cholesterol transporter Niemann-Pick C1.

Authors: Jan E Carette; Matthijs Raaben; Anthony C Wong; Andrew S Herbert; Gregor Obernosterer; Nirupama Mulherkar; Ana I Kuehne; Philip J Kranzusch; April M Griffin; Gordon Ruthel; Paola Dal Cin; John M Dye; Sean P Whelan; Kartik Chandran; Thijn R Brummelkamp
Journal: Nature Date: 2011-08-24 Impact factor: 49.962

7. Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega.

Authors: Fabian Sievers; Andreas Wilm; David Dineen; Toby J Gibson; Kevin Karplus; Weizhong Li; Rodrigo Lopez; Hamish McWilliam; Michael Remmert; Johannes Söding; Julie D Thompson; Desmond G Higgins
Journal: Mol Syst Biol Date: 2011-10-11 Impact factor: 11.429

8. Overview of the CCP4 suite and current developments.

Authors: Martyn D Winn; Charles C Ballard; Kevin D Cowtan; Eleanor J Dodson; Paul Emsley; Phil R Evans; Ronan M Keegan; Eugene B Krissinel; Andrew G W Leslie; Airlie McCoy; Stuart J McNicholas; Garib N Murshudov; Navraj S Pannu; Elizabeth A Potterton; Harold R Powell; Randy J Read; Alexei Vagin; Keith S Wilson
Journal: Acta Crystallogr D Biol Crystallogr Date: 2011-03-18

9. Dali server update.

Authors: Liisa Holm; Laura M Laakso
Journal: Nucleic Acids Res Date: 2016-04-29 Impact factor: 16.971

Review 10. Pushing the endogenous envelope.

Authors: Jamie E Henzy; Welkin E Johnson
Journal: Philos Trans R Soc Lond B Biol Sci Date: 2013-08-12 Impact factor: 6.237

3 in total

1. Structural insights into the lysophospholipid brain uptake mechanism and its inhibition by syncytin-2.

Authors: Maria Martinez-Molledo; Emmanuel Nji; Nicolas Reyes
Journal: Nat Struct Mol Biol Date: 2022-06-16 Impact factor: 18.361

2. Human Endogenous Retrovirus Type W Envelope from Multiple Sclerosis Demyelinating Lesions Shows Unique Solubility and Antigenic Characteristics.

Authors: Benjamin Charvet; Justine Pierquin; Joanna Brunel; Rianne Gorter; Christophe Quétard; Branka Horvat; Sandra Amor; Jacques Portoukalian; Hervé Perron
Journal: Virol Sin Date: 2021-03-26 Impact factor: 4.327

3. Structure of the Core Postfusion Porcine Endogenous Retrovirus Fusion Protein.

Authors: Trevor T Dean; Vitor Hugo B Serrão; Jeffrey E Lee
Journal: mBio Date: 2022-01-25 Impact factor: 7.867

3 in total