Literature DB >> 29596909

Protein-nucleic acid interactions of LINE-1 ORF1p.

M Nabuan Naufer1, Anthony V Furano2, Mark C Williams3.   

Abstract

Long interspersed nuclear element 1 (LINE-1 or L1) is the dominant retrotransposon in mammalian genomes. L1 encodes two proteins ORF1p and ORF2p that are required for retrotransposition. ORF2p functions as the replicase. ORF1p is a coiled coil-mediated trimeric, high affinity RNA binding protein that packages its full- length coding transcript into an ORF2p-containing ribonucleoprotein (RNP) complex, the retrotransposition intermediate. ORF1p also is a nucleic acid chaperone that presumably facilitates the proposed nucleic acid remodeling steps involved in retrotransposition. Although detailed mechanistic understanding of ORF1p function in this process is lacking, recent studies showed that the rate at which ORF1p can form stable nucleic acid-bound oligomers in vitro is positively correlated with formation of an active L1 RNP as assayed in vivo using a cell culture-based retrotransposition assay. This rate was sensitive to minor amino acid changes in the coiled coil domain, which had no effect on nucleic acid chaperone activity. Additional studies linking the complex nucleic acid binding properties to the conformational changes of the protein are needed to understand how ORF1p facilitates retrotransposition.
Copyright © 2018 The Authors. Published by Elsevier Ltd.. All rights reserved.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 29596909      PMCID: PMC6428221          DOI: 10.1016/j.semcdb.2018.03.019

Source DB:  PubMed          Journal:  Semin Cell Dev Biol        ISSN: 1084-9521            Impact factor:   7.727


Introduction

The long interspersed nuclear element 1 (LINE1, L1) non-LTR retrotransposon is an autonomously replicating genomic parasite that has been amplifying and evolving in mammalian genomes for >100 Myr and now constitutes 20% or more of certain mammalian genomes [1-3]. In addition, L1 can also copy non-L1 transcripts into genomic DNA, and as a consequence L1 activity has generated upwards of 40% of the mass of many mammalian genomes [4,5]. Only a subset of L1 elements actively retrotranspose in the mammalian genomes [6-9]. Currently, 80–100 L1 copies that belong to a single subfamily Ta are active in the human genome [10]while ~3000 L1 copies of 3 subfamilies, TF, A and GF, are active in the mouse genome [11-13]. Replication and evolution continue in humans [14-16], and L1 activity creates genetic diversity and various genetic alterations [17-20]. Given its detrimental [21-23] and potential catastrophic effects [24,25], explaining its persistence as well as determining a mechanistic understanding of its replication remain formidable biological issues. Mammalian L1 elements are 6–7 kb in length, contain a regulatory 5′ UTR, two long open reading frames that encode two proteins ORF1p and ORF2p, and a 3′ UTR (Fig. 1A). Both ORF1p (Fig. 1B) and ORF2p are required for retrotransposition [26]. The two proteins associate with their encoding transcript to form a ribonucleoprotein complex (RNP), which mediates retrotransposition [27-30]. L1 replicates by reverse transcription of its transcript. While this is analogous to retroviruses and retroviral-like retrotransposons, L1 replicates by the dramatically different process of target-site-primed reverse transcription, or TPRT [31,32] (Fig. 1C). In this mechanism reverse transcription of the L1 RNA transcript is primed from a 3′ hydroxyl at the genomic insertion site [33]. Highly conserved endonuclease and reverse transcriptase domains in ORF2p indicate its replicase function in TPRT [26,34,35]. In contrast, ORF1p lacks any known enzymatic domains, although it does contain highly conserved non-canonical RNA binding domains and phosphorylation sites [36,37], that are required for retrotransposition activity.
Fig. 1.

A) L1 domain structure. The domain organization of a typical full-length human L1 element. Approximate positions (not to scale) of the 5′ untranslated region (5′ UTR), ORF1, ORF2 (which includes endonuclease (EN) and reverse transcriptase (RT)), 3′ UTR, and the poly A tail are noted. B) Domain structure and trimeric depiction of ORF1p. The amino acid positions of the N-terminal domain (NTD), coiled coil domain (CC); RNA recognition motif (RRM) and C-terminal domain (CTD) for mouse [13,40,47,55] and human [36,44,46] ORF1p are denoted in red and blue text, respectively. C) Schematic of the proposed steps in TPRT [3,31,32]. Target site of the genomic DNA is depicted in blue shades and L1 RNA in purple. The first DNA strand at the target site is cleaved by ORF2p EN. The L1 RNA is annealed to the cleaved site and reverse transcribed by ORF2p RT to synthesize the first L1 cDNA (red). The second genomic target site strand is cleaved, and it primes the L1 cDNA to synthesize the second L1 DNA strand (magenta). Subsequent DNA synthesis required for completion is denoted in dashed lines. Consequently, target site duplications (TSD) are produced at the flanks of the newly synthesized L1 element. ORF1p may mediate strand exchange reactions required to anneal the primers and/or facilitate nucleic acid arrangements during reverse transcription.

Human and mouse ORF1 encode a ~40 kDa protein and early studies showed that they could be isolated as L1 RNP particles associated with their encoding L1 RNA [30,38]. As presented in detail below, subsequent studies showed that the 40 kDa monomer forms a coiled coil-mediated trimer that binds nucleic acids with high affinity (Fig. 1B) [38,39]. In vitro studies using bulk biochemical and single molecule assays showed that human and mouse ORF1p (from here on referring to the trimer) act as nucleic acid chaperones, presumably during TPRT [32,40-43], however, at which steps and by what means is unknown. In addition, both human and mouse ORF1p can polymerize in the presence or absence of nucleic acids [43-45]. Recent structural and theoretical studies on human and mouse ORF1p addressed its complex nucleic acid binding properties [46-49], yet we do not understand how they contribute to the molecular mechanism of L1 retrotransposition. We now review the current understanding of ORF1p in the context of those nucleic acid binding properties that had been obtained with the purified protein. Therefore, we will not be discussing the extensive and informative cell biological literature on the structure and properties of the L1RNP and its possible interaction with host factors as exemplified by two recently published papers [50,51].

ORF1p structure

The amino- and carboxy-terminal halves of mammalian ORF1p evolved under dramatically different evolutionary constraints. The carboxy terminal half is highly conserved (Fig. 2) and contains a non-canonical RNA recognition motif (RRM) and a distinct C-terminal domain (CTD) (Fig. 1B). Residues in the carboxy-terminal half mediate high affinity RNA binding and nucleic acid chaperone activity [32,39-42,44,52]. In contrast, the sequence of the amino-terminal half can be highly variable, although a coiled coil motif is conserved throughout vertebrate evolution [21,53,54]. Direct visualization of mouse ORF1p using atomic force microscopy revealed an elongated dumbbell-like structure, consistent with its trimeric conformation [55] mediated by the coiled coil [44,46,55], which is parallel in nature and stabilized by additional inter-chain hydrogen bonds [46]. Molecular structures for the human C-terminal half [36,46] and the mouse CTD [47] were solved. The RRM domain of human ORF1p is structured with a typical βαββαβ fold and contains non-canonical RNP1 and RNP2 sequences (Fig. 2) with aromatic side chains that could mediate base-stacking or hydrophobic interactions with nucleic acid substrates. It also contains two highly conserved salt bridges (E165-R215 and E169-R202) that are formed between two loops, L(β1-α1) and L (β2–β3) [36,46]. The L (β2-β3) loop is intrinsically disordered [46] and contains the two T/SP motifs (T203P204, and T213P214) out of the four (S18P19, S27P28 in the NTD) highly conserved proline-directed protein kinase (PDPK) targets in ORF1p. T or S phosphorylatable residues at these respective sites are required for L1 retrotransposition [37,56].
Fig. 2.

Alignment of mouse and human ORF1p sequences.

Consensus sequences of the mouse L1Tf family [13] and the human Ta1 subfamily of the currently active L1Pa1 [15] family were aligned. The middle L1T1f/Ta1 sequences shows the comparison between the aligned elements. A dot indicates identity, dashes indicate gaps, and lower case indicates conservative amino acid differences. The bottom entry (Identical) shows the 100% identical positions. The numbers in black refer to the L1Tf, and those in red, the L1Ta1 sequence. The beginning and end of the coiled coil domain, and the starts of the RNA recognition motif (RRM) and C-terminal domain (CTD) are indicated. The coordinates of the RRM and CTD were taken from references [36,47]. The alignment is also annotated with the following information: The beginning of truncated M128 [44], which is largely a monomer at ≥20 °C; the location of a natural variant in the mouse coiled coil domain (D159H, using the L1Tf numbering with an offset of plus 4 in the mouse alignment only to account for the 4 position gap introduced into the alignment of L1Tf between positions 40 and 45) [41,26,38,46] paired mutations in two adjacent highly conserved arginine residues of the CTD, in human ([26,29]) and in mouse [42]. Heptad repeats are highlighted in green and yellow. RNP1 and RNP2 sequences are highlighted in light brown [36]. Regarding the “phenotype” of these mutations: retro means retrotransposition as measured in a cell culture based retrotransposition assay [26]; RNP means the presence of ORF1p in RNP particles isolated from in vivo assays; RNA binding means as performed with purified ORF1p protein in vitro; chaperone means chaperone activity (as described in [42]).

A crystal structure of human ORF1p, which lacked most of its N-terminal half, yielded three distinct orientations[46] consistent with three structural states for the CTD: resting, lifting, and twisting. The π-stacking interaction between Y282 and R155 was shown to primarily influence the dynamics of the CTD. A lifting motion of the CTD was proposed to open the RRM–CTD cleft and expose the basic residues for nucleic acid binding. Subsequent twisting or rotational motions would then wrap the nucleic acid, such that a continuous strand would occupy binding sites at each ORF1p monomer. The resting position of the CTD render the basic patch residues of the coiled coil domain accessible, which could allow further nucleic acid binding. This binding mechanism is speculative, however, because the structure did not contain nucleic acid. To support the proposed binding mechanism, several mutants were generated and probed in NA binding and retrotransposition studies. For instance, the highly conserved R261 (Fig. 2), at which substitutions eliminated retrotransposition both in human [26,29] and mouse ORF1p [42], was predicted from the structure to inhibit RNA binding, which was also confirmed by size-exclusion chromatography. Other mutations, such as R220A, abolished retrotransposition but did not inhibit RNA binding. The effect of this and other similar mutations was proposed to be due to a defect in ORF1p flexibility and dynamics based on the structure. However, it is important to keep in mind that, as mentioned above, the crystal structure was derived from a protein that lacks the NTD and slightly more than half of the coiled coil domain, both of which contain structures critical to ORF1p function. The intrinsically disordered NTD contains highly conserved PDPK phosphorylation sites [37] and the coiled coil facilitates rapid polymerization of the trimers on single-stranded NA [43], two functions that are vital for L1 retrotransposition. Furthermore, the structural conformation of ORF1p may significantly vary in the presence of nucleic acids. Therefore, although the ORF1p C-terminal half structure suggests that flexible inter-domain orientations are functionally important, the exact details of the structure-function correlation may differ for full length protein. Nevertheless, it is clear that multiple domains in ORF1p simultaneously interact with nucleic acids and are also capable of mediating inter-protein interactions [44] (discussed in the next section, see Fig. 3). This results in a complex set of possible configurations, which in turn appear to depend strongly on solution conditions. An understanding of these configurations is essential to determine how ORF1p facilitates different stages of retrotransposition.
Fig. 3.

ORF1p polymerization.

Schematic representation of possible ORF1p species and their cross-linked products as observed in Callahan et al. [44]. The numbers to the left of the cartoons of the cross-linked species indicate their monomer content. 4..n and 5..n indicate higher orders of multimers beyond 3 or 4, respectively. Higher orders of ORF1p multimers of trimers were observed in crosslinking experiments with 1 mM EGS, in the presence of oligonucleotides at 0.05 M NaCl.

Nucleic acid binding properties of ORF1p

Several early studies demonstrated co-localization of ORF1p and L1 RNA in L1 RNPs. For example, L1 RNA of L1RNP isolated from mouse embryonal carcinoma cells was efficiently cross-linked to protein by a brief exposure to ultraviolet light, indicating that ORF1p and L1 RNA were in close contact [57]. Similarly, high concentrations of monovalent cations and RNase dissociated L1 RNPs isolated from human teratocarcinoma cell lines, again indicating co-localization between the protein and L1 RNA [38,58]. Co-localization of ORF1p with L1 RNA was further confirmed when HeLa cells were transfected with an L1 that contained an epitope tagged ORF1p [29]. Subsequent quantitative binding studies of ORF1p to NA showed that ORF1p binds without sequence specificity to both RNA and DNA [41-44,52,59,60]. Electrophoretic mobility shift assays showed that mouse L1 ORF1p binds RNA with positive cooperativity, providing early evidence of protein-protein interactions between the polypeptides [61]. The nature of protein-protein interactions was extensively studied in Callahan et al. [44] with human ORF1p purified from insect cells (Fig. 3). In this study, the cross-linking reagent EGS, which interacts with primary amines (e.g.,ε-amino group of lysine), was used to capture ORF1p interaction products, prior to treatment with 8 M urea, which fully denatures non-crosslinked ORF1p into its constituent components, i.e., monomers, partial trimers, and multiples of trimers (Fig. 3). In 0.5 M NaCl ORF1p is a soluble trimer but is unable to bind nucleic acids. Optimal nucleic acid binding occurs at 50 mM NaCl, but in the absence of nucleic acids, ORF1p trimers rapidly form massive polymers that precipitate from solution. When cross linked with 1 mM EGS the cross-linked polymers are too large to enter a 6% denaturing polyacrylamide (PAGE) gel (Fig. 3). Addition of equimolar oligonucleotides (N ≥ 20) immediately resolves the ORF1p polymers to trimers or multimers of trimers (depending on the length of the oligonucleotide, (Fig. 3)). Therefore, ORF1p remains active in its polymeric conformation, its presumed state in the L1RNP. Partial crosslinking occurs with limiting EGS (0.05 mM) and generates a ladder of products starting at the size of the ~40 kDa monomer and multimers thereof (Fig. 3). Carrying out these experiments with an amino-terminal truncated ORF1p (M128, Fig. 2) [44], which contains just 3.5 heptads of the 14-heptad coiled coil and is largely a monomer at ≥20 °C, showed that the amino acid residues that mediate inter-trimer interactions reside in the conserved C-terminal half of the protein. The ability to form large oligomers in the absence of nucleic acid may provide a mechanism to prevent the diffusion of the newly synthesized protein into the cytoplasm before binding to L1 RNA. However, the relationship between ORF1p translation from the L1 transcript and formation of the L1 RNP with the L1 transcript remains obscure as any such large clusters must dissociate in order to optimally coat the transcript. ORF1p binds RNA and ssDNA with similar affinity, but with an ~8-fold lower affinity to a perfectly matched duplex. In contrast, ORF1p binds mismatched duplexes with about the same affinity as ssDNA, and, in fact, stabilizes it from dissociation [44]. This occurs at near equimolar or a 10-fold higher molar excess of ORF1p/duplex but at sufficient molar excess, ORF1p dissociated the duplex. The monomeric form of ORF1p (i.e., N-terminal truncated M128p, Fig. 2) bound nucleic acids with substantially lower affinities, in comparison to the trimer, and could not protect a mismatched duplex from dissociation [44], even though M128p contains the intact RRM and CTD domains critical for both high affinity nucleic acid binding and NA chaperone activity [36,42,45,47]. These results indicate that intra-trimeric interactions play vital roles for the mechanism of nucleic acid binding by ORF1p. Single molecule DNA stretching studies have been used to quantitatively characterize the affinity of mouse ORF1p for DNA. Fig. 4 shows a diagram of a typical single molecule DNA stretching experiment while Fig. 5A shows a DNA stretching curve in the presence and absence of wild type mouse ORF1p. The width of the transition, shown as F in Fig. 5A, demonstrates the strong effect of ORF1p on DNA conformation. Transition width is positively correlated with protein binding to DNA, so the width as a function of concentration was used to determine the DNA binding affinity by fitting to a binding isotherm [41]. Here the protein concentration dependence of the change in the DNA force-extension profile is used to determine the DNA binding affinity. Below we will discuss the biophysical interpretation of the change in transition width in relation to nucleic acid chaperone activity. The DNA binding affinity measurements demonstrate that mouse ORF1p binding to DNA is in the range of 10 nM in near-physiological solution conditions (50–100mM Na+). However, many ORF1p mutants did not alter binding affinity but strongly affected retrotransposition [41,42]. Thus, more complex interactions beyond simple nucleic acid binding determine the effect of mouse ORF1p on retrotransposition, as will be discussed below in the section on nucleic acid chaperone activity
Fig. 4.

Single molecule DNA stretching.

A) Schematic depiction of an optical tweezers system. A single molecule of DNA (48.5 kbp) is attached by its biotinylated ends to streptavidin-coated polystyrene beads. One bead is immobilized by a glass micropipette attached to a flow cell while the other is held in an optical trap. The optical trap is created by converging two counter-propagating laser beams to overlap in space using microscope objectives. By moving the glass micropipette attached to the flow cell, the tethered DNA molecule is stretched, and the force exerted on the DNA is measured as a function of extension in the presence and absence of protein. B) Solid and empty black circles (also the solid and dashed black lines in Figs. 5A and 6 A) represent the stretch and return curves of a bare dsDNA molecule. The green circles represent the force-extension curve of an ssDNA molecule. The distinct regimes of the dsDNA force-extension profile are denoted in grey text. At forces ≪~60 pN the DNA molecule is primarily double-stranded. The plateau at ~60 pN represents a sharp overstretching transition, where very little additional force causes the DNA molecule to stretch to almost twice its contour length. This is due to force-induced melting of the DNA molecule and this transition is referred to as the helix-coil transition [73]. The force range over which this transition occurs is defined as the helix-coil transition width (or transition width). In the salt conditions used in these studies, the stretched dsDNA molecule is mostly single-stranded at forces higher than the helix-coil transition. The stretch and return cycles of the bare DNA molecule are almost reversible and immediate re-annealing of the duplex is observed when the stretched DNA molecule is returned to zero-force (empty black circles).

Fig. 5.

Single molecule DNA stretching results from mouse ORF1p mutational analyses [41,42,59].

Solid and dashed blue lines are the stretch and return curves of dsDNA in the presence of 15 nM wild type mouse ORF1p. ΔF is the relative increase in the helix-coil transition width due to bound protein, where ΔF = δF–δF0. Here, δF0 (~4pN) and δF are the helix-coil transition widths in the absence and presence of protein, respectively. δF (shown as the force difference between the solid red lines) is determined by the intersection of the dashed red lines on the figure, which represent ds- and ss- DNA regimes of the ORF1p-DNA complex. Thus, ΔF is a measure of the relative increase in force (due to bound protein) required in the cooperative conversion of ds- to ss- DNA, which was shown to be positively correlated with the nucleic acid chaperone capabilities of the bound protein as described in the text. B) ΔF at 15 or 20 nM protein concentration, as measured in single molecule DNA stretching assays (blue), and retrotransposition activity in cultured in vivo assays (red) for ORF1p mutants, presented after normalizing with the corresponding values for the wild type protein. The transition width for the RR297:298AA mutant at these concentrations was not reported due to its lower binding affinity. However, it was shown to saturate at a 5-fold higher concentration than what was used for wild type protein. The empty bar is to denote that the transition width is likely much smaller at these concentrations in comparison with the wild type protein. Extensive aggregation of single- or double- stranded DNA in comparison with the wildtype protein, are denoted with a plus sign. While aggregation is important to facilitate DNA interactions and can increase chaperone activity, extensive aggregate formation can also slow down DNA interaction kinetics, which may in turn inhibit chaperone activity [79]. The ability to bind RNA as observed in the bulk solution assays is also reported. The plus sign denotes comparable binding affinity as wild type and these mutants are ranked according to their relative binding affinities, with (−) the lowest and (3+) the highest affinity. ‘nd’ represents ‘not determined’.

Nucleic acid chaperone activity of ORF1p

The L1 retrotransposon, like retroviruses, replicates by reverse transcription of an RNA intermediate. Reverse transcription of retroviruses is facilitated by the nucleic acid chaperone activity of its nucleocapsid (NC) proteins. This appears to be achieved by lowering the energy barrier of rearranging nucleic acids to achieve maximal base complementarity [62]. Therefore, in vitro, a nucleic acid chaperone will primarily mediate nucleic acid annealing and strand exchange reactions. NC proteins from multiple retroviruses and retroelements [63-66] act as nucleic acid chaperones and contain the CCHC zinc finger motif, critical for retroviral replication processes [67,68]. In the case of HIV-1 NC, nucleic acid chaperone activity is positively correlated with retroviral infectivity [69,70]. ORF1p of I factor, a LINE-like element of Drosophila melanogaster [71], also behaves as a nucleic acid chaperone [72]. Although some retrotransposons such as Ty3 [66] contain an NC protein, L1 does not. Thus, as mammalian L1 ORF1p does not contain a CCHC motif it likely does not function by the same mechanism as NC proteins. Nevertheless, both mouse and human L1 ORF1p were shown to be nucleic acid chaperones [32,43]. To address the mechanism of ORF1p chaperone activity mutational analyses on mouse ORF1p were performed to correlate in vitro nucleic acid binding with retrotransposition. As discussed above, DNA binding affinities for several mutants was determined by single molecule DNA stretching assays using helix-coil transition width as a function of ORF1p concentration. The helix-coil transition in the absence of protein is very narrow (δF is small), due to cooperative conversion of dsDNA to ssDNA, i.e., the requirement for near simultaneous melting of multiple base pairs. Therefore, it was suggested that when an NA-binding protein increases the transition width (transition is less cooperative) it does so by lowering the energetic barrier to rearrangement of nucleic acid secondary structure, thereby promoting melting of small numbers of base pairs [67,68]. Thus, measurement of the transition width reflects both DNA binding affinity and its chaperone activity; i.e., that a greater transition width represents greater chaperone capabilities. When fully bound by protein, mouse ORF1p mutants altered the transition width to different degrees, and the extent to which the proteins increased the transition width at saturated protein binding appeared to correlate with retrotransposition activity. This result is shown in Fig. 5B, which shows the transition width at high protein concentration, near saturated binding, for several mutants [73]. As described above, HIV-NC protein dramatically increased the transition width [67,68,74,75] as also observed with ORF1p. Thus, it was suggested that the mutations reduced ORF1p nucleic acid chaperone activity, which was responsible for reduced retrotransposition [42]. However, the exact mechanism by which this dramatic increase in helix-coil transition occurs is not clear. A similar effect is also observed in the presence of DNA intercalators such as ethidium [76] and ruthenium complexes [77,78], which prevent DNA strand separation. Therefore, the observed increase in the helix-coil transition width in the presence of HIV-1 NC was subsequently attributed to the simultaneous destabilization of the duplex while maintaining protein-mediated interactions between the two DNA strands [65,69,70]. Below we discuss the ORF1p mutations that altered both retrotransposition and their interaction with DNA as measured by single molecule DNA stretching. The mutations are shown in Fig. 2 and the results from stretching studies are summarized in Fig. 5B. Substitutions in the highly conserved R284 (Fig. 2, note the +4 offset in the mouse ORF1p residue positions as described in the legend to Fig. 2) in the RRM and RR297–298 and Y318 in the CTD as well as D159 in the coiled coil of mouse ORF1p (Fig. 2) were analyzed by DNA stretching [41,42,59]. Alanine substitutions at 284 and 318 (R284A and Y318A) eliminated retrotransposition, while substitutions that conserved charge or hydrophobicity (R284K and Y318F) were active in retrotransposition. In contrast, KR297–298, KK297–298 or AA297–298 substitutions at RR297–298 eliminated retrotransposition while the RK297–298 mutation remained active, so simply preserving the charge is not always sufficient to retain retrotransposition activity. A [26,29] D159H substitution at the coiled coil lowered the retrotransposition frequency by a factor of ~15. As shown in Fig. 5B, many of the mutants show a direct correlation between reduced transition width and reduced retrotransposition relative to wild type ORF1p (R284A, R284K, Y318A, Y318F, R298K), suggesting that the reduced retrotransposition is due to inhibition of ORF1p chaperone activity. However, there are some outliers, which show similar transition widths to wild type ORF1p but are defective in retrotransposition (D159H, RR297:298KR, RR297:298KK). These outliers required further analysis to understand why retrotransposition was inhibited, however we note that D159H is a coiled coil mutant and in Section 5, we explore the effect of coiled coil mutants of human ORF1p and their effect on retrotransposition. To further understand the interactions between these additional ORF1p mutants and DNA, the full stretching curve was analyzed. In addition to an increase in transition width, extensive dsDNA aggregation by dsDNA binding proteins can increase the force required to reach the dsDNA contour length (RR297–298 KR, RR297–298KK, and RR297–298AA in Fig. 5B) [59]. In a recent study, the dsDNA aggregation by NC proteins from SIV and HIV-1 were quantitatively measured and used to differentiate the nucleic acid chaperone activity of these two proteins [63]. Strong dsDNA aggregation and wild-type transition widths was observed for RR297–298KR and RR297–298KK. While this shows that these proteins behave differently than wild type ORF1p, we do not know why retrotransposition is inhibited. Although dsDNA aggregation is a strong component of nucleic acid chaperone activity, aggregation can also inhibit chaperone activity if it slows protein-nucleic acid interaction kinetics [79]. The reduced retrotransposition for RR297–298AA was attributed to weak RNA binding. In comparison, all of the other mutants studied showed RNA binding affinity similar to wild type ORF1p. Finally, while D159H showed a wild type transition width and wild type levels of dsDNA aggregation, it was defective in retrotransposition. However, upon analysis of the full stretching curve, it was found that the defective activity of the protein is correlated with its extensive ssDNA aggregation. The DNA observed at extensions beyond 0.55 nm/bp is mostly single-stranded (Fig. 4B) in the salt conditions (50–100mM Na+) used in the studies discussed here[80-82]. Therefore, a relative decrease in ssDNA extension after the helix-coil transition in the presence of protein is indicative of ssDNA aggregation due to bound protein. This reduced retrotransposition activity resulting from ssDNA aggregation may also be attributed to a reduction in protein-DNA interaction kinetics [41,79].Thus, all of the mutants described here that exhibited wild type nucleic acid binding but were defective in retrotransposition showed a difference in ssDNA aggregation, dsDNA aggregation, or helix-coil transition properties in single molecule stretching experiments. These differences in turn may be related to nucleic acid chaperone activity. However, the mechanisms behind these changes were not determined in wild type and mutant mouse ORF1p studies. Therefore, understanding the reason for the observed behavior will require quantitative DNA interaction kinetics studies and characterization of ORF1p oligomerization states as discussed below [43,44].

ORF1p oligomerization and single molecule DNA binding kinetics

Single molecule DNA stretching assays were also carried out on human ORF1p to determine its DNA interaction kinetics in the context of its previously measured oligomerization properties [43,44]. A single DNA molecule was captured in the optical tweezers instrument and stretched to high forces beyond the helix-coil transition, such that most of the DNA molecule was single-stranded. The DNA molecule was held at this force in the presence of ORF1p for different incubation times, as shown in Fig. 6A. After subsequent relaxation to lower forces, the resulting length of DNA strongly depended on the time of incubation, indicating that ORF1p formed stably bound oligomers on ssDNA, with more stably bound (longer) oligomers at longer incubation time [43]. Three ORF1p variants were compared; modern human ORF1p, its resuscitated primate ancestor and a mosaic variant of the modern protein in which 9 of the 30 substitutions in the coiled coil domain retained their ancestral state (Fig. 2 and inset of Fig. 6B). Although these proteins formed equally stable trimers and behaved indistinguishably in bulk binding and in vitro nucleic chaperone assays, the mosaic 151p was completely inactive in an in vivo retrotransposition assay (Fig. 7). Here, the rate of ORF1p oligomerization on ssDNA was determined (Fig. 6). During incubation, stable oligomers formed on the ssDNA molecule held between polystyrene beads in solution in the optical tweezers. After a fixed time, the fraction of stable oligomers was determined. The results showed that both retrotransposition-competent ORF1p proteins (i.e, the modern human protein and its resuscitated ancestral counterpart) rapidly formed stably-bound oligomers on ssDNA. In contrast, the retrotransposition-defective mosaic ORF1p formed such stably bound oligomers at a rate at least 10-fold slower (Fig. 6B, green curve). This property is a function of the coiled coil sequence, presumably by positioning the carboxy-terminal halves ORF1p in a way that is conducive to the inter-trimer contacts that mediate oligomerization. Other than trimer formation per se (which is necessary but clearly is not sufficient for retrotransposition), this study revealed the first functional role for the coiled coil in L1 retrotransposition.
Fig. 6.

Single molecule studies of human ORF1p [43].

A) In contrast to the studies described in Fig. 5, here, the DNA molecule is overstretched above the helix-coil transition up to the ssDNA regime before incubating it with the protein, in order to minimize dsDNA binding effects and exclusively investigate ssDNA-ORF1p binding kinetics. Solid and dashed line represent the stretch-return cycle of a bare dsDNA, with the return curve also referred to as 0 min incubation. The colored empty circles are the return curves of a DNA molecule after incubating with 2 nM protein at ~70 pN for different time periods. The gold curve listed as saturated represents the maximum shift in extension towards ssDNA, obtained at long incubation times and greater than 15 nM protein concentration. The ssDNA-bound protein during the incubation prevents reannealing and thereby shift the relaxation curve after protein incubation towards the ssDNA curve (see green circles, Fig. 4B), allowing one to quantitatively probe the ssDNA fraction bound as a function of time. The solid lines represent the fitted curves as described elsewhere [43]. ORF1p rapidly binds ssDNA. However, it transforms relatively slowly into stable oligomers. Less stable proteins (presumably un-transformed trimers) dissociate during the return cycle and the ssDNA fraction bound by such proteins is defined as the fast fraction in this study. The fast fraction decreases with incubation time as they transform into more stable oligomers on ssDNA. B) Fast fraction as a function of time for modern human (111p, from L1Pa1 family), its resuscitated ancestral primate (555p, from L1Pa5 family) and a mosaic ORF1p (151p) in which 9 residues in the coiled coil are replaced with the corresponding ancestral residues, as shown in the inset (also see Fig. 2 where these residues are indicated in teal). Domain boundaries in the inset correspond to Fig. 1B and white and grey shades represent the amino acid residues of the modern and ancestral ORF1p, respectively. The number of amino acid substitutions relative to the modern ORF1p is denoted in the relevant domains in the other two ORF1p variants. The measured fast fraction is modeled as a sum of increasing and decreasing exponential functions (solid lines) and red, blue and green represent modern, ancestral, and mosaic proteins, respectively. The fast fraction rapidly saturates for all three variants, indicating rapid protein binding to ssDNA. However, this fraction decreases with increasing incubation time as the proteins form more stable oligomers. Therefore, the rate at which the fast fraction decreases is proportional to the rate of stable oligomerization of protein on ssDNA. The retrotransposition-incompetent mosaic ORF1p was at least 10-fold slower in forming stable oligomers in comparison with the active human and primate protein variants.

Fig. 7.

Retrotransposition assay.

A) Organization of an L1 vector in a typical retrotransposition assay. The L1 vector contains an antisense copy of the neo gene disrupted by an intron in the sense orientation. sd and sa are the splice donor and splice acceptor sites, respectively. The intron of the transcribed L1 vector will be spliced and contain the antisense copy of the neo gene. Retrotransposition competent elements will support subsequent cDNA synthesis of this transcript at a DNA target site and ultimately the insertion of an active copy of the neo gene, which when expressed from its promoter (Pr, in red) generates colonies of G418 resistant cells or foci. B) An example of stained foci generated from the retrotransposition assay described in (A) [43]. p111_rtc, p151_rtc and p555_rtc are the L1 vectors containing the modern human, mosaic and the resuscitated primate ORF1 sequences (see inset of Fig. 6B), respectively.

Conclusions and future directions

Although ORF1p is known to be critical for L1 retrotransposition in mouse and human systems, the mechanism by which it facilitates retrotransposition is not yet understood. To provide a clear mechanism, it will be necessary to fully characterize its structure function relationship and correlate its in vitro activities with its function in the cell. For mouse ORF1p, several studies have identified specific mutations that determine the locations of regions that are critical for nucleic acid chaperone activity and for retrotransposition. However, studies of mouse ORF1p have not yet characterized its oligomerization properties, although aggregation of DNA was observed in many single molecule experiments. This indicates that oligomerization and polymerization, as observed for human ORF1p, likely also occur for mouse ORF1p. It would be interesting to characterize these properties for wild type and mutant mouse ORF1p using similar methods for comparison. The recent experiments on human ORF1p demonstrate a wide variety of protein-protein and protein-nucleic acid interactions that likely govern ORF1p function in the cell. It is important to quantify the behavior of human ORF1p under multiple solution conditions to understand the dynamics of protein oligomerization. Although protein oligomerization is difficult to study, it is now clear that oligomerization kinetics are a determinant of retrotransposition activity. Therefore, future studies should attempt to determine the relationship between structure and human ORF1p function while including supramolecular structures formed during oligomerization. Not only are these large structures themselves important, but the process of assembly of the structures is clearly also critical for protein function.
  80 in total

1.  Trimeric structure for an essential protein in L1 retrotransposition.

Authors:  Sandra L Martin; Dan Branciforte; David Keller; David L Bain
Journal:  Proc Natl Acad Sci U S A       Date:  2003-11-13       Impact factor: 11.205

Review 2.  Single-molecule stretching studies of RNA chaperones.

Authors:  Hao Wu; Ioulia Rouzina; Mark C Williams
Journal:  RNA Biol       Date:  2010-11-01       Impact factor: 4.652

3.  Rapid kinetics of protein-nucleic acid interaction is a major component of HIV-1 nucleocapsid protein's nucleic acid chaperone function.

Authors:  Margareta Cruceanu; Robert J Gorelick; Karin Musier-Forsyth; Ioulia Rouzina; Mark C Williams
Journal:  J Mol Biol       Date:  2006-08-30       Impact factor: 5.469

4.  Characterization of a LINE-1 cDNA that originated from RNA present in ribonucleoprotein particles: implications for the structure of an active mouse LINE-1.

Authors:  S L Martin
Journal:  Gene       Date:  1995-02-14       Impact factor: 3.688

5.  A novel active L1 retrotransposon subfamily in the mouse.

Authors:  J L Goodier; E M Ostertag; K Du; H H Kazazian
Journal:  Genome Res       Date:  2001-10       Impact factor: 9.043

6.  Mobile interspersed repeats are major structural variants in the human genome.

Authors:  Cheng Ran Lisa Huang; Anna M Schneider; Yunqi Lu; Tejasvi Niranjan; Peilin Shen; Matoya A Robinson; Jared P Steranka; David Valle; Curt I Civin; Tao Wang; Sarah J Wheelan; Hongkai Ji; Jef D Boeke; Kathleen H Burns
Journal:  Cell       Date:  2010-06-25       Impact factor: 41.582

7.  High frequency retrotransposition in cultured mammalian cells.

Authors:  J V Moran; S E Holmes; T P Naas; R J DeBerardinis; J D Boeke; H H Kazazian
Journal:  Cell       Date:  1996-11-29       Impact factor: 41.582

8.  Non-LTR retrotransposons encode noncanonical RRM domains in their first open reading frame.

Authors:  Elena Khazina; Oliver Weichenrieder
Journal:  Proc Natl Acad Sci U S A       Date:  2009-01-12       Impact factor: 11.205

9.  The ORF1 protein encoded by LINE-1: structure and function during L1 retrotransposition.

Authors:  Sandra L Martin
Journal:  J Biomed Biotechnol       Date:  2006

10.  The Evolution of LINE-1 in Vertebrates.

Authors:  Stéphane Boissinot; Akash Sookdeo
Journal:  Genome Biol Evol       Date:  2016-12-01       Impact factor: 3.416

View more
  9 in total

1.  Reactivity of IgG With the p40 Protein Encoded by the Long Interspersed Nuclear Element 1 Retroelement: Comment on the Article by Carter et al.

Authors:  Mary K Crow
Journal:  Arthritis Rheumatol       Date:  2019-12-27       Impact factor: 10.995

2.  The L1-ORF1p coiled coil enables formation of a tightly compacted nucleic acid-bound complex that is associated with retrotransposition.

Authors:  Ben A Cashen; M Nabuan Naufer; Michael Morse; Charles E Jones; Mark C Williams; Anthony V Furano
Journal:  Nucleic Acids Res       Date:  2022-08-26       Impact factor: 19.160

Review 3.  mRNA Vaccines: Why Is the Biology of Retroposition Ignored?

Authors:  Tomislav Domazet-Lošo
Journal:  Genes (Basel)       Date:  2022-04-20       Impact factor: 4.141

Review 4.  Control of LINE-1 Expression Maintains Genome Integrity in Germline and Early Embryo Development.

Authors:  Fabiana B Kohlrausch; Thalita S Berteli; Fang Wang; Paula A Navarro; David L Keefe
Journal:  Reprod Sci       Date:  2021-01-22       Impact factor: 3.060

5.  Toxinology provides multidirectional and multidimensional opportunities: A personal perspective.

Authors:  R Manjunatha Kini
Journal:  Toxicon X       Date:  2020-05-11

6.  Properties of LINE-1 proteins and repeat element expression in the context of amyotrophic lateral sclerosis.

Authors:  Gavin C Pereira; Laura Sanchez; Paul M Schaughency; Alejandro Rubio-Roldán; Jungbin A Choi; Evarist Planet; Ranjan Batra; Priscilla Turelli; Didier Trono; Lyle W Ostrow; John Ravits; Haig H Kazazian; Sarah J Wheelan; Sara R Heras; Jens Mayer; Jose Luis García-Pérez; John L Goodier
Journal:  Mob DNA       Date:  2018-12-15

7.  Early life stress during the neonatal period alters social play and Line1 during the juvenile stage of development.

Authors:  Amelia Cuarenta; Stacey L Kigar; Ian C Henion; Liza Chang; Vaishali P Bakshi; Anthony P Auger
Journal:  Sci Rep       Date:  2021-02-11       Impact factor: 4.379

Review 8.  Factors Regulating the Activity of LINE1 Retrotransposons.

Authors:  Maria Sergeevna Protasova; Tatiana Vladimirovna Andreeva; Evgeny Ivanovich Rogaev
Journal:  Genes (Basel)       Date:  2021-09-30       Impact factor: 4.096

9.  MET canonical transcript expression is a predictive biomarker for chemo-sensitivity to MET-inhibitors in hepatocellular carcinoma cell lines.

Authors:  Wafaa M Rashed; Mohamed A Kandeil; Mohamed O Mahmoud; Doha Maher; Sameera Ezzat; Mohamed H Abdel-Rahman
Journal:  J Cancer Res Clin Oncol       Date:  2020-09-26       Impact factor: 4.553

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.