Literature DB >> 27136327

Structure of a group II intron in complex with its reverse transcriptase.

Guosheng Qu1, Prem Singh Kaushal2, Jia Wang3, Hideki Shigematsu4, Carol Lyn Piazza1, Rajendra Kumar Agrawal2,5, Marlene Belfort1,5, Hong-Wei Wang3,4.   

Abstract

Bacterial group II introns are large catalytic RNAs related to nuclear spliceosomal introns and eukaryotic retrotransposons. They self-splice, yielding mature RNA, and integrate into DNA as retroelements. A fully active group II intron forms a ribonucleoprotein complex comprising the intron ribozyme and an intron-encoded protein that performs multiple activities including reverse transcription, in which intron RNA is copied into the DNA target. Here we report cryo-EM structures of an endogenously spliced Lactococcus lactis group IIA intron in its ribonucleoprotein complex form at 3.8-Å resolution and in its protein-depleted form at 4.5-Å resolution, revealing functional coordination of the intron RNA with the protein. Remarkably, the protein structure reveals a close relationship between the reverse transcriptase catalytic domain and telomerase, whereas the active splicing center resembles the spliceosomal Prp8 protein. These extraordinary similarities hint at intricate ancestral relationships and provide new insights into splicing and retromobility.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 27136327      PMCID: PMC4899178          DOI: 10.1038/nsmb.3220

Source DB:  PubMed          Journal:  Nat Struct Mol Biol        ISSN: 1545-9985            Impact factor:   15.369


Group II introns, which are dynamic bacterial elements, are ancestrally related to eukaryotic spliceosomal introns and retrotransposons [1,2]. Group II introns are self-splicing, but like nuclear spliceosomal introns, they splice through two transesterification reactions to give rise to an excised intron lariat and ligated exons [1]. Whereas nuclear intron splicing is catalyzed by a spliceosome, a complex ribonucleoprotein (RNP) assembly consisting of small nuclear RNAs (snRNAs) and multiple protein factors [3], group II intron splicing is catalyzed by the intron ribozyme complexed to one intron-encoded protein (IEP) [1]. As mobile retroelements, these introns reverse-splice into DNA whereupon reverse transcriptase (RT) of the IEP uses the intron RNA as a template for target-primed reverse transcription (TPRT) [4,5]. Similarly, telomerase uses TPRT to extend chromosome ends [6,7]. These structural and mechanistic parallels between group II and spliceosomal introns, as well as between group II introns and retrotransposons, suggest that these self-splicing retroelements have played a pivotal role in eukaryotic evolution. Group II introns have a conserved RNA structure consisting of six domains. An appreciation of each domain's role in splicing has been achieved from crystal structures of in vitro transcribed group II intron RNAs [8-10]. The largest domain, domain I (DI), is the scaffold for folding the remainder of the intron, and contains sequences that bind exons through base-pairings to specify sites for splicing and retromobility into DNA. DII and DIII have been suggested to be a regulator and an effector of splicing, respectively. DIV often accommodates the open-reading frame (ORF) encoding the IEP, and contains the IEP binding site [11-13]. The most structurally conserved domain of the intron is DV, which is a short hairpin with a conserved “catalytic triad” at its base. Besides the triad, nucleotides from the junction of DII and DIII contribute to the catalytic core that coordinates metal ions and directly participates in catalysis [8-10]. DVI contains the branch-point, a bulged adenosine, the 2’-OH of which is the nucleophile that attacks the phosphate at the 5’ splice site to initiate the first step of splicing. This catalytic RNA structure is bound by the IEP for both splicing and retromobility reactions. The IEPs are also complex entities. The IEP encoded by the intron of interest in Lactococcus lactis, LtrA, has maturase activity, which facilitates splicing [11,14,15], in addition to RT and endonuclease (EN) activities, which can mobilize the intron [14]. The IEP binds to the intron RNA after translation from the DIV ORF, to form an RNP complex [12,13]. Once excised from the primary transcript, the intron RNA can reverse splice into DNA, utilizing the IEP to complete the retromobility reaction. The EN domain of the IEP bound to the integrated intron cleaves the antisense DNA strand, generating the primer with a 3’-OH that is extended by the RT in a TPRT reaction [1,5]. To enhance our understanding of these reactions, we set out to define the structure of a group II intron RNP complex. Previous structural studies yielded important insights into protein-free group II intron ribozymes synthesized in vitro [8-10] but not into RNPs isolated from cells. We previously reported isolation and purification of excised Ll.LtrB intron RNP particles from the native host L. lactis, and described their hydrodynamic properties, as well as their overall shape by small-angle X-ray scattering [16]. However, the molecular mechanism whereby the IEP interacts with the intron and facilitates the intron's multiple activities remains largely unknown. Here we present a 3.8 Å resolution single particle cryo-EM structure of the L. lactis L1.LtrB group II intron in complex with its IEP, LtrA. In addition, we determined a cryo-EM structure of an LtrA-depleted intron RNA at 4.5 Å resolution. Our study provides the first glimpse of a bacterial RT interacting with a mobile intron RNA. The structure reveals LtrA as a mosaic of a bacterial HNH endonuclease and RT with similarities to eukaryotic telomerase and spliceosomal RT domains, while the configuration of the post-catalytic splicing complex provides clues to how the RNP engages DNA for retrohoming.

RESULTS

Overall architecture of native group II intron RNP

We purified the Ll.LtrB group IIA intron RNP from its native lactococcal host by an intein-based strategy as previously described [16]. The intron had been engineered to delete most of the ORF encoding LtrA, which was expressed downstream of the intron (). The resulting RNPs, which reveal only intron RNA and LtrA bands on silver-stained gels (), were active in splicing (), target DNA binding, integration and bottom strand DNA cleavage (). The excised intron RNP complex, comprising a 902-nt intron RNA and 599-amino acid LtrA protein, appeared as mono-dispersed intact molecules with dimensions of ~20 nm by negative staining EM, allowing us to obtain a three-dimensional (3D) reconstruction of the RNP complex at ~30 Å resolution, which served as the reference for high resolution refinement (). Through subsequent cryo-EM analysis of the complex, we obtained a 3D reconstruction of the group II intron RNP complex at an average resolution of 3.8 Å from a major portion of the particles (). While core regions of the complex are better resolved (up to 3.5 Å; ; ), peripheral regions have somewhat lower resolution (~6 Å), indicating an intrinsic flexibility. In most parts, the map shows clear secondary and tertiary structure characteristics of the RNA and protein components. These features allowed us to determine the 3D structure of the group IIA intron (), revealing some substructures not observed in group IIB and group IIC introns (). Classification of the cryo-EM dataset revealed a 3D class that has similar shape to the RNA portion of the RNP complex but completely lacks density for the IEP (. Since the affinity-purification is IEP-based (), this RNA form must be depleted of LtrA after RNP binding to the affinity column. Refinement of this class to 4.5 Å resolution reveals conformational differences in the LtrA-depleted RNA (). In the RNP structure, the intron RNA exhibits a roughly Y shape, and the density corresponding to LtrA bridges the two arms of the Y by binding the DI and DIV domains (). In addition to clear secondary structure features in most parts, the LtrA protein density also reveals some bulky side-chains of amino acids (), allowing modeling of the entire protein (). Notably, the DNA-binding and EN domains at the C-terminus of LtrA appear to be dynamic, with the densities corresponding to these two domains relatively less resolved ().

The intron ribozyme

The core regions of the intron RNA within the LtrA-bound and -depleted forms show similarity to structures of group IIB and group IIC introns [9,10] (). Because group IIA and IIB introns have additional peripheral RNA helices, they are more similar to each other than to the smaller group IIC intron. Therefore, we focus mainly on the comparison between group IIA (LtrA-bound form, which is better resolved and more complete) and IIB introns. As for group IIB and IIC introns [9,10], DI of Ll.LtrB contains exon binding sites, EBS1, EBS2 and δ, which base-pair to specify sites for splicing and retromobility into DNA [1] (). Interestingly, we find that in the absence of LtrA, EBS1 is disordered, whereas EBS2 adopts a slightly different conformation (). DI in the group IIA Ll.LtrB intron possesses additional structural features not previously observed, for instance, the β-β’ interaction [17] (), which is consistent with biochemical studies[18], and a unique RNA helix, d(iii)a, interacting with LtrA protein (). DII and DIII, which enhance the catalytic activity and interact via τ-τ’ in the IIB intron [9], lack this interaction in Ll.LtrB, from which a requisite hairpin is absent. However, tertiary interactions between DII and DVI (π-π’ and η-η’) [9] are present in both types of intron (). DIV, which both encodes and interacts with LtrA, is longer and adopts a different conformation than in the group IIB intron (), and is made up of two hairpin-like branches that emerge from one basal stem. This difference could be innate, or attributed to engineering of this region to remove coding sequences, or due to the absence of an IEP in previous structural studies. DV, the main catalytic domain, is highly conserved across the group II introns, and exhibits the best defined structure in the EM map. Surprisingly, there is a clear density accommodating a 12-nt strand of nucleic acid running along EBS1 and EBS2 of DI in our RNP structure (). We assumed that this strand represents the ligated exons that have remained associated with the intron using intron binding sites (IBSs), as also observed after splicing of reconstituted RNPs in vitro [19], and that the rest of the loosely associated portion of the mRNA has been degraded or is disordered. This supposition was supported by traces of mRNA found in RNP particles from which the IEP has been removed, where primer extension analysis shows cDNA corresponding to the ligated exons (trapped mRNA ~5% of total ligated exons, ). Moreover, our LtrA-depleted intron structure also lacks the density for mRNA (), suggesting a concomitant release of mRNA along with the IEP. Nevertheless, how this mRNA is displaced to allow DNA targeting in the presence of the IEP is a matter of conjecture (discussed below).

LtrA, a bacterial RT similar to telomerase and Prp8 RTs

LtrA is a multi-domain protein, consisting of an RT domain, a DNA-binding domain (DBD), and an EN domain [20] (). The RT domain is further divided into three subdomains, the N-terminal domain (NTD), a fingers-palm domain containing the catalytic residues, and a thumb domain corresponding to the maturase (sometimes referred to as domain X) (). The DBD appears specific to LtrA, whereas the C-terminal HNH endonuclease is in wide use by mobile bacterial elements to initiate homing[21]. These modules are variously involved in RNA binding, splicing and reverse-splicing (NTD, fingers-palm, thumb, DBD), DNA binding (DBD), cDNA synthesis (RT fingers-palm) and target DNA cleavage (EN). In contrast to suggestions that LtrA interacts with the group IIA intron as a dimer like HIV-RT [16,19,20,22], our structure clearly shows that LtrA interacts as a monomer (). This discrepancy with particles generated in vitro [19,22] and in vivo [16] remains unexplained, but interestingly, most telomerase RTs (TERTs), which maintain the integrity of chromosome ends, and the pre-mRNA processing protein 8 (Prp8) at the heart of the spliceosome, also bind their cognate RNAs as monomers [23-28]. The possibility of LtrA dimer formation in the RNP is ruled out because: (i) we do not see cryo-EM density that would accommodate another molecule of LtrA, and (ii) some of the regions proposed to be involved in dimerization [20] are occupied in RNA-protein interactions (). Consistent with this observation, LtrA carries an insertion in fingers domain (IFD), also present in Prp8 () and in the catalytic subunit of TERT () [29,30] but not in the previously suggested homolog HIV-RT[20] (Figs. 3a-d; Supplementary Fig. 6d; Supplementary Videos 3-5). Further dissection of LtrA protein indicates its similarity to TERT and Prp8.
Fig. 3

Structure Model of LtrA

(a) Overall structure of LtrA protein, with structural domains identified by matching colors. The bar diagram in panel a depicts the overall domain organization of LtrA. (b) Tertiary structure alignment of LtrA RT fingers-palm domain with that in Tribolium castaneum TERT (PDB: 3DU5, left panel; sequence alignment is shown in ) and thumb domain with that in Saccharomyces cerevisiae Prp8 (PDB: 3ZEF, right panel; sequence alignment is shown in ). LtrA domains are in the same colors as in panel , with purple bars representing RT motifs and blue bars representing the thumb domain, whereas homologous domains in TERT and Prp8 are in grey. The RMSDs between the structures of fingers-palm domains of LtrA and TERT and the thumb domains of LtrA and Prp8 are 2.0 Å and 1.5 Å, respectively. (c) Schematic comparison of homologous RT domains. Structure-based sequence alignments[45] were used to identify conserved sequence blocks RT1-RT7 (purple) and three parallel helices of the thumb domain (cylinders in red, orange and green). Insertions between RT1 and RT7 (2a, 4a, 5a, 7a, and ti, white bars) are labeled in red. IFDs, magenta bars. The two catalytic aspartates are highlighted (red). (d) Comparison of tertiary structures of RT domains in LtrA and related proteins, TERT, Prp8, and HIV-1 RT. Subdomains are color coded as in panel . For structure-based sequence alignments of RT domains, see .

The NTD of LtrA, which contains a β-hairpin and three α-helices, is unique among characterized RTs, and has likely evolved to contact DIVa and anchor LtrA to the RNA template for TPRT [31]. The NTD is thus proposed to be a functional equivalent to a telomerase RNA-binding domain (TRBD) [29,30]. The fingers-palm and thumb domains form a “right-hand” configuration of a typical RT with all seven RT motifs, RT-1 to RT-7 (). A comparison of fingers-palm and thumb domains with their nearest structural homologs, reveals that LtrA's fingers-palm domain is most closely related to that of a TERT [29], whereas the thumb domain is structurally most closely related to the thumb domain of spliceosomal Prp8 [32] (Figs. 3b-d; Supplementary Note; Supplementary Figs. 6-8; Supplementary Table 1; Supplementary Videos 3-6).

Intron RNA-LtrA protein interactions

LtrA protein makes multiple contacts with specific regions of DI and DIV in the intron RNA ( & 4). Here we describe the RNA-protein contacts within 4 Å of each other. DI is intimately involved in interactions with the C-terminal half of LtrA, including the DBD, thumb domain, and regions of fingers-palm domains. Whereas the β-strand of the DBD interacts with the RNA helix c2(ii) of DI (), the third α-helix (Tα3) of LtrA's thumb domain interacts with DI primarily through the bound 12-nt exon fragment (). Thus, the exon-binding sites, EBS1 and EBS2, are primarily occupied by the mRNA fragment present in our RNP structure. Additionally, we find that EBS1 and EBS2 adopt different conformations in LtrA-depleted intron RNA, which lacks mRNA (), corroborating a study with a group IIC intron, wherein conformational changes of the EBSs are related to exon recognition [33]. The loop region of the DBD also interacts with DI through the mRNA (). This segment of the DBD includes three basic amino acids RRY (residues 501-503) that were previously shown to affect reverse splicing activity [34]. Notably, Tα3 of the thumb domain that directly contacts the EBS-IBS duplex () contains the amino-acid sequence EYSC (residues 460-463), which was suggested by unigenic evolution screening to interact with the core region [35]. Additionally, the tip of α-helix 5 (FPα5) and loops between β-sheets of the LtrA fingers-palm domain (RT-7 motif, as described in [35]) interact with the RNA hairpin Id(iii)a of DI (). We also observed clear interactions of the ti loop [20] of LtrA's thumb domain, encompassing Lys407, Lys408, Asp409 and Phe413, with a cavity formed by the apical loops of RNA helices Id3(ii) and Id(iii) of DI (). The NTD and fingers of LtrA interact with DIVa (). While the NTD terminal α-helices interact with DIVa helix iii (), the fingers’ α helix 4 (FPα4) and the connecting β-strand interact with the RNA helices DIVa(i, ii) (). These results are substantially in accord with previously described protein-RNA interactions including those involving the Shine-Dalgarno sequence required for LtrA translation[12,13,35]. More generally, the observed interactions of the NTD and fingers domain with DIVa, and of the palm, thumb and DBD with DI of the Ll.LtrB intron are consistent with previous biochemical studies [11-13,31,34,35]. In our structure, as in group IIB and IIC introns [9,10], the catalytic core for splicing and reverse splicing comprises the AGC triad, the dinucleotide bulge from DV, and a conserved guanosine (G471 here) in J2/3, the junction between DII and DIII (). In the IIB and IIC introns, these conserved residues coordinate magnesium ions for catalysis [9,10]. Additionally, the group IIA intron's 5’- and 3’-ends are in close proximity to the core, whereas the bulged adenosine (A) branch-point in DVI, and the lariat structure formed between the 5’-end of the intron and the A branch-point are adjacent (). Also, the exon-binding sites, EBS1 and EBS2, are in the vicinity of the core. Notably, the current structure allows us to view the intron active site in the context of the IEP, the thumb domain of which interacts with both the EBSs and the IBS-containing mRNA.

DISCUSSION

This work provides an unprecedented view of the interactions between a catalytic intron RNA and its cognate protein that helps effect RNA catalysis. Furthermore, the structural analysis of the IEP highlights ancestral relationships with related proteins: bacterial LtrA comprises domains with structural similarities to spliceosomal Prp8 RT and telomerase RT from eukaryotes. Whereas parallels to Prp8 allow us to appreciate the role of the protein in RNA splicing, the similarities to TERT reveal how the intron RNP utilizes RT and a bacterial HNH endonuclease domain of the IEP to prepare for DNA integration and TPRT.

How the IEP facilitates splicing reactions

The stability of the LtrA-depleted intron RNA structure () indicates that whereas LtrA assists intron folding [36], it is not needed for maintenance of the overall RNA structure. Rather, after RNA folding, LtrA is required specifically for interactions involved in RNA splicing and retrohoming. One difference from the LtrA-bound structure is in DIVa, the primary IEP binding site, which is disordered in the absence of the LtrA (). Strikingly, in the absence of LtrA, the mRNA, which is sandwiched between EBSs and the thumb domain of LtrA in the RNP complex, is lost, leaving the loops of the EBSs, particularly of EBS1, disordered (). Similarly, disordered EBSs of a group IIC intron become ordered by providing exogenous mRNA [33]. These observations imply that LtrA has a direct role in stabilization of EBS-IBS interactions. Thus, whereas initial high-affinity binding of LtrA to DIVa serves to anchor the protein and regulates its translation [12], interactions with DI not only enhance binding[11], but most importantly, provide key functional interactions for splicing and reverse splicing. We propose that the EBS-IBS interaction can form independently but is stabilized by RNA-protein interactions. Interestingly, LtrA binds less tightly to spliced intron RNA than to the exon-containing precursor and LtrA binding to the intron lariat is enhanced by RNA or DNA oligonucleotides corresponding to IBS sequences [11]. These observations have implications for promoting both RNA splicing and reverse-splicing into RNA or DNA by involving the maturase thumb domain, which would promote splicing by stabilizing the EBS-IBS duplex, an action that could also facilitate reverse splicing into DNA for retrohoming.

Active sites of the Ll.LtrB intron and the spliceosome

Our study supports the idea of a common ancestry between group II introns and the spliceosome. Not only are there structural similarities between LtrA and Prp8 (), but also, these proteins interact with the equivalent RNA components that direct 5’ splice site recognition, the EBS1 loop in the group II intron and the U5 loop I in the spliceosome (). Remarkably, the thumb domains of LtrA and Prp8 interface with their respective EBS1-exon and U5-exon complexes (). Similarly, the active site helices, catalytic residues and branch-points are analogous between DV and DVI of the group II intron and U6-U2 snRNAs () [1,2,8-10,28,37,38]. In the spliceosome, U6 and U2 partner while U2 base-pairs with the intron to present the bulged adenosine to the active site helix and catalytic triad formed mainly by U6 [3]. This occurs in much the same way as DVI of the group II intron presents the bulged adenosine to the active site helix of DV (), which is in turn juxtaposed to EBS1-exon 1 for catalysis at the 5’ splice site [37,38]. In both cases the thumb domain of the respective protein buttresses these conserved interactions.

Structure-function similarities between LtrA and TERT

Group II intron retrohoming and telomere lengthening both occur by TPRT [5-7] (). Whereas TERT uses telomerase RNA as the template and the chromosome 3’ end as primer, LtrA uses the group II intron RNA that is reverse-spliced into the target DNA as template and the opposite strand, which is cleaved by the EN domain to generate a 3’ end, as primer. Considerable progress has been made recently on the structure of telomerase from both protozoa and beetles [24,29,30]. Like TERT, ~25 Å diameter cavity formed by LtrA is able to accommodate a DNA-RNA heteroduplex (). Docking of the DNA-RNA heteroduplex from the model of Tribolium casteneum TERT[30] (PDB ID: 3KYL) into the analogous cavity of LtrA shows that although the RT-1 and RT-2 loop partially penetrates the RNA-DNA heteroduplex, there is no perturbation of the main protein structure. It is noteworthy that many other features of the active center of TERT and LtrA are similar. Two of the three identical active site aspartate residues, D343 and D344 in LtrA, are catalytically essential for reverse transcription, but not for splicing [14,35]. Additionally, most of the residues that either constitute the dNTP pocket or interact with the RNA template and the DNA primer in the crystal structures of TERT are conserved between TERT and LtrA (). These observations are consistent with LtrA initiating cDNA synthesis in TPRT in a manner similar to TERT.

An intron RNP poised for retrohoming

Our structure of the post-splicing RNP represents a functional state ready to target DNA for reverse splicing after displacement of the mRNA (). Topographies of the RT, DBD and EN domains provide insight into subsequent steps toward TPRT, while raising some mechanistic questions. The first question is how the mRNA is removed, such that the double-stranded DNA (dsDNA) can be scanned by the RNP, for the IEP to melt the IBS and thereby to allow base-pairing between the intron RNA's EBSs and the DNA top strand. Dissociation of the RNP from the ligated exons could possibly occur during translation or in the initial steps of retromobility, by replacement of the mRNA with DNA[1,39]. Although this question remains unanswered, the mRNA resolved in the current cryo-EM structure is the equivalent of the DNA top strand, and the RNP structure indicates that the EBS-IBS interaction is stabilized by LtrA's thumb domain (). Based on this premise, the DNA insertion site (IS) on the top strand is brought to the active site (DV) in proximity to DVI and the intron lariat, for reverse splicing (). Because RNP invasion of the top DNA strand is a relatively unfavorable event[40], we propose that the IEP's DBD domain, which stacks on the thumb, would be positioned to interact with the 3’-exon downstream of the IS to trap intron invasion products. This interaction occurs through recognition of the single T+5 residue[39], which also serves to bend the DNA [41]. The EN domain next to the DBD tips toward the entrance of the cavity of LtrA (), suggesting that dsDNA bending may serve to bring the DNA cleavage site (CS) to the EN active site in LtrA (). EN cleavage of the bottom strand to generate the 3’-OH primer for cDNA synthesis, 9 nt downstream of the IS, then occurs after reverse splicing of the intron into the top DNA strand (). After reverse splicing, the DNA-RNA hybrid would be enveloped within the LtrA cavity, but the RT active site is 45 Å from the EN active site, raising the second question, of how this distance is bridged. Importantly, the EN domain appears partially disordered (), and therefore hints not only at its conformational flexibility but also that its precise state could control cleavage activity, as does the HNH nuclease domain of Cas9 in the CRISPR system [42]. However, larger structural rearrangements of the trimolecular DNA-RNA-protein complex must be invoked to bring the 3’-OH primer to the RT active site (). Possibly, the interaction of LtrA with DIV, which provides the main anchoring site for LtrA on the intron, is maintained, while the weaker interactions with DI are relaxed to enable translocation of the RNA-DNA template-primer or LtrA protein or both. The cleaved DNA 3’-end in proximity to the tri-aspartate active site of LtrA RT would then serve to prime TPRT for intron retrohoming.

Evolutionary relatedness to spliceosomes and telomeres

Clearly, the group II intron RNP is a molecular mosaic of endonuclease and RT domains. The RT domain of the bacterial LtrA is structurally related to cellular functions in extant eukaryotes; the RNA-binding thumb resembles Prp8 of the spliceosome in structure, whereas the catalytic fingers and palm appear more akin to telomerase TERT. Prp8, which is present in primitive eukaryotes, also forms a large internal cavity thought to interact in spliceosomes with the 5’ and 3’ splice sites and the branch-point A [32]. There is little doubt that group II intron RNAs are ancestral to the spliceosome, and the likely donors of the snRNAs, and the hypothesis that the group II IEP gave rise to Prp8, which then lost RT activity, is also favored [2,27,32,43]. TERT, which is present in the most deeply branching of eukaryotes, performs telomere addition by a TPRT mechanism similar to that used in group II intron retrotransposition [6,7]. Cellular TERT may have evolved from a parasitic intron to solve the problem of protecting chromosome ends, or opportunistic introns may have recruited primitive cellular RTs in order to invade genomes. Hereto a group II intron origin for telomerase is favored [6,7,44]. If we then assume a group II intron origin for spliceosomal Prp8 and TERT, the presumption is that the components of the structure were subjected to different evolutionary pressures, where the thumb domain of Prp8 was constrained in the spliceosome by specific RNA interactions involved in splicing, whereas the palm and fingers in TERT were under evolutionary pressure to maintain RT function. In each case, the RT domain would co-evolve with the RNA on which it acts. Regardless, structural and functional parallels between RNA-protein interactions involved in exon recognition by the group II intron RNP and the spliceosome and between TPRT in intron mobility and telomere lengthening are inescapable.

ONLINE METHODS

RNP purification

The Ll. LtrB group II intron RNP complex with most of the intron's open reading frame deleted was produced and purified by following previous procedures [16] with minor modifications. Lactococcus lactis IL1403 containing the plasmid pLNRK-nisLtrB (ΔORF+A)-12MS2E2+nisLtrA-intein-CBD was grown, lysed and applied to chitin resin. After washing the resin, 200 μg of MS2-MBP protein was introduced onto each column to separate the unspliced intron RNP. The columns were washed, then immersed in 40 mM DTT for 16-18 h to release the bound LtrA protein and its associated intron RNAs. After purification by sucrose gradient sedimentation, for cryo-EM analysis, residual genomic DNA was digested with RQ1 RNase-free DNase (Promega) for 15 min at 37 °C, and the pure RNP was recovered by applying the digestion mixture to an Amicon Ultra-2 MWCO 100 KDa filter unit. The purified RNP product was snap-frozen in liquid nitrogen and stored at −80 °C. The purity of the RNA content of the RNP was verified on a 1.2% agarose-formaldehyde gel and by primer extension analysis (see below), and the protein was observed on polyacrylamide gels with Coomassie and silver staining.

Primer extension analysis

Reverse transcription reactions were performed using SuperScript® III Reverse Transcriptase (Invitrogen) as per manufacturers protocol, using 2.0 μg of DNA-free total RNA or 20 ng purified intron RNP preparation, and 0.4 pmol of 5’ end 32 P-labeled primer (Exon 2 specific primer IDT1072 - CTGCGGCCGCAGAATTAAAAATG, or Intron specific primer IDT1073 - GTACCTTAAACTACTTGACTTAACACC). Products were analyzed on a denaturing 7 M urea-8% polyacrylamide gel, exposed on phosphor screens, and scanned on a GE Healthcare Typhoon Trio.

RNP activity assay

DNA oligonucleotides (IDT3204 - CAACCCCGTCGTCGTGAACACATCCA TAACCATATCATTTTTAATTCTAC, and IDT3206 - GTAGAATTAAAAATGATATG GTTATGGATGTGTTCACGATCGACGTGGGTTG), which contain all the sequence required for intron retrohoming, were labeled at the 5’ end using 32P-γ-ATP (PerkinElmer) and T4 polynucleotide kinase (NEB), and annealed to make a double-stranded substrate. To assay activity, 100 ng of purified intron RNP was incubated in reverse splicing buffer (50 mM Tris-HCl, pH7.5, 10 mM KCl, 10 mM MgCl2, 5 mM DTT, 0.1% Nonidet P-40, 0.1% Tween-20) at 37 °C for 50 min, with 0.4 pmol of 32P-end-labeled substrate DNA. Reaction products were put on ice, divided in half, and diluted with glycerol (25%, v/v) (for native gel) or 2X Gel Loading Dye II (Ambion) (for denaturing gel). Products were analyzed for DNA binding and reverse splicing, on a 4% native polyacrylamide gel and a denaturing 7 M urea-10% polyacrylamide, respectively, exposed on phosphor screens, and scanned on a GE Healthcare Typhoon Trio. Original images of gels, autoradiographs and blots used in this study can be found in .

Electron Microscopy

The purified intron RNP complex was diluted to a concentration of 76 nM in reverse splicing buffer, containing 50 mM Tris-HCl, pH 7.5, 10 mM KCl, 10 mM MgCl2, 5 mM DTT, immediately before preparing the negatively-stained or frozen-hydrated grids. For negative staining EM, 4 μl of the RNP solution was applied to a glow-discharged holey carbon grid that was pre-coated with a thin layer of continuous carbon film over the holes. After 1 min, the grid was washed consecutively with three droplets of 2% (w/v) uranyl acetate solution for total of 30 sec. After another 1 min, the residual stain was blotted off and the grid was air-dried. The EM data for the negatively-stained specimen was collected on an FEI Tecnai-12 transmission electron microscope, equipped with a LaB6 filament and operated at 120 kV acceleration voltage. Images were collected at a nominal magnification of 68,000×, using a total dose of about 20 e−/Å2 and defocus value ranging from −1.0 to −1.5 μm on a Gatan Ultrascan4000 CCD camera, with a pixel size of 1.65 Å on the object scale. Total of 66 tilt-pair images of the specimen at 0° and 50° were recorded manually for the random-conical tilt 3D reconstruction. For cryo-EM, frozen-hydrated specimens were prepared using the FEI Vitrobot Mark IV plunger. 4 μl of the diluted RNP complex was placed on a glow-discharged holey carbon grid (Quantifoil Cu-Rh R1.2/1.3) pre-coated with the home-made continuous carbon film (with ~2 nm thickness). The excess of solution from the grid was blotted for 2.0 s at 100% humidity at 22 °C before the grid was flash frozen into liquid ethane slush cooled at liquid nitrogen temperature. Cryo-EM data was collected on an FEI Titan Krios electron microscope, equipped with a Gatan K2 Summit direct-electron counting camera. The microscope was operated at 300 kV and images of the specimen were recorded with a defocus range of −1.2 to −3 μm at a calibrated magnification of 38,269 in super resolution mode of the K2 camera, yielding a pixel size of 0.653 Å on the object scale. A total of 2,266 movie stacks, each containing 32 sub-frames, were recorded using the semi-automated low-dose acquisition program UCSF-Image4 (developed by X. Li), with an electron dose rate of 6.25 electrons per Å2 per second and total exposure time of 8 seconds. Thus, the total accumulated dose on the specimen was about 50 electrons per Å2 and each frame had an exposure time of 0.25 s (see ).

Image processing of electron micrographs

In order to generate a reliable initial model of the RNP complex, we performed the random-conical tilt (RCT) reconstruction method of the negatively stained specimen to get a low resolution 3D model of the complex [46]. First, we used the e2boxer.py in EMAN2 package [47] to semi-automatically pick 15,783 particles with a box size of 180×180 pixels from the negatively-stained micrographs of untilted specimens. We then used the particle coordinates of the untitled-specimen micrographs to locate the corresponding tilt-paired particle coordinates in the 50° tilted-specimen, based on the particle distribution pattern feature of the paired micrographs (using a home-made python algorithm developed by J. Wang and H.W. Wang). In brief, we estimated the rough tilt axis, directions and angles of the tilted specimen using CTFTILT program [48]. Next, we compressed the coordinates of particles from the untilted specimen to estimate the location of their partners in tilted specimen. This was followed with local cross-correlation calculation to further refine the particle location and tilting parameters in the tilted micrograph. Finally, we picked 14,933 tilt-pairs of particles from the raw micrographs. The untilted-specimen particle images were decimated twice, band-pass filtered, normalized, aligned, and classified using multivariate statistical analysis and multi-reference alignment iteratively in IMAGIC-4D software package [49]. Based on the 2D analysis, we performed reconstruction from the tilted-specimen particle images of the same class using back projection methods in SPIDER [50]. This procedure generated 39 structures, among which two distinct volumes with the most populated particles were used as initial models for further 3D refinement using both the untilted and tilted-specimen particle images in RELION 1.3 [51]. After refinement, all models converged to a very similar structure. For the high-resolution 3D reconstruction from the cryo-EM dataset, the 32 frames of each movie stack collected from the K2 Summit camera were decimated twice, aligned and summed using the frame-based motion correction algorithm to generated a final drift-corrected micrograph for further processing [52]. The contrast transfer function (CTF) parameters of each micrograph were determined using CTFFIND3 [48]. We first manually picked 92,665 particle images of the RNP from 256 drift-corrected micrographs using EMAN2 and performed reference-free 2D-classification and alignment in RELION 1.3. This yielded 100 2D class averages with detail features. We then used the best 2D class averages as reference to perform auto picking procedures in RELION 1.3 of all the micrographs to generate a total of 608,019 particle images. After two rounds of reference-free 2D-classification, particle images cleaned of ice contaminants and other impurities were subjected to reference-based 3D-classification procedures against the 3D RCT reconstruction from the negatively stained specimen. The 3D classification yielded four major classes, among which three classes showed an intact RNP complex and one class lacked density for the LtrA protein. A total of 450,296 particle images from the first three classes were merged for 3D auto-refinement in RELION 1.3. The 3D refinement of this dataset yielded a 4.21 Å resolution map. Then we applied a soft mask on the core region of the map for further auto-refinement (). This yielded a final 3D map at average 3.8 Å resolution (corrected gold-standard FSC at 0.143), after post-processing with auto-mask and the auto-bfactor option (B-factor of −161 Å2) in RELION 1.3. We also performed 3D auto-refinement of the class with LtrA-depleted RNA, containing 102,522 particle images, and obtained a final refined map at 4.5 Å resolution (corrected gold-standard FSC at 0.143), after post-processing with auto-mask and the auto-bfactor option in RELION 1.3 ().

Molecular Interpretation of the cryo-EM map

Modeling of the group IIA intron

The X-ray crystallographic structure of group IIB intron (PDB ID: 4R0D) [9] was used as a starting reference model to generate an initial molecular model for the group IIA intron RNA. For this, structure of the group IIB intron was docked first as a rigid body into the cryo-EM density. This initial docking revealed substantial conformational differences between the crystallographic structure of the group IIB intron and our cryo-EM density. In order to obtain a better fit, individual domains of group IIB intron structure were computationally separated and docked independently into the cryo-EM density using Chimera [53]. While the docking of the RNA helices in the conserved core region was straightforward, most of the peripheral and non-homologous RNA helices that do not fit into the cryo-EM map were deleted. In the next step, nucleotides in the best-fit regions were mutated according to the sequence of group IIA intron using Coot [54], which also matched better to the cryo-EM density. To build the non-homologues helices, the sequence and secondary structure of the group IIA intron [18] was used to generate 3D models for individual RNA helices, using RNAComposer package [55]. Each of these helices was individually fit into the corresponding cryo-EM density, and then connected using Coot. In the final model most of the helices in the core region retained a very similar structure to that of initial group IIB structure ().

Modeling of the LtrA protein

To obtain an initial model, online homology modeling server I-TASSER [56] and Phyre [57] were used. These servers take the sequence information provided by user to search whole protein data bank (PDB) for homologous structures, and use multiple structural templates to generate the final homology model. The I-TASSER gives the best top 5-homology models, whereas Phyre gives a single homology model. The homology modeling using both I-TASSER and Phyre servers failed to generate a full-length model of LtrA that could be rigid-body docked into the cryo-EM density corresponding to the LtrA protein. Therefore, modeling of individual domains was pursued, according to structural domain information in the previously reported homology model of LtrA[20]. For the homology models of RT fingers-palm (residues 82-360) both I-TASSER and Phyre server gave very similar models that could be easily docked into the cryo-EM mass corresponding to the LtrA fingers-palm region (, left panel). This model was further adjusted wherever necessary and few small segments of α-helices were modeled based structural features in our cryo-EM density (, middle panel). The final fitted model retains the same fold and shows RMSD of 1.9 Å with initial homology model (, right panel). Similarly, the initial homology model of thumb domain (residues 424-473) was docked in the corresponding cryo-EM mass (, left panel). The complete thumb domain (residues 391-474) was built based on secondary structure prediction (http://www.compbio.dundee.ac.uk/jpred4/indexup.html), features in the cryo-EM map (, middle panel). The model building for the ti-loop within the thumb domain was also supported by the clearly observed cryo-EM densities for bulky side chain residues (). The final model retains the three helices (, right panel) and has a RMSD of 1.6 Å with the initial model. In order to eliminate the potential model bias of TERT and Prp8 in our homology modeling practice for the fingers-palm and thumb domains, we also intentionally removed TERT and Prp8 from the database and performed the homology modeling again. We found little difference of the final modeling outcome with that from the entire database. For the modeling of the EN domain, the antiparallel β-sheet of the initial EN homology model (residues 525-591) was docked in the cryo-EM (, left panel). The alpha helix was subsequently modeled into the cryo-EM mass (, middle panel). Note that this helix in all five homology models obtained from I-TASSER shows different conformations and does not make any tertiary interaction with EN domain, which allows us to move this helix with respect to the antiparallel β-sheet, into the density. Cryo-EM masses that correspond to the loop regions that connect the antiparallel β sheet was disordered in our map, and therefore the loop region was not modeled (, right panel). The final model retains the antiparallel β-sheet that is characteristic of HNH endonuclease () and shows RMSD of 1.7 Å with initial homology model. The homology models obtained for NTD and DBD could not be rigid-body docked in corresponding cryo-EM mass. To model the NTD and DBD domains, secondary structure prediction using JPred4 (http://www.compbio.dundee.ac.uk/jpred4/indexup.html) and matching the secondary structure within cryo-EM mass was applied (). Because of weaker densities and relatively low resolution for the DBD and the EN domains in our highest 3.8 Å resolution cryo-EM map, we used a lower resolution (5 Å) map to partially model these domains (). Disordered densities for the DBD and EN domains in our highest resolution (3.8 Å) map, might be due to the flexibility of these domains in the absence of a substrate DNA, which may be needed to stabilize these domains through functional interactions. All individually fitted domains were connected subsequently using Coot. In the final LtrA model residues 1-3, 252-301, 364-374, 503-524, 562-580 and 592-599 were not modeled. The models of the group IIA intron and LtrA protein were combined and refined in two steps. First, using the Molecular Dynamics Flexible Fitting (MDFF) protocols [58], and subsequently using the PHENIX software [59]. MDFF was performed by applying restraints to maintain secondary structures, cis-peptide and chirality of the molecule. Initial 2000 cycles of energy minimization was followed by 20 ps of flexible fitting using a gscale value of 0.3. The structure obtained after MDFF was subjected to real space refinement, by applying secondary structure restraints in PHENIX. Iterative cycles of model building in Coot and model refinement using PHENIX were performed. For the model building of the LtrA-depleted group II intron, the RNA from the final refined model of the RNP complex was docked into the cryo-EM map. Disordered regions were deleted and structure was refined using PHENIX. The final models were validated using MolProbity () [60]. The FSC curves calculated between the final RNP model and the map, and between the model for the best resolved RNA domain V and corresponding density in the cryo-EM map, are shown in .
Table 1

Data collection, refinement and modeling information

Group IIA RNPLtrA-depleted intron RNA
Data Collection
    Particles450,296102,522
    Pixel size (Å)0.6530.653
    Defocus range (μm)1.2 - 31.2-3
    Voltage (kV)300300
    Electron dose (eÅ−2)5050
Model composition
    Non-hydrogen atoms1907013494
    Protein residues486-
    RNA bases704630
    Ligands--
Refinement
    SoftwareRELION1.3RELION1.3
    Resolution (Å)3.84.5
    Accuracy of rotation (°)2.4853.06
    Accuracy of translation1.0041.324
    Map sharpening B-factor (Å2)−161−120
RMS deviations
    Bonds (Å)0.0010.001
    Angles (°)0.3210.219
Validation (proteins)
    Molprobabity score2.02-
    Clashscore, all atoms4.39 (95th2.37 (99th
    Good rotamers (%)99.5-
Ramachandran plot
    Favored (%)74-
    Outliers (%)6-
Validation (RNA)
    Correct sugar puckers (%)9998.7
    Good backbone7978.0
  59 in total

1.  Mechanism of maturase-promoted group II intron splicing.

Authors:  M Matsuura; J W Noah; A M Lambowitz
Journal:  EMBO J       Date:  2001-12-17       Impact factor: 11.598

2.  The pathway for DNA recognition and RNA integration by a group II intron retrotransposon.

Authors:  Yasunori Aizawa; Qing Xiang; Alan M Lambowitz; Anna Marie Pyle
Journal:  Mol Cell       Date:  2003-03       Impact factor: 17.970

3.  Accurate determination of local defocus and specimen tilt in electron microscopy.

Authors:  Joseph A Mindell; Nikolaus Grigorieff
Journal:  J Struct Biol       Date:  2003-06       Impact factor: 2.867

4.  Protein structure prediction on the Web: a case study using the Phyre server.

Authors:  Lawrence A Kelley; Michael J E Sternberg
Journal:  Nat Protoc       Date:  2009       Impact factor: 13.491

5.  Prp8, the pivotal protein of the spliceosomal catalytic center, evolved from a retroelement-encoded reverse transcriptase.

Authors:  Mensur Dlakić; Arcady Mushegian
Journal:  RNA       Date:  2011-03-25       Impact factor: 4.942

Review 6.  Spliceosome structure and function.

Authors:  Cindy L Will; Reinhard Lührmann
Journal:  Cold Spring Harb Perspect Biol       Date:  2011-07-01       Impact factor: 10.005

7.  Features and development of Coot.

Authors:  P Emsley; B Lohkamp; W G Scott; K Cowtan
Journal:  Acta Crystallogr D Biol Crystallogr       Date:  2010-03-24

8.  Structure of the Tribolium castaneum telomerase catalytic subunit TERT.

Authors:  Andrew J Gillis; Anthony P Schuller; Emmanuel Skordalakes
Journal:  Nature       Date:  2008-08-31       Impact factor: 49.962

9.  Structural basis for exon recognition by a group II intron.

Authors:  Navtej Toor; Kanagalaghatta Rajashankar; Kevin S Keating; Anna Marie Pyle
Journal:  Nat Struct Mol Biol       Date:  2008-10-26       Impact factor: 15.369

10.  MolProbity: all-atom structure validation for macromolecular crystallography.

Authors:  Vincent B Chen; W Bryan Arendall; Jeffrey J Headd; Daniel A Keedy; Robert M Immormino; Gary J Kapral; Laura W Murray; Jane S Richardson; David C Richardson
Journal:  Acta Crystallogr D Biol Crystallogr       Date:  2009-12-21
View more
  56 in total

1.  Reverse transcriptases lend a hand in splicing catalysis.

Authors:  Joseph A Piccirilli; Jonathan P Staley
Journal:  Nat Struct Mol Biol       Date:  2016-06-07       Impact factor: 15.369

Review 2.  Structural biology of telomerase and its interaction at telomeres.

Authors:  Yaqiang Wang; Juli Feigon
Journal:  Curr Opin Struct Biol       Date:  2017-07-18       Impact factor: 6.809

3.  DNA cleavage and reverse splicing of ribonucleoprotein particles reconstituted in vitro with linear RmInt1 RNA.

Authors:  María Dolores Molina-Sánchez; Nicolás Toro
Journal:  RNA Biol       Date:  2019-04-14       Impact factor: 4.652

Review 4.  Group II Intron RNPs and Reverse Transcriptases: From Retroelements to Research Tools.

Authors:  Marlene Belfort; Alan M Lambowitz
Journal:  Cold Spring Harb Perspect Biol       Date:  2019-04-01       Impact factor: 10.005

5.  The Reverse Transcriptase/RNA Maturase Protein MatR Is Required for the Splicing of Various Group II Introns in Brassicaceae Mitochondria.

Authors:  Laure D Sultan; Daria Mileshina; Felix Grewe; Katarzyna Rolle; Sivan Abudraham; Paweł Głodowicz; Adnan Khan Niazi; Ido Keren; Sofia Shevtsov; Liron Klipcan; Jan Barciszewski; Jeffrey P Mower; André Dietrich; Oren Ostersetzer-Biran
Journal:  Plant Cell       Date:  2016-10-19       Impact factor: 11.277

6.  Forks in the tracks: Group II introns, spliceosomes, telomeres and beyond.

Authors:  Rajendra Kumar Agrawal; Hong-Wei Wang; Marlene Belfort
Journal:  RNA Biol       Date:  2016-10-11       Impact factor: 4.652

7.  LocalSTAR3D: a local stack-based RNA 3D structural alignment tool.

Authors:  Xiaoli Chen; Nabila Shahnaz Khan; Shaojie Zhang
Journal:  Nucleic Acids Res       Date:  2020-07-27       Impact factor: 16.971

8.  A group II intron-encoded protein interacts with the cellular replicative machinery through the β-sliding clamp.

Authors:  Fernando M García-Rodríguez; José L Neira; Marco Marcia; María D Molina-Sánchez; Nicolás Toro
Journal:  Nucleic Acids Res       Date:  2019-08-22       Impact factor: 16.971

9.  Small molecule-RNA targeting: starting with the fundamentals.

Authors:  Amanda E Hargrove
Journal:  Chem Commun (Camb)       Date:  2020-11-26       Impact factor: 6.222

Review 10.  Structural Insights into the Mechanism of Group II Intron Splicing.

Authors:  Chen Zhao; Anna Marie Pyle
Journal:  Trends Biochem Sci       Date:  2017-04-21       Impact factor: 13.807

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.