Dahai Luo1, Andrew Kohlway, Anna Marie Pyle. 1. Department of Molecular, Cellular, and Developmental Biology, Yale University, New Haven, CT, USA. dahai.luo@yale.edu
Abstract
Double-stranded RNAs are an important class of functional macromolecules in living systems. They are usually found as part of highly specialized intracellular machines that control diverse cellular events, ranging from virus replication, antiviral defense, RNA interference, to regulation of gene activities and genomic integrity. Within different intracellular machines, the RNA duplex is often found in association with specific RNA-dependent ATPases, including Dicer, RIG-I and DRH-3 proteins. These duplex RNA-activated ATPases represent an emerging group of motor proteins within the large and diverse super family 2 nucleic acid-dependent ATPases (which are historically defined as SF2 helicases). The duplex RNA-activated ATPases share characteristic molecular features for duplex RNA recognition, including motifs (e.g., motifs IIa and Vc) and an insertion domain (HEL2i), and they require double-strand RNA binding for their enzymatic activities. Proteins in this family undergo large conformational changes concomitant with RNA binding, ATP binding and ATP hydrolysis in order to achieve their functions, which include the release of signaling domains and the recruitment of partner proteins. The duplex RNA-activated ATPases represent a distinct and fascinating group of nanomechanical molecular motors that are essential for duplex RNA sensing and processing in diverse cellular pathways.
Double-stranded RNAs are an important class of functional macromolecules in living systems. They are usually found as part of highly specialized intracellular machines that control diverse cellular events, ranging from virus replication, antiviral defense, RNA interference, to regulation of gene activities and genomic integrity. Within different intracellular machines, the RNA duplex is often found in association with specific RNA-dependent ATPases, including Dicer, RIG-I and DRH-3 proteins. These duplex RNA-activated ATPases represent an emerging group of motor proteins within the large and diverse super family 2 nucleic acid-dependent ATPases (which are historically defined as SF2 helicases). The duplex RNA-activated ATPases share characteristic molecular features for duplex RNA recognition, including motifs (e.g., motifs IIa and Vc) and an insertion domain (HEL2i), and they require double-strand RNA binding for their enzymatic activities. Proteins in this family undergo large conformational changes concomitant with RNA binding, ATP binding and ATP hydrolysis in order to achieve their functions, which include the release of signaling domains and the recruitment of partner proteins. The duplex RNA-activated ATPases represent a distinct and fascinating group of nanomechanical molecular motors that are essential for duplex RNA sensing and processing in diverse cellular pathways.
Entities:
Keywords:
ATPases; Dicer; RIG-I like receptors; RNAi; antiviral immunity; helicase; molecular motor
Many cellular processes result in the production of double-stranded RNA molecules, including transcription of convergent cellular genes or mobile genetic elements, self-annealing of cellular transcripts and the replication of common RNA viruses. Duplex RNAs are important for numerous cellular functions, including gene regulation, chromatin remodeling, antiviral defense and maintenance of genomic integrity., Most of these processes involve the interaction of double-stranded RNAs with conserved and highly specialized intracellular machines. Well-characterized examples include Dicer and the RIG-I like receptors, as well as Dicer-like RNA helicases 1 and 3 (DRH-1 and -3), which are two mechanical proteins involved in the RNA interference (RNAi) pathway in worms., Although there are fundamental differences between these proteins, they share a similar, highly conserved motor domain that is essential for duplex RNA sensing, signaling and processing. This domain is similar in sequence and form, if not function, to the helicase domain that is found in many DNA and RNA remodeling proteins.,Helicases have been classically defined as enzymes that couple ATP hydrolysis to the unwinding of nucleic acid duplexes, and they were originally phylogenetically grouped into families based on sequence conservation rather than function. However, these family members were subsequently shown to have diverse mechanical functions, of which duplex unwinding is only one type of activity. Therefore, these enzymes are now commonly referred to as nucleic acid remodeling proteins or, perhaps more correctly, as nucleic acid-dependent ATPases., Other classifications have grouped these proteins by their nucleic acid target (RNA or DNA), the nucleic acid strandness (α for single stranded NA or β for double stranded NA) and the translocation polarity on the nucleic acid (A for 3′ to 5′ or B for 5′ to 3′) as defined by Wigley et al. Sequence and structure analysis have revealed a common arrangement of conserved motifs for the Superfamily 1 and 2 (SF1 and 2) nucleic acid dependent ATPases. In these proteins, two conserved RecA-like domains lie against each other, forming a cleft that binds and hydrolyzes ATP, thereby serving as the catalytic core. This ATPase core includes conserved motifs Q, I, II and VI (Fig. 1A), which are aligned and rigidified through binding of RNA along the surface of the RecA folds. Conserved motifs Ia, Ib, Ic, IV, IVa, V and Vb mediate RNA binding, while motifs III and Va help to couple nucleic acid binding with ATP hydrolysis. Despite the high degree of conservation in both RNA binding and ATPase motifs, SF1 and SF2 proteins have unique functions and are usually not interchangeable. Specialization in mechanical function and the presence of accessory domains makes each nucleic acid-dependent ATPase unique. These enzymes are involved in every aspect of nucleic acid metabolism in all living organisms and viruses.,- Because of their conserved molecular functions, they are also heavily involved in genetic, autoimmune and infectious diseases, and they are potential targets for drug discovery.
Figure 1. DRAs and related nucleic acid-dependent ATPases. (A) The conserved SF2 ATPase/helicase core. The figure is prepared from eIF4aIII, which is a component of the exon junction complex (PDB: 2J0S). Positions of the characteristic motifs are highlighted. (B) Domain organization of DRAs. Domains are not to the scale. C-terminal regions that resemble the ATPase domains of Dicer and FANCM-like proteins are simplified and are not labeled. (C) Schematic cladogram showing DRAs within the SF2 family of proteins, specifically the DEAD box family and double stranded nucleic acid binding ATPases. The alignment and family trees were determined with the UGENE software package. The multiple sequence alignment was run with T-Coffee on the core ATPase/helicase domains listed in Table 1 and the family tree was determined using the PHYLIP Neighbor Joining method with the Jones-Taylor-Thornton distance matrix. Pair-wise sequence identity for the ATPase core regions of DRAs range from the highest 42% (hsMDA5: hsLGP2) and 36% (hsRIG-I: hsMDA5), to 26% (hsRIG-I: ceDRH-1) and 21% (ceDRH-1: ceDRH-3), with the lowest 14% between hsDicer1 and ceDRH-3.
Figure 1. DRAs and related nucleic acid-dependent ATPases. (A) The conserved SF2ATPase/helicase core. The figure is prepared from eIF4aIII, which is a component of the exon junction complex (PDB: 2J0S). Positions of the characteristic motifs are highlighted. (B) Domain organization of DRAs. Domains are not to the scale. C-terminal regions that resemble the ATPase domains of Dicer and FANCM-like proteins are simplified and are not labeled. (C) Schematic cladogram showing DRAs within the SF2 family of proteins, specifically the DEAD box family and double stranded nucleic acid binding ATPases. The alignment and family trees were determined with the UGENE software package. The multiple sequence alignment was run with T-Coffee on the core ATPase/helicase domains listed in Table 1 and the family tree was determined using the PHYLIP Neighbor Joining method with the Jones-Taylor-Thornton distance matrix. Pair-wise sequence identity for the ATPase core regions of DRAs range from the highest 42% (hsMDA5: hsLGP2) and 36% (hsRIG-I: hsMDA5), to 26% (hsRIG-I: ceDRH-1) and 21% (ceDRH-1: ceDRH-3), with the lowest 14% between hsDicer1 and ceDRH-3.
Table 1. SF2 nucleic acid-dependent ATPases listed in this study
Notes: Members of DRA family are boxed and bolded. a Abbreviations: ce, Caenorhabditis elegans; dm, Drosophila melanogaster; ec, Escherichia coli; hs, Homo sapiens; sc, Saccharomyces cerevisiae; pf, Pyrococcus furiosus. b Sequence ID refers to the protein sequence taken from the UniProt Knowledgebase (www.uniprot.org). c PDB Code refers to the structures available from Protein Data Bank (www.rcsb.org).
Recently, significant progress has been made in our understanding of RNA-dependent ATPases, including the identification and characterization of new examples like DRH-1 and DRH-3, from nematodes and new structural and functional insights of known cases like Dicer and RIG-I. In this review, we focus on this emerging group of specialized RNA-dependent ATPases that include Dicer, a ribonuclease that plays an essential role in miRNA and siRNA biogenesis; RIG-I, an intracellular surveillance protein for detecting pathogenic RNA; and DRH-3, Dicer-related-helicase-3, a component of the siRNA pathway from Caenorhabditis elegans., The mechanistic feature shared by all these proteins [hereafter named Duplex RNA-activated ATPases (DRAs)] is that dsRNA is required to stimulate their ATPase activity and thereby activate all subsequent functions, which is in sharp contrast to other SF2 proteins that are specifically activated by single-stranded RNA. Further, unlike the bona fide RNA helicases, DRAs are unlikely to display RNA unwinding activity., Rather, the conformational changes that occur upon binding to RNA and ATP are coupled to other processes, such as the release of signaling domains and binding to partner proteins. Here we review the discovery of the DRAs, highlight recent advances in understanding of their function and discuss how this is related to their structural features.
Comparison of DRAs with Related Mechanical Proteins
DRA proteins are phylogenetically classified as a subgroup within Helicase Superfamily 2 (known as SF2 proteins, Fig. 1C), and present a core ATPase domain that is very similar in both sequence and structure to the DEAD box family ATPases/Helicases. Unlike DEAD box proteins, DRAs contain a unique α-helical insertion domain (HEL2i) within the second RecA fold of the core ATPase/Helicase domain. Structural studies have shown that this adaptation is important for duplex RNA binding (Fig. 1B
and C).- As the closest phylogenetic relatives of of RIG-I, innate immune sensors MDA5 and LGP2 are more similar to each other than to RIG-I, although there are conflicting reports on which of these proteins should be considered the evolutionary precedent of the others., A constructed family tree of SF2 proteins from several subgroups suggests that DRH-1 is more closely related to RIG-I than to Dicer (Fig. 1C), underscoring the difficulties in naming these proteins based on functional associations.Perhaps most significant given their function, DRAs are most closely related to proteins that function not on RNA, but on double-stranded DNA (Fig. 1C). The DRAs are relatives of the FANCM family of proteins that function during DNA repair, and these include Hef, FANCM, Mph1 and Fml1 (http://www.rnahelicase.org/rig.htm database) (Fig. 1 and Table 1). Members of the FANCM family bind dsDNA and contain a similar α-helical HEL2i insertion as the DRAs. Other closely related dsDNA binding proteins include members of the Swi/Snf, RecG and the T1R families (Fig. 1C and Table 1). Swi/Snf proteins have a six α helix motif that is inserted within two parts of the HEL2 domain. This motif interacts with the 5′ strand of dsDNA and occupies the same position relative to HEL1 and HEL2 as the Hef and the RIG-I HEL2i domain.- Interestingly, RecG proteins have a TRG motif (ranslocation in ec) located immediately after motif VI that forms a helical hairpin proposed to be a transmission system for driving double-stranded translocation. This hairpin appears to be analogous to the pincer domain of RIG-I and other DRAs, although the pincer domain is significantly longer and more complex. Similarly, members of the prokaryotic T1R family have an α-helical domain at their C terminus that plays a role in recognizing foreign dsDNA, either from transmissible plasmids or from phages.Notes: Members of DRA family are boxed and bolded. a Abbreviations: ce, Caenorhabditis elegans; dm, Drosophila melanogaster; ec, Escherichia coli; hs, Homo sapiens; sc, Saccharomyces cerevisiae; pf, Pyrococcus furiosus. b Sequence ID refers to the protein sequence taken from the UniProt Knowledgebase (www.uniprot.org). c PDB Code refers to the structures available from Protein Data Bank (www.rcsb.org).In addition to the conserved ATPase/Helicase core and the specialized RNA recognition and transduction domains, DRAs contain two semi-conserved motifs that contribute to binding of double-stranded nucleic acid (Fig. 2). In these proteins, motifs IIa and Vc form contacts with the 5′-3′ “second strand,” which contrasts with the other nucleic acid binding motifs that contact only the 3′-5′ “tracking strand” that is bound by all translocative helicases. Motif IIa in particular was initially noted in the Sulfolobus solfataricus Rad54 structure and later within the motor subunit of the T1R restriction modification enzyme, EcoR124, from Escherichia coli., Not surprisingly, motif IIa also appears to be present in members of the DEAD box family, and its functional role in duplex RNA binding is supported by recent structural studies of DEAD-box protein Mss116 bound to duplex RNA. Motifs IIa and Vc are primarily conserved structurally rather than in primary sequence (Fig. 2).,, Although there are semi-conserved lysines and asparagines in motif IIa and a semi-conserved asparagine in motif Vc, the majority of the contacts made with nucleic acids involve the peptide backbone.
Figure 2. Sequence and structural features of RIG-I that contribute to duplex RNA recognition. (A) Sequence alignment of DRAs and other SF2 proteins. Notice motifs IIa and Vc (boxed in dotted lines) are not very conserved in amino acid sequence. (B) HEL2i domain juxtaposes with the duplex RNA backbone (PDB codes: 3TMI in green; 2YKG in yellow; 4A36, in magenta). (C) Specialized motifs IIa and Vc recognize the top strand of the duplex RNA (Botton strand or tracking strand is the strand nucleic acid that binds to the SF2α proteins; Top strand is the complementary strand). (D) Possible structural conservation of motif IIa and motif Vc found in DEAD-box RNA family members. Figure shows the aligned structures of DEAD-box protein:ssRNA complexes with duplex RNA (PDB codes: 2J0S, 3I5X, 3G0H and 2DB3). The possible presence of motif IIa and Vc in DEAD-box proteins are labeled in parenthesis.
Figure 2. Sequence and structural features of RIG-I that contribute to duplex RNA recognition. (A) Sequence alignment of DRAs and other SF2 proteins. Notice motifs IIa and Vc (boxed in dotted lines) are not very conserved in amino acid sequence. (B) HEL2i domain juxtaposes with the duplex RNA backbone (PDB codes: 3TMI in green; 2YKG in yellow; 4A36, in magenta). (C) Specialized motifs IIa and Vc recognize the top strand of the duplex RNA (Botton strand or tracking strand is the strand nucleic acid that binds to the SF2α proteins; Top strand is the complementary strand). (D) Possible structural conservation of motif IIa and motif Vc found in DEAD-box RNA family members. Figure shows the aligned structures of DEAD-box protein:ssRNA complexes with duplex RNA (PDB codes: 2J0S, 3I5X, 3G0H and 2DB3). The possible presence of motif IIa and Vc in DEAD-box proteins are labeled in parenthesis.By contrast, RNA-dependent ATPases that function as helicase enzymes preferentially bind single stranded nucleic acid before unwinding adjacent duplex regions. Two distinct mechanisms of unwinding have been proposed, and these include melting of the RNA backbone through local distortions of the A-form duplex RNA, as proposed for the DEAD box family of helicases including Ded1p and Mss116p, and displacement of adjacent duplex strands during translocation, as shown for the viral SF2 DExH helicases including NS3 and NPH-II., In contrast to these bona fide helicases, DRAs preferentially bind RNA duplex instead of single stranded regions,,, and in all existing structures of RNA-RIG-I complexes, the duplex RNA maintains an undistorted A-form conformation.-, Furthermore, the crucial β-hairpin motif that participates in strand separation by DExH helicases is missing in RIG-I and other DRAs.,, Therefore, because DRAs have no structural features designed to disrupt duplex RNA, and no structural motifs for coupling translocation with strand separation, it is not surprising that DRAs have not yet been shown to function as unwindases.
The Molecular and Structural Biology of DRAs
Structural studies on nucleic acid-dependent ATPases are hampered by the intrinsic flexibility that arises from their function as molecular motors. In these proteins, the two conserved Rec-A like domains are loosely connected in the absence of nucleic acid and ATP. This is particularly true for DEAD box proteins, which tend to be captured crystallographically only in the presence of both ssRNA and ATP analogs,- thereby limiting our understanding of their functional cycles. Structural studies of DRAs face the same challenges as those focused on DEAD box proteins. Adding to this difficulty, DRAs are large multi-domain proteins with several moving parts that usually function within even larger protein complexes. Nevertheless, recent cryo-electron microscopy studies of Dicer- and crystallographic studies on RIG-I have advanced our understanding of the biological function and mechanical properties of DRA proteins.,-
Dicer: the small RNA processing machine
Ever since Fire et al., published the groundbreaking paper on RNA interference (RNAi), great strides have been made in understanding the biogenesis and functional mechanisms of the small RNAs that facilitate dsRNA-mediated gene regulation.,- There are two major types of small RNAs: microRNA (miRNA) and small interfering RNA (siRNA). While these two RNAs differ in their pathway of biogenesis, they share similarities in function. Of central importance to the RNAi pathway is the formation of an RNA-induced silencing complex (RISC),, which binds to the target mRNA and results in downregulation of gene expression by either RNA degradation or translational arrest., RISC is assembled from the RISC-loading complex (RLC), which includes dsRNA, Ago, Dicer and other additional dsRNA binding and accessory proteins.,Dicer plays a major role in gene regulation by processing dsRNA precursors into short fragments that are used to target the silencing of specific genes.- Dicer cleaves long duplex precursor RNAs (pre-miRNA or pre-siRNA) into short miRNA and siRNA fragments and then loads the correct “guide” strand into RISC. Dicer received its name because of its dsRNA cleavage, or “dicing” activity. Phylogenetically, Dicer is a class III RNase, members of which are conserved among eukaryotic species. In humans, there is only one Dicer protein, hsDicer1, however Drosophila and plants contain two and four Dicer proteins respectively. HumanDicer1 mutations have been found in various cancer syndromes,- emphasizing its fundamental roles in gene regulation. In general, all Dicer and Dicer like proteins (DCL) from eukaryotic species share a similar domain architecture, containing a SF2 RNA-dependent ATPase domain at the N terminus, a DUF283 domain (Domain of Unknown Function), a PAZ domain, two tandem RNase III domains and a dsRNA-binding domain (dsRBD) at the C terminus,(Fig. 1B). The first structural insights into Dicer components came from a crystal structure of a Dicer homolog obtained from the unicellular eukaryote Giardia intestinalis. This structure revealed a specific spatial arrangement of the PAZ domain relative to the two RNaseIII domains, suggesting that Dicer contains a molecular ruler that enables it to generate dsRNA fragments of specific length. Unfortunately, unlike Dicer genes from other organisms, Giardia intestinalisDicer does not contain an ATPase motor domain.The ATPase motor domain of Dicer is highly conserved across species and it is phylogenetically distinguishable as a DRA protein (Fig. 1C). The precise biochemical function of the motor domain is still unclear, and it is not yet known whether the RNA-dependent ATPase activity is actually linked to duplex unwinding, and whether the motor domain behaves like a helicase. Recent studies have indicated that it plays a role in helping to select the “guide strand” from the two duplex strands that are initially bound within the RISC-loading complex. This is accomplished by sensing thermodynamic features of the RNA duplex, and determining which terminus is more easily opened. The selected siRNA guide strand will then be loaded into the Ago protein, resulting in formation of a functional RISC complex.The overall three-dimensional architecture and domain organization of Dicer is well conserved among orthologs., Dicer adopts an L-shape as determined by negative-stain electron microscopy (EM). Using a streptavidin tagging method and domain deletion constructs, Lau et al. accurately located the position of the motor domain at the base of the L shaped structure (Fig. 3). Furthermore, when the motor domain of the RIG-I was docked into the EM structure of Dicer, the RNA binding interfaces of the motor domain and the RNase III domain creates an adjacent central RNA binding groove. A complex between Dicer and its TRBP (TAR RNA Binding Protein, an accessory protein of Dicer and a dsRBD) forms a similar L shape with a long edge of 150 Ǻ and a 100 Ǻ extension at the bottom end. Because of the small size and intrinsic flexibility of TRBP, it is difficult to accurately assign its location, particularly in the absence of a siRNA or miRNA substrate. A low resolution EM structure of the human RISC-loading complex (containing Dicer, AGO2 and TRBP in a 1:1:1 stoichiometric ratio) was obtained by crossing-linking the complex. In the resulting model, AGO2 was proposed to interact with the C-terminal region of Dicer. The RNA binding site of AGO2 is located in close proximity to the C-terminal region of Dicer, pointing away from the N-terminal motor domain. This model is consistent with biochemical data suggesting that the motor domain of Dicer may not be required for loading mature siRNA into the AGO2. Two discrete conformations of the Dicer motor domain have been identified, suggesting that it may adopt multiple conformations on dsRNA. This structural flexibility contributes to specific dsRNA recognition and may support a processive dicing mechanism (Fig. 3).,, Not surprisingly, it is similar to structural rearrangements observed when the RIG-I motor domain binds to RNA duplex.,
Figure 3. (A) Segmented map of human Dicer with crystal structures of homologous domains docked. (B) Model for pre-miRNA recognition. A pre-miRNA hairpin is modeled into the proposed binding channel of Dicer, with the stem-loop fit in the RNA-binding cleft of the protein. (C) Schematic for processive dicing in which dsRNA is translocated into the nuclease core (1). The PAZ domain (purple) recognizes the dsRNA end, positioning RNase III (orange) for cleavage (2). The siRNA product is released while the dsRNA substrate remains bound to the protein (3). Reprinted with permission from Lau et al.
Figure 3. (A) Segmented map of humanDicer with crystal structures of homologous domains docked. (B) Model for pre-miRNA recognition. A pre-miRNA hairpin is modeled into the proposed binding channel of Dicer, with the stem-loop fit in the RNA-binding cleft of the protein. (C) Schematic for processive dicing in which dsRNA is translocated into the nuclease core (1). The PAZ domain (purple) recognizes the dsRNA end, positioning RNase III (orange) for cleavage (2). The siRNA product is released while the dsRNA substrate remains bound to the protein (3). Reprinted with permission from Lau et al.
RIG-I: The innate immune sensor for viral RNA detection and defense
A diverse group of cytoplasmic surveillance proteins sensitively detect the presence of viral genomes and gene products and then initiate inflammatory responses that enable vertebrates to fight viral infections., These proteins form the foundation of our innate immune response. The RIG-I-like receptors (RLRs) are a specialized subclass of DRA proteins that detect double stranded viral RNAs in the cytoplasm and initiate a series of signaling events to elicit an antiviral response. The RLR motor proteins include RIG-I, MDA5 (Melanoma Differentiation Associated gene 5) and LGP2 (Laboratory of Genetics and Physiology 2). They were initially identified in different biological contexts and were later re-discovered to be key members in antiviral innate immunity.,, RIG-I is the most extensively studied member of the RLRs and has been demonstrated to be the major antiviral RLR. RIG-I recognizes a broad range of viruses, including negative stranded viruses, e.g., vesicular stomatitis virus, Sendai virus, influenza virus and rabies virus; positive stranded viruses such as dengue virus, Japanese encephalitis virus, West Nile virus and Hepatitis C virus; dsRNA virus (reovirus) and DNA virus (Epstein-Barr virus)., MDA5 is both structurally and functionally similar to RIG-I and complements RIG-I by recognizing a distinct set of virus RNAs although there might be some overlap. LGP2 is thought to serve as a feedback regulator but its exact function is still not clearly defined.,RIG-I contains two tandem caspase activation and recruitment domains (CARDs; CARD1 and CARD2) at its N-terminus, which mediate a downstream signaling relay; a central DRA motor domain, and a C-terminal domain (CTD) that facilitates viral RNA recognition (Fig. 1B).,-, It is commonly believed that RIG-I is inactive in resting cells and it is activated upon detection and binding of viral RNA. The activated RIG-I is believed to hydrolyze ATP and initiate a signaling cascade and type I interferon (IFN) response via the adaptor protein MAVS, also known as IPS-1, VISA or CARDIF. MAVS in turn activates several transcription factors including IRF3, IRF7 and NF-κB, and leads to the production of IFN and inflammatory cytokines.,, Moreover, RIG-I displays apoptosis-inducing properties in tumor cells., Effective therapeutic RIG-I antagonists and agonists may provide new tools for the treatment of viral infections and cancer.Recent research has focused on characterizing the molecular determinants for RNA-RIG-I recognition, the mechanisms of activation and signaling, and regulatory pathways that help control RIG-I signaling. Structural and biochemical studies on RIG-I have revealed that 5′ tri-phosphorylated blunt-ended duplex RNAs are the optimal substrate for RIG-I binding and activation.-,,, The exact length of the duplex is unclear although it is generally accepted that RIG-I recognizes RNA that is ten to hundreds of base pairs in length, while MDA5 forms filaments on longer RNA in the thousands of base pairs.,- The CTD is primarily responsible for 5′tri-phosphate recognition and both the CTD and helicase domain form critical contacts with the RNA duplex.,,,,- The latest structural studies indicate RNA binding induces a dramatic conformational change in RIG-I (Fig. 4A and B). The role of ATP binding and hydrolysis has not been determined, although mutations in the ATPase domain are clearly deleterious to function.,, Post translational modifications of RIG-I, including ubiquitination, phosphorylation and SUMOylation, have been reported to be important for its function.- Non-covalent polyubiquitin binding to the CARDs is likely to be essential for full activation of RIG-I and possibly oligomerization.,
Figure 4. Structural basis for dsRNA recognition and activation of RLRs. Models were created by aligning and merging known duck and human RIG-I structures and by taking consideration of our recent solution hydrodynamic studies on RIG-I conformational dynamics upon RNA and ATP binding. (A) Model of full length RIG-I apoenzyme based on structures of duck RIG-I (PDB: 4A2W) and the CTD (PDB: 4A2V). In the autoinhibited conformation, the N-terminal CARDs are sequestered from signaling and maintain RIG-I in an autoinhibited state. (B) RIG-I switches into a semi-closed conformation upon RNA binding. Binding of dsRNA to the CTD brings the HEL domains in contact with dsRNA (PDB: 4A2Wand 2YKG). (C) ATP binding (yellow) closes the HEL domains that would cause a clash between the CARDs and CTD (PDB: 4A2W, 4A36 and 2YKG). (D) The change in conformation upon dsRNA and ATP binding releases the CARD domains for signaling.
Figure 4. Structural basis for dsRNA recognition and activation of RLRs. Models were created by aligning and merging known duck and humanRIG-I structures and by taking consideration of our recent solution hydrodynamic studies on RIG-I conformational dynamics upon RNA and ATP binding. (A) Model of full length RIG-I apoenzyme based on structures of duck RIG-I (PDB: 4A2W) and the CTD (PDB: 4A2V). In the autoinhibited conformation, the N-terminal CARDs are sequestered from signaling and maintain RIG-I in an autoinhibited state. (B) RIG-I switches into a semi-closed conformation upon RNA binding. Binding of dsRNA to the CTD brings the HEL domains in contact with dsRNA (PDB: 4A2Wand 2YKG). (C) ATP binding (yellow) closes the HEL domains that would cause a clash between the CARDs and CTD (PDB: 4A2W, 4A36 and 2YKG). (D) The change in conformation upon dsRNA and ATP binding releases the CARD domains for signaling.In the resting state, RIG-I adopts an autoinhibited conformation in which the motor domain is sterically blocked., The CARDs are trapped in a fixed conformation relative to the HEL domains (synonymous with Rec-A folds 1 and 2) through an interaction between the second CARD and the insertion domain (HEL2i) (Fig. 4A). This conformation was speculated to inhibit both CARD1 ubiquitination by the ubiquitin E3 ligase TRIM25 and non-covalent binding of polyubiquitin to the CARDs, both of which are required for RIG-I activation. In the apoenzyme state the RNA binding surfaces of RIG-I (and particularly the CTD) are largely exposed, allowing RIG-I to search for viral RNAs. The CTD, which is connected to the HEL domain through a long and flexible pincer domain, enhances the specificity of RIG-I for tri-phosphorylated RNA.It is believed that viral activation of RIG-I signaling occurs in a carefully choreographed sequence of events. Binding of viral RNA is the initial trigger for RIG-I activation, whereupon the motor domain (comprised of HEL1, HEL2and HEL2i) of RIG-I forms a ring-shaped clamp around the sugar-phosphate backbone of the duplex and the CTD caps the helical terminus, even in the absence of a 5′ triphosphate. The tight and specific interaction of the CTD with the duplex terminus may prevent RIG-I from binding with high affinity to internal sites on the duplex-(Fig. 4B). Structural analysis suggests that binding of RIG-I to RNA alone may not be sufficient to disrupt the autoinhibitory interaction between the CARD2 and HEL2i domains, hinting that an additional trigger might be needed to activate signaling. In the crystal structures of RIG-I:dsRNA with AlFx and BeFx, ATP binding appears to bring the RIG-Ihelicase into a more closed and compact conformation relative to RIG-I structures that contain only dsRNA (Fig. 4C).,, This ATP-induced conformational change shifts the CTD and HEL2i toward each other, resulting in a clash between the CARDs and CTD (Fig. 4C). Consequently the structure is likely to reorganize, reorienting the relative positions of the CARDs and HEL2i, and potentially releasing the CARDs which makes them available for interaction with MAVS and activates the innate immune response (Fig. 4D).,In agreement with this idea, ATP is required for in vitro reconstitution of the RIG-I signaling pathway,, although ATP hydrolysis and turnover is not essential. Activation of RIG-I is therefore a tightly-regulated, multi-checkpoint process, starting with recognition of the correct RNA substrate, followed by ATP binding, and then subsequent coupled structural rearrangements that release auto-inhibition and switch RIG-I into a signaling-competent state (Fig. 4).,
DRH-3: Attenuating the siRNA pathway in Caenorhabditis elegans
A group of endogenous siRNAs, named 22G endo-siRNA, from Caenorhabditis elegans are linked to a variety of biological processes that are vital to maintaining genetic stability, including transposon silencing and chromosome segregation in germline cells.- Defects in the endo-RNAi pathway can result in many forms of genetic instability such as loss of chromosomes during mitosis, abnormal gene expression and increased sensitivity to X-ray irradiation.- These siRNAs are classified as secondary siRNA molecules because they are produced directly by RNA-dependent RNA polymerase (RdRP) transcription, without a double-stranded RNA intermediate or cleavage.,Dicer-related helicase 3 (DRH-3) is a large multi-domain, multi-functional protein that is essential for the biogenesis of these endogenous secondary siRNAs.,,, DRH-3 interacts with members of the C. elegans RNAi machinery, including Dicer (DCR-1) and the RdRP, RRF-1., A large protein (1119 amino acids, ca. 130 kDa), DRH-3 contains 3 sub-domains (Fig. 1B). These include an N-terminal domain of novel sequence that lacks a known homolog, and the central motor domain that is common to DRA proteins.,, The C-terminal domain of DRH-3 shares sequence similarity with the CTD that plays an important role in RNA and triphosphate recognition by RIG-I. Although there are no structural studies of DRH-3, a preliminary biochemical characterization has been reported. Several key features of DRH-3 are now apparent: the protein binds more strongly to dsRNA than to ssRNA; potent ATP hydrolysis by DRH-3 is only stimulated by dsRNA; DRH-3 does not have unwinding activity. DRH-1, a homolog of DRH-3, is implicated in both germline and somatic RNA interference (RNAi) pathways as well as virus sensing and viral siRNA formation, and may be an equally important target for future biochemical and structural studies.DRH-3 has a domain organization that is very similar to RIG-I. It is tempting to speculate that DRH-3 might bind to the RNA duplexes generated by the endogenous siRNA pathway and recruit signaling partners through its NTD. This speculation is further supported by the absence of helicase activity and the preference for canonical RNA duplex binding.
Concluding Remarks and Future Directions for Research on DRA Proteins
DRAs share several characteristic features that distinguish them from other groups of RNA-dependent ATPases. First, in addition to the conserved motifs that classify DRAs as SF2 RNA-dependent ATPases, DRAs contain unique motifs (e.g., motifs IIa and Vc) and domains (HEL2i) that specialize in duplex RNA recognition. Second, although the literature on DRAs is somewhat unclear on this point, DRAs do not appear to possess RNA unwinding activity and they may accomplish their biological function by simply binding duplex RNA or by translocating along the duplex without unwinding. Lastly, the DRAs discussed in this review are all part of larger protein complexes that function in duplex RNA sensing and processing.One of the most intriguing questions about DRAs is whether they, in fact, require ATP hydrolysis for function. At the present time, it is not established that DRAs require ATP binding and/or hydrolysis and, like DEAD-box proteins, they may only utilize ATP for recycling. ATPase activity is unnecessary for pre-miRNA processing by humanDicer,,, but in contrast, DrosophilaDicer-2 appears to require ATP for siRNA production., The ATPase motor domain from C. elegans DCR is required for the biogenesis of some but not all siRNAs. Evidently, there is no consensus for the function of ATP hydrolysis by Dicers from different species. RIG-I has shown a clear dependence for ATP in in vitro reconstitution systems,, but mutagenesis studies by Bamming and Horvath suggest that signaling by RIG-I and MDA5 can occur independent of ATPase enzymatic activity. To reconcile this, recent structural data suggests that ATP binding but not necessarily hydrolysis induces a conformational change on the RIG-Ihelicase domain that may eventually lead to RIG-I activation (Fig. 4). Further experiments are needed to verify this structure driven hypothesis.Although the recent crystal structures of RIG-I advance our understanding of DRAs, there are several questions that remain unanswered. One future question is whether DRAs recognize specific RNA sequences or structures. Several studies suggest that Dicer recognizes specific terminal structures of RNA as evidenced by the fact that humandicer is more efficient in processing pre-miRNA than siRNA. Furthermore, DrosophilaDicer-1 recognizes the terminal loop structure of pre-miRNAs through its motor domain, and both DrosophilaDicer-2 and C. elegansDCR-1 differentiate the end structures of long duplex RNAs for endo-siRNA processing. This suggests there might be a special structural feature in the Dicer RNA-dependent ATPase domain that is responsible for recognizing RNA ends. RIG-I specifically recognizes tri-phosphorylated RNA through its accessory CTD domain, and given that RIG-I recognizes a broad but distinct set of RNA viruses, it will be interesting to determine if RIG-I can recognize unique viral RNA sequences or structures, in addition to 5′ triphosphate and duplex RNA.Oligomerization is a variable characteristic that can be important for the function of SF2 proteins, including DRAs. To generalize, the minimum functional unit of SF2 proteins is monomeric, but there is biochemical evidence suggesting that oligomerization can enhance biological activity of certain SF2 proteins., As for DRAs, there is no indication that Dicer forms oligomers, however RIG-I has been proposed to dimerize or tetramerize upon activation. The exact molecular basis for RIG-I oligomerization is still unknown; however the downstream target of RIG-I, MAVS, forms filaments upon activation. Distinct from RIG-I, MDA5 cooperatively binds to long duplex RNAs and may form a filament-like structure itself,, which would be unique not only among DRAs but also SF2 proteins in general. It is therefore important to establish whether oligomerization obligatory for RIG-I or MDA5 function.Given the central role of DRA proteins in diverse cellular pathways that range from epigenetic regulation to the innate immune response, it will be interesting to characterize the regulatory cofactors that help to specify and control the activity of DRAs. It will also be exciting to identify new DRA proteins that have distinct molecular functions. The DRA proteins seem to be markers of interesting biology, and investigation of this protein family will continue to yield major insights into the nanomechanical features of living systems.
Authors: Noah C Welker; Derek M Pavelec; David A Nix; Thomas F Duchaine; Scott Kennedy; Brenda L Bass Journal: RNA Date: 2010-03-30 Impact factor: 4.942
Authors: Dahai Luo; Na Wei; Danny N Doan; Prasad N Paradkar; Yuwen Chong; Andrew D Davidson; Masayo Kotaka; Julien Lescar; Subhash G Vasudevan Journal: J Biol Chem Date: 2010-04-07 Impact factor: 5.157
Authors: Megan E Fitzgerald; David C Rawling; Olga Potapova; Xiaoming Ren; Andrew Kohlway; Anna Marie Pyle Journal: Nucleic Acids Res Date: 2017-02-17 Impact factor: 16.971