| Literature DB >> 35229468 |
Anna Baud1, Magdalena Derbis1, Katarzyna Tutak1, Krzysztof Sobczak1.
Abstract
Short tandem repeats are repetitive nucleotide sequences robustly distributed in the human genome. Their expansion underlies the pathogenesis of multiple neurological disorders, including Huntington's disease, amyotrophic lateral sclerosis, and frontotemporal dementia, fragile X-associated tremor/ataxia syndrome, and myotonic dystrophies, known as repeat expansion disorders (REDs). Several molecular pathomechanisms associated with toxic RNA containing expanded repeats (RNAexp ) are shared among REDs and contribute to disease progression, however, detailed mechanistic insight into those processes is limited. To deepen our understanding of the interplay between toxic RNAexp molecules and multiple protein partners, in this review, we discuss the roles of selected RNA-binding proteins (RBPs) that interact with RNAexp and thus act as "partners in crime" in the progression of REDs. We gather current findings concerning RBPs involved at different stages of the RNAexp life cycle, such as transcription, splicing, transport, and AUG-independent translation of expanded repeats. We argue that the activity of selected RBPs can be unique or common among REDs depending on the expanded repeat type. We also present proteins that are functionally depleted due to sequestration on RNAexp within nuclear foci and those which participate in RNAexp -dependent innate immunity activation. Moreover, we discuss the utility of selected RBPs as targets in the development of therapeutic strategies. This article is categorized under: RNA Interactions with Proteins and Other Molecules > Protein-RNA Interactions: Functional Implications RNA in Disease and Development > RNA in Disease.Entities:
Keywords: RAN translation; RNA binding proteins; liquid-liquid phase separation; repeat expansion; short tandem repeats
Mesh:
Substances:
Year: 2022 PMID: 35229468 PMCID: PMC9539487 DOI: 10.1002/wrna.1709
Source DB: PubMed Journal: Wiley Interdiscip Rev RNA ISSN: 1757-7004 Impact factor: 9.349
FIGURE 1Characteristics of expanded STRs specific for different diseases. (a) Localization and size of STRs within specific gene regions. Expanded STRs, depending on the sequence, are located in different parts of the gene. Size of expansion of STRs necessary for the development of individual REDs differ between diseases, however, it may be roughly specified that the longest expansions are located within introns and further within 3′UTRs, middle‐size expansions within 5′UTRs and the shortest within exons, and thus with protein‐coding sequences. Here we show representative REDs from the larger group of diseases. (b) Structures formed by RNA. Trinucleotide CNG repeats form RNA hairpin structures, all characterized by high thermodynamic stability, the highest for CGG, next CAG, and the lowest for CUG repeats. Hairpin structures are also formed by CCUG and G4C2 repeats. Moreover, G4C2 and CGG repeats are able to form G‐quadruplex structure. (c) Protein products derived from repeat‐associated non‐AUG (RAN) translation. RAN translation may potentially start and produce proteins in all three reading frames. In the process of RAN translation of trinucleotide repeats, the homopolymeric proteins with tracts of single amino acids are biosynthesized. CCUG tetranucleotide repeats are RAN translated to proteins containing tracts of four amino acids, the same in all reading frames. RAN translation of G4C2 hexanucleotide repeats is the source of DPRs composed of tracts of two amino acids repeats, one of which is glycine in all reading frames
Proteins implicated in pathomechanisms of repeat expansion diseases (REDs)
| Process | RNAexp | Protein | Protein name | UniProt ID | Effect/method | |
|---|---|---|---|---|---|---|
| Transcription | CAG | SUPT4H1 | Transcription elongation factor SPT4 | P63272 (human) | KO in mouse reduced production of mutated huntingtin | (C. R. Liu et al., |
| CAG | SPT4‐A | Transcription elongation factor SPT4‐A | P63271 (mouse) | KD (ASO) in mouse selectively reduced mutated mRNA and protein | (H. M. Cheng et al., | |
| G4C2; C4G2 | SUPT4H1 | Transcription elongation factor SPT4 | P63272 (human) | KO in yeast and in fly reduced RNA foci and RAN translation | (Kramer et al., | |
| G4C2 | PAF1, LEO1 | RNA polymerase II‐associated factor 1 homolog; RNA polymerase‐associated protein LEO1 | Q8N7H5 (human), Q8WVC0 (human) | KD in fly suppressed toxicity, upregulated in patients | (Goodman, Prudencio, Kramer, et al., | |
| G4C2 | AFF2/FMR2 | AF4/FMR2 family member 2 | P51816 (human) | KD in fly reduced RNAexp and RAN protein level, KO in iPSNs reduced level of | (Yuva‐Aydemir et al., | |
| GGCCUG | SPT4‐A, SPT5 | Transcription elongation factor SPT4‐A; Transcription elongation factor SPT5 | P63271 (mouse), O55201 (mouse) | KD in Neuro2A cells reduced RNA foci and RAN translation | (Furuta et al., | |
| Sequestration, co‐localization | CAG | MBNL1 | Muscleblind‐like protein 1 | Q9NR56 (human) | OE rescued splicing abnormalities in HeLa cells expressing CAGexp | (Mykowska et al., |
| G4C2 | Nucleolin | Nucleolin | P19338 (human) | Nucleolar stress in C9‐ALS/FTD patients | (Haeusler et al., | |
| G4C2 | Pur alpha | Transcriptional activator protein Pur‐alpha | Q00577 (human) | OE reduced neurodegeneration in fly, cell, and fish models. | (Swinnen et al., | |
| G4C2 | hnRNP H | Heterogeneous nuclear ribonucleoprotein H | P31943 (human) | Colocalized with RNA foci in C9‐ALS/FTD patients, missplicing of hnRNP H‐dependent transcript | (Y. B. Lee et al., | |
| G4C2 | hnRNP H1/F, ALYREF, and SRSF2 | Heterogeneous nuclear ribonucleoprotein F, THO complex subunit 4, SRSF protein kinase 2 | P52597 (human), Q86V81 (human), P78362 (human) | Colocalized with RNA foci in C9‐ALS/FTD patients | (Cooper‐Knock et al., | |
| C4G2 | SRSF1, hnRNP A1, hnRNP H/F, ALYREF | SRSF protein kinase 1, Heterogeneous nuclear ribonucleoprotein A1, Heterogeneous nuclear ribonucleoprotein F, THO complex subunit 4 | O70551 (human), P09651 (human), P52597 (human), Q86V81 (human) | Colocalized with RNA foci in C9‐ALS/FTD patients | (Cooper‐Knock et al., | |
| G4C2 | ADARB2 | Double‐stranded RNA‐specific editase B2 | Q9NS39 (human) | KD decreased RNA foci in iPSNs, colocalized with RNA foci in C9‐ALS/FTD patients | (Donnelly et al., | |
| G4C2 | ZPF106 | Zinc finger protein 106 | Q9H2Y7 (human) | OE suppressed neurotoxicity in C9‐ALS/FTD fly model | (Celona et al., | |
| G4C2 | Matrin‐3 | Matrin‐3 | P43243 (human) | OE suppressed neurotoxicity in C9‐ALS/FTD fly model | (Ramesh et al., | |
| CUG | MBNL1 | Muscleblind‐like protein 1 | Q9NR56 (human) | OE/ASO blockers complementary to RNAexp reversed DM1 phenotype in mouse model | (Kanadia et al., | |
| CUG | hnRNP H | Heterogeneous nuclear ribonucleoprotein H | P31943 (human) | Colocalized with RNA foci in DM1 patients | (D. H. Kim et al., | |
| CCUG | MBNL1 | Muscleblind‐like protein 1 | Q9NR56 (human) | Colocalized with RNA foci in DM2 patients | (Mankodi, | |
| CCUG | hnRNP core protein and snRNP Sm antigen | Small nuclear ribonucleoprotein E | P62304 (human) | Colocalized with RNA foci in DM2 patients | (Perdoni et al., | |
| CCUG | RBFOX1 | RNA binding protein fox‐1 homolog | Q9NWB1 (human) | Colocalized with RNA foci in DM2 patients | (Sellier et al., | |
| CGG | FMRpolyG/PolyG | FMRpolyG | – | Colocalized with RNAexp, cause neurotoxicity in primary neurons derived from FXTAS mouse | (Asamitsu et al., | |
| CGG | hnRNP A2/B1 | Heterogeneous nuclear ribonucleoproteins A2/B1 | P22626 (human) | OE suppressed neurotoxicity in FXTAS fly model | (Sofola et al., | |
| CGG | CELF1 | CUGBP Elav‐like family member 1 | Q92879 (human) | OE suppressed neurotoxicity in FXTAS fly model | (Sofola et al., | |
| CGG | Pur alpha | Transcriptional activator protein Pur‐alpha | Q00577 (human) | OE suppressed neurotoxicity in FXTAS fly model | (Jin et al., | |
| CGG | TRA2A | Transformer‐2 protein homolog alpha | Q13595 (human) | Colocalized with RNAexp foci in FXTAS cellular model and with inclusions in FXTAS mouse model and patients | (Cid‐Samper et al., | |
| CGG | DROSHA‐DGCR8 | Ribonuclease 3 and Microprocessor complex subunit DGCR8 | Q9NRR4 (human), Q8WYQ5 (human) | OE suppressed neurotoxicity in FXTAS cellular model, miRNA processing is impaired in FXTAS patients | (Sellier et al., | |
| CGG | SAM68 | KH domain‐containing, RNA‐binding, signal transduction‐associated protein 1 | Q07666 (human) | Colocalized with RNAexp foci in FXTAS cellular model, SAM68‐dependent alternative splicing is impaired in FXTAS patients | (Sellier et al., | |
| Nucleocytoplasmic transport | CAG | U2AF65 | Splicing factor U2AF 65 kDa subunit | P26368 (human) | KD in fly enhanced RNAexp toxicity and accumulation, reduction in HD mouse caused RNAexp nuclear accumulation | (Tsoi et al., |
| G4C2 | RanGAP1 | Ran GTPase‐activating protein 1 | P46060 (human) | OE suppressed RNAexp toxicity in C9‐ALS/FTD fly, OE rescued impairment of NCT in C9‐ALS iPSNs | (K. Zhang et al., | |
| G4C2; CGG | SRSF1 | Serine/arginine‐rich splicing factor 1 | O70551 (human) | KD reduced nuclear export of mutated G4C2exp/CGGexp transcripts in cells and neurotoxicity in C9‐ALS/FTD iPSNs and in FXTAS & C9‐ALS/FTD fly models | (Hautbergue et al., | |
| G4C2 | POM 121 | Nuclear envelope pore membrane protein POM121 | Q8TEM1 (human) | OE rescued expression of nuclear pore components and NCT impairment in C9‐ALS/FTD iPSNs | (Coyne et al., | |
| CUG | hnRNP H | Heterogeneous nuclear ribonucleoprotein H | P31943 (human) | KD reduced nuclear retention of RNAexp in cells | (D. H. Kim et al., | |
| CUG | Staufen1 | Double‐stranded RNA‐binding protein Staufen homolog 1 | O95793 (human) | OE reduced nuclear retention of RNAexp in cells | (Ravel‐Chapuis et al., | |
| RAN translation | G4C2; CGG | eIF4A | Eukaryotic initiation factor 4A‐I | P60842 (human) | Inhibition reduced RAN translation in vitro | (Green et al., |
| CGG; G4C2 | eIF4B and eIF4H | Eukaryotic translation initiation factor 4B and Eukaryotic translation initiation factor 4H | P23588 and Q15056 (human) | KD reduced RAN translation in C9‐ALS/FTD fly model | (Goodman, Prudencio, Srinivasan, et al., | |
| CGG | DDX3X | ATP‐dependent RNA helicase DDX3X | O00571 (human) | KD reduced RAN translation in FXTAS fly model | (Linsalata et al., | |
| G4C2 | DDX3X | ATP‐dependent RNA helicase DDX3X | O00571 (human) | KD enhanced RAN translation in C9‐ALS/FTD fly model and in C9‐ALS/FTD iPSNs | (W. Cheng et al., | |
| G4C2; CGG | DHX36 | ATP‐dependent DNA/RNA helicase DHX36 | Q9H2U1 (human) | KD reduced RAN translation in C9‐ALS/FTD iPSNs | (H. Liu, Lu, Paul, Periz, Banco, Ferré‐D'Amaré, et al., | |
| CGG | eIF1 and eIF5 | Eukaryotic translation initiation factor 1 and Eukaryotic translation initiation factor 5 | P41567 and P55010 (human) | Modulated RAN translation in vitro | (Linsalata et al., | |
| CAG; G4C2 |
eIF3F | Eukaryotic translation initiation factor 3 subunit F | O00303 (human) | KD reduced RAN translation in vitro | (Ayhan et al., | |
| CGG | 5MP | eIF5‐mimic protein (also known as BZW2) | E9PFD4 (human) | OE reduced RAN translation in FXTAS fly model | (Singh et al., | |
| G4C2 | eIF2D | Eukaryotic translation initiation factor 2D | P41214 (human) | KD reduced RAN translation in | (Sonobe et al., | |
| G4C2; CAG | RPS25 | 40S ribosomal protein S25 | P62851 (human) | KD reduced RAN translation in C9‐ALS/FTD fly model and in C9‐ALS/FTD iPSNs | (Yamada et al., | |
| Stress response | G4C2; CAG; CCTG; CAGG | PKR | Interferon‐induced, double‐stranded RNA‐activated protein kinase | P19525 (human) | Inhibition of PKR reduced RAN translation in C9‐ALS/FTD mouse model | (Zu et al., |
| G4C2; CGG | eIF2α | Eukaryotic translation initiation factor 2 subunit 1 | P05198 (human) | Enhanced RAN translation when phosphorylated in stress conditions in vitro | (W. Cheng et al., | |
| G4C2; CAGG; CCTG | eIF2A | Eukaryotic translation initiation factor 2A | Q9BY44 (human) | KO reduced RAN translation in vitro | (Sonobe et al., | |
| G4C2 | PERK | Eukaryotic translation initiation factor 2‐alpha kinase 3 | Q9NZJ5 (human) | Elevated activity in response to accumulation of RAN peptides increases RAN translation | (Zu et al., | |
| CUG/CAG | Dicer | Endoribonuclease Dicer | Q9UPY3 (human) | Cleaves RNA repeat regions into ~21 nt fragments | (Krol et al., | |
| CAG/CUG | ADAR1 | Double‐stranded RNA‐specific adenosine deaminase | P55265 (human) | Co‐expression of ADAR1 with (CAG/CUG)100 dsRNA rescued repeat‐related pathology in Drosophila | (van Eyk et al., | |
| CAG/CUG | TLR | Toll‐like receptors | KD of TLRs in | (Samaraweera et al., | ||
| MLO formation (Phase separation) | CAG | SRSF2 | Serine/arginine‐rich splicing factor 2 | Q01130 (human) | Marker of nuclear speckles, colocalized with RNAexp foci | (Jain & Vale, |
| G4C2 | SRSF2 | Serine/arginine‐rich splicing factor 2 | Q01130 (human) | Marker of nuclear speckles, colocalized with RNAexp foci | (Jain & Vale, | |
| G4C2 | G3BP1, Caprin1, USP10, eIF3b, ELAVL1, TIAR | Ras GTPase‐activating protein‐binding protein 1, Caprin‐1, Ubiquitin carboxyl‐terminal hydrolase 10, Eukaryotic translation initiation factor 3 subunit B, ELAV‐like protein 1, Nucleolysin TIAR | Q13283 (human), Q14444 (human), Q14694 (human), P55884 (human), Q15717 (human) Q01085 (human) | Markers of stress granules, condensed in vitro with RNAexp and lysates from cell lines and mouse brain | (Fay et al., | |
| G4C2 | FMRP | Synaptic functional regulator FMR1 | Q06787 (human) | Marker of transport granules, colocalized with RNAexp, FMRP‐dependent translation regulation was impaired in C9‐ALS/FTD iPSNs | (Burguete et al., |
Abbreviations: ASO, antisense oligonucleotides; iPSNs, induced pluripotent stem cells‐derived neurons; KD, knockdown; KO, knockout; MLO, membraneless organelles; NCT, nucleocytoplasmic transport; OE, overexpression.
FIGURE 2Nuclear processing and accumulation of RNAexp molecules. (a) Transcription. DSIF and PAF1 complexes promote transcription of repeat expansion regions through inhibition of the formation of DNA secondary structures and R‐loops (described for CAGexp and G4C2exp). (b) Splicing. The majority of pre‐mRNAs with expanded repeats undergo correct splicing, however, in some parts of mature transcripts intron retention takes place (described for CCUGexp and G4C2exp). Moreover, G4C2exp‐containing spliced intron is stabilized in a circular form. Bold line, exon; thin line, intron. (c) Sequestration. RNAexp molecules accumulate in nuclei where they bind multiple RBPs and sequester some of them and form RNAexp foci (described for CAGexp, CUGexp, CCUGexp, CGGexp, G4C2exp). (a–c) Arrow with a dotted line, change in place and/or in time; solid lines show induction or inhibition of certain processes
FIGURE 3Involvement of RNAexp in nucleocytoplasmic transport (NCT). (a) Impairment of NCT by RNA. Gradient of RanGDP/RanGTP proteins between nucleus and cytoplasm, supported by RanGAP1, enables proper NCT. Binding of G4C2exp to RanGAP1 leads to impaired import of nuclear proteins, exemplified by TDP‐43. (b) Export of RNA. Nuclear export adaptor SRSF1 binds to RNA with G4C2exp and C4G2exp and supports its transport to cytoplasm through NXF1‐dependent pathway. NXF1 and its cofactor NXT1 participate also in export of circular RNAexp (circRNAexp) derived from G4C2exp‐bearing intron lariat. (a,b) Arrow with a dotted line, change in place and/or in time; solid lines show induction or inhibition of certain processes
FIGURE 4Processes involving RBPs and RNAexp in cytoplasm. (a) The role of eIF2D in RAN translation initiation. Non‐canonical translation initiation factor eIF2D delivers Met‐tRNA to the P‐site of 40S ribosomal subunit at CUG codon contributing to RAN translation initiation (described for G4C2exp). A (aminoacyl) site, P (peptidyl) site, E (exit) site in the ribosome. (b) Elongation of RAN translation. DHX36 helicase unwinds G‐quadruplexes formed by RNAexp and thus facilitates ribosome processivity and production of toxic homopolymeric proteins or dipeptide repeat proteins (described for G4C2exp and CGGexp). (c) RAN translation initiation upon stress. Stress related to the presence of double‐stranded RNA (dsRNA) formed by RNAexp and RAN proteins activates PKR and PERK kinases, respectively, which catalyze phosphorylation of eIF2α. This, in turn, inhibits eIF2α‐P binding to Met‐tRNA and the formation of preinitiation complex (PIC) with 40S ribosome subunit. Under the stress, eIF2A may take over role of phosphorylated eIF2α‐P, bind Met‐tRNA and participate in translation initiation at near‐cognate start codons (described for G4C2exp and CCUGexp). (d) Stress response caused by RNA. RNAi pathway component, ribonuclease Dicer, cleaves dsRNA formed by RNAexp hairpin or bidirectionally transcribed CUGexp/CAGexp duplex into 21‐mer fragments. Such RNA fragments may next activate TLRs and trigger innate immune response. (e) Stress granules formation. Stress stimuli such as RNAexp and RAN proteins lead to phosphorylation of eIF2α, followed by global translation suppression and formation of stress granules (SG; described for CGGexp and G4C2exp). Moreover, G4C2exp may serve as a core component of SG and promote formation of membraneless organelles composed of different mRNAs and SG protein markers. Chronic stress may entail SG transition towards more solid‐like structures. (f) Phase separation of RNA. Homopolymeric RAN protein, polyG, binds to its own RNAexp what promotes its phase transition from liquid droplets towards gel‐like aggregates (described for CGGexp). (a–c) arrow with a dotted line, change in place and/or in time; solid lines show induction or inhibition of certain processes