| Literature DB >> 35053321 |
Joana R Loureiro1,2, Ana F Castro1,2,3, Ana S Figueiredo1,2,3, Isabel Silveira1,2.
Abstract
The number of neurodegenerative diseases resulting from repeat expansion has increased extraordinarily in recent years. In several of these pathologies, the repeat can be transcribed in RNA from both DNA strands producing, at least, one toxic RNA repeat that causes neurodegeneration by a complex mechanism. Recently, seven diseases have been found caused by a novel intronic pentanucleotide repeat in distinct genes encoding proteins highly expressed in the cerebellum. These disorders are clinically heterogeneous being characterized by impaired motor function, resulting from ataxia or epilepsy. The role that apparently normal proteins from these mutant genes play in these pathologies is not known. However, recent advances in previously known spinocerebellar ataxias originated by abnormal non-coding pentanucleotide repeats point to a gain of a toxic function by the pathogenic repeat-containing RNA that abnormally forms nuclear foci with RNA-binding proteins. In cells, RNA foci have been shown to be formed by phase separation. Moreover, the field of repeat expansions has lately achieved an extraordinary progress with the discovery that RNA repeats, polyglutamine, and polyalanine proteins are crucial for the formation of nuclear membraneless organelles by phase separation, which is perturbed when they are expanded. This review will cover the amazing advances on repeat diseases.Entities:
Keywords: ATTTC repeat insertion; FAME1; RNA foci; RNA-binding protein; SCA37; familial adult myoclonic epilepsy; liquid–liquid phase separation; polyalanine; polyglutamine; spinocerebellar ataxia
Mesh:
Year: 2022 PMID: 35053321 PMCID: PMC8773600 DOI: 10.3390/cells11020205
Source DB: PubMed Journal: Cells ISSN: 2073-4409 Impact factor: 6.600
Figure 1Genomic localization of pathogenic repeat expansions. Pathogenic repeats are found in protein-coding genes or non-coding RNAs; in gene-coding regions, these repeats are found in promoters, 5′ UTRs, exons, introns and 3′ UTRs; in red boxes are represented expanded repeats that trigger gene silencing or downregulation (gene loss-of-function); in blue boxes the non-coding transcribed repeat expansions (RNA-mediated toxicity); in green boxes are expanded repeats in protein coding regions or that lead to upregulation (protein gain-of-function). The pentanucleotide repeats are highlighted in blue. In promoters are located the pathogenic repeats causing progressive myoclonus epilepsy of the Unverricht-Lundborg (EPM1) [8] and Baratela-Scott Syndrome (BSS) [9]; the repeat expansion leading to spinocerebellar ataxia type 12 (SCA12) is in the promoter or the 5’ UTR, depending on the transcript [10,11]. In 5′ UTRs are found the expanded repeats associated with fragile X syndrome (FXS or FRAXA) [12], FRAXE [13], FRA2A [14], FRA7A [15], FRA11A [16], FRA12A [17], glutaminase deficiency (GAD) [18], fragile-X-associated tremor ataxia syndrome (FXTAS) [19], fragile X-associated primary ovarian insufficiency (FXPOI) [20], neuronal intranuclear inclusion disease (NIID) [21,22,23], and oculopharyngodistal myopathy (OPDM1) [23]. In introns, abnormal repeat tracts may lead to Friedrich ataxia (FRDA) [24], frontotemporal dementia/amyotrophic lateral sclerosis (FTD/ALS) [25,26], SCA10 [27], SCA36 [28,29], myotonic dystrophy type 2 (DM2) [30], and Fuchs endothelial corneal dystrophy (FECD) [31]. The pathogenic pentanucleotide repeats causing cerebellar ataxia with neuropathy and vestibular areflexia (CANVAS) [32,33], SCA31 [34], SCA37 [35], familial adult myoclonic epilepsy type 1 (FAME1) [36], FAME2 [37], FAME3 [38], FAME4 [39], FAME6, and FAME7 [36] are also intronic; a repeat-containing retrotransposon insertion potentially interferes with transcription of surrounding genomic elements in X-linked dystonia-parkinsonism (XDP) [40]. In 3′ UTR, expanded trinucleotide repeats originate DM1 [41,42]. The Huntington’s disease-like 2 (HDL2) repeat expansion is located both in the 3′ UTR and in an exon, depending on the transcript, being bidirectional transcribed [43,44,45] as in SCA8 [46,47]. In exons, trinucleotide repeat expansions encode abnormal polyglutamine (polyQ) tracts in dentatorubral-pallidoluysian atrophy (DRPLA) [48], HD [49], SCA1 [50], spinobulbar muscular atrophy (SBMA) [51], SCA2 [52], Machado-Joseph disease/SCA3 [53], SCA6 [54], SCA7 [55], and SCA17 [56] or polyalanine (polyA) stretches in oculopharyngeal muscular dystrophy (OPMD) [57], X-linked mental retardation (XLMR) [58], blepharophimosis syndrome (BPES) [59], cleidocranial dysplasia (CCD) [60], congenital central hypoventilation syndrome (CCHS) [61], hand-foot-genital syndrome (HFGS) [62], holoprosencephaly (HPE) [63], synpolydactyly (SPD) [64], X-linked mental retardation and abnormal genitalia (XLAG) [65], or XLMR and growth hormone deficit (XLMRGHD) [66]. An expanded repeat in opposite DNA strands of non-coding RNAs LOC642361/ NUTM2B-AS1 cause oculopharyngeal myopathy with leukoencephalopathy (OPML1) [23]. * The pathogenic repeat threshold is not clear in FTD/ALS, some studies consider >30 repeats.
Figure 2Representation of cellular condensates and location of RNA and proteins with expansions. Transcriptional condensates, formed by liquid–liquid phase separation, concentrate TFs, mediators, and other crucial players in transcription initiation. They are closely located to nuclear speckles, small reservoirs of splicing factors acting in mRNA splicing, and to paraspeckles, that assemble TFs and RBPs, thus regulating gene expression. Histone loci bodies drive histone synthesis and storage [71], while RNA transport granules carry mRNAs from and to the nucleus. The nucleolus is the place of rRNA synthesis and early-stage ribosomal subunit assembly. Under stress conditions (1) in nuclear stress bodies, the phosphorylation of RBPs leads to alterations in mRNA splicing of their target genes; in SCA31, the toxic (UGGAA)n competes with SatIII (UGGAA)n non-coding RNA for SRSF9 binding; (2) in the cytoplasm, the stress granules sequester mRNA, arresting translational initiation; Ataxin-2 protein has been implicated in the assembly of stress granules when expanded interacts with TDP-43 and recruits Staufen1 (STAU1) protein, leading to alterations in stress granule dynamics [4]. Perinucleolar compartment is thought to be the place where RNA Pol III transcripts are processed; additionally, recent work has demonstrated that the recruitment of PTBP1 by PNCTR long non-coding RNA controls the splicing patterns of RNA Pol II transcripts [72]; cleavage bodies are regions of mRNA processing. P bodies stall untranslated mRNAs, mediating its decay. U bodies assemble and store small nuclear ribonucleoproteins (snRNPs), previously biosynthesized in Cajal bodies. There are several mechanisms contributing to pathology in repeat expansion diseases (e.g., bidirectional transcription, RNA foci formation and RAN translation; most of them disturb phase-separation capacity, interfering with the correct assembly of condensates. Polyalanine—polyA; polyglutamine—polyQ.
Figure 3Pathogenic pentanucleotide repeats. (A) Schematic representation of pathogenic pentanucleotide repeats located in poly-A regions of ancient Alu families. Repeats are represented in Alu orientation; orange for repeats in genes with the Alu in antisense DNA strand; blue for repeats with Alu in shared introns of genes transcribed in opposite directions. The SCA37 repeat is located in the middle poly-A of an AluJb and the remaining are in 3′ poly-A tails of AluS elements. LM—left Alu monomer; RM—right Alu monomer; boxes A and B—internal regions of an RNA pol III promoter degenerated in ancient Alu families. (B) Schematic representation of non-pathogenic and pathogenic alleles with ATTTC repeat insertions, in intronic gene regions. Orange represents repeat composition of non-pathogenic alleles, whereas red is the pathogenic repeat insertion. Most non-pathogenic chromosomes have less than 30 repeats, but some can be larger than the pathogenic tract.
Neurological diseases caused by pentanucleotide repeat expansions.
| Disease | Type | Gene | Pathogenic Alleles | Non-Pathogenic Alleles | |
|---|---|---|---|---|---|
| Common | Less Common | ||||
| SCA | 27SCA10 |
| (ATTCT)800–4500 | (ATTCT)10–16 | (ATTCT)17–32 |
| 34SCA31 |
| TAAAA(TAGAA)1–4(TGGAA)n | (TAAAA)8–20 | (TAAAA)exp | |
| (TAAAA)n(TAGAA)n and (TAAAATAGAA)n | |||||
| 35SCA37 |
| (ATTTT)60–79(ATTTC)31–75(ATTTT)58–90 | (ATTTT)7–30 | (ATTTT)31–400 | |
| CANVAS | 32CANVAS |
| (AAGGG)400-2000 | (AAAAG)11 | (AAAAG)15–200 |
| (AAAGG)40–1000 | |||||
| 130(ACAGG)~1000 | |||||
| 131(AAAGG)10–25(AAGGG)exp | |||||
| FAME | 36FAME1 |
| (ATTTT)exp(ATTTC)exp | (ATTTT)<100 | (ATTTT)>100 |
| [(ATTTT)exp(ATTTC)exp(ATTTT)exp]440–3680 | |||||
| 143(ATTTT)n(ATTTG)n | |||||
| 37FAME2 |
| (ATTTT)340–390(ATTTC)345–588 | (ATTTT)n | n.d. | |
| 38FAME3 |
| [(ATTTT)exp(ATTTC)exp]791–1035 | (ATTTT)9–20 | n.d. | |
| 39FAME4 |
| (ATTTT)819(ATTTC)221 | (ATTTT)4–120 | (ATTTT)121–1219 | |
| 36FAME6 |
| (ATTTT)22(ATTTC)exp(ATTTT)exp | (ATTTT)<200 bp * | (ATTTT)>200 bp * | |
| 36FAME7 |
| (ATTTT)exp(ATTTC)exp(ATTTT)n | (ATTTT)<300 bp * | (ATTTT)300–3000 bp * | |
n.d.—not determined; * Repeat size is in base pairs.