Literature DB >> 32060489

A slipped-CAG DNA-binding small molecule induces trinucleotide-repeat contractions in vivo.

Masayuki Nakamori1, Gagan B Panigrahi2, Stella Lanni2, Terence Gall-Duncan2,3, Hideki Hayakawa1, Hana Tanaka1, Jennifer Luo2,3, Takahiro Otabe4, Jinxing Li4, Akihiro Sakata4, Marie-Christine Caron5,6, Niraj Joshi5,6, Tanya Prasolava2, Karen Chiang2,3, Jean-Yves Masson5,6, Marc S Wold7, Xiaoxiao Wang8, Marietta Y W T Lee8, John Huddleston9,10, Katherine M Munson9, Scott Davidson2, Mehdi Layeghifard2, Lisa-Monique Edward2, Richard Gallon11, Mauro Santibanez-Koref11, Asako Murata4, Masanori P Takahashi1, Evan E Eichler9,10, Adam Shlien2, Kazuhiko Nakatani4, Hideki Mochizuki1, Christopher E Pearson12,13.   

Abstract

In many repeat diseases, such as Huntington's disease (HD), ongoing repeat expansions in affected tissues contribute to disease onset, progression and severity. Inducing contractions of expanded repeats by exogenous agents is not yet possible. Traditional approaches would target proteins driving repeat mutations. Here we report a compound, naphthyridine-azaquinolone (NA), that specifically binds slipped-CAG DNA intermediates of expansion mutations, a previously unsuspected target. NA efficiently induces repeat contractions in HD patient cells as well as en masse contractions in medium spiny neurons of HD mouse striatum. Contractions are specific for the expanded allele, independently of DNA replication, require transcription across the coding CTG strand and arise by blocking repair of CAG slip-outs. NA-induced contractions depend on active expansions driven by MutSβ. NA injections in HD mouse striatum reduce mutant HTT protein aggregates, a biomarker of HD pathogenesis and severity. Repeat-structure-specific DNA ligands are a novel avenue to contract expanded repeats.

Entities:  

Mesh:

Substances:

Year:  2020        PMID: 32060489      PMCID: PMC7043212          DOI: 10.1038/s41588-019-0575-8

Source DB:  PubMed          Journal:  Nat Genet        ISSN: 1061-4036            Impact factor:   38.330


Expansion of unstable trinucleotide repeats causes over 40 neurodegenerative diseases, including Huntington’s disease (HD), and 17 of these are caused by gene-specific CAG/CTG trinucleotide repeat expansions[1-5]. Ongoing repeat expansions in affected tissues correlate with disease age-of-onset, progression and severity[6,7], and dramatic repeat length variations exist between tissues of the same individual[1]. For at least six CAG diseases (HD, spinocerebellar ataxia type 1 (SCA1), SCA2, SCA3, SCA7, and SCA17), DNA repair proteins are major modifiers of age-of-onset[8,9] and disease progression[10], suggesting that ongoing somatic expansions contribute to age-of-onset and progression[2]. Therefore, arresting or reversing somatic CAG/CTG repeat expansions may arrest or reverse disease progression[2]. Inheritance of larger HD CAG expansions also leads to earlier age-of-onset and accelerated disease progression[11,12]. Thus, inducing contractions of the expanded repeat to lengths shorter than the inherited tract length should have beneficial effects. While the exact mechanism(s) of repeat expansions are unclear, all models involve the aberrant repair of unusual slipped-DNA structures formed by the repeats[4,13-22]. Slipped-DNAs form by out-of-register annealing of complementary repeat strands during DNA repair, replication or transcription, and have slip-outs with intra-strand hairpins of CAG or CTG repeats[16,21]. Slipped-DNAs were detected at the mutant myotonic dystrophy (DM1) locus in patient tissues, including the CNS, and were not present at the non-expanded allele[13]. Levels of slipped-DNAs were greatest in tissues showing the largest expansions, supporting the involvement of slipped-DNAs in the expansion processes. A compound with specificity for slipped-DNAs may alter their formation and/or processing—attributes that could modulate repeat instability. We previously designed a CAG-specific DNA binding compound, Naphthyridine-Azaquinolone (NA)[23-25], which binds to a distorted intra-strand CAG hairpin, with the naphthyridine and azaquinolone moieties forming hydrogen bonds to guanine and adenine, respectively, causing two cytosines to flip-out from the CAG hairpin (Fig. 1a,b)[25]. NA bound with high affinity and increased the melting temperature of (CAG)10 hairpins by >30 °C[24], and preferentially bound to longer (>30) CAG hairpins[25]. The high selectivity of NA for longer CAG hairpins coupled with the presence of slipped-DNAs at a mutant repeat locus[13] suggested it may specifically target the expanded allele.
Figure 1 |

NA binds to long CAG slip-outs.

a, Structure of NA comprising two heterocycles, naphthyridine (red) and 8-azaquinolone moiety (blue)[25]. b, Schematic of NA–(CAG)•(CAG) triad complex revealed by NMR spectroscopy[25]. The (CAG)n DNA sequence (left) can fold into hairpins involving mismatched A-A pairs flanked by C-G and G-C pairs (middle). NA molecules intercalate into the DNA helix, with the 2-amino-1,8-naphthyridine moiety hydrogen bonding to guanine (in red) and 8-azaquinolone moiety hydrogen bonding to adenine (in blue), forcing the flipped-out cytosine bases. c, Binding of NA to gel-purified DNA fragments with (CAG)•(CAG) repeats in both strands flanked by 59 bp and 54 bp of non-repetitive DNA labeled with 32P-labeled on both strands (green star). d, Quantification of NA binding. Relative migration was measured as the ratio of migration distance of each NA-DNA complex to the migration distance of free DNA. Densitometric analysis was performed for the (CNG)1–3 DNA substrate. Graphs indicate the mean of three independent experiments +/− standard deviation (s.d.). e, Footprinting on (CAG)50•(CTG)30 uniquely 32P-labeled on the (CAG)50 strand, cleaved throughout the repeat using mung bean nuclease (MBN)[16]. In the presence of NA, all scissile sites (red arrows) with the exception of the hairpin tip, are protected, revealing binding specificity for the slip-out stem. Two independent experiments were performed with similar results. f, Gel-purified DNAs with a long (CAG)20 slip-out from (CAG)50•(CTG)30, 32P-labeled on both strands, were treated as indicated and resolved on 4% polyacrylamide. NA-DNA complexes formed with both the (CAG)50 strand (both panels) and the heteroduplexed (CAG)50•(CTG)30 (right panel) are shown by brackets; free DNA is indicated by arrowheads. NA did not bind (CTG)30 hairpin fragment in either experiment (white arrowheads), and did not inhibit re-hybridization of complementary strands. Two independent experiments were performed with similar results. Blots have been cropped and the corresponding full blots are available in the Source Data files.

RESULTS

NA binds long CAG slip-out structures.

To determine if NA can bind disease-relevant (CAG)•(CTG) duplexes with and without slip-outs, we performed band-shifts using slipped-DNAs[16] that mimic those at the mutant DM1 locus in patient tissues[13]. NA did not bind to fully duplexed DNAs containing (CAG)50•(CTG)50 repeats, and high concentrations of NA did not induce structural alterations of the DNA (Fig. 1c,d). NA bound molecules of (CAG)50•(CTG)50 with clustered short slip-outs[22] to a limited degree, but their short slip-outs and electrophoretic heterogeneity prohibited detection of a single-shifted species (Fig. 1c,d). Slipped-heteroduplexes with a single long slip-out of 20 CAG excess repeats in (CAG)50•(CTG)30 were extensively bound by NA (Fig. 1c,d). NA-binding caused electrophoretic band-broadening, previously observed for other DNA-binding ligands[26]. The progressively slower migration of the slipped-DNA with added NA is consistent with NA binding to additional (CAG)•(CAG) pairs in the hairpin[25]. In contrast, DNAs with long slip-outs of CTG repeats, (CAG)30•(CTG)50, were not bound by NA (Fig. 1c,d). Thus, NA specifically bound CAG but not CTG slip-outs. We confirmed this with footprinting experiments using the single-strand DNA-specific mung bean nuclease (MBN), which cleaves only the slip-out regions between the AG of every CAG unit[16] (Fig. 1e). Upon NA binding to the CAG slip-out, MBN digestion is inhibited, with the exception of the hairpin tip (Fig. 1e), not bound by NA. Denaturation of purified slipped (CAG)50•(CTG)30 to individual (CAG)50 and (CTG)30 single-strands showed NA binding specifically to the single-stranded (CAG)50 strand (Fig. 1f). Subsequent renaturation of the individual single-strands revealed that NA did not block complementary strand hybridization, but bound specifically to the slipped-out CAG strand (Fig. 1f). The greater degree of electrophoretic shift of the isolated single-stranded (CAG)50 tract compared to (CAG)50•(CTG)30 is caused by it being forced into a long hairpin of 50 repeats that can bind more NA molecules. Note that the amount of NA bound to (CAG)50•(CTG)30 or (CAG)50 following re-hybridization (Fig. 1f, 7.5 μM NA) is considerably greater than in the absence of re-hybridization (Fig. 1c, 50 μM NA), even though the amount of NA is more than 5-fold higher in the latter. These results suggest that, in biological situations where the CAG repeat is denatured from its complementary CTG strand, the full-length of the isolated CAG tract would become fully bound by NA in a hairpin-like conformation. Furthermore, NA binding was significantly greater for longer CAG slip-outs (Supplementary Fig. 1). We previously estimated a ratio of two NA molecules to one CAG-CAG, wherein a slip-out of (CAG)20 would bind maximally 20 NA molecules[25]. The molar ratio of 2:1 NA to slipped-out repeats was evident for the longer slip-outs. Affinity of each NA molecule/CAG-CAG was estimated as 1.8 × 106 M–1 as Ka or 0.56 × 10–6 M as Kd[25]. NA can be removed from the DNA using phenol/chloroform extraction (Extended Data Figure 4c). Together, these findings support the structure-specificity of NA for long CAG repeat slip-outs of disease-relevant tract lengths, and indicate that NA does not induce slip-out extrusion from an expanded fully duplexed molecule, but induces formation of longer slip-outs from a single-stranded CAG tract.
Extended Data Fig. 4

NA does not affect HTT transcription or translation

a, NA does not affect transcription across expanded repeats in HTT in HD patient cells, determined by quantitative real-time reverse transcriptase (qRT)-PCR and normalized to U6 RNA. Data are indicated as the mean ± s.d. of independent triplicates. b, Western blot showing that NA does not affect HTT translation in HD patient cells with (CAG)43. Western blots were repeated 4 times with similar results. Blots have been cropped and the corresponding full blots are available in the Source Data files.

NA inhibits repair of long CAG slip-outs.

We tested whether NA could block processing of slipped-DNAs, which could forecast its ability to modify repeat instability. NA specifically inhibited repair of slipped-DNAs with long (CAG)20 slip-outs, but not (CTG)20 slip-outs (Fig. 2a–c). This specificity is consistent with the inability of DNA polymerases to extend primers along NA-bound (CAG)20 templates[23,24]. We assessed the effects of NA on repair of a single extra CAG repeat slip-out, too small to be bound by NA. Repair of this substrate was unaffected by NA (Fig. 2d and Supplementary Fig. 1), consistent with its binding specificity for long slip-outs. NA did not bind to slip-outs of 3 excess repeats, but binding could be detected for 5, 11, 15, or 20 excess CAG repeats, with increased binding for slip-outs >10 (Supplementary Fig. 1, upper panel). Repair of DNA substrates with a similar range of slip-out sizes from 6, 10, 13, or 20 excess CAG repeats was inhibited by NA (Supplementary Fig. 1, lower panel), indicating that, once the DNA is bound by NA, repair is inhibited and not further altered by increased NA-bound lengths. These data suggest that there is a less-than-additive effect for both NA binding and NA’s effect on slip-out repair. NA had no effect on repair of a G-T mismatch, the most frequent base-base mismatch, which depends upon mismatch repair (MMR) proteins[22,27] (Fig. 2e). Thus, NA is unlikely to cause genome-wide mutations that occur in the absence of MMR.
Figure 2 |

NA specifically inhibits repair of long CAG slip-outs by human (HeLa) cell extracts.

a, Schematic of the in vitro repair assay. b-e, Starting DNAs and repair products (schematics) have the repeat-containing fragment released, resolved on PAGE and assessed on a molar level by Southern blotting and densitometry. Repair of DNA substrates containing a long (CAG)20 slip-out (b), a long (CTG)20 slip-out (c), a single CAG slip-out (d), or a G-T base-base mismatch (e), in the absence or presence of NA. Slipped-DNAs were hybrids of (CAG)50•(CTG)30, (CAG)30•(CTG)50 or (CAG)50•(CTG)49 (Methods)[22,56]. Repair of the G-T mismatch reconstitutes a HindIII restriction site. Graphs show percentage repair efficiencies to repaired product relative to all repeat-containing fragments in the lane; values are normalized to the NA-free efficiency. Values represent the mean of three independent experiments ± s.d. Blots have been cropped and the corresponding full blots are available in the Source Data files.

NA modulates instability of expanded mutant repeats in HD cells, with limited off-target effects.

At 50 μM, NA is cell-permeable and enters nuclei without causing acute or long-term cytotoxicity, slowing proliferation, or altering DNA replication or transcription across CTG tracts (Fig. 3a–c and Extended Data Fig. 1). Tract lengths of >200 repeats are frequent in brain cells that have experienced somatic expansions in HD individuals that inherited (CAG)40–50[1,7,28]. NA induced a significant shift in repeat population towards contraction of a (CAG)180•(CTG)180 tract at the HTT locus in HD patient-derived primary fibroblasts[5]. NA enhanced the number of contractions of the expanded HD repeat (Fig. 3d; P = 7.25 × 10−6, Supplementary Table 1), causing significant repeat loss (Fig. 3f; P = 0.0003). NA also induced a significant reduction in the number of expansions of the HD repeat (Fig. 3d; P = 4.34 × 10−5, Supplementary Table 1). We also treated HD patient fibroblasts with (CAG)43, a mutation length common in the majority of patients. A significant number of NA-treated cells showed contractions of the expanded repeat, with contractions as low as 20 repeats, below the HD disease threshold of 35 units (Fig. 3e,g; P = 3.28 × 10−5, Supplementary Table 1). NA also reduced the number of expansions (Fig. 3e; P = 8.32 × 10−5, Supplementary Table 1). Thus, NA can induce contractions of expanded tract lengths common in inherited and somatically expanded HD alleles. In contrast, NA does not affect either the non-expanded HD repeat or other microsatellite repeats known to be prone to instability under stressed conditions (Fig. 3h and Extended Data Fig. 2a), suggesting that NA is specific for structures formed by expanded CAG/CTG repeats and will not deleteriously affect other repeats.
Figure 3 |

NA cellular distribution, non-toxicity, and effects on repeat instability in HD patient cells.

a, NBD-labeled NA (green) was distributed throughout nuclei and cytoplasm of human HD primary fibroblasts with (CAG)43 (GM02191). Nuclei and cell membranes were stained with DAPI (blue) and Cell Light Plasma Membrane-RFP (red), respectively. Scale bar: 20 μM. Three independent experiments were performed. b, Cell toxicity of (CAG)43 cells treated with NA for 72 h. Viability estimated by WST-1 assays. Values represent means of three independent measures ± s.d.. c, Population doubling levels and percentage of cell viability of (CAG)43 cells treated with or without 50 μM NA. Error bars indicate the s.d. of three independent experiments. Unpaired two-tailed t-test was used to calculate P-values. d,e, Repeat instability was analyzed by small-pool-PCR across the HD repeat tract (Methods, see Supplementary Table 1). Histograms show repeat-length distributions in human HD primary fibroblasts with (CAG)180[5] (GM09197) or with (CAG)43 (GM02191), after 40 days growth +/− NA. Frequency distribution of repeat alleles is indicated as gray bars. Dashed line indicates peak CAG size. Allele lengths are grouped in bins spanning 10 repeats; >230 alleles were sized per group. Percentage of repeat population was calculated by dividing the number of alleles grouped in bins spanning 10 repeats by the number of total alleles. Shown is a summary of 3 independent experiments. P-values were calculated by χ2-test comparing the frequencies of expanded, unchanged, and contracted alleles in each set of experiments (Supplementary Table 1). f,g, Average (mean) repeat size in HD fibroblasts after 40 days incubation with or without NA. *P < 0.001, two-sided Student’s t test. Error bars: 99% confidence limits. h, Repeat lengths of the HTT (normal allele), CASK, and Mfd15 loci in HD fibroblast cells (after 40 days incubation +/− NA). Three independent experiments were performed. i, Rate of single base substitutions (SBS), insertions/deletions (INDELs), and double substitutions (DBS) in NA-treated HD cells with (CAG)43 or (CAG)180 compared to mock-treated cells. SBS/INDELs/DBS/rearrangements detected in the NA-treated (CAG)43 or (CAG)180 cells were 172/29/7/0 and 187/42/5/0, respectively. Dashed lines indicate the 0.8 mutations/Mb, which is the lowest burden observed in a typically “quiet” adult cancer (bone marrow myelodysplastic syndrome[45]), and the 5–10 and 100–1,000 mutations/Mb in biallelic mismatch repair deficient cancers, with and without polExpo activity[44–48]. Blots have been cropped and the corresponding full blots are available in the Source Data files.

Extended Data Fig. 1

NA does not affect replication efficiency or replication fork progression

Three circular plasmids containing the SV40 origin of replication, and an expanded (CAG)79•(CTG)79 repeat tract (pDM79EF and pDM79HF) or no repeats (pKN16), were replicated in vitro by human (HeLa) cell extracts without or with NA (7.5 μM or 15 μM) treatment. The location of SV40-ori determines the replication direction and which strand will be used as the leading or the lagging strand template. pDM79HF uses the CAG strand as the lagging strand template, while pDM79EF uses the CTG strand as the lagging strand template (schematic on the top of the gel panel). Replication products were purified and linearized with BamHI. An equal portion of the reaction material was also digested with BamHI and DpnI as DpnI digests un-replicated and partially-replicated material, as shown in the schematic (top figure). The digestion products were electrophoresed on a 1% agarose gel to resolve completely replicated and un-replicated material (bottom figure). Equal amount of unreplicated plasmid DNA was digested with DpnI and stained with Ethidium Bromide to show the complete digestion of unreplicated plasmid DNA (Bottom panel). Panel I, ethidium bromide stained, Panel II, autorad: marker (lane 1); DpnI undigested plasmid DNA (lane 2); DpnI digested unreplicated plasmid DNA (lane 3–4); replicated plasmid DNA, DpnI resistant (lane 5). No difference in DpnI resistant material is observed between replication in the presence or absence of NA, in all the three templates tested (panel III, IV, V). Blots have been cropped and the corresponding full blots are available in the Source Data files.

Extended Data Fig. 2

NA does not affect non-mutant genetically stable repeats

a,b, Representative data showing small-pool PCR for the non-expanded CAG/CTG repeat length of CASK and Mdf15 in HD primary fibroblast cells (a) and small-pool PCR for the non-expanded CAG/CTG repeat length of CASK, Mdf15, ATXN8 and the non-expanded HTT allele genes in HT1080–(CAG)850 cells (b). Even under the sensitive mutation detection capacity of spPCR, length variation was not observed in either NA treated- and untreated-cells. Notably, some reactions did not show any product as is typical of the low genomic DNA template dilutions used in small-pool PCR. c, Repeat-tract lengths of the CASK, ATXN8, and Mfd15 loci in HT1080–(CAG)850 cells (initial clone and cells after 30 days incubation with or without NA). Length variation was not observed at any of these repeats of normal length loci in HT1080-(CAG)850 cells (after 30 days incubation with or without NA). Three independent experiments were performed. d, Small-pool PCR for the non-expanded CAG tracts in TBP alleles in HD patient fibroblasts treated with or without NA for 40 days. e, Small-pool PCR for the non-expanded CAG tracts in TBP alleles in HD R6/2 mouse striatum with four injections of NA or saline. f, Microsatellite instability assay. Assay scores >1.3 indicate increased MSI relative to a control sample set from peripheral blood leukocytes. Both NA positive and NA negative HD cells with (CAG)43 or (CAG)180 scored <1.3, indicating no effect of treatment on MSI. 8 known CMMRD-negative controls and 3 known CMMRD-positive controls were included in the assay.

As an additional control, we first assessed the effect of NA at the very long but genetically stable (i.e., not forming slipped-DNA structures) CAG tract of the TBP gene in HD patient cells. Stretches of >49 CAG repeats at the TBP locus cause fully-penetrant SCA17 disease[29]. Our HD cell lines have 43 unstable repeats at the HTT gene and a similar length of 38 and 34 stable repeats of the non-mutant TBP gene. NA did not change the length distribution of TBP repeats[30,31] (Extended Data Fig. 2d and Supplementary Table 1), further supporting its specificity for actively unstable tracts that form slipped-DNAs. To assess possible mutagenic activity of NA, we assessed its effect on a cancer diagnostic MSI panel of >20 slippage-prone mononucleotide repeats, using a sequencing-based assay capable of detecting low-level microsatellite instability in non-neoplastic tissues[32]. There was no evidence of increased MSI from NA treatment in either HD cells with (CAG)43 or (CAG)180 (Extended Data Fig. 2f). As a control for non-repetitive regions, we assessed the HPRT1 gene (exons 2 to 3), which is an established surrogate for evaluating mutagen-induced mutation spectra[33-36]. Since HPRT1 is on the X-chromosome, each of multiple single long-read sequences of HPRT1 can be used as a proxy for a single cell in male HD cells (GM09197, (CAG)180/21), permitting assessment of the effect of NA on individual cells without overlapping alleles. We sequenced up to 2,402 individual HPRT1 alleles in three independent replicates and found no sequence differences between NA- and mock-treated cells (Extended Data Fig. 3a–c). We also assessed whether NA could induce an unsuspected mutation signature in HD cells ((CAG)43 or (CAG)180) using whole-genome sequencing. Environmental and anti-cancer agents can lead to specific mutation signatures[37-39]. We screened the NA-treated and untreated HD cells for mutational signatures shown in COSMIC using an established pipeline (SigProfiler version 0.0.5.75[40]). We were unable to detect a statistically significant mutation signature. The genome of NA-treated HD cells showed very low numbers of single or double substitutions (SBS or DBS) or insertions/deletions (INDELs), and no rearrangements (Fig. 3i). The SBS/INDELs/DBS/rearrangements for the (CAG)43 and (CAG)180 cells were 172/29/7/0 and 187/42/5/0, respectively. Similar background mutations attributed to DNA damage incurred during cell culture (~245 SBS, ~35 INDELs, ~6 DBS, and ~0 rearrangements per genome) have been detected[41-43]. Moreover, cells treated for 24 hours with the food additive potassium bromate (875 μM) or the anticancer drug cisplatin (12.5 μM) showed ~576/~31 and ~926/~65 SBS/INDELs, respectively[41], both substantially “noisier” than our experiment treating HD cells with NA chronically for 40 days (50 μM). The “silence” of NA off-target mutagenesis is contrasted by the “noisy” mutational burden of the hyper- and ultra-hyper-mutated genomes of brain cancers of MMR-deficient individuals assessed by the same pipeline[44-48]—a portent of non-specific targeting of MutSβ (Fig. 3i). Together, these results argue against NA as a general mutagen.
Extended Data Fig. 3

NA is not a general mutagen

Towards assessing whether NA-treatment acted as a general mutagen to sequences other than CAG slip-outs, we harnessed the high read accuracy and depth of single molecule, real time, circular consensus sequencing (SMRT-CCS). Single-molecule sequencing was done on the HPRT1 gene – widely used as a surrogate indicator of the global effect of induced genetic variation. For each replicate, we calculated the relative mutation rate between NA- and saline-treated cells as the mutation rate for NA-treated cells minus the rate for saline-treated cells and identified excess mutation rates based on an absolute relative rate >0.5%. a, Schematic of HPRT1 sequencing for mutation detection. Briefly, cells were grown under identical conditions differing only by the addition of NA (50 μM) or saline, DNAs were isolated, HPRT1 exons 2 and 3 PCR amplified and sequenced. b, Quality control for our analysis. c,d, Comparison of sequence variations between NA-treated and saline treated is presented. We chose to compare the single-molecule sequence reads of individual X chromosome-linked HPRT1 alleles (exons 2 and 3) from our male HD patient-derived cells (c), and our male R6/2 mice (d), thsat had been NA- or saline-treated. Each read represents a single cell (Supplementary Note). Graphs show the distribution of sequence variants by relative mutation rate between three experimental replicates of NA-treated and saline-treated cells sequenced with PacBio single-molecule long reads.

Effect of NA depends on transcription but not replication through the repeat tract.

NA could affect repeat instability during replication or transcription. NA treatment induced a significant shift in repeat population towards contraction of a (CAG)850•(CTG)850 tract in human cells, expressing r(CAG)850[20] (Fig. 4a,b; P = 4.78 × 10−5, Supplementary Table 1), and significantly shortened the average size of the repeat tract, with losses of up to 790 repeats (Fig. 4b; P = 6.44 × 10−3). NA’s effects were independent of cell proliferation and DNA replication, as near-complete arrest of proliferation using Palbociclib, contact inhibition, or serum starvation had no effect on NA-induced CAG contractions (Fig. 4d,e and not shown). This is consistent with NA’s lack of effect on cell proliferation (Fig. 3c) or replication fork progression (Extended Data Fig. 1). To address the requirement for transcription, we used an established cell model, HT1080-nontranscribing (CAG)850, having a single (CAG)850 transgene floxed by transcription-terminator elements, so that the expanded repeat is not transcribed in either direction[20,49]. We confirmed the absence of transcription (Fig. 4c) and treated cells with NA in transcribing or non-transcribing conditions. NA only induced contractions of the (CAG)850 tract when transcription was permissible (Fig. 4b,f). Since NA did not alter transcription across the expanded repeat, either in these cells or HD patient fibroblasts (Fig. 4g and Extended Data Fig. 4a), this shows that NA induces CAG contractions during transcription across the expanded repeat, consistent with transcription having a driving role in inducing CAG instability and NA-induced contractions[15,20,50,51]. The ability of NA to induce contractions required transcription across the CTG repeat, but did not involve arrest of transcription. Again, NA did not affect non-expanded CAG/CTG tracts or other microsatellite repeats in the HT1080 cell model (Extended Data Fig. 2b,c). Thus, NA depends upon transcription across the expanded repeat to induce repeat contractions, and is effective independent of cell proliferation.
Figure 4 |

NA induces CAG contractions, independent of proliferation, dependent on rCAG transcription.

a, Schematic transcription bubble, and representative data showing spPCR repeat length analysis of HT1080–(CAG)850 cells (initial cell clone and cells after 30 days incubation +/− NA) (Methods)[20,49]. Scale at left shows molecular weight markers (M) converted into repeat number for CAG-repeat fragments of equivalent size. b, Histograms showing repeat length distributions, after 30 days incubation +/− NA, in proliferating HT1080-(CAG)850 cells, in which transcription was permissible. Mean repeat size change is shown below. Frequency distribution of unstable and stable alleles is shown by gray bars and black bars, respectively. Dashed lines indicates the unchanged CAG size. Allele lengths are grouped in bins spanning 50 repeats. Percentage of repeat population was calculated by dividing the number of alleles grouped in bins by the number of total alleles; >50 alleles were sized for each group (Supplementary Table 1). P-values were calculated by χ2-test. Error bars: 99% confidence limits. Shown is a summary of 3 independent experiments. c, RT–PCR analysis of the CAG repeat in HT1080-non-transcribing cells, showing that the transgene is integrated as a single copy. Three independent experiments were performed. d, BrdU-positive cells after 24 h incubation with BrdU in Palbociclib-treated HT1080-(CAG)850 cells. P-value was calculated by t-test. Data are the mean ± s.d. of triplicates. e, Histograms showing repeat length distributions after 30 days incubation with or without NA, in non-proliferating (Palbociclib-arrested) HT1080-(CAG)850 cells. Mean repeat size change is shown below. P-values were calculated by χ2-test (Supplementary Table 1). Error bars: 99% confidence limits. Shown is a summary of 3 independent experiments. f, Histograms showing repeat length distributions after 30 days incubation with or without NA, in non-transcribing HT1080-(CAG)850 cells. Mean repeat size change is shown below. P-values were calculated by χ2-test (Supplementary Table 1). Error bars: 99% confidence limits. Shown is a summary of 3 independent experiments. g, RNA transcript levels of transgene (transcript in the CAG-direction) in HT1080–(CAG)850 cells treated with or without NA (50 μM)[49]. Data are the mean ± s.d. of triplicates; t-test (two-sided) was used for independent biological triplicate experiments. Blots have been cropped and the corresponding full blots are available in the Source Data files.

NA does not block in vitro R-loop formation, R-loop stability, or RNase digestion, but enhances formation of repeat contractions.

Transcription-induced R-loops can lead to CAG/CTG instability[17,20] and may cause the transcription dependency of NA on repeat instability. NA did not alter transcript levels of the expanded repeat or translation of the mutant Huntingtin protein in cells (Extended Data Fig. 4a,b), nor did it affect transcription in vitro (Supplementary Fig. 2a–d). NA (120 μM) did not affect R-loop formation, nor RNaseA or RNaseH processing of pre-formed R-loops (Supplementary Fig. 2c,d). At NA concentrations used on cells (50 μM), R-loops remained detectable, suggesting NA does not affect R-loop biophysical stability (Supplementary Fig. 2d). However, NA altered processing of CAG/CTG R-loops in extracts of neuron-like human SH-SY5Y cells, significantly increasing the number of contraction products with a milder effect on expansion products (Extended Data Fig. 5a–c, P = 0.002 vs. P = 0.04). These results suggest NA causes preferential R-loop processing to repeat contractions.
Extended Data Fig. 5|

NA induces contractions during R-loops processing.

a, Schematic of R-loop formation, processing, and analysis. Pre-formed double-R-loops were processed by terminally differentiated (retinoic acid) human neuron-like cell extracts (SH-SY5Y) in the absence or presence of NA (50 μM), as described and DNA repeat lengths scored as expansions, contractions, or stable, by the STRIP assay (Methods). b, Representative example of STRIP analysis. Transcription products were isolated, processed and transformed in E. coli cells, previously shown to stably maintain the (CAG)79•(CTG)79 lengths (Methods). Plasmids isolated from individual bacterial colonies were digested with restriction enzymes to release the repeat containing fragment, resolved on 4% polyacrylamide gels and scored for instability. c, Graphical analysis of STRIP results. Two-sided χ2 test was performed to compare 191 untreated colonies vs. 100 NA-treated colonies.

NA alters activity of several DNA repair proteins on long CAG slip-outs.

NA could block interaction of DNA repair proteins with slipped-DNAs, similar to its ability to block MBN activity on (CAG)20 slip-outs (Fig. 1e). As the proteins involved in large (CAG)20 slip-out repair are unknown, we assessed four candidate proteins MutSβ, RPA, polδ and polβ. In mice, the mismatch repair MutSβ complex (MSH2-MSH3) with a functioning ATPase drives CAG/CTG expansions[2,3,19,52,53]. MSH3 is a modifier of age-of-onset and disease progression in HD patients, and a modifier of repeat instability in HD and DM1 patients[54,55]. MutSβ is not required to repair long (CAG)10–25 slip-outs[22,27,56,57], but may be involved in formation of slipped-DNAs following resolution of transcriptionally induced R-loops[21,56,58]. This process, expected to involve MutSβ binding to DNA and ATP-mediated dissociation of the MutSβ-DNA complex, may be affected by NA. NA did not block binding of MutSβ to long CAG slip-outs, nor did NA affect the ATP-mediated dissociation of this complex (Fig. 5a) that was previously reported[59]. Given also that NA does not block formation of slip-outs from denatured repeat-containing DNAs (Fig. 1e), our data do not support a role of NA in blocking formation of slipped-DNAs either with or without MutSβ. We further tested possible overlap of MutSβ and NA pathways. MSH3 knock-down suppressed CAG repeat expansions in cells, confirming the essential role of MutSβ in active repeat instability[2,3,52,53,60]. MSH3 knockdown also blocked the effect of NA on repeat instability (Fig. 5b). Thus, NA depends upon MutSβ, and its effect on inducing contractions depends upon active CAG instability.
Figure 5 |

NA affects interaction of DNA repair proteins on long CAG slip-outs.

a, MutSβ binding to radiolabelled slipped-DNA (CAG)50•(CTG)30 containing a single (CAG)20 slip-out at room temperature for 30 min. Addition of ATP (+/− Mg2+) disrupts binding of MutSβ to the DNA, as described[59]. Addition of NA has no effect on binding or dissociation of MutSβ with this substrate. b, Histograms showing repeat length distributions in HT1080-(CAG)850 cells treated with control siRNA (left) and siRNA against MSH3 (right), +/− NA. Frequency distribution of unstable and stable alleles is shown by gray and black bars, respectively. Dashed lines indicate unchanged CAG size. Percentage of repeat population was calculated by dividing the number of alleles grouped in bins spanning 50 repeats by the number of total alleles; >50 alleles were sized for each group (Supplementary Table 1). P-values were calculated by χ2-test. Error bars: 99% confidence limits. Shown is a summary of 3 independent experiments. c, RPA (250 nM) binding to slipped-DNA (CAG)50•(CTG)30 containing a single (CAG)20 slip-out (lane 2), 32P-labeled on both strands (schematic on top). DNA was incubated with NA (50-microM) for 10 min at room temperature, prior to addition of RPA. Percentage of RPA-bound DNA quantified by densitometry. Histograms indicate mean of three independent experiments +/− s.d. d, Polymerase extension assay performed as described[23]. (CAG)10 template oligo was annealed with a 32P-labeled primer and incubated +/−NA (50 μM) for 30 min at room temperature. 250 nM RPA and/or 20 nM Polδ was added and incubated for 15 min at 37 °C. Products were separated on a 6% sequencing gel together with Maxam-Gilbert sequencing reactions (lane 1). Primer only is in lane 2. Three independent experiments were performed. Blots have been cropped and the corresponding full blots are available in the Source Data files.

Replication Protein A (RPA) inhibits formation of unusual DNA structures, like hairpins, by binding and stabilizing single-stranded regions[61,62], and enhances DNA polymerase progression through structured DNA templates[63,64]. Polymerase delta (polδ) is implicated in CAG repeat instability[65,66] and active in brains in a repair capacity[67]. Polδ was unable to synthesize across a CAG tract, and this was rescued by RPA (Fig. 5c). NA competitively blocked interaction of RPA with slipped-CAG repeats (Fig. 5c) and blocked enhanced progression of polδ along the CAG template (Fig. 5d). NA was also able to block progression of polβ (data not shown). These results support a mechanism whereby NA induces CAG contractions by blocking RPA from binding slip-outs, which would otherwise facilitate polymerases to synthesize through CAG templates (Fig. 5c). NA may disrupt the interaction or activity of other repair proteins on CAG slip-outs.

NA modulates repeat instability in the striatum of HD mice.

In HD mice and HD patients, the largest somatic CAG expansions and most neurodegeneration occur in the striatum[1,2,14,68]. R6/2 mice harbor a single-copy transgene of HTT exon 1[69,70]. R6/2 is one of the best characterized models of repeat instability, with instability patterns and tissue-specificity comparable to HD knock-in models[1,28,71-74] and human HD patient brains[1,7,28,73-75]. CAG expansions are evident in the striatum of these mice as early as a few weeks, which continue as the mice age[14]. All HD CAG mouse models assessed show a similar pattern of ongoing spontaneous CAG expansions in striatum[74], which was quantified in one model to show broadly distributed sizes of additional repeats gained at a rate of ~3.5 CAG units/month/cell[76,77]. Our dosing protocol involved one, two, or four stereotactic injections into the striatum of 6-week old R6/2 HD mice, each spanning a total of 4 weeks, when DNAs were harvested. Since pups of the same litter inherit different CAG lengths, each inheriting around 150–160 CAG repeats, this complicates a direct comparison of the effect of NA between mice. Instead, we injected the left striatum with NA (in saline) and the right striatum with saline only to serve as an internal control. NA intra-striatal injections led to contractions of the expanded repeat relative to the saline-only injections in the same mice (Fig. 6). In the absence of treatment, CAG length distributions between left and right halves of the striatum for a given mouse were identical within the limits of experimental resolution. Thus, NA specifically induces CAG length differences in mice (Fig. 6a).
Fig. 6|

NA induces CAG contractions in R6/2 mouse striatum.

a, GeneMapper traces showing the distribution of CAG-repeat lengths in striatum from six representative 10-week-old R6/2 mice that received one (mouse 1), two (mice 2 and 3) or four (mice 4–6) injections of NA, over a 4-week period. NA, dissolved in saline, was injected into the left striatum (blue) and saline alone was injected into the right striatum (red). All mice treated four times are shown in Supplementary Figs. 3–6. b, The effect of one, two or four NA injections reflected by contraction and expansion instability indices (Methods)[77,78]. c, The effect of one, two or four NA injections reflected by the relative composition of contractions and expansions (Methods)[77,78]. d, The CAG length distributions by GeneMapper traces in the striatum, frontal cortex, cerebellum and tail from one representative R6/2 mouse (mouse 6) following four injections of NA into the left striatum, over a 4-week period. DNAs were isolated from the left (NA, blue) and right (saline, red) sides of the striatum, frontal cortex and cerebellum, and from the tail before (red) and after (green) NA treatment. The repeat size change is in brackets, with the first number representing the NA-induced contractions of the major peak relative to the somatic expansions without NA, and the second number representing the contractions relative to the inherited (tail) allele. The brackets do not account for the size changes in the second mode of the bimodal distribution in the striatum. e, Instability indices in various tissues shown in d, where the red and blue diamonds represent values of the saline-treated/right and NA-treated/left sides of the striatum, respectively. f, The repeat tract lengths of the Mapkap1 and Fgd4loci in both sides of the striatum from an R6/2 mouse with four injections. Three independent experiments were performed. Uncropped gels are available as source data.

Continued repeat contractions resulted from additional NA administrations over a four-week period, with the most striking effect after the fourth administration. This was highly reproducible for a total of 13 mice (one for 1 injection, two for 2 injections, and ten for 4 injections; Fig. 6a, Supplementary Figs. 3–6, and Extended Data Fig. 8). NA injection did not alter brain morphology or induce cell death or cell proliferation (Extended Data Fig. 9a–c). The effect of NA was similar between mice, regardless of inherited repeat length. A bimodal distribution of repeat sizes is present in HD patient brains[7,28,75]. The second peak of larger expansions is evident in the striatum of most CAG mouse models and was found to consist of the most vulnerable HD cells, medium spiny neurons[1,4,68], and to have greater levels of expanded CAG transcripts[74]. NA had a greater contracting effect on the larger expansions of the bimodal repeat distribution (Fig. 6d). This portion of larger-sized expansions in the striatum may have arisen by mutation events involving large slip-outs, large enough to be bound by NA. Instability analyses in HD individuals suggests that the number of CAG units gained or lost at each mutation event are predominantly changes of one repeat unit, but may include changes of 5–15 repeat units, sizes that could be bound by NA[78-80]. Slipped-DNAs at the DM1 locus, isolated from various tissues of DM1 patients, including brain, also presented a bimodal distribution of slip-out sizes, with peaks of ~30 and <10 repeats[13], where the former could be bound by NA.
Extended Data Fig. 8

A total of ten HD mice revealed consistent NA-induced contractions of expanded CAG repeats

Instability Indices in striatum of ten mice (iv-xiii) treated four times with saline in the right striatum and NA in the left striatum. Indices in NA-treated striatum were significantly different from the control saline-treated striatum (Mann-Whitney, P = 0.00035). Instability Indices for mouse v and xi are positive for both NA and saline as there are less data points to the left of the highest peak compared to the points to the right. Still, after NA treatment there is a reduction in the index.

Extended Data Fig. 9

NA does not induce cell death in the CNS and cell proliferation, and does not affect transcription across the Htt locus

a, Histological study, mouse striatum with saline, NA in saline, or no injection, followed by H&E staining. Three independent experiments were performed. b, NeuN staining showing that NA does not induce cell death. Quantification of NeuN positive cells below. Data are indicated as mean ± s.d. of triplicates. c, Doublecortin staining showing that NA does not induce cell proliferation. Three independent experiments were performed. d, The effect of NA on TUNEL signal as assessed via fluorescent microscopy and immunohistochemistry. Representative 40x magnification confocal images of striatal medium spiny neurons (MSNs) of R6/2 mice treated with sale (left striata) and 50 μM NA (right striata) stained for TUNEL (red, staining apoptotic cells), and DARPP32 (green, staining MSNs). Panel locations (i-vi) correspond to the locations outlined in Figure 7 (middle panels). e, NA does not affect transcription across expanded repeats in HTT in HD patient cells and mouse striatum, determined by quantitative real-time reverse transcriptase (qRT)-PCR and normalized to U6 RNA, expressed as the ratio of NA-treated vs. PBS-treated R6/2 striatum. Data are indicated as mean ± s.d. of independent triplicates.

Most alleles in the NA-treated striatum showed contractions of CAG repeats, indicating that NA affected most cells (Fig. 6a, Supplementary Figs. 3–6, and Extended Data Fig. 8). Notably, contractions were detected by standard PCR and did not require more sensitive small-pool PCR. Hence, they were robust, seeming to occur en masse in most cells. We quantified the somatic “instability index”[76,77]: higher indices reflect greater expansions, whereas lower indices reflect lower expansions or greater contractions (Extended Data Fig. 7b). The number of contracted vs. expanded peaks (Fig. 6b) or the relative composition (%) of contracted vs. expanded peaks (Fig. 6c) was greater with subsequent NA treatments. Repeat size distributions in striatum treated four times with NA were significantly different from mock saline-treated striatum (Mann-Whitney, P = 0.00034). The effect of NA was localized to the site of injection, as the CAG tract in cerebral cortex and cerebellum from the same mice showed identical patterns of CAG length heterogeneity between the right and left sides (Fig. 6d,e and Supplementary Figs. 3–6).
Extended Data Fig. 7

Instability Index calculation

Instability index determination was as described[76,77], using a relative peak height threshold, with modifications. To quantify the levels of instability from GeneMapper traces peak height was used to determine a relative threshold of 20% based upon the main peak in the shorter mode of the control striatum (see points 1 & 2 in the figure). We used a conservative threshold factor (20%) as this detects peaks with good signal intensity and is more resistant to amplification variation than lower thresholds. Lower thresholds (10%, 5%) can provide more sensitive quantification. Peaks falling below his threshold were excluded from analysis. Peak heights were scored (see point 3) and normalized to the total of all peak heights in a given scan (see point 4). Since we are comparing the effect of NA versus saline upon instability in the striatum, the CAG length distribution in tail is not a factor in this comparison, but is for determining absolute instability, as in previous studies[61,65]. So as to facilitate comparison between NA and saline-treated striatum, these were normalized by multiplying the values by the change in CAG length of each peak relative to the highest peak in saline-treated striatum (see point 5), as opposed to the highest peak in the tail, as previously done[76,77]. These normalized values (see point 6) were summed to generate the instability index (see point 7). Striatum analysis for mouse vi is shown as an example R6/2, 6-weeks treated with four injections spanning 4 weeks of saline (red) or NA (blue). Peaks of the main allele in the saline-treated striatum, NA-treated striatum and tail of the same mouse, are indicated by triangle-brackets at the top (see point 1).

NA induced contractions rather than only arresting expansions because the peak repeat length in the NA-treated striatum was shorter than the inherited tract length in the tail of the same mouse, which remains constant throughout the mouse’s lifetime (Fig. 6d,e and Supplementary Figs. 3–6). For a given mouse, the repeat size distributions in the NA-treated half of the striatum shifted towards contraction, relative to the inherited length in the tail, while the saline-treated half continued to incur expansions (Fig. 6b–e and Supplementary Figs. 3–6; see Δ of main peaks where saline- vs. NA-treated vs. tail CAG lengths are reduced by 4–7 CAG units). This suggests that injections of NA into the striatum induced contractions of the expanded CAG tract, and thereby countered the expansion bias in that tissue (showing a spontaneous expansion rate of ~3.5 CAG units/month/cell, calculated as described[76,77]). The effect of NA in vivo was specific for the expanded repeat, having no effect on non-expanded repeats (Fig. 6f and Extended Data Fig. 2). Furthermore, analysis of the mouse Hprt1 sequence in treated male mice did not show significant sequence changes in NA-treated striatum (Extended Data Fig. 3d), indicating that NA is not a general mutagen in vivo, but specific for CAG tracts that are actively unstable. We also performed TUNEL analyses on HD mouse striatum to determine if NA can induce DNA fragmentation and apoptosis[81,82]. Studies with R6/2 HD mice did not show obvious TUNEL signals[83-85]. Neither NA- nor saline-treatment (four administrations spanning four weeks) induced detectable TUNEL staining (Extended Data Fig. 9d). To identify molecular phenotypes caused by NA-induced CAG contractions, we assessed aggregation of mutant HTT protein (mHTT) in the medium spiny neurons (MSNs) of R6/2 mice. mHTT aggregates are a key biomarker of HD: aggregation level and number of MSNs expressing aggregates are directly linked to CAG repeat length and disease progression[86-91]. Furthermore, reductions in mHTT aggregate levels in the striatum of HD mice can increase cell viability, decrease neurodegeneration, and improve behavioral phenotypes[92-97]. The rapid rate of mHTT aggregate turnover (~39 hours[92,95,97]) coupled with the considerable CAG contractions induced by NA over 4 weeks (Fig. 6a) may permit detection of NA-mediated alteration of mHTT aggregate levels. To test this, we assessed the intensity of fluorescent mHTT aggregates and the number of MSN nuclei expressing aggregates in NA-treated and saline-treated striatum of R6/2 mice. The intensity of mHTT aggregates in NA-treated striatum (4 weeks), relative to the contralateral saline-treated striatum, was significantly reduced (P = 0.0002) (Fig. 7a,b). Furthermore, NA-treated striata also showed significant reductions (10–20%) in the number of MSNs expressing mHTT aggregates (P < 0.0001), with aggregates being smaller and less intense (Fig. 7a,c). Since NA does not affect transcription (Extended Data Figs. 4a and 9e) or translation of mHTT (Extended Data Fig. 4b), the reduced mHTT aggregation is most likely due to NA-induced CAG contractions and hence polyglutamine contractions arising from the contracted templates. This interpretation is consistent with the CAG/polyQ length sensitivity of mHTT aggregate formation[86-90,98-100]. Together, these results suggest that NA can disrupt accumulation of mHTT into aggregates. Thus, NA can improve a pathological molecular phenotype by inhibiting formation of a biomarker associated with neurodegeneration and disease progression.
Fig. 7|

NA induces a reduction in mHTT aggregates in R6/2 mice.

The effect of NA on mHTT aggregates in striatal MSNs of R6/2 mice treated with saline (right striata) and 50 μM NA (left striata), the slide indicates saline- and NA-treated halves with red and blue arrowheads, respectively. a, Top middle panel: a representative ×20 magnification epifluorescent image of a whole-brain slice stained for DARPP-32 (green, staining MSNs), mHTT aggregates (red) and counterstained with DAPI (blue). The numerals and associated dotted white squares mark the location of the corresponding representative ×40 magnification confocal images for saline-treated (i, ii, iii) and NA-treated (iv, v, vi) striata. The dotted lines demarcate borders of striatum (DARPP-32-positive cells) with the white dotted lines demarcating borders of striatal cell bodies (used for quantification) and the yellow dotted lines demarcating axons that form striatonigral bundles (not used for quantification). Bottom middle panel: the mHTT (red) channel alone. Scale bars, 19 μm (magnifications) and 500 μm (middle panels). b, Quantification by box plots of the red pixel intensity per striatal area in R6/2 mouse striatum treated with saline (right striata) and 50 μM NA (left striata) obtained via ImageJ quantification of ×20 mHTT/red channel epifluorescent images (4 mice total, 3 slides quantified per mouse, 30 images per mouse, 60–75 cells per image). An unpaired two-tailed t-test was used to compare the intensity of the red pixels in treated versus untreated striata. c, Quantification of the percentage of MSN nuclei containing mHTT aggregates in R6/2 mouse striatum treated with saline (right striata) and 50 μM NA (left striata) obtained via counting of mHTT-positive MSN nuclei in ×40 confocal images (4 mice total, 3 slides quantified per mouse, 10 images quantified per striata, ~50–100 cells per image). An unpaired two-tailed t-test was used to compare the percentage of MSN nuclei containing mHTT aggregates in treated versus untreated striata.

DISCUSSION

Targeting DNA repeats to eliminate or correct the expansion in disease cells is of interest[2,49,101-107], with substantial mechanistic and human data validating somatic instability as a driver of disease age-of-onset, progression and severity[8,9,19,54,55,108,109]. Here we targeted slipped-DNA structures using a small molecule, NA, which induced contractions in vivo in patient cells and in the striatum of HD mice. Targeting this DNA intermediate permitted high specificity for the actively unstable mutant allele and limited off-target effects. There are many paths by which repeat instability can arise, and various ways through which NA may act, all of which involve slipped-DNAs[3,4,13]. Our data show that the effect of NA on CAG instability is independent of proliferation/replication, yet dependent upon transcription through the repeat (Fig. 4d,e), consistent with the reported enhanced CAG instability upon transcription[20]. Involvement of slipped-DNAs in NA’s mechanism of action is supported by their existence in patient cells at disease loci[13], the ability to form slipped-DNAs during transcription across repeats and through R-loop processing[17,21], and NA’s ability to block repair of slipped-DNAs (Fig. 2b). We propose a mechanism by which NA enhances contractions of expanded repeats, while an expansion bias spontaneously arises in the absence of NA (Fig. 8). When HTT is transcribed along the coding CTG DNA strand, the non-coding CAG DNA strand is displaced as single-stranded, which is extended and stabilized through formation of an R-loop. Upon removal of RNA from the R-loop, which can occur in the presence of NA (Extended Data Fig. 3), slipped-DNAs can arise on both strands[17,21]. Evidence suggests that MutSβ facilitates formation of slipped-DNAs by binding to repeats[21,56,58,110], even in the presence of NA (Fig. 5a). To resolve these slipped-DNAs, transcription-coupled repair will be directed to the coding strand[111], which in HTT is the CTG strand, using the displaced non-coding CAG strand as a template for repair. Correct repair of both slip-outs would leave the tract length stable, with removal of the CTG slip-out and retention of the CAG hairpin. Repair of the CTG slip-outs in vitro is less efficient than repair of CAG slip-outs[22,27,56,57,112]; therefore CTG slip-outs will often be retained, leading to an expansion bias. Repair would be facilitated by RPA-melting of slip-outs[62,65] and RPA-enhanced polymerase gap-filling across the repeat[61,62] (Fig. 5d). Thus, in the absence of NA, spontaneous mutations would lead to expansions. In contrast, NA-binding will drive most of the transcriptionally displaced CAG repeats into long hairpins, essentially increasing the size and number of repeat units forming the NA-bound CAG slip-outs, which may be enhanced by MutSβ binding. The poor ability of RPA to melt the NA-bound CAG slip-outs would block the ability of polymerases to gap-fill[113] (Fig. 5c), and ultimately block repair of NA-bound CAG slip-outs, leading to a contraction bias. Since NA bound more CAG repeats than were displaced from the CTG strand, this leads to fewer repeats incorporated in the repair product and hence contractions. That NA targets active repeat expansions is consistent with its increased efficacy in striatum over cultured fibroblasts, where the former displays higher levels of spontaneous expansions. That NA binds to and inhibits repair of long, but not short, CAG slip-outs is consistent with NA preferentially affecting repeats in striatum, which undergo large saltatory expansions (Fig. 6).
Figure 8 |

Schematic of the plausible mechanisms through which NA may induce contractions of expanded CAG tracts.

Schematic of transcription dependent slip-out formation and repair in the presence (green panel) or absence (white panel) of NA. See text for details.

NA efficiently induced contractions of expanded CAG repeats in most striatal cells, a tissue in HD individuals that shows selective neuronal vulnerability. Our findings over a four-week period of NA treatment in mice with ~150 CAG repeats shows that neurons en masse incurred contractions of approximately ~0.5 repeats lost/week[76,77]. Repeated administration led to continued contractions. Extrapolating this to an HD-affected human, we speculate that applying a drug like NA prior to rapid onset of somatic CAG expansions could effectively block expansions and induce contractions of the inherited expanded allele to shorter lengths, where treatments spanning one year could be estimated to contract the repeat by 5–25 repeats. For an HD allele of 36–70 repeats, such changes could be clinically relevant. Mutant HTT-polyQ inclusions are biomarkers of Huntington’s disease, where inclusion size and numbers of neurons expressing inclusions are directly linked to CAG repeat length and disease progression[86-91]. Reducing levels of mHTT-polyQ aggregates in striatum of HD mice can increase cell viability, decrease neurodegeneration, and improve behavioral phenotypes[92-97]. Complete ablation of somatic CAG expansions (but no induced contractions) caused by genetic knock-out of MSH2, MSH3, or MLH1 delayed the detectable formation of mHTT-polyQ aggregates[114,115]. This effect was detectable after repair-deficient mice with (CAG)110–117 had lived 5–10 months without any somatic CAG expansions. Our treatment of mice with NA for only 4 weeks was able to induce CAG contractions, and reduced both the size and frequency of mHTT-aggregates in medium spiny neurons (Fig. 7), the cells in which NA is most effective in reducing CAG expansions (Fig. 6a). Other approaches have been used to target expanded repeats in vivo. Targeting the HD locus has a history that predates discovery of the HTT gene or the disease-causing mutation[116]. CRISPR/Cas9 targeting of the mutant HTT allele in HD cells deleted ~44 kb DNA spanning the promoter region, transcription start site, and CAG expansion, resulting in haploinsufficiency with a functional non-mutant allele[110]. A study using the CRISPR/Cas9 nickase that produced single strand nicks within a transgenic repeat led to contractions of the repeat in an artificial selectable cell model[105]. Extension to an endogenous disease locus in cells or relevant tissues has yet to be demonstrated. CRISPR/Cas9 targeting only the expanded repeats at a disease locus proved not possible due to the absence of suitable PAM sequences and a likelihood of targeting other CAG/CTG repeats elsewhere in the genome. Inactivation of mutant HTT resulting in haploinsufficiency is possible[102-104]. Preferential targeting of fully duplexed expanded repeats by zinc-finger nucleases[117] or zinc-finger repressors[118] has been demonstrated. A small molecule approach may overcome some of the in vivo hurdles of enzyme-mediated paths (delivery, editing efficiency, persistent nuclease/off-targets, immune response, haploinsufficiency, etc.)[107,119-121]. Previous studies using cell models of CAG/CTG instability demonstrated that exogenously added compounds, including DNA damaging agents, can modulate repeat instability[122]. However, these compounds lack specificity for expanded repeats, and would induce mutations in non-mutant alleles and throughout the genome[123]. In a separate strategy, a (CAG)6 antisense oligonucleotide was able to reduce CAG expansions, but did not induce contractions of an expanded repeat[49]. A mitochondria-directed compound, XJB-5–131, by unknown processes, mildly suppressed CAG expansions rather than inducing contractions[124]. Similarly, an inhibitor of histone deacetylase 3 suppressed somatic CAG expansions through unknown processes[106]. A sequence-specific polyamide directed to the fully paired duplex GAA/TTC repeat that is expanded in Friedreich’s ataxia patients prevented triple-stranded structure formation and suppressed GAA repeat expansions in FRDA cells, but did not induce contractions[107]. The earliest non-enzymatic chemical approaches to target the HD locus aimed to modulate gene regulation using ligands targeted to fully paired duplex DNAs[116,125]. In contrast, NA, through rational design, targets the unusual DNA structures formed by CAG repeats that are actively unstable, and can induce en masse repeat contractions rather than only suppressing expansions. Moreover, NA is specific for the mutant HD allele in a disease vulnerable brain region. These attributes of NA – its specificity for CAG repeat slipped-DNA structures, its preferential effect on disease-causing expanded repeats, and its ability to induce contractions in vivo in an affected brain region – make it a first-in-class example showing the potential of small molecule DNA-binding compounds to impact somatic CAG repeat instability. Administration of such small molecules, once optimized for therapy, to human brains might effectively target the root cause of repeat expansion diseases and address downstream deleterious effects.

METHODS

NA synthesis is described in the Supplementary Note.

Cell culture.

Construction of the HT1080–(CAG)850 cell model was previously described[20]. The HT1080 model cells, the HD primary fibroblasts, GM09197 (Coriell Biorepository) with (CAG)180/21[5], and the HD primary fibroblasts, GM02191 (Coriell Biorepository) with (CAG)43/19, were cultured at 37 °C with 5% CO2 in DMEM supplemented with 10% fetal bovine serum. Cells were treated with or without continuous exposure to 50 μM NA for 30 days (HT1080 cells) or for 40 days (primary fibroblasts). WST-1 assay was performed according to the manufacturer’s instructions (Roche). All cell lines were tested to be mycoplasma-free. We performed three independent experiments for each cell line. Proliferation of HT1080-(CAG)850 cells was inhibited by treatment with 0.5 μM Palbociclib, as described[126]. Degree of proliferation arrest was assessed in living cells by counting BrdU-positive cells after 24 h incubation with BrdU in proliferating or Palbociclib-treated HT1080-(CAG)850 cells. Sustained knockdown of MSH3 by siRNA was as described[69].

Microscopy.

HT1080–(CAG)850 cells were incubated with 50 μM NBD-labeled NA for 48 h and Cell Light Plasma Membrane-RFP, BacMam 2.0 (Life Technologies, Cat # C10608), then fixed for 15 min at room temperature with 4% paraformaldehyde and washed twice for 10 min in PBS. Cells were mounted with Vectashield hard-set mounting media that contains DAPI (Vector Laboratories, Cat #: H-1500). Fluorescence images were obtained using Olympus FV1000D confocal laser scanning microscope (Olympus).

Repeat length analysis.

The (CTG)•(CAG) repeats were sized by small-pool PCR (spPCR) with the input of 1.4–1.7 genome equivalents, followed by Southern blot[20]. Repeat size differences in models is at most 3,000 bp; therefore, being strictly conservative, a bias to amplifying shorter alleles can be possible even under optimized PCR conditions in our study. For HD primary fibroblasts, spPCR and Southern blots were performed as described[20] where DNA was diluted to 1.0–1.6 genome equivalents. Amplified products (PCR primers, Supplementary Table 2) were detected by Southern blot using a DIG-labeled (CAG)7 locked nucleic acid probe[127]. We analyzed at least 50 alleles for each of the three experiment (more than 150 alleles/experiment). Repeat analyses are summarized in Supplementary Table 1. The “% of repeat population” was calculated by determining the proportion of >50 individual spPCR reactions across the CAG repeat tract that harbored a certain size of repeat product[18]. A χ2-test was performed to compare the frequencies of expanded, unchanged, and contracted alleles in each set of experiments[20,49]. The trinucleotide and dinucleotide tract lengths of the HTT, CASK, ATXN8, Mfd15, TBP, and mouse Mapkap1, Fgd4, and Tbp loci were PCR amplified (primers listed in Supplementary Table 2) following amplification conditions described elsewhere[30,31,128-132]. Human TBP CAG length variability was assessed by spPCR.

mRNA quantification.

RNA was harvested using the RNeasy Plus Micro Kit (Qiagen). Total RNA was primed with random hexamers and reverse transcribed with Superscript III (Life Technologies), followed by RNAseH-treatment. Quantitative reverse transcription (RT)–PCR was performed using TaqMan Gene Expression assays or PrimeTime qPCR assays on an ABI PRISM 7900HT (Life Technologies). Level of transgene-derived mRNA was normalized to 18S rRNA. Results were statistically analyzed using paired t-test. Primer sequences are listed in Supplementary Table 2.

Western blot analysis.

Lysates were prepared from HD patient fibroblasts in RIPA buffer. 20 μg of lysate was electrophoresed at 100 V for 4 h at room temperature in 1x MOPS buffer (Thermo Fisher Scientific, Cat #NP0001) in 3–8% Tris-Acetate pre-cast protein gel (Invitrogen, Cat #EA0375BOX), transferred to PVDF Western Blotting Membranes (Sigma-Aldrich, Cat #3010040001) overnight at 4 °C at a constant voltage of 20 V. Membranes were blocked in 10% milk in TBS + 0.1% Tween-20 (TBST), incubated with primary antibody at room temperature for 2 h, and with secondary antibody at room temperature for 1 h, and then detected with ECL (GE Healthcare, Cat #RPN2232). Primary antibodies: Anti-Huntingtin Protein Antibody a.a. 181–810 Clone 1HU-4C8 (1:1,000, mouse) (Millipore Sigma, Cat #MAB2166), Anti-Actin Protein Antibody (C4 clone, 1:30,000, mouse) (BD Transduction, Cat #612657). Secondary antibodies: Peroxidase-AffiniPure Sheep Anti-Mouse IgG H+L (1:2,000) (Cedarlane Labs, Cat #515-035-062).

NA-DNA binding.

Homoduplex slipped structures of 50 repeats and heteroduplex with long (CAG)20 or (CTG)20 slip-outs were formed by alkaline denaturation/renaturation, as described[16]. Briefly, plasmids containing human DM1 genomic (CTG)n•(CAG)n repeats (n = 30 or 50) were linearized by HindIII digestion. DNAs of (CAG)50 and (CTG)30 repeats, or DNAs of (CAG)30 and (CTG)50 repeats, were mixed in equimolar amounts and then heteroduplexed by alkaline denaturation/renaturation. Repeat-containing fragments were released by EcoRI digestion and electrophoretically resolved on a 4% polyacrylamide gel. Gel-purified fragments were radiolabeled with [α−32P]dNTPs on both strands by fill-in reaction, radioactivity of each structure was determined using Cerenkov counting, and an equivalent radioactive concentration of each structure was incubated with increasing concentration of NA for 30 min at room temperature with 1x hypotonic buffer. Binding products were resolved by electrophoresis on a 4% (w/v) polyacrylamide gel in 1x TBE buffer at a constant 200 V for 2.5 h.

Replication assay.

In vitro replication templates containing (CTG)79•(CAG)79 were previously described[133,134]. The SV40-origin of bidirectional replication was placed 103 or 98 bp 5’ or 3’ of the CAG repeat for pDM79EF and pDM79HF, respectively. These templates and an SV40-ori template with no-repeats (pKN16) were replicated in vitro by HeLa cell extract adding [α−32P]dCTP and T-Antigen, as described[133,134]. Replication was performed in presence or absence of NA (7.5 μM or 15 μM). Radioactive replication products were purified, linearized (BamHI) and treated with DpnI. Equal amounts of unreplicated pKN16 DNA was treated with DpnI to show the complete digestion of unreplicated plasmid DNA. Equal quantities of reaction products were resolved by electrophoresis on a 15-cm 1% agarose gel run for 16 h at 4 V/cm in TBE buffer, dried, and exposed to Kodak film.

Repair assay.

To determine the effect of NA binding upon slipped-DNA repair, a series of circular slipped-DNA substrates were prepared with an excess of repeats with a nick located 5′ or 3′ of the slip-out[22,56]. G-T mismatched substrate was prepared as described[22,56]. All DNA substrates were processed in vitro by HeLa extracts. Analysis of repair products was performed by Southern blotting, comparing with starting material. Each repair assay has been performed in three independent experiments.

R-loop formation and processing.

Plasmids bearing (CAG)79•(CTG)79 tract with convergent T3 and T7 RNA polymerase promoters were described[17,135]. Transcription reactions were performed as described[17]. Briefly, 500 ng of template DNA in 1x transcription buffer (Roche) and 1x bovine serum albumin (New England Biolabs) were mixed for 1 h with 20 U of the appropriate RNA polymerase: T7, T3 or T7+T3 (Roche), with or without NA 120 μM. Products were purified and treated with either 1 g of RNaseA (Roche) alone or mixed with 1 U of E. coli RNaseH (Roche), at room temperature for 30 min, in presence or absence of NA 120 μM. All in vitro transcription reaction products were analyzed on 1% agarose gels run in 1x TBE buffer at 80 V for 3 h. Gel-borne products were visualized with ethidium bromide (0.5 mg/ml) under ultraviolet (UV) light. R-loop templates prepared from in vitro transcription and RNaseA treatments were incubated with NA and then processed by extracts of HeLa or SH-SY5Y neuroblastoma cells, where the latter were terminally differentiated by retinoic acid[56]. Nucleic acids were purified and transformed into bacteria for STRIP analysis as described[17]. Briefly, products of human cell extract processing were transformed into E. coli XL1-MutS (Agilent). Individual bacterial colonies (each representing one processed template) were picked and cultured for 6 h. Magnitudes of repeat length changes were determined by electrophoretic sizing of the repeat-containing fragments on 4% polyacrylamide gels relative to the starting length material and a size marker. Data are derived from three independent in vitro transcription and human cell extract processing reactions with ~150 colonies (~50 colonies per replicate) representing 150 individual products of cell extract treatment analyzed for each R-loop configuration. Individual experiments were compared using the χ2-test.

MutSβ binding assay.

MutSβ was purified from baculovirus-infected Sf9 cells expressing His-tagged hMSH2 and hMSH3 as described[22]. Binding reactions were performed at room temperature. Slipped-DNA with long (CAG)20 was prepared and end-labeled as described above. Protein was incubated with DNA for 30 min in a buffer containing 10 mM HEPES-KOH pH 7.5, 110 mM KCl, 1 mM EDTA, and 1 mM DTT with or without ATP in the buffer, as indicated. Reactions were loaded onto a 4% native polyacrylamide gel with non-denaturing loading dye (20 mM Tris-HCl pH 7.4, 4% glycerol, bromophenol blue). Gel was run in 1x TBE buffer for 2 h.

Mung bean nuclease footprinting.

Experimental conditions used were single-hit kinetics, as described[16].

RPA DNA-binding.

Recombinant RPA protein was expressed in BL21(DE3) cells and binding was performed as described[136].

Polymerase extension assay.

Recombinant human polymerase δ (polδ) was prepared in insect cells using a baculovirus vector and purified by immunoaffinity column chromatography, as described[137]. Polδ extension assay was performed as described[138], using an oligo containing (CAG)10 repeats, as described[24]. Briefly, 0.1 μM primer and 0.1 μM oligo were denatured at 95 °C for 3 min, annealed for 30 min at room temperature, and incubated with NA for 30 min at room temperature. RPA and/or polδ were added and the reactions was started by adding 0.1 mM dNTPs in 10 μl reaction volume, and incubated at 37 °C for 15 min. The reaction was stopped by adding 20 mM EDTA and purified by extraction with phenol/chloroform/isoamyl alcohol (25:24:1, v/v/v) followed by ethanol precipitation. Pellets were resuspended in formamide buffer, denatured at 95 °C for 10 min, and run on a 6% sequencing gel at 2,000 V and 90 W for 40 min.

Stereotaxic injections into R6/2 mice.

Mouse handling and experimental procedures were conducted in accordance with the Osaka University guidelines for the welfare of animals. A single NA application involved six separate stereotactic injections (three injections of NA or saline into three different striatal regions of either the left or right striatum, respectively). Under sterile conditions, six male R6/2 mice (B6CBA-Tg(HDexon1)62Gpb/1J, Jackson Laboratory, cat# 002810), 6-week-old, were anesthetized with 50 mg/kg pentobarbital sodium, and stereotaxically injected with 5 μL of saline (PBS solution) (right side) or 500 μM NA dissolved in saline (left side). The control for NA-treatment is therefore the contralateral side of the striatum of each mouse. Mice received injections once or twice bi-weekly, or weekly for 4 consecutive weeks. Stereotaxic injections were delivered to three sites within the striatum with the following coordinates: (anterior-posterior (AP) = 0.0 mm, medial-lateral (ML) = 1.5 mm from bregma, dorsal-ventral (DV) = 2.5 mm below the dural surface; AP = 1.0 mm, ML = 1.5 mm, DV = 2.5 mm; and AP = 0.5 mm, ML = 1.5 mm, DV = 2.5 mm), using a 10-μl Hamilton microsyringe at a rate of 0.5-μL/min. Both right and left striatum were assessed for repeat length at the HD CAG transgene and at endogenous CAG tracts. For the mice treated four times, DNAs from the tail prior to and following NA administration, as well as left and right frontal cortex and cerebellum, were harvested. Coronal sections of the striatum were cut serially at 20 μm thickness using a cryostat (CM1850UV; Leica Microsystems). Striatum sections of 0.62, 1.10, and 1.58 mm posterior from bregma were washed in PBS containing 0.05% Triton X-100 (PBS-T). They were then immersed in a solution of 3% H2O2, 10% methanol in PBS for 10 min. After washing 3 times with PBS-T, the sections were incubated with primary antibody (mouse anti-NeuN, 1:100, Millipore Bedford, MA; rabbit anti-doublecortin, 1:200, Abcam,) diluted in blocking solution of 10% Block-Ace blocking solution (Yukijirushi Co.) overnight at 4 °C and then with biotinylated secondary antibody (1:500, VECTOR Laboratories) in PBS-T for 1 h at room temperature followed by VECTASTAIN ABC regents (1:100, VECTOR Laboratories) in PBS for 1 h at room temperature. The sections were washed 3 times with PBS-T in between the steps. The bound complex was visualized with 3,3’-diaminobenzidine (Sigma).

Genescan analysis.

At 4 weeks after the first injection, DNA was isolated from mouse brain tissue as described[127]. PCR was performed as described previously[139], and PCR products were sized on an ABI310 Gene Analyser using GENESCAN 3.1 software (Life Technologies).

Instability Index calculation.

Instability index was calculated as described[76,77] with modifications as outlined in Extended Data Figure 7.

Immunofluorescence.

Medium spiny neuron (MSN) specific mutant HTT aggregates were assessed using a multiplexed immunofluorescence approach staining for HTT protein (EM48 antibody, cat.# MAB5374, Millipore-Sigma) and anti-DARPP-32 (19A3 antibody, cat.# 2306S, New England Biolabs - expressed almost exclusively by striatal medium spiny neurons), as described[140]. After rehydration, slides were subjected to antigen retrieval in 10 mM sodium citrate for 20 min in a steamer and cooled to room temperature for 1 h. Slides were washed twice in 1x PBS + 0.05% Tween-20 (1x PBST) (2 min, 2 times). Slides were blocked in 10% normal goat serum for 1 h at room temperature and incubated with primary antibody overnight at 4 °C. Secondary antibody incubation was performed for 1 h at room temperature. Slides were mounted with Hardset VectaShield with DAPI. Aggregates were then blindly assessed in two ways: Whole striatal assessment of red pixel intensity (mHTT aggregates) via ImageJ. Intensity was measured from 20x images obtained via a 3DHistech Pannoramic 250 Flash II Slide Scanner. Each individual striatal intensity per slice was normalized to the total striatal intensity of the slice and then plotted. Three separate brain slices were assessed per mouse, assessing a total of 4 mice (each treated 4 times over 4 weeks). Counting MSN nuclei containing mHTT aggregates from confocal images obtained from a Quorum Spinning Disk Confocal (Olympus IX81 microscope). Ten 40x (water) confocal images were taken at 10 different locations per striatum with ~100 cells per image (~1,000 cells per striatum, or ~2,000 cells total per brain slice) being assessed. 3 striatal slices were assessed per mouse (4 mice total, ~3,000 cells assessed per striatum per mouse). Percentages of cells containing mHTT aggregates were taken relative to the total number of cells in the image.

Whole-genome sequencing, alignment, and variant calling.

WGS was performed using established protocols on Illumina instruments and paired-end FASTQ files were aligned to the human genome (hg19/GRCh37) using BWA-MEM (v.0.7.8), with Picard MarkDuplicates (v.1.108) being used to mark PCR duplicates. Indel realignment and base quality scores were recalibrated using the Genome Analysis Toolkit (v.2.8.1). Somatic mutations were detected using MuTect2 (part of GATK v.3.5)[141]. Mutations present only in treated cells (as opposed to non-treated cells) were retained. For substitutions, we also removed common single-nucleotide polymorphisms (SNPs) as described[44]. To remove common germline variants, we used an in-house panel of controls and removed any putative substitutions present in ≥ 2 controls. Putative substitutions were also removed if they were overlapped with a highly repetitive sequence (using DUST[142] with score > 60) or were located in excessively high-depth alignments in difficult to align regions of the genome, as described[143]. We apply a maximum depth threshold of d+4*sqrt(d), where d is the average normal mean read depth of the chromosome in the corresponding untreated cell. Mutational signatures shown in COSMIC, were screened for using a further elaborated version (v0.0.5.75[40]) of an established nonnegative matrix factorization-based method, SigProfiler[144-146]. This framework works in two steps: (1) de novo extraction of signatures on the catalogue of substitutions using non-negative matrix factorization approach; and (2) comparing deciphered de novo signatures to a set of previously described mutational signatures (COSMIC Mutational Signatures v3) using cosine similarity to identify the etiology of the underlying mutagenic process. It is common practice to only consider mutational signatures with >100 mutations as reliable.

HPRT1 sequence analysis by SMRT-CCS sequencing.

DNAs were extracted from three independent NA- or saline-treatments of the HD primary fibroblasts (GM09197) and three independent NA-treated and control striata of three R6/2 mice. Details of the sequencing results are described in the Supplementary Note. Mutation rates were not significantly different between NA- and saline- sequences (two-sample Kolmogorov-Smirnov test).

Microsatellite instability assay.

Microsatellite instability was assessed as per Gallon et al.[32]. Briefly, >20 mononucleotide repeats were amplified from 50 ng of template DNA using single molecule molecular inversion probes, and sequenced to a median (Q1-Q3) depth of 1,007 (733–1,515) reads/marker/sample using a MiSeq (Illumina). Sequencing reads were aligned to the human reference genome (build hg19) using BWA[147]. Frequencies of microsatellite length variants were extracted for each marker in each sample, and converted to a probability by comparison to a reference distribution generated from 40 microsatellite stable, peripheral blood leukocyte controls. A sample score is calculated by combining probabilities from each marker using Fisher’s method, and multiplying the decadic logarithm of this combined probability by −1. Scores >1.3 are therefore equivalent to a >95% probability that the observed frequency of microsatellite length variants for a sample is greater than the reference set, and is used as a classification threshold. Residual samples from Gallon et al.[32], comprising peripheral blood leukocyte DNA of 3 constitutional mismatch repair deficiency and 8 control patients, were used as controls.

TUNEL assay.

TUNEL assay was conducted using the Click-iT™ Plus TUNEL Assay for In Situ Apoptosis Detection, Alexa Fluor™ 488 dye (Invitrogen, cat #: C10617), multiplexed with anti-DARPP-32 (19A3 antibody, cat.# 2306S, New England Biolabs), as outlined by the manufacturer.

DATA AVAILABILITY STATEMENT

Raw sequencing data have been deposited at the Sequence Read Archive (SRA) under accession number SUB6615083.

NA does not affect replication efficiency or replication fork progression

Three circular plasmids containing the SV40 origin of replication, and an expanded (CAG)79•(CTG)79 repeat tract (pDM79EF and pDM79HF) or no repeats (pKN16), were replicated in vitro by human (HeLa) cell extracts without or with NA (7.5 μM or 15 μM) treatment. The location of SV40-ori determines the replication direction and which strand will be used as the leading or the lagging strand template. pDM79HF uses the CAG strand as the lagging strand template, while pDM79EF uses the CTG strand as the lagging strand template (schematic on the top of the gel panel). Replication products were purified and linearized with BamHI. An equal portion of the reaction material was also digested with BamHI and DpnI as DpnI digests un-replicated and partially-replicated material, as shown in the schematic (top figure). The digestion products were electrophoresed on a 1% agarose gel to resolve completely replicated and un-replicated material (bottom figure). Equal amount of unreplicated plasmid DNA was digested with DpnI and stained with Ethidium Bromide to show the complete digestion of unreplicated plasmid DNA (Bottom panel). Panel I, ethidium bromide stained, Panel II, autorad: marker (lane 1); DpnI undigested plasmid DNA (lane 2); DpnI digested unreplicated plasmid DNA (lane 3–4); replicated plasmid DNA, DpnI resistant (lane 5). No difference in DpnI resistant material is observed between replication in the presence or absence of NA, in all the three templates tested (panel III, IV, V). Blots have been cropped and the corresponding full blots are available in the Source Data files.

NA does not affect non-mutant genetically stable repeats

a,b, Representative data showing small-pool PCR for the non-expanded CAG/CTG repeat length of CASK and Mdf15 in HD primary fibroblast cells (a) and small-pool PCR for the non-expanded CAG/CTG repeat length of CASK, Mdf15, ATXN8 and the non-expanded HTT allele genes in HT1080–(CAG)850 cells (b). Even under the sensitive mutation detection capacity of spPCR, length variation was not observed in either NA treated- and untreated-cells. Notably, some reactions did not show any product as is typical of the low genomic DNA template dilutions used in small-pool PCR. c, Repeat-tract lengths of the CASK, ATXN8, and Mfd15 loci in HT1080–(CAG)850 cells (initial clone and cells after 30 days incubation with or without NA). Length variation was not observed at any of these repeats of normal length loci in HT1080-(CAG)850 cells (after 30 days incubation with or without NA). Three independent experiments were performed. d, Small-pool PCR for the non-expanded CAG tracts in TBP alleles in HD patient fibroblasts treated with or without NA for 40 days. e, Small-pool PCR for the non-expanded CAG tracts in TBP alleles in HD R6/2 mouse striatum with four injections of NA or saline. f, Microsatellite instability assay. Assay scores >1.3 indicate increased MSI relative to a control sample set from peripheral blood leukocytes. Both NA positive and NA negative HD cells with (CAG)43 or (CAG)180 scored <1.3, indicating no effect of treatment on MSI. 8 known CMMRD-negative controls and 3 known CMMRD-positive controls were included in the assay.

NA is not a general mutagen

Towards assessing whether NA-treatment acted as a general mutagen to sequences other than CAG slip-outs, we harnessed the high read accuracy and depth of single molecule, real time, circular consensus sequencing (SMRT-CCS). Single-molecule sequencing was done on the HPRT1 gene – widely used as a surrogate indicator of the global effect of induced genetic variation. For each replicate, we calculated the relative mutation rate between NA- and saline-treated cells as the mutation rate for NA-treated cells minus the rate for saline-treated cells and identified excess mutation rates based on an absolute relative rate >0.5%. a, Schematic of HPRT1 sequencing for mutation detection. Briefly, cells were grown under identical conditions differing only by the addition of NA (50 μM) or saline, DNAs were isolated, HPRT1 exons 2 and 3 PCR amplified and sequenced. b, Quality control for our analysis. c,d, Comparison of sequence variations between NA-treated and saline treated is presented. We chose to compare the single-molecule sequence reads of individual X chromosome-linked HPRT1 alleles (exons 2 and 3) from our male HD patient-derived cells (c), and our male R6/2 mice (d), thsat had been NA- or saline-treated. Each read represents a single cell (Supplementary Note). Graphs show the distribution of sequence variants by relative mutation rate between three experimental replicates of NA-treated and saline-treated cells sequenced with PacBio single-molecule long reads.

NA does not affect HTT transcription or translation

a, NA does not affect transcription across expanded repeats in HTT in HD patient cells, determined by quantitative real-time reverse transcriptase (qRT)-PCR and normalized to U6 RNA. Data are indicated as the mean ± s.d. of independent triplicates. b, Western blot showing that NA does not affect HTT translation in HD patient cells with (CAG)43. Western blots were repeated 4 times with similar results. Blots have been cropped and the corresponding full blots are available in the Source Data files.

NA induces contractions during R-loops processing.

a, Schematic of R-loop formation, processing, and analysis. Pre-formed double-R-loops were processed by terminally differentiated (retinoic acid) human neuron-like cell extracts (SH-SY5Y) in the absence or presence of NA (50 μM), as described and DNA repeat lengths scored as expansions, contractions, or stable, by the STRIP assay (Methods). b, Representative example of STRIP analysis. Transcription products were isolated, processed and transformed in E. coli cells, previously shown to stably maintain the (CAG)79•(CTG)79 lengths (Methods). Plasmids isolated from individual bacterial colonies were digested with restriction enzymes to release the repeat containing fragment, resolved on 4% polyacrylamide gels and scored for instability. c, Graphical analysis of STRIP results. Two-sided χ2 test was performed to compare 191 untreated colonies vs. 100 NA-treated colonies.

Dosing regimen

A single drug administration involved six separate stereotactic injections (three injections of drug in saline or saline into three different striatal regions of either the left or right striatum, respectively). At the onset mice were 6-weeks old.

Instability Index calculation

Instability index determination was as described[76,77], using a relative peak height threshold, with modifications. To quantify the levels of instability from GeneMapper traces peak height was used to determine a relative threshold of 20% based upon the main peak in the shorter mode of the control striatum (see points 1 & 2 in the figure). We used a conservative threshold factor (20%) as this detects peaks with good signal intensity and is more resistant to amplification variation than lower thresholds. Lower thresholds (10%, 5%) can provide more sensitive quantification. Peaks falling below his threshold were excluded from analysis. Peak heights were scored (see point 3) and normalized to the total of all peak heights in a given scan (see point 4). Since we are comparing the effect of NA versus saline upon instability in the striatum, the CAG length distribution in tail is not a factor in this comparison, but is for determining absolute instability, as in previous studies[61,65]. So as to facilitate comparison between NA and saline-treated striatum, these were normalized by multiplying the values by the change in CAG length of each peak relative to the highest peak in saline-treated striatum (see point 5), as opposed to the highest peak in the tail, as previously done[76,77]. These normalized values (see point 6) were summed to generate the instability index (see point 7). Striatum analysis for mouse vi is shown as an example R6/2, 6-weeks treated with four injections spanning 4 weeks of saline (red) or NA (blue). Peaks of the main allele in the saline-treated striatum, NA-treated striatum and tail of the same mouse, are indicated by triangle-brackets at the top (see point 1).

A total of ten HD mice revealed consistent NA-induced contractions of expanded CAG repeats

Instability Indices in striatum of ten mice (iv-xiii) treated four times with saline in the right striatum and NA in the left striatum. Indices in NA-treated striatum were significantly different from the control saline-treated striatum (Mann-Whitney, P = 0.00035). Instability Indices for mouse v and xi are positive for both NA and saline as there are less data points to the left of the highest peak compared to the points to the right. Still, after NA treatment there is a reduction in the index.

NA does not induce cell death in the CNS and cell proliferation, and does not affect transcription across the Htt locus

a, Histological study, mouse striatum with saline, NA in saline, or no injection, followed by H&E staining. Three independent experiments were performed. b, NeuN staining showing that NA does not induce cell death. Quantification of NeuN positive cells below. Data are indicated as mean ± s.d. of triplicates. c, Doublecortin staining showing that NA does not induce cell proliferation. Three independent experiments were performed. d, The effect of NA on TUNEL signal as assessed via fluorescent microscopy and immunohistochemistry. Representative 40x magnification confocal images of striatal medium spiny neurons (MSNs) of R6/2 mice treated with sale (left striata) and 50 μM NA (right striata) stained for TUNEL (red, staining apoptotic cells), and DARPP32 (green, staining MSNs). Panel locations (i-vi) correspond to the locations outlined in Figure 7 (middle panels). e, NA does not affect transcription across expanded repeats in HTT in HD patient cells and mouse striatum, determined by quantitative real-time reverse transcriptase (qRT)-PCR and normalized to U6 RNA, expressed as the ratio of NA-treated vs. PBS-treated R6/2 striatum. Data are indicated as mean ± s.d. of independent triplicates.
  139 in total

1.  Somatic instability of the expanded CTG triplet repeat in myotonic dystrophy type 1 is a heritable quantitative trait and modifier of disease severity.

Authors:  Fernando Morales; Jillian M Couto; Catherine F Higham; Grant Hogg; Patricia Cuenca; Claudia Braida; Richard H Wilson; Berit Adam; Gerardo del Valle; Roberto Brian; Mauricio Sittenfeld; Tetsuo Ashizawa; Alison Wilcox; Douglas E Wilcox; Darren G Monckton
Journal:  Hum Mol Genet       Date:  2012-05-16       Impact factor: 6.150

Review 2.  Repeat instability: mechanisms of dynamic mutations.

Authors:  Christopher E Pearson; Kerrie Nichol Edamura; John D Cleary
Journal:  Nat Rev Genet       Date:  2005-10       Impact factor: 53.242

Review 3.  Molecular genetics: unmasking polyglutamine triggers in neurodegenerative disease.

Authors:  J F Gusella; M E MacDonald
Journal:  Nat Rev Neurosci       Date:  2000-11       Impact factor: 34.870

Review 4.  Repeat instability as the basis for human diseases and as a potential target for therapy.

Authors:  Arturo López Castel; John D Cleary; Christopher E Pearson
Journal:  Nat Rev Mol Cell Biol       Date:  2010-03       Impact factor: 94.444

5.  Identification of an HD patient with a (CAG)180 repeat expansion and the propagation of highly expanded CAG repeats in lambda phage.

Authors:  K Sathasivam; I Amaechi; L Mangiarini; G Bates
Journal:  Hum Genet       Date:  1997-05       Impact factor: 4.132

6.  Age, CAG repeat length, and clinical progression in Huntington's disease.

Authors:  Adam Rosenblatt; Brahma V Kumar; Alisa Mo; Claire S Welsh; Russell L Margolis; Christopher A Ross
Journal:  Mov Disord       Date:  2011-12-15       Impact factor: 10.338

7.  Dramatic tissue-specific mutation length increases are an early molecular event in Huntington disease pathogenesis.

Authors:  Laura Kennedy; Elizabeth Evans; Chiung-Mei Chen; Lyndsey Craven; Peter J Detloff; Margaret Ennis; Peggy F Shelbourne
Journal:  Hum Mol Genet       Date:  2003-10-21       Impact factor: 6.150

Review 8.  Expandable DNA repeats and human disease.

Authors:  Sergei M Mirkin
Journal:  Nature       Date:  2007-06-21       Impact factor: 49.962

9.  Somatic expansion of the Huntington's disease CAG repeat in the brain is associated with an earlier age of disease onset.

Authors:  Meera Swami; Audrey E Hendricks; Tammy Gillis; Tiffany Massood; Jayalakshmi Mysore; Richard H Myers; Vanessa C Wheeler
Journal:  Hum Mol Genet       Date:  2009-05-23       Impact factor: 6.150

10.  DNA repair pathways underlie a common genetic mechanism modulating onset in polyglutamine diseases.

Authors:  Conceição Bettencourt; Davina Hensman-Moss; Michael Flower; Sarah Wiethoff; Alexis Brice; Cyril Goizet; Giovanni Stevanin; Georgios Koutsis; Georgia Karadima; Marios Panas; Petra Yescas-Gómez; Lizbeth Esmeralda García-Velázquez; María Elisa Alonso-Vilatela; Manuela Lima; Mafalda Raposo; Bryan Traynor; Mary Sweeney; Nicholas Wood; Paola Giunti; Alexandra Durr; Peter Holmans; Henry Houlden; Sarah J Tabrizi; Lesley Jones
Journal:  Ann Neurol       Date:  2016-05-06       Impact factor: 10.422

View more
  31 in total

Review 1.  Modifiers of CAG/CTG Repeat Instability: Insights from Mammalian Models.

Authors:  Vanessa C Wheeler; Vincent Dion
Journal:  J Huntingtons Dis       Date:  2021

Review 2.  On the wrong DNA track: Molecular mechanisms of repeat-mediated genome instability.

Authors:  Alexandra N Khristich; Sergei M Mirkin
Journal:  J Biol Chem       Date:  2020-02-14       Impact factor: 5.157

3.  Structural Basis for Targeting T:T Mismatch with Triaminotriazine-Acridine Conjugate Induces a U-Shaped Head-to-Head Four-Way Junction in CTG Repeat DNA.

Authors:  Ching-Ming Chien; Pei-Ching Wu; Roshan Satange; Cheng-Chung Chang; Zi-Lun Lai; Lauren D Hagler; Steven C Zimmerman; Ming-Hon Hou
Journal:  J Am Chem Soc       Date:  2020-06-16       Impact factor: 15.419

4.  Event-related potentials using the auditory novel paradigm in patients with myotonic dystrophy.

Authors:  Shugo Suwazono; Hiroshi Arao; Yukihiko Ueda; Shino Maedou
Journal:  J Neurol       Date:  2021-02-20       Impact factor: 4.849

Review 5.  Unnatural bases for recognition of noncoding nucleic acid interfaces.

Authors:  Shiqin Miao; Yufeng Liang; Sarah Rundell; Debmalya Bhunia; Shekar Devari; Oliver Munyaradzi; Dennis Bong
Journal:  Biopolymers       Date:  2020-09-24       Impact factor: 2.505

6.  Repeat DNA expands our understanding of autism spectrum disorder.

Authors:  Anthony J Hannan
Journal:  Nature       Date:  2021-01       Impact factor: 49.962

Review 7.  Structure-forming repeats and their impact on genome stability.

Authors:  Rebecca E Brown; Catherine H Freudenreich
Journal:  Curr Opin Genet Dev       Date:  2020-12-03       Impact factor: 5.578

8.  Propensity for somatic expansion increases over the course of life in Huntington disease.

Authors:  Radhia Kacher; François-Xavier Lejeune; Sandrine Noël; Cécile Cazeneuve; Alexis Brice; Sandrine Humbert; Alexandra Durr
Journal:  Elife       Date:  2021-05-13       Impact factor: 8.140

Review 9.  Alternative DNA Structures In Vivo: Molecular Evidence and Remaining Questions.

Authors:  Lucie Poggi; Guy-Franck Richard
Journal:  Microbiol Mol Biol Rev       Date:  2020-12-23       Impact factor: 11.056

10.  FAN1-MLH1 interaction affects repair of DNA interstrand cross-links and slipped-CAG/CTG repeats.

Authors:  Antonio Porro; Mohiuddin Mohiuddin; Christina Zurfluh; Vincent Spegg; Jingqi Dai; Florence Iehl; Virginie Ropars; Giulio Collotta; Keri M Fishwick; Nour L Mozaffari; Raphaël Guérois; Josef Jiricny; Matthias Altmeyer; Jean-Baptiste Charbonnier; Christopher E Pearson; Alessandro A Sartori
Journal:  Sci Adv       Date:  2021-07-30       Impact factor: 14.136

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.