Literature DB >> 16199754

Mechanistic features of CAG*CTG repeat contractions in cultured cells revealed by a novel genetic assay.

Richard Pelletier1, Brian T Farrell, Juan José Miret, Robert S Lahue.   

Abstract

Trinucleotide repeats (TNRs) undergo high frequency mutagenesis to cause at least 15 neurodegenerative diseases. To understand better the molecular mechanisms of TNR instability in cultured cells, a new genetic assay was created using a shuttle vector. The shuttle vector contains a promoter-TNR-reporter gene construct whose expression is dependent on TNR length. The vector harbors the SV40 ori and large T antigen gene, allowing portability between primate cell lines. The shuttle vector is propagated in cultured cells, then recovered and analyzed in yeast using selection for reporter gene expression. We show that (CAG*CTG)25-33 contracts at frequencies as high as 1% in 293T and 293 human cells and in COS-1 monkey cells, provided that the plasmid undergoes replication. Hairpin-forming capacity of the repeat sequence stimulated contractions. Evidence for a threshold was observed between 25 and 33 repeats in COS-1 cells, where contraction frequencies increased sharply (up 720%) over a narrow range of repeat lengths. Expression of the mismatch repair protein Mlh1 does not correlate with repeat instability, suggesting contractions are independent of mismatch repair in our system. Together, these findings recapitulate certain features of human genetics and therefore establish a novel cell culture system to help provide new mechanistic insights into CAG*CTG repeat instability.

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 16199754      PMCID: PMC1240116          DOI: 10.1093/nar/gki880

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Trinucleotide repeats (TNRs) are microsatellite sequences whose genetic instability causes at least 15 human neurological disorders, including Huntington's disease, myotonic dystrophy and fragile X syndrome (1–4). TNR instability shows an unusual dependence on the length of the tract. Alleles up to ∼30–35 repeats are stably inherited in humans (5), however beyond that size the repeats display much more frequent instability with a strong tendency towards expansions (6). This pattern can be attributed to two characteristics of TNRs that are not seen with other microsatellites. First, the 30–35 repeat length constitutes a genetic threshold, defined as a repeat span that demarcates short, relatively stable tracts from long, unstable alleles. The allele length relative to the threshold is therefore a key indicator of the likelihood of mutation. Second, once the threshold is reached, mutation spectra are shifted towards expansions rather than contractions (6). A key factor in avoiding disease therefore depends on preventing TNRs from attaining the threshold. In humans, CAGCTG alleles in the normal (subthreshold) range tend to contract more often, or at least as often, as they expand, based on single sperm typing at the HD locus (7), at the androgen receptor gene and the DM1 locus (8), and at the MJD1 gene (9). This observation also holds for subthreshold CGG•CCG repeats at the fragile X locus (10,11). A reasonable conclusion from these data is that contractions of alleles below the threshold are important for maintaining short, genetically stable alleles that do not confer disease and that are unlikely to expand in later generations. The molecular foundation for TNR instability is not well understood, but the key role of DNA structure is widely accepted (2,3). Structural studies (12) revealed that disease-causing TNRs adopt unusual configurations, such as single-strand hairpins, slipped-strand structures and triplex DNA, whose thermodynamic stability correlates with the length of the repeat tracts (13–17). These observations, coupled with the knowledge that new repeats need to be synthesized (for an expansion) or deleted (for a contraction), generated the proposal that TNR-induced structures lead to aberrant DNA synthesis (2–4,12). One prevalent line of thought is that hairpin formation on the newly synthesized strand promotes expansions. The corollary is that hairpins on the template strand are the precursors for contractions. Thus both types of TNR mutations can be ascribed to models where structural perturbations, such as hairpins, promote error-prone DNA synthetic events (2–4,12). It should be noted however that hairpins have only been observed in the test tube; there has been no direct detection of hairpin formation in cells. This difficulty can be overcome by comparing genetic instability of different TNR sequences with different hairpin-forming capacities. Nearly always, the better a TNR sequence is at forming a hairpin, the higher its instability [e.g. (18)]. Evidence from bacterial and yeast model systems supports the hypothesis that instability results from three synthesis-dependent sources, namely DNA replication (19–31), repair [summarized in (32)], and recombination (33–36). Similarly, instability in mammalian systems is probably derived from more than one cellular mechanism. For example, replication slippage as a source of CAGCTG instability was suggested in several studies (7,37–40). Also, several human cell lines were shown to have active replication initiation regions within 3.6 kb or less of SCA-7, SBMA, and both normal and expanded alleles of HD loci (41). A third line of evidence is that replication inhibitors also modulate instability at the endogenous DM1 gene in human fibroblasts (42). In contrast, at least two reports (43,44) found no simple association between the rate of cell division and instability in transgenic mice or in tissue culture cell lines established from transgenic mice. Aberrant DNA repair in mammals has also been proposed as an important contributor to instability. Available evidence, however, does not suggest a unifying model by which repair accounts for TNR mutations because several pathways have been implicated. Mismatch repair was inferred in five papers showing decreased expansion frequencies for CAGCTG repeats in transgenic mice that were also deficient in the mismatch repair genes Msh2 (45–48) or Msh3 (49). For contractions, four of these papers found little or no apparent change in contraction frequencies (45–47,49), whereas the fifth study showed a dramatic shift towards contractions in Msh2-deficient animals (48). In Pms2 knockouts, expansion frequencies were decreased by about one-half relative to wild-type littermates, but contractions were largely unaffected (50). In contrast to the mouse findings, studies in human cell lines deficient for Msh2 or Mlh1 showed no apparent instability for endogenous (normal-length) alleles of DM or the CGG•CCG repeat at FRAXA (51). Recombinational repair, perhaps by single-strand annealing, was deduced from a study of maternally derived SCA1 expansions (52). Subsequently, CTG repeats were shown to markedly influence the outcome of genetic recombination in CHO cells (53). Repair of double-strand breaks and/or stalling of replication forks can also induce instability of some CTGCAG tracts in CHO cells (40) or in HeLa cells (54). It was recently shown that changes in DNA methylation status also modulate instability (55–57), indicating an epigenetic influence on the mutagenic process. The studies summarized above suggest multiple mechanisms of CAGCTG repeat instability in mammalian systems. To understand the potential mechanisms more fully, improvements in detection methods will be required. Physical analyses of the repeat tracts (e.g. PCR monitoring) would be nicely complemented by selective genetic assays for TNR instability. In this study, we established an assay that combines the relevance and convenience of cultured primate cells with quantitative genetic selection in yeast to better understand CAGCTG repeat mutability. We found that instability in cultured cells recapitulates a number of the characteristic features from humans, suggesting this new system will be useful in helping understand the mechanisms causing TNR mutations.

MATERIALS AND METHODS

Cell lines

Human 293 and 293T (human embryonic kidney) cells, and COS-1 (African green monkey kidney) cells were cultured in DMEM supplemented with 10% fetal bovine serum, 50 U/ml penicillin-streptomycin (Gibco BRL) and anti-PPLO agent (1× final concentration; Gibco BRL) or Fungizone amphotericin B (2.5 µg/ml final concentration) (Invitrogen).

Plasmids

The shuttle vector backbone (pBL67) was constructed by subcloning the HpaII–BamH1 fragment (with ends filled-in) of the SV40 viral DNA (Gibco BRL), encompassing the origin of replication (ori) and the large T antigen gene, into the BamH1 site (blunt ended) of the pRS313 vector (58). The ∼9.8 kb shuttle vector includes the SV40 viral origin of replication (ori) and the large T antigen gene. Yeast markers include a reporter system to detect TNR contractions, which occur in human cells, via subsequent expression of the URA3 gene (18). Also present on the plasmid are the HIS3 gene, to identify yeast transformants, and the genetic elements autonomously replicating sequence and CEN, a centromere element that controls plasmid number in yeast to 1–2 copies per cell. The vector contains an Escherichia coli origin of replication and an ampicillin resistance gene. To create individual shuttle vectors (CAG)25+8, (CTG)25+8, (C,T,G)33, and (CAG)33, cassettes (for explanation of the plasmid nomenclature see Results section) containing the Padh1 promoter, the TNR tract and the URA3 reporter gene were excised from plasmids carrying the relevant TNR tracts (as described previously in (18)] using XhoI and EcoRI enzymes, and subcloned into the corresponding restriction sites of pBL67. The reporters (CAG)25+8 and (CTG)25+8 differ only in the cloning orientation of the repeat tracts. A vector backbone was also constructed in which the SV40 ori is on the opposite, or 5′, side of the TNR tract as compared with the vectors described above (3′ side in pBL67). In this vector, pBL185, a 443 bp SfaNI fragment from SV40 was subcloned into the ApaI (filled-in) of pRS313. To create plasmid (CAG)25+8 new ori, a cassette encompassing the (CAG)25+8 tract from pBL145 was subcloned into the XhoI–EcoRI sites of pBL185. The (TAG)25+8 reporter was created by replacing the (CAG)25+8 triplet repeat tract of (CAG)25+8 new ori with an oligonucleotide duplex containing the TAG repeat sequence. Finally, removal of all SV40 sequences (ori and large T antigen) from pBL67 yielded plasmid pBL207. Subcloning the XhoI–EcoRI cassette from pBL145 into the corresponding restriction sites of the ori− shuttle vector yielded (CTG)25+8 ori−. Each plasmid DNA was prepared by the Qiagen method (plasmid maxi-kit, Qiagen Inc.) from bacterial cultures grown to no more than mid-log phase, to minimize background TNR contractions (59).

Mammalian cell transfections

Supercoiled DNA (15 µg) was used to transfect exponentially growing mammalian cells (∼1–2 × 106 cells per 75 cm2 flask, one flask per transfection) by the calcium phosphate co-precipitation method (60). After 2–3 days of growth at 37°C, the cells were lysed with 3 ml of Hirt's lysis buffer (61), and the lysate was pooled into a 15 ml Corex tube. NaCl (5 M) (750 µl) was added to each lysate and stored overnight at 4°C. The lysate was then centrifuged in a Sorval SS34 rotor at 10 000 r.p.m. for 15 min at 4°C. The supernatant, containing the plasmid DNA, was collected into a fresh tube. To remove proteins, the lysate was spun on a Centriplus YM-100 column (Millipore) for 1–1.5 h at 4°C in a Sorval SS34 rotor at 5000 r.p.m., following the manufacturer's instructions. The retentate was recovered by inverting the column into a collection tube and spinning an additional 3–4 min at 4000 r.p.m. The DNA was then precipitated with two volumes of ethanol, and suspended in 100 µl of 10 mM Tris–HCl and 1 mM EDTA, pH 7.5.

Yeast transformations

DpnI digestion was used to eliminate unreplicated plasmid molecules that retained the bacterial methylation pattern. The plasmid DNA (50 µl), prepared as described above, were digested for 1 h with ∼60 U of DpnI (New England Biolabs) plus 10 µg RNase A in the buffer provided by the manufacturer at a final volume of 60 µl. In the case of the SV40 ori− control plasmid, which does not replicate in human cells, the DpnI digestion was omitted. A fraction of the digest (between 50 and 100%) was used to transform yeast S150-2B cells (MATa leu2-3,112 ura3-52 his3Δ trp1-289) via a modified version (62) of the lithium acetate method. A fraction of each transformation mixture (typically 0.5%) was plated onto SC-His plates (synthetic complete, lacking histidine), and the remainder onto SC-His-Ura (synthetic complete, lacking histidine and uracil) to score for contractions. Colonies on each plate were counted after 3 days of growth at 30°C. The frequency of contraction was determined as the number of colonies obtained on SC-His-Ura divided by the total number of transformants on SC-His, with appropriate correction for dilution factors. PCR analysis was performed to determine the fraction of colonies that had taken up a plasmid with a bona fide change in TNR size. Isolated colonies were disrupted in 100 µl of 50 mM DTT plus 0.5% Triton X-100, incubated at 37°C for 30 min, heated at 95°C for 6 min and kept on ice thereafter. A portion of this material was used as a template for PCR amplification as described previously (23), with the following modifications. Amplification in the presence of ∼1.25 µCi of [α-32P]dCTP was performed for 30 cycles (1 min at 94°C, 1 min at 60°C and 1 min at 72°C), plus a final extension at 72°C for 5 min. The products were separated on a 6% denaturing polyacrylamide gel. PCR product sizes (±2 repeat units) were determined by comparison with a DNA sequencing ladder. The final, reported mutation frequency for each template was determined by multiplying the fraction of plasmids with bona fide alterations in TNR size (between 80 and 100%) times the raw mutation frequencies calculated above. Additional details regarding the assay results can be viewed in Supplementary Data. The experimental protocol for detecting TNR mutations in cultured cells is shown in Figure 1A. By measuring frequencies, where the denominator reflects the total number of plasmid molecules available for testing, each experiment is internally controlled for DNA recovery, transformation efficiency and so on. A critical concern with this assay is to ensure that the TNR mutation events occurred in the tissue culture cells, and not during plasmid propagation in E.coli (59) or immediately following transformation in yeast, before selection becomes effective. To determine the background frequency at which TNR tracts mutate in yeast and/or E.coli, 0.5–5 µg of each DNA plasmid preparation was used to transform yeast cells directly, without passaging through mammalian cells (Figure 1A). The cells were plated on selective media as described above and the background mutation frequency was calculated by dividing the number of colonies on selective media by the total number of transformants. E.coli strain HB101 and yeast strain S150-2B provided the lowest achievable background frequencies among several strains tested. As noted by others (19), we found instability in E.coli varied with the repeat orientation. This explains why our background values are different between the CAG and CTG orientations of the reporter.
Figure 1

Experimental procedure to detect TNR tract changes in human tissue cultured cells and the DNA shuttle vector. Details are provided in Materials and Methods. (A) Each TNR-containing vector was transfected into mammalian cells, the cells were cultured for 2–3 days, then the plasmid DNA was transformed into yeast for analysis. Contraction frequency was calculated by dividing the number of yeast colonies with a contraction by the total number of transformants. Background contraction frequencies were measured as described in Materials and Methods. (B) The selection for contractions in yeast is based on spacing sensitivity of the yeast promoter from the preferred transcription initiation site, labeled ‘I’. Starting TNR lengths of 25 + 8 (e.g. 25 repeats of CTG plus non-repeating DNA equivalent in length to 8 repeats) inhibit expression of the URA3 reporter gene and yield a Ura− phenotype (18). TNR contractions that remove at least five repeats restore promoter activity and result in a Ura+ phenotype. See Supplementary Figure S1 for detailed sequence information of the TNR-containing region. (C) The ∼9.8 kb shuttle vector allows replication in primate cells, yeast and E.coli. The TNR-URA3 reporter provides a readout of TNR tract length via URA3 expression, as described above. (D) Plasmid (CAG)25+8 new ori is a variant with the SV40 origin moved to the opposite side of the TNR tract, plus deletion of the large T antigen gene (denoted by the cross). (E) (CTG)25+8 ori− has deletions of both the SV40 origin and the large T antigen gene. Details of plasmid constructions are provided in Materials and Methods.

Western blot analysis

Mlh1 protein expression was analyzed in 293 and 293T cells and compared with HeLa controls by western blotting. Cell lysates (40 µg protein per sample) were separated by 8% SDS–PAGE, transferred to a nitrocellulose membrane (Amersham), and blocked overnight in Odyssey Blocking Buffer (LI-COR). Mlh1 protein was detected with mouse anti-human Mlh1 monoclonal antibody (PharMingen), followed by secondary antibody AlexaFluor 680 goat anti-mouse IgG (Molecular Probes), then the membrane was scanned using the Odyssey System according to the manufacturer's specifications.

RESULTS

The shuttle vector assay

A new genetic assay was established that combines the availability of many useful tissue culture lines with yeast genetics. A DNA shuttle vector is transfected into cultured mammalian cells, the cells are propagated for a few days to allow contractions of the TNR tract to occur, and then the plasmid DNA is recovered and transformed into yeast (Figure 1A). Since the vector contains a selective reporter system, described in the figure legend (Figure 1B), only those yeast cells that have incorporated a plasmid missing at least five repeats from the TNR tract survive on selective medium. Thus contractions that occur in the mammalian cells are revealed selectively in yeast. While the current study focuses on contractions, future work will include experiments to examine TNR expansions. This system also complements existing, frameshift-based assays for microsatellite instability in tissue culture cells (63,64) by providing a way to select for mutations that affect 3 bp repeating units. The essential features of the shuttle vector are shown in Figure 1C. The plasmid can replicate by virtue of an SV40 viral origin of replication (ori), which drives high-copy production of the plasmid in just a few days. Host factors from many cell types can mediate replication from this origin; the only required exogenous protein is SV40 large T antigen, which is encoded by the shuttle vector. The plasmid is therefore portable between different primate cell lines. The vector also contains a TNR-URA3 reporter system that allows detection of contractions once the plasmid is subsequently transferred into yeast (Figure 1B). In most cases, the starting TNR tract had 25 repeats appended to 24 nt of randomized sequence (‘25 + 8’, total length equivalent to 33 repeats). The perfect 25 repeat length represents a subthreshold allele, and the addition of the randomized sequence allows the yeast selection to function as designed (Figure 1B). Three classes of sequences were assayed: CAG and CTG repeats that form strong hairpin structures (13,15–17) and which are relevant to human disease; a TAG repeat which forms hairpins less well (13) to test the influence of structure on instability; and scrambled sequences lacking any repetitive nature, as a negative control. Because the bidirectional SV40 ori (65,66) is located on one side of the repeats at close distance (Figure 1C), it is possible to assign a particular repeat sequence to either the leading or lagging strand of replication. All triplet constructs as written represent the sequence on the lagging template strand. The plasmid also has elements for selection in yeast, plus E.coli sequences for cloning and propagation purposes. Therefore it is a trifunctional shuttle vector, capable of replicating in primate cells, in yeast and in E.coli.

CAG•CTG repeat instability in human 293T cells

The TNR vectors were first tested in human 293T embryonic kidney cells, which express SV40 large T antigen constitutively. These cells were chosen because they are easily transfected by the calcium phosphate co-precipitation method, and provide a high replicative yield of SV40 based plasmids (67). Furthermore, the kidney is a site of substantial somatic instability in myotonic dystrophy transgenic cell lines (44). Figure 2A shows that contractions of (CAG)25+8 occurred at a frequency of 5.7 × 10−3. This value is significantly above the background level of 1.3 × 10−3 (P = 0.047 by Student's t-test), indicating that the majority of the contraction events occurred in the human cells, not in E.coli or yeast. The frequency indicates that nearly 0.5% of all plasmid molecules contracted upon passage through 293T cells. Surprisingly the (CTG)25+8 reporter gave only about one-tenth as many contractions (0.5 × 10−3) as the CAG reporter. While a low value, the (CTG)25+8 contraction frequency was significantly above background value of 0.15 × 10−3 (P = 0.021). This orientation effect was statistically significant [P = 0.029, comparing the data from (CAG)25+8 with (CTG)25+8]. The orientation of the tract is therefore important in 293T cells. The (C,T,G)33 scrambled control, containing a 99 nt scrambled sequence with no repetitive nature, was inert in this assay (<0.01 × 10−3), as expected.
Figure 2

CAG•CTG contraction frequencies in mammalian cells. For each graph, the stippled bars represent the average frequency of contractions (±SEM) after passage through the human cells, while the open bars represent the background mutation frequency. Each transfection was performed 3–9 times. The X-axis denotes the TNR sequence on the lagging template strand relative to the SV40 origin of replication on the vector. (A) Results from 293T cells. (B) Frequencies using 293 cells. (C) Contraction frequencies in COS-1 cells. Note that the scale on the Y-axis is different in the three graphs. The frequency data are available in the Supplementary Data.

The observed orientation effect, where CAG repeats are more unstable than CTG runs, leads to the specific, testable prediction that reversing the direction of replication should reverse the instability pattern. Figure 3A shows the deduced replication fork as it moves past the TNR repeat sequence in (CAG)25+8. The position of the CAG repeats is on the lagging template strand. Similarly, Figure 3B shows that the (CTG)25+8 reporter gives rise to CTG sequences as the template for lagging strand synthesis. If the direction of DNA replication for (CTG)25+8 is reversed, such that the fork progresses in the opposite direction, then the lagging template strand now contains CAG repeats (Figure 3C). This situation was accomplished by moving the SV40 ori in the vector backbone from 3′ of the CTG tract to a 5′ location to create (CAG)25+8 new ori (Figure 1D). If CAG repeats are more probable to act as the template for contraction events, then (CAG)25+8 new ori should be more unstable than (CTG)25+8. Figure 2A shows an increase of ∼5-fold in contraction frequency for (CAG)25+8 new ori (2.6 × 10−3), compared with (CTG)25+8 (P = 0.015). The increase implies that the direction of DNA replication through the repeat tract influences instability. If true, then we also predict that removal of the SV40 origin (Figure 1E) should preclude replication and therefore contractions. This loss of instability was seen for (CTG)25+8 ori− (Figure 2A), which yielded the low value of <0.08 × 10−3. The (CAG)25+8 new ori and ori− data indicate that instability requires replication through the triplet repeat tract, however it does not prove definitively that replication per se is the source of CAGCTG contractions. In summary, CAGCTG contractions in human 293T cells occur at relatively high frequencies, contractions are dependent on the triplet repeat sequence, they require replication of the repeat DNA, and there are ∼10-fold more contractions for runs of CAG rather than CTG on the lagging strand template.
Figure 3

Schematic representation of replication forks proceeding through TNR regions. Each panel shows a replication fork as it passes through the TNR site. The direction of replication was deduced from the position of the SV40 ori sequences relative to the TNR (Figure 1C and D). The 5′ and 3′ polarities are indicated for the template strands. The leading strand product is shown as a long continuous arrow, while the newly replicated lagging strand is shown as discontinuous Okazaki fragments prior to processing and ligation. (A) For the substrate (CAG)25+8, replication moves from right to left. This places the CAG repeat sequence on the lagging template strand. (B) When the TNR is cloned in the opposite orientation, denoted as (CTG)25+8, the CTG repeat now occupies the lagging template position. (C) The SV40 origin in (CTG)25+8 was moved to the opposite side of the TNR to create (CAG)25+8 new ori, as diagrammed in Figure 1D. Accordingly, replication now proceeds from left to right. The change in fork direction makes the ‘top’ strand, with CAG repeats, the lagging template, hence the designation as a CAG plasmid.

Contraction sizes in 293T cells

PCR analysis of 48 contracted alleles from the (CAG)25+8 plasmid was performed to determine the extent of individual contraction events in 293T cells (Figure 4). For the (CAG)25+8 reporter, deletions from −5 to −25 repeats are identifiable with our assay (18). Contracted alleles were observed from −10 to −21 repeats, with a median of −15 (Figure 4). The majority of these events must be genetically independent because the results represent several independent transfections from three different plasmid preparations, and because most contractions from a given experiment were of distinct sizes. In contrast, if the majority of contractions resulted from a few early events that were subsequently replicated, there would be predominance of a few allele sizes, rather than the dispersed pattern observed. There is a possibility that a particular plasmid molecule contracted more than once, thus giving multiple size products, but this seems unlikely based on our frequency estimates. Taken together, these factors indicate that genetically independent contractions make up the majority of observed events.
Figure 4

Summary of contraction events for (CAG)25+8 in 293T cells. Contraction sizes were determined by single-colony PCR amplification of the repeat tract, followed by size measurements on DNA sequencing gels. Sizes of the contractions were determined (±1–2 repeats) by comparison with an M13 DNA sequencing ladder.

In two other tissue culture systems for TNR instability (40,54), deletion of the flanking sequences sometimes accompanies contractions. To evaluate the possibility of flanking sequence deletions in our system, PCR amplification was performed on 24 yeast colonies (from four separate 293T cell transfection experiments). The PCR primer sites end 41 nt upstream and 53 nt downstream from the ends of the CTG tract. In 23 of 24 samples, a shortened PCR product was clearly identified, indicating that the primer sites were intact. The one sample that failed to amplify at the TNR site was also negative for amplification at the URA3 and HIS3 genes on the plasmid. We conclude that the DNA template preparation from that one sample was defective, and therefore it was excluded from further consideration. Among the 23 amplified samples, DNA sequence analysis was performed on the PCR products from eight randomly chosen representatives (corresponding to all four independent 293T transfections). In all eight cases, the deletion was entirely within the CTG tract; there were no examples of deletions extending into flanking sequences, including the randomized (C,T,G)8 sequence immediately adjacent to the repeat. All deletions precisely removed 3 bp CTG units, ranging from −10 to −18 repeats. This size range is consistent with the spectrum seen in Figure 4. We conclude that contractions within the CTG repeats comprise the large majority of deletions in our system.

Mlh1 expression does not correlate with TNR instability

It was reported that expression of the human mismatch repair gene Mlh1 is impaired in 293T cells owing to promoter hypermethylation (68). Extracts from 293T cells are accordingly deficient for in vitro mismatch repair activity, but this deficiency is reversed when the cells are transfected with a wild-type Mlh1 gene (68). These results indicate that the mismatch repair deficiency in 293T cells is solely owing to lack of Mlh1 protein expression. Previous reports from transgenic mouse studies implicated the mismatch repair genes Msh2, Msh3 and Pms2 in TNR instability (45–50). It is conceivable that Mlh1 expression levels, and therefore mismatch repair status, might influence contraction frequencies, orientation effects or both. Also, based on a recent study with Msh2-deficient mice (48), contraction frequencies may also be heavily dependent on the mismatch repair status of the cell. To determine if differences in Mlh1 expression affected contraction frequencies or orientation effects, we examined Mlh1 expression by western blot analysis. The results in Figure 5 reveal that Mlh1 was undetectable in the 293T cell extract, while it is clearly expressed in 293 and in control HeLa extracts. The apparent molecular weight corresponds well to the predicted mass of 84.5 kDa. Because the frequencies of CAGCTG instability in 293T cells were similar to that of 293 cells (see next section), these data suggest that the presence or absence of Mlh1 does not detectably alter CAG or CTG contraction frequencies, nor do differences in Mlh1 steady-state levels correlate with the orientation effects described above.
Figure 5

Western blot analysis of Mlh1 in several human cell lines. Extracts were prepared from the indicated cell lines and 40 µg of total protein were subjected to SDS–PAGE separation and immunoblotting, as described in Materials and Methods. The primary antibody was anti-human Mlh1. The notations at the left indicate the position of the 50 and 100 kDa molecular weight markers run on the same gel.

Instability in other cell lines

Our shuttle vector replicates efficiently in 293T cells, which constitutively express the large T antigen necessary for initiation of replication. Because the gene for the T antigen is present on the shuttle vector, its expression should be sufficient to drive replication of the plasmid to levels high enough to detect TNR contractions. The assay should therefore be portable in cell lines that do not express T antigen. To test this idea, we chose human 293 embryonic kidney cells, which are immortalized with adenovirus 5 DNA instead of T antigen (60). We transfected these cells with a subset of shuttle vectors and measured their contraction frequencies. The genetic instability in 293 cells (Figure 2B) is reproduced at levels similar to the 293T cells. Although less plasmid DNA (in absolute terms) was recovered from the 293 cells, the yield is sufficient to reveal genetic instability at 8.3 × 10−3 for the vector bearing the (CAG)25+8 tract (Figure 2B). This value was significantly above the background value of 1.3 × 10−3 (P = 0.001). As seen previously in 293T cells, the (CTG)25+8 tract showed a low but significant level of contractions (0.42 × 10−3) compared with background frequency of 0.15 × 10−3 (P = 0.023). The (CTG)25+8 reporter in 293 cells was therefore more stable than its (CAG)25+8 complement (P < 0.001) by ∼20-fold. This finding suggests that the orientation dependence is not a unique feature of the 293T cells tested earlier, and the orientation effect can be reproduced in 293 cells as well. As was the case previously, a plasmid bearing the scrambled control sequence (C,T,G)33 was genetically stable in 293 cells. To determine if TNR instability occurs in a non-human cell type, African green monkey kidney cells (COS-1) were chosen. These cells are easily transfectable by the calcium phosphate method, and they also express T antigen constitutively. The vector bearing (CAG)25+8 displayed contraction levels (Figure 2C) similar to 293 and 293T cells (3.4 × 10−3, P = 0.030 compared with background of 1.3 × 10−3). However, the TNR tract in the opposite orientation, (CTG)25+8, was equally unstable as the CAG test tract (also 3.4 × 10−3; P = 0.034 compared with background of 0.15 × 10−3). Clearly there is no orientation effect in COS-1 cells [P = 0.996 comparing (CAG)25+8 with (CTG)25+8]. The scrambled repeat vector, (C,T,G)33, was stable in the monkey cells as expected. The control repeat (TAG)25+8, predicted to have reduced hairpin-forming capacity, showed a lower frequency of contractions [1.2 × 10−3; P = 0.02 versus (CAG)25+8]. This result indicates that the ability of CAG and CTG repeats to form hairpins increases their instability compared with TAG repeats.

CAG contractions exhibit an apparent threshold

Although thresholds have been identified or inferred for a number of human TNR expansion diseases, this feature has not been directly demonstrated in mammalian cells. Our genetic assay seemed suitably sensitive to detect a threshold effect. To address this issue, we created a series of reporter constructs with CAG repeat lengths from 0 to 33 repeats. Scrambled sequences, containing no repeating nature, were added to equalize the total tract length to 99 bp in all cases. For example, the (CAG)15 construct also contained 54 bp of scrambled sequences (equivalent in length to 18 repeats). Each construct was tested for instability in COS-1 cells, background frequencies were also determined, and the difference (contraction frequency in COS-1 minus background frequency) is shown in Figure 6. Contractions were observed at low levels for repeat lengths of 15 and 17, with a somewhat higher value observed at 25 repeats. However, the response increased sharply for starting tracts of 30 and 33 repeats. Compared with the frequency at 25 repeats (2.1 × 10−3), contractions occurred 3.2-fold more often for 30 repeats, and 7.2-times more often for 33 repeats. Thus, lengthening the starting tract from 25 to 33 repeats (a 32% increase in DNA sequence) resulted in a 720% increase in instability. This result is inconsistent with a simple stochastic function of tract length. Instead, the findings in Figure 6 lead us to conclude that CAG contractions in COS-1 cells are governed by a threshold between 25 and 33 repeats. This value is in concordance with the accepted range of 30–35 repeats in human disease genes (5). Additional experiments with our system will be necessary to define the threshold more definitively.
Figure 6

Response of CAG repeat contraction frequency on initial tract length. Contraction frequencies were measured in triplicate in COS-1 cells for the following triplet repeat reporters: 0 repeats, (C,A,G)33; 15 repeats, (CAG)15+18; 17 repeats, (CAG)17+16; 25 repeats, (CAG)25+8; 30 repeats, (CAG)30+3; and 33 repeats, (CAG)33. Background frequencies were also measured, and the difference (COS-1 frequency minus background frequency) are plotted. Error bars indicate ±1 SEM.

DISCUSSION

The shuttle vector assay described here provides a new way to investigate TNR contractions in cultured primate cells. The assay combines convenient features (genetic selection, easy manipulation of the input TNR sequence and molecular analysis of the mutated tract) and thereby adds to the repertoire of tools available to study mechanisms of TNR instability. The shuttle vector assay is also demonstrably portable between primate cell lines. We confirmed replication-dependent instability described in other studies, plus our experiments discovered an apparent threshold for contractions of CAGCTG tracts that coincides with the expansion threshold in humans. Furthermore, we found that contractions apparently occur independently of the mismatch repair protein Mlh1. Each of these findings is discussed in more detail below. The threshold for TNR instability is a genetic hallmark that distinguishes triplet repeats from other microsatellites. While the threshold is well established in human genetics, previous studies in cell culture systems have not reported a clear threshold effect. Our finding of an apparent threshold was made possible by the sensitive nature of the assay plus the ability to manipulate the input TNR tracts. It is of interest that the estimated threshold value in COS-1 cells, between 25 and 33 repeats, coincides with the range of 30–35 repeats generally observed in human TNR disease genes (5). This concordance implies that the mechanism of TNR instability may be accurately reflected in tissue culture. This also may suggest that whatever molecular features dictate thresholds may apply to both expansions and contractions for CAGCTG repeats. The experiments presented here were designed to model the situation of TNR alleles just below the threshold length in mammalian cells. The findings suggest that contractions play an important, if perhaps under appreciated, role in disease prevention by maintaining TNR lengths below the critical threshold where expansions become frequent and predominant. A similar conclusion was drawn from transgenic mouse studies (69), where deletions in early development help safeguard the genome against expansions. In our studies, (CAG)25 subthreshold alleles contract at relatively high frequencies of 0.3–0.8% in three separate cell lines. These frequency measurements are in agreement with studies of similar length alleles measured by single sperm analysis at the HD locus (7), at the androgen receptor gene and DM1 locus(8), at the MJD1 gene (9) and at the fragile X locus (10,11). When results from the sperm analyses are considered together, contraction frequencies ranged from 0.2 to 3% for CAGCTG alleles of 15–31 repeats (7–9), and from 0.2 to 2% for fragile X CGG•CCG tracts of 19–29 repeats (10,11). The sizes of the contractions we observed (−10 to −21 repeats) tend to be larger than what is seen in human sperm, in part owing to the fact that our assay requires loss of at least five repeats, whereas PCR assays in human tissue samples can record smaller changes as well. We cannot rule out that our plasmid-based system might have some differences from TNRs in a chromosomal location. TNR instability in our assay is dependent on many of the same elements as in human families, including TNR sequence, length and the repeating nature of the tract. This dependence suggests that the contractions seen in our system reasonably mimic TNR instability in humans. Hairpins are implicated as important intermediates by the fact that CAG and CTG repeats, which form hairpins readily, also contract at appreciable frequencies. In contrast, TAG repeats contract less frequently and scrambled sequences are genetically stable. The data also indicate that DNA replication is the most probably mechanistic source for contractions in our system, since removal of the SV40 ori sequence from the reporter plasmid eliminated contractions. It is also possible that contractions result from an event following replication, such as recombination between newly replicated regions of the plasmid. Clarification on this point awaits further investigation, such as the use of cell lines deficient in specific recombination or repair pathways. Possible effects on p53, which is inactivated by large T antigen, are largely ruled out by previous studies that found no connection between p53 status and TNR instability (70,71). The orientation dependence seen in 293T and 293 cells also supports a replication model of TNR contractions, since the instability of the tract depends on the direction of the replication fork. The most persuasive evidence was that relocation of the SV40 origin from one side of the TNR tract to the other resulted in a 5-fold increase in contraction frequency. Thus a relatively stable tract was converted to a more unstable configuration when replication occurred from the opposite direction. Orientation effects for TNR instability have been reported from numerous assays performed in E.coli, yeast and mammalian cells (19,20,22,23,30,38), although some reports indicated no obvious orientation effect (72). In these other studies, however, placement of the CTG repeat on the lagging template produces more instability than when CAG repeats occupy the lagging template. The surprise in our 293T and 293 cell experiments was that CAG template sequences trigger more contraction events than for CTG repeat templates. In contrast, about equal levels of instability were observed in COS-1 cells. The orientation effect did not correlate with level of vector replication, as both high-replicating 293T cells and low-replicating 293 cells displayed the orientation effect, whereas high-replicating COS-1 cells did not. Cleary et al. (38) found an orientation effect in COS-1 cells, while we observed this phenomenon only in 293/293T cells. The different repeat lengths tested in the two laboratories might contribute to this disparity. Another difference was that we did not systematically vary the distance of the replication ori relative to the TNR in our vectors. We offer two possible explanations for our orientation data. First, the threshold for (CTG)25+8 tract instability may be lower in COS-1 cells than in 293 or 293T cells. In other words, the minimum number of CTG repeats necessary to bring about instability may be lower in COS-1 cells than in 293/293T cells. This way, while the (CTG)25+8 tract is genetically unstable in the former cell line, it is too short to trigger contractions in the latter two lines. Second, specific trans-acting factors may exist in COS-1 or 293/293T cells that can modulate TNR instability, in addition to the other factors common to both cell types. (Because the repeats are located on an identical shuttle vector used for all cell lines, cis-acting modifiers are unlikely.) These cellular factors may help define the genetic threshold, for example. Further investigation is needed to identify such trans-acting modifiers in COS-1 and 293T cells using our assay. One factor that did not correlate with instability was the expression of the important mismatch repair protein Mlh1. Both instability and replication orientation effects were very similar in 293 cells that show substantial steady-state levels of Mlh1 protein, and in 293T cells, which have undetectable levels of Mlh1 and are accordingly deficient in mismatch repair activity (68). To our knowledge, our data is the first to assess a potential role of Mlh1 in TNR instability. In mismatch repair, heterodimers containing Mlh1 are important modulators of the mismatch binding properties of the Msh2/Msh3 complex [summarized in (73–75)]. Thus transgenic mice lacking Mlh1 will provide an important test in helping understand the role of mismatch repair on TNR instability. If it turns out that TNR mutagenesis in mice is unaffected by loss of Mlh1, it would imply that whatever role Msh2/Msh3 heterodimer plays in TNR instability (45–49) is separable from Mlh1 and therefore unlikely to be mismatch repair per se. How does our selection system compare with others? Two recent publications described genetic selections in mammalian cells for CAGCTG contractions (40) and expansions (76). When embedded into an intron of either the APRT or HPRT gene, CAG tracts of ∼34 or more repeats interfere with mRNA splicing and therefore yield an APRT− or HPRT− phenotype (40). Selection for APRT+ or HPRT+ cells showed that the majority of such isolates had contracted the CAG allele to 33 repeats or less. In some cases, deletion of flanking sequences was also observed. The sensitivity of this assay approaches 10−6 per cell generation. One limitation of this assay is that it only works for CAG repeats; when CTG repeats were tested, the phenotype of the cells was always APRT+. Expansions that increase a (CAG)31 tract to 39 or more repeats can be identified by selection to APRT− (76). Thus our genetic assay and that of Wilson and co-workers (40,76) provide alternative, complementary methods, each with different strengths and limitations.

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online.
  76 in total

1.  Analysis of germline mutation spectra at the Huntington's disease locus supports a mitotic mutation mechanism.

Authors:  E P Leeflang; S Tavaré; P Marjoram; C O Neal; J Srinidhi; H MacFarlane; M E MacDonald; J F Gusella; M de Young; N S Wexler; N Arnheim
Journal:  Hum Mol Genet       Date:  1999-02       Impact factor: 6.150

2.  The effect of DNA replication mutations on CAG tract stability in yeast.

Authors:  J K Schweitzer; D M Livingston
Journal:  Genetics       Date:  1999-07       Impact factor: 4.562

3.  Stability of the human fragile X (CGG)(n) triplet repeat array in Saccharomyces cerevisiae deficient in aspects of DNA metabolism.

Authors:  P J White; R H Borts; M C Hirst
Journal:  Mol Cell Biol       Date:  1999-08       Impact factor: 4.272

4.  Somatic deletion events occur during early embryonic development and modify the extent of CAG expansion in subsequent generations.

Authors:  I V Kovtun; A R Thornhill; C T McMurray
Journal:  Hum Mol Genet       Date:  2004-10-20       Impact factor: 6.150

5.  Role of replication and CpG methylation in fragile X syndrome CGG deletions in primate cells.

Authors:  Kerrie Nichol Edamura; Michelle R Leonard; Christopher E Pearson
Journal:  Am J Hum Genet       Date:  2004-12-29       Impact factor: 11.025

6.  Genome-wide demethylation destabilizes CTG.CAG trinucleotide repeats in mammalian cells.

Authors:  Vera Gorbunova; Andrei Seluanov; David Mittelman; John H Wilson
Journal:  Hum Mol Genet       Date:  2004-09-30       Impact factor: 6.150

7.  Influence of hairpins on template reannealing at trinucleotide repeat duplexes: a model for slipped DNA.

Authors:  A M Gacy; C T McMurray
Journal:  Biochemistry       Date:  1998-06-30       Impact factor: 3.162

8.  Orientation-dependent and sequence-specific expansions of CTG/CAG trinucleotide repeats in Saccharomyces cerevisiae.

Authors:  J J Miret; L Pessoa-Brandão; R S Lahue
Journal:  Proc Natl Acad Sci U S A       Date:  1998-10-13       Impact factor: 11.205

9.  Genomic instability associated with myotonic dystrophy does not involve p53 expression and activity.

Authors:  M Gennarelli; M Lucarelli; P Amicucci; S Soddu; G Novelli; B Dallapiccola
Journal:  Cell Biochem Funct       Date:  1998-06       Impact factor: 3.685

10.  GAA instability in Friedreich's Ataxia shares a common, DNA-directed and intraallelic mechanism with other trinucleotide diseases.

Authors:  A M Gacy; G M Goellner; C Spiro; X Chen; G Gupta; E M Bradbury; R B Dyer; M J Mikesell; J Z Yao; A J Johnson; A Richter; S B Melançon; C T McMurray
Journal:  Mol Cell       Date:  1998-03       Impact factor: 17.970

View more
  17 in total

Review 1.  Non-B DNA structure-induced genetic instability and evolution.

Authors:  Junhua Zhao; Albino Bacolla; Guliang Wang; Karen M Vasquez
Journal:  Cell Mol Life Sci       Date:  2009-09-01       Impact factor: 9.261

Review 2.  Modifiers of CAG/CTG Repeat Instability: Insights from Mammalian Models.

Authors:  Vanessa C Wheeler; Vincent Dion
Journal:  J Huntingtons Dis       Date:  2021

Review 3.  On the wrong DNA track: Molecular mechanisms of repeat-mediated genome instability.

Authors:  Alexandra N Khristich; Sergei M Mirkin
Journal:  J Biol Chem       Date:  2020-02-14       Impact factor: 5.157

4.  DNA mismatch repair complex MutSβ promotes GAA·TTC repeat expansion in human cells.

Authors:  Anasheh Halabi; Scott Ditch; Jeffrey Wang; Ed Grabczyk
Journal:  J Biol Chem       Date:  2012-07-11       Impact factor: 5.157

5.  Coordinated processing of 3' slipped (CAG)n/(CTG)n hairpins by DNA polymerases β and δ preferentially induces repeat expansions.

Authors:  Nelson L S Chan; Jinzhen Guo; Tianyi Zhang; Guogen Mao; Caixia Hou; Fenghua Yuan; Jian Huang; Yanbin Zhang; Jianxin Wu; Liya Gu; Guo-Min Li
Journal:  J Biol Chem       Date:  2013-04-12       Impact factor: 5.157

6.  Zinc-finger directed double-strand breaks within CAG repeat tracts promote repeat instability in human cells.

Authors:  David Mittelman; Christopher Moye; Jason Morton; Kristen Sykoudis; Yunfu Lin; Dana Carroll; John H Wilson
Journal:  Proc Natl Acad Sci U S A       Date:  2009-05-29       Impact factor: 11.205

Review 7.  Methods to determine DNA structural alterations and genetic instability.

Authors:  Guliang Wang; Junhua Zhao; Karen M Vasquez
Journal:  Methods       Date:  2009-02-24       Impact factor: 3.608

8.  Human mismatch repair protein hMutLα is required to repair short slipped-DNAs of trinucleotide repeats.

Authors:  Gagan B Panigrahi; Meghan M Slean; Jodie P Simard; Christopher E Pearson
Journal:  J Biol Chem       Date:  2012-10-18       Impact factor: 5.157

9.  Proofreading and secondary structure processing determine the orientation dependence of CAG x CTG trinucleotide repeat instability in Escherichia coli.

Authors:  Rabaab Zahra; John K Blackwood; Jill Sales; David R F Leach
Journal:  Genetics       Date:  2007-03-04       Impact factor: 4.562

10.  Instability of CTG repeats is governed by the position of a DNA base lesion through base excision repair.

Authors:  Yanhao Lai; Meng Xu; Zunzhen Zhang; Yuan Liu
Journal:  PLoS One       Date:  2013-02-26       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.