Literature DB >> 27386516

Mutation at a distance caused by homopolymeric guanine repeats in Saccharomyces cerevisiae.

Michael J McDonald1, Yen-Hsin Yu1, Jheng-Fen Guo1, Shin Yen Chong2, Cheng-Fu Kao3, Jun-Yi Leu1.   

Abstract

Mutation provides the raw material from which natural selection shapes adaptations. The rate at which new mutations arise is therefore a key factor that determines the tempo and mode of evolution. However, an accurate assessment of the mutation rate of a given organism is difficult because mutation rate varies on a fine scale within a genome. A central challenge of evolutionary genetics is to determine the underlying causes of this variation. In earlier work, we had shown that repeat sequences not only are prone to a high rate of expansion and contraction but also can cause an increase in mutation rate (on the order of kilobases) of the sequence surrounding the repeat. We perform experiments that show that simple guanine repeats 13 bp (base pairs) in length or longer (G 13+ ) increase the substitution rate 4- to 18-fold in the downstream DNA sequence, and this correlates with DNA replication timing (R = 0.89). We show that G 13+ mutagenicity results from the interplay of both error-prone translesion synthesis and homologous recombination repair pathways. The mutagenic repeats that we study have the potential to be exploited for the artificial elevation of mutation rate in systems biology and synthetic biology applications.

Entities:  

Keywords:  Mutation at a distance; evolution; homologous recombination; mutator; repeat sequence; translesion synthesis

Mesh:

Substances:

Year:  2016        PMID: 27386516      PMCID: PMC4928981          DOI: 10.1126/sciadv.1501033

Source DB:  PubMed          Journal:  Sci Adv        ISSN: 2375-2548            Impact factor:   14.136


INTRODUCTION

Mutation is the source of both adaptive genetic variation and deleterious genetic load. As such, the mutation rates that we observe in nature represent a balance between opposing selective forces. Because extant organisms are relatively well adapted, most mutations that occur will be deleterious (). It is this aversion to mutation that has driven the evolution of complex systems for high-fidelity replication and maintenance of genome integrity. However, mutation rates vary greatly; RNA viruses have an error rate of 10−4 substitutions per nucleotide per cell infection (), whereas bacteria make only a single error for every billion nucleotides synthesized (). Although it is perhaps not surprising that widely diverged species exhibit large differences in mutation rate, this observation raises questions about the degree of mutation rate variation within and across genomes. Studies of clinical and experimental populations have revealed that mutation rates can evolve on time scales observable to experimentalists and provided insights into the dynamics of mutation rate evolution (). In microbes, mutator strains arise via mutations that inactivate DNA repair enzymes, resulting in elevated mutation rates. These strains can become established in adapting populations because of the higher rate at which they produce beneficial mutations (). However, after the population has become better adapted, the increased rate of deleterious mutation experienced by a mutator will be selected against, providing selective pressure to drive mutation rates back down (, ). The effects of mutator alleles that diminish polymerase accuracy extend to the whole genome. However, mutation rates can also vary within a single genome. Highly transcribed genes have been found to have elevated mutation rates proportional to the rate of transcription (, ), probably because DNA that is highly transcribed is more often in a single-stranded state and thus more vulnerable to mutagens (, ). Another well-established correlate of mutation rate variation is DNA replication timing. Experimental () and bioinformatic (, ) studies have shown that later replicated DNA has a higher mutation rate than the earliest replicated regions. A proposed explanation for this is that DNA that is copied later during cell division will have less time for slower, high-fidelity repair mechanisms and will rely on error-prone DNA repair (). Primary DNA sequence can also influence mutation rates. Homopolymeric repeats of nucleotides are prone to increase and decrease in length at a high frequency (, ) and have been found to play an important role in genome evolution (, ), especially genetic switching mechanisms in pathogenic bacteria (, ). Microsatellites or other short repeats have been implicated in several human diseases (), and the high degree of polymorphism that results from their instability has been exploited as genetic markers (). In some cases, the causes of genetic instability have been directly linked to physical properties of DNA sequence. For instance, the twist angle between two adjacent bases in a DNA double helix is predetermined by their identity: some combinations of nucleotides have twist angles that are fragile and thus more prone to breakage (). The repeated evolution of mutations in the promoter of pitX1 in multiple independent populations of sticklebacks has been hypothesized to be due to both the strong selection upon these mutations and the proclivity of this region to sustain double-strand breaks. The part of the pitX1 promoter in which these adaptive mutations occur is the most fragile site in the stickleback genome, as predicted by DNA twist angles (). The mutagenic effects of some DNA sequences have been found to extend to flanking regions of DNA, with stretches of DNA that have the capacity to form secondary structures often being the culprit. Tang and colleagues () found that runs of 230 Friedreich’s ataxia repeats (GAA•TTC) were able to induce large deletions [>50 bp (base pairs)] and point mutations in a reporter gene more than a kilobase downstream of the repeat itself. Others working with the same repeat unit found that this repeat sequence could induce mutagenesis in sequences up to 8 kb away (). Directly inducing double-strand breaks using HO () or I-SCE1 () sharply increases not only the mutation rate in the surrounding 2 kb of sequence but also has a detectable but weak long-range mutator effect that decays exponentially across 60 kb of DNA sequence (). In all of these cases, a combination of double-strand break and translesion repair was directly implicated (–). In previous work, we found that short repeat sequences are positively correlated with the substitution rate in the surrounding DNA sequence (), distinct from the well-known repeat length polymorphism associated with repetitive DNA sequences, and that the experimental insertion of repeat sequences in yeast could elevate mutation rates in the downstream sequence. We have proposed that repeat sequences are more likely than other sequences to recruit error-prone DNA repair polymerases, leading to an increased mutation rate in DNA sequence surrounding the repeat. Here, we test our hypothesis by investigating one particular type of repeat in more detail: homopolymeric runs of guanine nucleotides. We demonstrate which DNA replication repair pathways are necessary for mutagenesis and show that these sequences interact with other known causes of mutation rate variation.

RESULTS

Homopolymeric runs of guanines 13 bp or longer (G) cause an increase in mutation rate

Using Saccharomyces cerevisiae, we engineered runs of 11 to 14 guanine nucleotides four bases upstream of the URA3 coding region (Fig. 1A), and measured mutation rates (Fig. 2A). The mutation rate in the URA3 coding region, downstream of a G or G repeat sequence, increased by up to fourfold. We measured the mutation rate at another locus encoding the CAN1 gene (Fig. 2B), confirming that mutation rates obtained at a site that did not have a G sequence upstream did not increase relative to the wild type. Moving G from the coding strand to the template strand and from upstream of the URA3 translation start site to downstream of the transcription termination site abolished the mutagenic effect of G (Fig. 2C).
Fig. 1

Experimental approach to quantifying mutagenicity of G13+ DNA sequences.

(A) Poly-G sequences were engineered 4 bp upstream of the URA3 translation start site (the 5′UTR region). The G14-ORF (open reading frame) construct (G14-ORF) allowed detection of loss-of-function mutations in the URA3 reading frame by plating on medium containing 5-fluoroorotic acid (5-FOA), which selects for individual cells that contain mutations that inactivate URA3. The red asterisk indicates the mutation site. (B) Using a weak URA3 allele (URA3-w), this construct (G14-repeat) facilitated the detection of the polyguanine repeat expansion mutation, because the mutation, G14 to G15 or longer, results in the 5′-FOA–resistant phenotype. wt, wild type.

Fig. 2

Polyguanine sequences cause a localized, directional effect on mutation rate.

(A) Mutation rates of homopolymeric guanine repeat sequences of increasing length. The estimated phenotypic mutation rate of G0-ORF, G13-ORF, and G14-ORF is 5.4 × 10−7, 13.5 × 10−7, and 20.3 × 10−7, respectively. G11 and G12 had no detectable increase in mutation rate. (B) Mutation rate was measured using CAN1, at a site distal from the URA3 locus. (C) The G14 repeat does not cause an increase in mutation rate if engineered on the template strand (C14-URA3) or on either the coding strand (ORF-G14) or the template strand (ORF-C14) downstream of the URA3 terminator sequence. Significant differences were calculated using t tests. *P < 0.05; **P < 0.005. Error bars show 95% confidence intervals.

Experimental approach to quantifying mutagenicity of G13+ DNA sequences.

(A) Poly-G sequences were engineered 4 bp upstream of the URA3 translation start site (the 5′UTR region). The G14-ORF (open reading frame) construct (G14-ORF) allowed detection of loss-of-function mutations in the URA3 reading frame by plating on medium containing 5-fluoroorotic acid (5-FOA), which selects for individual cells that contain mutations that inactivate URA3. The red asterisk indicates the mutation site. (B) Using a weak URA3 allele (URA3-w), this construct (G14-repeat) facilitated the detection of the polyguanine repeat expansion mutation, because the mutation, G14 to G15 or longer, results in the 5′-FOA–resistant phenotype. wt, wild type.

Polyguanine sequences cause a localized, directional effect on mutation rate.

(A) Mutation rates of homopolymeric guanine repeat sequences of increasing length. The estimated phenotypic mutation rate of G0-ORF, G13-ORF, and G14-ORF is 5.4 × 10−7, 13.5 × 10−7, and 20.3 × 10−7, respectively. G11 and G12 had no detectable increase in mutation rate. (B) Mutation rate was measured using CAN1, at a site distal from the URA3 locus. (C) The G14 repeat does not cause an increase in mutation rate if engineered on the template strand (C14-URA3) or on either the coding strand (ORF-G14) or the template strand (ORF-C14) downstream of the URA3 terminator sequence. Significant differences were calculated using t tests. *P < 0.05; **P < 0.005. Error bars show 95% confidence intervals.

Sequence analysis of G- and G-ORF mutants reveals similar spectrum of mutations

We sequenced 113 independent G-ORF ura3 mutants and 101 G-ura3 mutants and analyzed the distribution and identity of the mutations (Fig. 3). Although both sets of mutations were significantly different from a uniform distribution (Kolmogorov-Smirnov test, G, P = 0.023; G, P = 0.002), they were not different from each other (Kolmogorov-Smirnov test, P = 0.99). The identities of the mutations were highly similar except for a slight enrichment of indels relative to substitutions (Fisher’s exact test, P = 0.048) and transitions relative to transversions (Fisher’s exact test, P = 0.035) in the set of G-ORF ura3 mutants. Our sequence data incorporated the G repeat as well as the entire URA3 ORF, confirming that changes in guanine repeat length (repeat expansion or contraction) and large deletions were not responsible for any of the elevated mutation rate detected in this assay (table S1).
Fig. 3

Mutational spectrum of 183 5′-FOA–resistant mutants at the URA3 locus.

5′-FOA–resistant mutants were collected from G0-URA3 (brown) and G14-ORF (blue) strains. Each point mutation is shown directly above the wild-type sequence, and indels and complex mutations are shown directly below.

Mutational spectrum of 183 5′-FOA–resistant mutants at the URA3 locus.

5′-FOA–resistant mutants were collected from G0-URA3 (brown) and G14-ORF (blue) strains. Each point mutation is shown directly above the wild-type sequence, and indels and complex mutations are shown directly below.

DNA replication timing correlates with the degree of G mutagenicity

To test whether genome position would influence G mutagenicity, we engineered G-URA3 and G-ORF genes into different positions on chromosomes XII and XV (Fig. 4 and table S2). We found that G-URA3 inserts sustained a higher mutation rate than G-URA3 inserts at the same site. The change in mutation rate associated with G ranged from a 4-fold to an 18-fold increase, supporting the fact that G mutagenicity occurs regardless of genome position.
Fig. 4

The effect of G14 mutagenicity is correlated with DNA replication timing.

(A and B) The regression of mutation rate and replication timing for G alleles inserted at six different sites on chromosome XII and two sites on chromosome XV (Spearman’s R = 0.89, P = 0.007) (A) and for insertion of G at the same sites (Spearman’s R = 0.63, P = 0.086) (B). Replication timing is shown in minutes after the release of cells into synchronized S phase, as reported by Nieduszynski et al. (). Error bars show 95% confidence intervals.

The effect of G14 mutagenicity is correlated with DNA replication timing.

(A and B) The regression of mutation rate and replication timing for G alleles inserted at six different sites on chromosome XII and two sites on chromosome XV (Spearman’s R = 0.89, P = 0.007) (A) and for insertion of G at the same sites (Spearman’s R = 0.63, P = 0.086) (B). Replication timing is shown in minutes after the release of cells into synchronized S phase, as reported by Nieduszynski et al. (). Error bars show 95% confidence intervals. The direction of DNA replication for the chromosome V version of the G-ORF construct indicates that the replication fork moves in the same direction as transcription for URA3. Because moving the G14 sequence to the template strand or downstream of the transcription termination site abolishes the mutagenic effect, this suggests a relationship between G14 mutagenicity and DNA replication timing. DNA replication timing correlates with mutation rate variation in organisms ranging from bacteria to humans (, ). We calculated correlation coefficients for mutation rates of the G-ORF inserts and replication timing [from Nieduszynski et al. ()] and found a significant positive correlation (Fig. 4A; Spearman’s R = 0.89, P = 0.007). The G-URA3 inserts in equivalent positions showed a nonsignificant positive correlation (Fig. 4B; Spearman’s R = 0.638, P = 0.086) consistent with previous work ().

G13 repeats and transcription promote the accumulation of replication fork intermediates

Repeat sequences are known to incur an increased risk of replication fork stalling (). Upon fork stalling, replication reinitiates downstream, leaving a single-stranded gap that is filled in using either homologous recombination (HR) or translesion synthesis (TLS) (–), with a bias toward TLS for gaps requiring repair later in S phase (, ). TLS, mediated by the Rev1/Polζ complex, often introduces errors, even when synthesizing undamaged DNA in vivo (, ), and is responsible for approximately 50% of mutations in wild-type S. cerevisiae (, ). Replication fork intermediates can be visualized using the two-dimensional (2D) gel technique (). We made the doxycycline-repressible constructs tet-G-URA3 and tet-G-URA3 for 2D analysis (we were unable to generate a tet-G-URA3 construct). These were designed so that when running the 2D gel, replication fork intermediates that stall at the beginning of URA3 (Y molecules) should accumulate at the point indicated in Fig. 5B. We observed a 2.5-fold increase of Y molecules in G constructs compared to G constructs when cells were grown in the absence of doxycycline (Fig. 5, A to C).
Fig. 5

G13 repeats are implicated in the accumulation of replication fork intermediates.

(A) 2D gels for tet-G0-URA3 and tet-G13-URA3 under the control of a repressible tetR promoter () for time points taken at 15, 30, 45, and 60 min after synchronized cells were released from G1 cell cycle arrest. (B) The circled area show replication fork intermediates that are putatively stalled at the beginning of URA3 (Y molecules, blue circle). The reference region (red hexagon) is used to normalize between samples and determine the relative amounts of Y molecules in each treatment. (C) When transcription is enabled (−dox), the G13 construct has ~2.5 as much putative fork stalling as the G0 construct (calculated at the 30-min time point). When transcription is repressed, both G13 and G0 constructs appeared to have more Y molecules, although G13 has only ~1.13 as much replication fork intermediates as G0 (calculated at the 30-min time point). (D) Mutation rates for G0-URA3 and G13-ORF under the control of a repressible tetR promoter. −dox indicates the high-expression treatment, whereas +dox indicates repression of transcription. Significant differences were calculated using t tests; *P < 0.05. Error bars show 95% confidence intervals.

G13 repeats are implicated in the accumulation of replication fork intermediates.

(A) 2D gels for tet-G0-URA3 and tet-G13-URA3 under the control of a repressible tetR promoter () for time points taken at 15, 30, 45, and 60 min after synchronized cells were released from G1 cell cycle arrest. (B) The circled area show replication fork intermediates that are putatively stalled at the beginning of URA3 (Y molecules, blue circle). The reference region (red hexagon) is used to normalize between samples and determine the relative amounts of Y molecules in each treatment. (C) When transcription is enabled (−dox), the G13 construct has ~2.5 as much putative fork stalling as the G0 construct (calculated at the 30-min time point). When transcription is repressed, both G13 and G0 constructs appeared to have more Y molecules, although G13 has only ~1.13 as much replication fork intermediates as G0 (calculated at the 30-min time point). (D) Mutation rates for G0-URA3 and G13-ORF under the control of a repressible tetR promoter. −dox indicates the high-expression treatment, whereas +dox indicates repression of transcription. Significant differences were calculated using t tests; *P < 0.05. Error bars show 95% confidence intervals. Previous studies have shown that plasmid-based G and G repeats cause transcription-dependent replication fork stalling (, ). When we repressed transcription by supplementing the growth media with doxycycline, we found that the enrichment of Y molecules in G compared to G was reduced compared to the treatment without doxycycline (Fig. 5C). The ratio of G to G decreased, whereas the amount of actual detected replication fork intermediates increased; possible explanations for this will be discussed below. The change in replication fork intermediates caused by repression of transcription raises the possibility that the mutagenic effect of G is also changed. We measured the effect of transcriptional repression on mutation rates in tet-G-URA3 and tet-G-URA3. The mutation rates of tet-G-URA3 and tet-G-URA3 were reduced when URA3 transcription was repressed, although not significantly. Even after repression of transcription by doxycycline, the mutation rate of tet-G-URA3 remained significantly higher than that of tet-G-URA3 (Fig. 5D).

Mutagenesis downstream of G repeats is Rev1-dependent

Previously, we had proposed that repeat sequence–mediated increases in downstream mutation rate were caused by frequent recruitment of error-prone translesion DNA polymerases by sequences prone to stall the high-fidelity, housekeeping DNA polymerase (). To test this, we deleted REV1, which is required for all TLS in yeast (, ). We found that ablation of REV1 significantly reduced the mutation rate of both G-ORF (t test, P < 0.005) and GURA3 (t test, P < 0.005) (Fig. 6A). That deletion of REV1 decreases mutation rate suggests that the replication fork interruptions that are typically accommodated by Rev1-mediated TLS DNA synthesis are lethal in the rev1 mutant (). If this is the case, G sequences would cause an increased likelihood of mutation in the surrounding DNA sequence by increasing the rate of error-prone translesion DNA synthesis in that region.
Fig. 6

Expansion of polyguanine repeats occurs at a high rate and is RAD52-dependent.

(A) Mutation rates of G0 and G14-ORF strains compared to their respective rev1 deletion mutants. (B) Mutation rates of G0, G14-repeat, and G14-ORF strains, shown relative to the G0 mutation rate. (C and D) In each panel, the mutation rates of deletion mutants are shown relative to their respective nondeletion progenitor, either G0 (C), G14-ORF (C), or G14-repeat (D). Significant differences calculated using t tests are indicated by asterisks: *P < 0.05 and **P < 0.005. Error bars represent 95% confidence intervals.

Expansion of polyguanine repeats occurs at a high rate and is RAD52-dependent.

(A) Mutation rates of G0 and G14-ORF strains compared to their respective rev1 deletion mutants. (B) Mutation rates of G0, G14-repeat, and G14-ORF strains, shown relative to the G0 mutation rate. (C and D) In each panel, the mutation rates of deletion mutants are shown relative to their respective nondeletion progenitor, either G0 (C), G14-ORF (C), or G14-repeat (D). Significant differences calculated using t tests are indicated by asterisks: *P < 0.05 and **P < 0.005. Error bars represent 95% confidence intervals.

Expansion of homopolymeric repeats also occurs at the G repeat and is Rad52-dependent

It has long been established that homopolymeric repeat sequences are unstable, increasing and decreasing in repeat length at a high rate (, ). In the experiment described above, only mutations that occur in the ORF of URA3 can be recovered, even though mutations that change repeat length almost certainly occur in the G sequences of some of the individuals within the populations of yeast cells used to measure mutation rate. This is because mutations changing the length of the G repeat, which is in the 5′ untranslated region (5′UTR), do not cause the loss of URA3 function that is selected in the assay. To facilitate the capture of mutations that change the number of G’s in the G repeat, we constructed a new strain containing the G sequence, engineered upstream of an alternative URA3 sequence (URA3-w), whose function is mildly compromised (G-repeat, Fig. 1B). We had previously observed that a G repeat in the 5′UTR region of URA3-w could cause a reduction in protein translation (fig. S1). Although G-ORF-w exhibits the Ura+ phenotype of the wild-type allele, a mutation from G to G (or longer) results in the assayable loss of URA3 function, probably due to a combined effect of impaired function and reduced translation (Fig. 1B). We used this construct (G-repeat) to directly measure the mutation rate of repeat length increase from G to G or longer (contraction of the G repeat would not generate the phenotype). We found that repeat length–dependent increases in mutation rate were higher than the downstream mutation rate, with a 45-fold difference between G-repeat and G cells (Fig. 6B). Sequencing of 105 independent ura3 mutant clones of G-repeat confirmed that most of them carried an increased polyguanine repeat (G) but there were no mutations in the coding region and no large deletions (table S1).

Repeat expansion occurs independently of REV1 and RAD30

Although deletion of REV1 reduced the mutation rate within the URA3 ORF (as detected by the G-ORF construct), we found that deletion of REV1 had no effect on the mutation rate in the homopolymeric repeat as measured using the G-repeat construct (Fig. 6D). To confirm that another translesion DNA synthesis pathway was not involved, a gene essential for another translesion pathway, RAD30, was deleted and also had no effect on mutation rate. We next turned to the alternative mechanism for rescuing the stalled replication fork, HR. Rad52 is essential for the annealing of DNA strands during HR (), and its ablation causes an increase in mutation rate of approximately fivefold in G cells (Fig. 6D). The reason for this increase is that rad52 mutants depend on error-prone DNA polymerases to synthesize over the single-stranded gaps resulting from replication fork stalling (, ). Conversely, we found that deletion of RAD52 in the G-repeat strain markedly reduced URA3 mutation rates (Fig. 6D), consistent with previous work examining recombination and frameshifts underlying “adaptive mutation” in Escherichia coli (–). We checked whether rad52 deletion was able to reduce the mutation rate in G-ORF cells. However, similar to G cells, the mutation rate was increased approximately fivefold (Fig. 6C), showing that Rad52-mediated HR affects only the change of G length, not the downstream mutagenic effect of G.

G14 mutagenesis is not caused by formation of a G-quadruplex structure

G-quadruplex structures, which can induce double-strand breaks and replication fork pausing in Yeast pif1 (G-quadruplex resolvase) deletion mutants, are another potential cause of DNA replication stress (). To test whether our polyguanine sequences could form G-quadruplex structures, we compared a known G-quadruplex–forming sequence from Tetrahymena () to G, G, G, and G sequences. We designed five oligomers (given in Materials and Methods) that included either the G-quadruplex control sequence or 11 to 14 guanines in a row, each integrated into the same sequence context as the URA3 constructs used for fluctuation tests in this study. We first performed circular dichroism analysis of the oligos in ionic solutions that promote G-quadruplex formation. Circular dichroism analysis showed that the control was able to form a G-quadruplex, whereas the G11 to G14 repeat sequences could not (fig. S2, A to E). We then used DNA polymerase stop assays to test whether DNA polymerase could synthesize the complementary DNA across the single-stranded template, based on the principle that a stable secondary structure should inhibit DNA synthesis. The results show that the G-quadruplex control blocked the polymerase, whereas G to G sequences did not have the same effect (fig. S2F).

DISCUSSION

“Mutation at a distance” has been shown to occur in a number of different sequence contexts. For instance, DNA sequences that are prone to double-strand breaks have been indirectly linked to kataegis, a catastrophic mutational event that causes large clusters of mutations dispersed over tens of kilobases in cancer genomes (, ). Break-induced replication (BIR) is often initiated in response to double-strand breaks () and produces long regions of DNA in single-stranded form, which are more susceptible to mutation than double-stranded DNA. Experiments in yeast have shown that the widely distributed clusters of point mutations characteristic of kataegis result from the attack by mutagenic agents upon extended tracts of single-stranded DNA (), whereas studies in yeast (, ) and mammalian cells () have implicated the deregulation of AID/APOBEC deaminases as an alternative cause of kataegis-like mutational events. The elevated mutation rate that we observed in this study is unlikely to be caused by a similar mechanism because G-ORF mutagenesis requires REV1 whereas BIR functions independently of any translesion polymerase (). Double-strand breaks have also been implicated as the cause of the mutagenicity of long repeats known to form hairpin secondary structures (, ). The importance of double-strand breaks as a trigger for induced mutagenesis has been further reinforced by studies that directly induce double-strand breaks using HO () or I-SCE1 (). In all of these systems, increases in mutation rate are detectable in the surrounding 1 to 2 kb of sequence and dependent on TLS (–, ). Conversely, the repeat-induced mutagenesis system studied by Shishkin et al. () and Shah et al. () occurs independently of TLS and can induce mutations both upstream and downstream of the repeat sequence. Mutagenesis in this system only occurs when repeat tracts exceed the length of the Okazaki fragment. Here, we provide a tentative mechanism for how G repeats can lead to elevated mutation rates in the surrounding DNA sequence. The 2D gel results suggest that the replication fork is more prone to stall at the G-URA3 sequence than at the G-URA3 sequence. Mutation rates of the C-ORF, ORF-C, and ORF-G shown in Fig. 2C suggest that the mutagenic effect of G was dependent on the direction of replication. In this experiment, the guanines are seemingly required to be in the leading strand, and the mutagenic effect is only conferred on sequences that are replicated downstream of the guanine repeat sequence. However, this result is also consistent with the direction of transcription playing a role. G mutagenesis only occurs when transcription proceeds through G with the URA3 downstream. The question of the relative importance of replication or transcription is addressed by the insertion of G-ORF constructs in various positions in chromosomes XII and XV (Fig. 4 and table S2). Although all of these constructs are transcribed through G to URA3, the direction of replication and whether the genes are on the Watson or Crick strand vary by chromosome position (table S2). All constructs had a 4- to 18-fold increase in mutation rate, regardless of the direction of replication or the position of G on the leading or lagging strand. These results concur with previous work that has shown that G20 to G32 repeats on plasmids caused high rates of replication fork stalling that depend on transcription (, ). In agreement with our results, the equivalent cytosine repeats (C20 to C32) did not have an effect on replication fork stalling (, ). Mutation rates were not measured in these previous experiments, but in our study, even with repressed transcription, the mutation rate remained elevated (Fig. 5D). The tet repressor is known to be extremely effective at repressing expression; thus, it is unlikely that leaky expression can account for the remaining mutagenic effect in the noninduced tet-G-URA3. However, it has been shown that RNA polymerase II (RNAPII) (sufficient for hindering the DNA replication fork) can bind to the tet promoter even in the absence of induction (). Figure 5 shows that tet repression of transcription causes the G strain to accumulate more Y replication fork intermediates. This is consistent with RNAPII binding the repressed tet promoter. However, these accumulated Y fork intermediates do not translate into an increased mutation rate for G. That G does not have an elevated mutation rate despite an increase in replication fork intermediates suggests that the G sequence does more than stall the replication fork; it may also interfere with homologous repair, biasing toward TLS repair. The precise role of transcription in G13+ mutagenesis requires further work to be fully resolved. One model proposed by Lopes and co-workers (), and later by Huang et al. (), postulates that after replication fork stalling, the replication complex decouples from the fork and reinitiates replication downstream. This leaves a patch of single-stranded DNA to be filled using either HR or TLS. We propose that G repeats confer a higher probability of replication fork stalling than other DNA sequences (Fig. 7). This leads to more TLS activity in the region and, therefore, a higher mutation rate. There is mounting evidence that in early S phase, high-fidelity HR is the synthesis mechanism of choice, whereas in late S phase and early G2 phase, TLS is preferred (, , ). Our finding of a strong correlation between G-induced mutation and DNA replication timing adds support to this hypothesis by suggesting that G repeats in later-replicating DNA are more likely to depend on TLS for repair, leading to a higher average mutation rate in this region.
Fig. 7

Model for the outcome of G13+-induced replication fork stalling.

(I) The replication fork can proceed in either direction, but the transcription complex must encounter the G13+ repeat before the transcribed gene. (II) Transcription stalls at the G13+ sequence. (III) The replication fork stalls at the G13+ sequence; stalling is more likely if there is a stalled transcription complex already present. (IV) The replication fork detaches from the template and reinitiates replication downstream, leaving a patch of singlestranded DNA that is 800 to 3000 bp in length. (V) The DNA complementary to the single-stranded gap is synthesized using either Rad52-dependent HR (detected using the G14-repeat construct) or Rev1-dependent TLS (detected using the G14-ORF construct) to bypass the difficult-to-replicate region.

Model for the outcome of G13+-induced replication fork stalling.

(I) The replication fork can proceed in either direction, but the transcription complex must encounter the G13+ repeat before the transcribed gene. (II) Transcription stalls at the G13+ sequence. (III) The replication fork stalls at the G13+ sequence; stalling is more likely if there is a stalled transcription complex already present. (IV) The replication fork detaches from the template and reinitiates replication downstream, leaving a patch of singlestranded DNA that is 800 to 3000 bp in length. (V) The DNA complementary to the single-stranded gap is synthesized using either Rad52-dependent HR (detected using the G14-repeat construct) or Rev1-dependent TLS (detected using the G14-ORF construct) to bypass the difficult-to-replicate region. In one recent study, mismatch repair efficiency was found to vary with time of replication (). We show that most of the difference in mutation rates between G-URA3 and G-URA3 is dependent on having a functional REV1. We also show that G inserts have a stronger correlation between replication timing and mutation rate than G inserts. It follows that this stronger correlation is also REV1-dependent, and in support of this, a previous comprehensive analysis of G-URA3 inserts at 49 sites on chromosome 6 found that the deletion of REV1 abolished the correlation between replication timing and mutation rate (). These results suggest that mismatch repair plays a background role in the correlation between mutation rate and replication timing that we observed in this study. The G repeats that we study do not require an artificial increase in mutation rate or polymerase deletion for the effect to be detectable. Moreover, G14 repeats are short, simple, and present in most genomes, suggesting that G14 repeats could play an important role in genome evolution. The tight correlation between repeat-induced mutation and replication timing (R = 0.89, P = 0.007) suggests that the degree to which a repeat sequence will alter mutation rate can be directly combined with knowledge of replication timing. Because such repeats are common throughout all genomes, it is not implausible that these repeats could have implications as locally acting mutator sequences. In addition, short repeat sequences are easy to engineer and could be applied to synthetic biology applications for speeding up evolution of a target gene.

MATERIALS AND METHODS

Strain construction

All strains were constructed in a strain isogenic with W303 (MAT his3-11,15 leu2-3,112 trp1-1 ura3 ade2-1). Homopolymeric nucleotide strains were constructed by amplifying URA3, with primers containing a homopolymeric nucleotide tract at the position between −4 and −5 of URA3, and the resultant polymerase chain reaction (PCR) product was transformed into urayeast cells using the LiAc transformation method. The URA3 genes of transformants were amplified using PCR, and the sequences were confirmed by Sanger sequencing. Different mutant strains were constructed by amplifying the G418 insertion mutant for each gene of interest from the whole-genome deletion collection. Strains were transformed with PCR products and deletion mutants selected based on their resistance to G418. G-repeat was constructed using an alternative URA3 sequence that carries a point mutation at position 736 (val247ile), which has slightly reduced function compared to the wild-type URA3 gene. Change in repeat length from G to G in the G-repeat construct reduced protein translation such that cells containing this mutation were 5-FOA–resistant and detectable using the mutation rate assay.

Fluctuation assays

Strains to be assayed were grown overnight in 3-ml complete supplement mixture (CSM)–uracil medium, diluted 10−4, and then inoculated into 100-μl cultures so that there were approximately 1000 cells per culture. At least 24 independent cultures were used per assay, and each assay was repeated at least three times. Cultures were left overnight at 30°C until the cultures were assessed to have reached a suitable density, and then the entire culture, except for 5 μl, was plated onto predried 5-FOA plates to detect ura3 mutants that were 5-FOA–resistant. The remaining culture was pooled and diluted, and then the cell count was assayed using a Scepter cell counter. Mutation rates were calculated using the maximum likelihood method (). To measure the background mutation rate at a site distal from the URA3 locus, the wild-type CAN1 locus was restored in G-URA3 and G-ORF strains. Mutation rate assays were carried out the same as above except that mutations in the CAN1 gene were detected by plating on CSM-arginine plates supplemented with canavanine (60 μg/ml).

2D gel analysis

Strains for the 2D gel analysis were constructed by amplifying tet-G-URA3 fragments from a plasmid for insertion by HR into the intergenic region between YCL052C and YCL054W at position 34,028 on chromosome III. After we introduced the tet-G-URA3 fragments into S. cerevisiae, we plated the cells on selection plates lacking uracil. Colonies were picked and transformation was confirmed by PCR and Sanger sequencing. For the 2D gel experiments, log-phase cells [OD600 (optical density at 600 nm) ≈ 0.5] were synchronized at G1 by α factor (10 μg/ml) [with or without doxycycline (10 μg/ml)] for 2.5 hours and then released in YPD (yeast extract, peptone, and dextrose) medium (with or without doxycycline) at 25°C. Cells were harvested at 15, 30, 45, and 60 min and terminated immediately using sodium azide, and then in vivo psoralen cross-linking was undertaken as previously described (, ). Genomic DNA was digested using Hind III. First-dimension gels were made of 0.35% agarose and second-dimension gels were made of 1% agarose. A DNA fragment correlated to the yeast genome positions 34,028 to 34,999 was used to make the probe. Replication intermediate signals were quantified as previously described ().

DNA synthesis stop assay

To determine whether G11 to G14 sequences could form G-quadruplex structures, we conducted experiments comparing a known G-quadruplex–forming sequence from Tetrahymena (GGGTTGGGTTGGGTTGGGTT) () to G, G, G, and G sequences. We designed oligonucleotides composed of either homopolymeric runs of 11 to 14 G’s in a row or the G-quadruplex sequence, integrated into the same sequence context as the genetic constructs used to measure mutation rate in this study. Following Han and co-workers (), a radiolabeled primer (Γ-32P), shown below, was annealed with template DNA (10 nM) in buffer containing 5 mM KCl. To initiate the sequencing reactions, MgCl (3 μM), Taq polymerase (2.5 U per reaction), and deoxynucleotide triphosphates (final concentration of 100 μM) were added and the mix was incubated at either 37° or 55°C. The reactions were stopped and then run on 12% polyacrylamide gel. If the template forms a G-quadruplex, then DNA synthesis will not be completed, and no band can be visualized on the polyacrylamide gel. Primer: [CTGCACAGAACAAAAACCTGCAGGAAACG]. Templates: G-quadruplex control, GCTTTCGACATGATT(GGGTTGGGTTGGGTTGGGTT)TATCTTCGTTTCCTGCAGGTTTTTGTTCTGTGCAG; G11, d[GCTTTCGACATGATTGGGGGGGGGGGTATCTTCGTTTCCTGCAGGTTTTTGTTCTGTGCAG; G12, GCTTTCGACATGATTGGGGGGGGGGGGTATCTTCGTTTCCTGCAGGTTTTTGTTCTGTGCAG; G13, GCTTTCGACATGATTGGGGGGGGGGGGGTATCTTCGTTTCCTGCAGGTTTTTGTTCTGTGCAG; and G14, GCTTTCGACATGATTGGGGGGGGGGGGGGTATCTTCGTTTCCTGCAGGTTTTTGTTCTGTGCAG.

Circular dichroism

Following the work by Dexheimer et al. (), we incubated cuvettes containing 5 μM of oligomeric DNA dissolved in tris-HCl (50 mM, pH 7.6) containing either 100 mM KCl or 100 mM NaCl for 5 min at 90°C and then let them slowly cool to 25°C. Circular dichroism spectra were measured on a spectropolarimeter (J-815, JASCO) using a 1-cm path length quartz cuvette, over a range of 200 to 320 nm, with a response time of 1 s and a scanning speed of 100 nm min−1. Three replicate measurements were taken and measured at 25°C.
  67 in total

Review 1.  The distribution of rates of spontaneous mutation over viruses, prokaryotes, and eukaryotes.

Authors:  J W Drake
Journal:  Ann N Y Acad Sci       Date:  1999-05-18       Impact factor: 5.691

2.  Tight control of gene expression in mammalian cells by tetracycline-responsive promoters.

Authors:  M Gossen; H Bujard
Journal:  Proc Natl Acad Sci U S A       Date:  1992-06-15       Impact factor: 11.205

3.  A sodium-potassium switch in the formation of four-stranded G4-DNA.

Authors:  D Sen; W Gilbert
Journal:  Nature       Date:  1990-03-29       Impact factor: 49.962

4.  Analysis of replication intermediates by two-dimensional agarose gel electrophoresis.

Authors:  K L Friedman; B J Brewer
Journal:  Methods Enzymol       Date:  1995       Impact factor: 1.600

5.  Adaptive reversion of a frameshift mutation in Escherichia coli by simple base deletions in homopolymeric runs.

Authors:  P L Foster; J M Trimarchi
Journal:  Science       Date:  1994-07-15       Impact factor: 47.728

6.  Adaptive mutation by deletions in small mononucleotide repeats.

Authors:  S M Rosenberg; S Longerich; P Gee; R S Harris
Journal:  Science       Date:  1994-07-15       Impact factor: 47.728

7.  The preference for error-free or error-prone postreplication repair in Saccharomyces cerevisiae exposed to low-dose methyl methanesulfonate is cell cycle dependent.

Authors:  Dongqing Huang; Brian D Piening; Amanda G Paulovich
Journal:  Mol Cell Biol       Date:  2013-02-04       Impact factor: 4.272

8.  Human mutation rate associated with DNA replication timing.

Authors:  John A Stamatoyannopoulos; Ivan Adzhubei; Robert E Thurman; Gregory V Kryukov; Sergei M Mirkin; Shamil R Sunyaev
Journal:  Nat Genet       Date:  2009-03-15       Impact factor: 38.330

9.  Break-induced replication is a source of mutation clusters underlying kataegis.

Authors:  Cynthia J Sakofsky; Steven A Roberts; Ewa Malc; Piotr A Mieczkowski; Michael A Resnick; Dmitry A Gordenin; Anna Malkova
Journal:  Cell Rep       Date:  2014-05-29       Impact factor: 9.423

10.  Heterogeneous polymerase fidelity and mismatch repair bias genome variation and composition.

Authors:  Scott A Lujan; Anders R Clausen; Alan B Clark; Heather K MacAlpine; David M MacAlpine; Ewa P Malc; Piotr A Mieczkowski; Adam B Burkholder; David C Fargo; Dmitry A Gordenin; Thomas A Kunkel
Journal:  Genome Res       Date:  2014-09-12       Impact factor: 9.043

View more
  2 in total

1.  Constraints and consequences of the emergence of amino acid repeats in eukaryotic proteins.

Authors:  Sreenivas Chavali; Pavithra L Chavali; Guilhem Chalancon; Natalia Sanchez de Groot; Rita Gemayel; Natasha S Latysheva; Elizabeth Ing-Simmons; Kevin J Verstrepen; Santhanam Balaji; M Madan Babu
Journal:  Nat Struct Mol Biol       Date:  2017-08-14       Impact factor: 15.369

2.  GIGYF1 disruption associates with autism and impaired IGF-1R signaling.

Authors:  Guodong Chen; Bin Yu; Senwei Tan; Jieqiong Tan; Xiangbin Jia; Qiumeng Zhang; Xiaolei Zhang; Qian Jiang; Yue Hua; Yaoling Han; Shengjie Luo; Kendra Hoekzema; Raphael A Bernier; Rachel K Earl; Evangeline C Kurtz-Nelson; Michaela J Idleburg; Suneeta Madan-Khetarpal; Rebecca Clark; Jessica Sebastian; Alberto Fernandez-Jaen; Sara Alvarez; Staci D King; Luiza Lp Ramos; Mara Lucia Sf Santos; Donna M Martin; Dan Brooks; Joseph D Symonds; Ioana Cutcutache; Qian Pan; Zhengmao Hu; Ling Yuan; Evan E Eichler; Kun Xia; Hui Guo
Journal:  J Clin Invest       Date:  2022-10-03       Impact factor: 19.456

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.