In addition to environmental factors and intrinsic variations in base substitution rates, specific genome-destabilizing mutations can shape the mutational trajectory of genomes. How specific alleles influence the nature and position of accumulated mutations in a genomic context is largely unknown. Understanding the impact of genome-destabilizing alleles is particularly relevant to cancer genomes where biased mutational signatures are identifiable. We first created a more complete picture of cellular pathways that impact mutation rate using a primary screen to identify essential Saccharomyces cerevisiae gene mutations that cause mutator phenotypes. Drawing primarily on new alleles identified in this resource, we measure the impact of diverse mutator alleles on mutation patterns directly by whole-genome sequencing of 68 mutation-accumulation strains derived from wild-type and 11 parental mutator genotypes. The accumulated mutations differ across mutator strains, displaying base-substitution biases, allele-specific mutation hotspots, and break-associated mutation clustering. For example, in mutants of POLα and the Cdc13-Stn1-Ten1 complex, we find a distinct subtelomeric bias for mutations that we show is independent of the target sequence. Together our data suggest that specific genome-instability mutations are sufficient to drive discrete mutational signatures, some of which share properties with mutation patterns seen in tumors. Thus, in a population of cells, genome-instability mutations could influence clonal evolution by establishing discrete mutational trajectories for genomes.
In addition to environmental factors and intrinsic variations in base substitution rates, specific genome-destabilizing mutations can shape the mutational trajectory of genomes. How specific alleles influence the nature and position of accumulated mutations in a genomic context is largely unknown. Understanding the impact of genome-destabilizing alleles is particularly relevant to cancer genomes where biased mutational signatures are identifiable. We first created a more complete picture of cellular pathways that impact mutation rate using a primary screen to identify essential Saccharomyces cerevisiae gene mutations that cause mutator phenotypes. Drawing primarily on new alleles identified in this resource, we measure the impact of diverse mutator alleles on mutation patterns directly by whole-genome sequencing of 68 mutation-accumulation strains derived from wild-type and 11 parental mutator genotypes. The accumulated mutations differ across mutator strains, displaying base-substitution biases, allele-specific mutation hotspots, and break-associated mutation clustering. For example, in mutants of POLα and the Cdc13-Stn1-Ten1 complex, we find a distinct subtelomeric bias for mutations that we show is independent of the target sequence. Together our data suggest that specific genome-instability mutations are sufficient to drive discrete mutational signatures, some of which share properties with mutation patterns seen in tumors. Thus, in a population of cells, genome-instability mutations could influence clonal evolution by establishing discrete mutational trajectories for genomes.
EVOLUTIONARY analysis and laboratory studies show that mutation rates vary across chromosomes in unperturbed cells in association with factors such as base composition, repair efficiency, transcription, or replication timing (reviewed in Nishant ). More recently, observations of mutation clustering in long stretches of single-stranded DNA (ssDNA) have been posited as conserved mutational mechanisms operating in tumors and methylmethane sulfonate (MMS)-treated yeast cells (Roberts ). However, the influence of specific mutations on mutational patterns in the genome is usually not well characterized. Nonetheless, the spectrum of mutations will strongly influence the rates at which important phenotypes evolve. Recent work in Escherichia coli nicely demonstrates this concept by showing that different mutator alleles more potently adapt to different antibiotics based on the penetrance of the resistance mutations conferred by each mutator (Couce ). A handful of studies in models such as Caenorhabditis elegans and Saccharomyces cerevisiae have linked mutator alleles to mutation patterns on a genome-wide scale (e.g., dog-1 deficiency mutating G-quadruplexes, mismatch-repair-deficiency mutating homopolymers, DNA polymerase δ variants preferentially mutating lagging strands) (Cheung ; Larrea ; Zanders ). However, the influence of most mutator alleles on mutation patterns is not understood at this level. Thus, not only the mutation rate increase conferred by a mutator allele, but also its unique mutational signature (i.e., preferred type of mutation and bias to a genomic context, if any) generates its effect on genome stability and evolution.Tumor genomes accumulate alterations during oncogenesis. The accumulation of mutations, aneuploidies, or epigenetic changes frequently exceeds the normal rate of such events due to a predisposing mutation or environmental state that increases genetic instability. This in turn acts as an enabling characteristic of oncogenesis in most tumors (Hanahan and Weinberg 2011). Increased genetic instability, either in the form of increased mutation rate or chromosome instability (CIN), may promote selection of oncogenic cells from preoncogenic populations by increasing the likelihood that the requisite constellation of oncogenic mutations will occur in fewer cell divisions (Stratton ; Loeb 2011). Thus, early genome-destabilizing mutations have the potential to shape the evolutionary trajectory of precancerous cells and intratumoral heterogeneity in large tumor cell populations.Analysis of tumor genomes has uncovered discrete mutational signatures operating within and between tumor types (Nik-Zainal ; Alexandrov ). These powerful computational approaches have demonstrated that multiple mutational processes operate simultaneously in tumors (Nik-Zainal ; Alexandrov ). Mutational signatures can result from environmental exposure to genotoxins such as ultraviolet light or components of cigarette smoke (Pleasance ,b). Other signatures show the signs of endogenous processes such as the inappropriate action of APOBEC family cytosine deaminases (Nik-Zainal ; Alexandrov ; Burns ). Indeed, APOBECs from various sources are sufficient to induce breast-cancer-like patterns of mutations (i.e., clustered hypermutation of C:G basepairs or “kataegis”) when expressed ectopically in yeast (Lada ; Alexandrov ; Taylor ). Importantly, mutant alleles in genome-stability factors such as the DNA repair proteins BRCA1 and BRCA2 also correlate with a characteristic pattern of mutations (Nik-Zainal ).Screens in yeast have identified dozens of genome-destabilizing “mutator” alleles that function in a handful of cellular pathways (Huang ; Smith ). However, these efforts have focused on nonessential gene deletions available from the yeast knockout collection and thus are missing the fraction of mutators encoded by essential genes. In this study we probe mutant alleles of >500 essential yeast genes for those that increase the forward mutation rate, highlighting the role of the DNA replication machinery in suppressing mutation accumulation. Analyzing whole genome sequences for 68 mutation accumulation lines derived from 12 parental genotypes, reveals examples of mutator-allele-associated base-substitution biases, clustered mutations, and locus-specific patterns of mutations. These observations show that different genome destabilizing lesions can result in discrete mutational trajectories for a genome and thus, for a population of cells, could influence clonal evolution.
Materials and Methods
Yeast growth, microscopy, and fluctuation analyses
Strains used are listed in Supporting Information, Table S7. Yeast were grown on rich media except for , fluctuation, or recombination analyses, which were conducted as described (Lang and Murray 2008; Stirling ). Rates per generation were calculated from at least 12 (recombination rate) or 18 (mutation rate) independent cultures using the FALCOR program (Hall ). For microscopy, logarithmic cultures in synthetic medium were shifted to 37° for 2 hr mounted on concanavalin-coated slides, imaged using Metamorph (Molecular Devices), and scored using ImageJ (rsbweb.nih.gov/ij/) as described (Stirling ).
Mutation accumulation
Overnight cultures of parent clones were diluted into parallel cultures in 96 DeepWell plates. Cultures saturated over 3 days were diluted 10,000-fold in fresh medium and the process was repeated until ∼195 generations elapsed. Temperature sensitivity (ts) was confirmed by spot assays at 37° for the ts-alleles to remove wells with revertants. For evolved wild type (WT), Δ, Δ, and ts-wells, single colonies were isolated and genomic DNA was prepared using two rounds of phenol-chloroform extraction and ethanol precipitation with an intervening RNaseA step (75 μg/ml RNaseA at 37°, 30 min).
Sequencing, data analysis, and mutation identification
Whole-genome sequencing was done using the Illumina Hi-Seq2000 platform. Sequence files are deposited at the National Center for Biotechnology Information (NCBI) Sequence Read Archive (http://www.ncbi.nlm.nih.gov/sra) (no. PRJNA219315). After whole-genome sequencing on 12 parental strains and 4–6 evolved progeny strains, postquality control reads were aligned to the S. cerevisiae reference genome (University of California Santa Cruz version sacCer2) using Burrows-Wheeler Aligner 0.5.7 (Li and Durbin 2009), using default parameters. For each pair of parent–progeny, joint variant calling was carried out using SAMtools 0.1.13 utilities (Li ), with the parameters −C50 to decrease false calls from reads with excessive mismatches to the reference. A score based on log ratio of genotype likelihoods was used to identify variants in the progeny, with a threshold of 20 (Li ). Positions of variants determined in the parental strains were ignored in our analysis regardless of their quality score to reduce false positive calls. The remaining variants were annotated with SnpEff 3.0 (Cingolani ), using the Ensembl sacCer2.61 database (Flicek ). Copy-number variants were detected using a modified version of CNAseq developed at the Michael Smith Genome Sciences Centre, as described (Shah ; Jones ). As an additional quality control, we manually checked variants with Phred-scaled variant quality scores <20 using the Integrated Genomics Viewer (Broad Institute) and discarded those with poor read support.
Base substitution and flanking nucleotide bias
Base substitution biases were calculated using the Fisher exact test to compare with a wild-type mutation spectrum derived from the literature (Lynch ; Lang ). As in the literature, each mutation, regardless of strand, is expressed as a change originating in a C or T (Nik-Zainal ). A Holm–Bonferroni approach was applied to Fisher’s test P-values to correct for multiple hypothesis testing; corrected P-values are reported.
Mutation clustering analysis
Clustering was assessed essentially as described (Roberts ). Intermutation distances for variants in the genome of a single isolate were calculated and those appearing within a 100-kb window were tested statistically. For the purposes of this analysis, mutations within 10 bp were considered “complex mutations” and treated as a single event, thus not constituting clusters unto themselves. Using the intermutation distance (i.e., cluster size) in base pairs, the number of mutations observed in a candidate cluster and the probability of finding a mutation at a given location in the genome, we calculated the cumulative probability on a negative binomial distribution of observing the intermutation distance at random. The probability of identifying a mutation at a given location was determined by dividing the total number of mutations (n) for a particular genotype by the number of base pairs sequenced for that strain (no. of genomes sequenced × 12,162,995 bp). Calculations were done using MatLab V7.12.0.635. To account for multiple hypothesis testing, we divided our original significance threshold (P < 0.01) by the number of putative clusters analyzed (103), establishing a corrected significance threshold of P < 9.7 × 10−5.
Results and Discussion
Systematic identification of mutator alleles in essential genes
Our previous efforts to catalog yeastCIN genes revealed a bias toward essential genes; thus we reasoned that there were likely to be unrecognized essential genes operating to suppress mutations. We surveyed 813 alleles in 525 essential genes (∼50% of total) (Ben-Aroya ; Breslow ; Li ) for increased mutation frequencies, identifying 47 mutator alleles in 38 genes (Figure 1A). Only those alleles that retested as mutators across five independent patches are included (Table S1). To assess the sensitivity of our assay, we quantified mutation rates for alleles of 33 mutator genes (Figure 1B). Fluctuation analysis revealed that all but two alleles had a twofold or higher increase in mutation rate at 30°, suggesting that our primary screen was as sensitive to changes in mutation rate as previous screens and that our retesting effectively removed false positives (Huang ).
Figure 1
A catalog of yeast mutator alleles. (A) Schematic of essential mutator allele screen. Representative canavanine-resistant papillae are shown in the center panel. Right, cellular functional groups identified in the screen. (B) Quantification of mutation rate by fluctuation analysis of newly identified mutator alleles. Error bars represent 95% confidence intervals. (C) Functional groupings of all known yeast mutator alleles from the literature and this study (Table S1).
A catalog of yeast mutator alleles. (A) Schematic of essential mutator allele screen. Representative canavanine-resistant papillae are shown in the center panel. Right, cellular functional groups identified in the screen. (B) Quantification of mutation rate by fluctuation analysis of newly identified mutator alleles. Error bars represent 95% confidence intervals. (C) Functional groupings of all known yeast mutator alleles from the literature and this study (Table S1).Combining our dataset with the Saccharomyces Genome Database phenotype term “mutation frequency: increased,” we compiled the list of yeast genes with reported mutator phenotypes (Table S1). These 127 genes represent a community survey of >5000 yeast genes (∼85% of total) for mutator phenotypes, thus most cellular pathways that suppress mutations are probably represented. Functional enrichment analysis of the mutator alleles highlights the dominant role of the DNA repair and replication machinery in suppressing mutations (Figure 1C). Gene Ontology analysis showed that essentially all DNA repair pathways are significantly enriched (i.e., P < 10−8 for base-excision repair, nucleotide excision repair, mismatch repair, break-induced replication, homologous recombination, and nonhomologous end joining). Nearly 90% of the mutators likely function at least partially in the nucleus, consistent with more direct mechanisms in preventing a mutator phenotype compared with CIN, for which ∼40% of CIN genes function outside the nucleus (Stirling ). Mutants affecting the nucleotide pool, oxidative stress response, or mitochondrial function make up the only nonnuclear functional groups, consistent with the established potential for these pathways to influence nuclear genome integrity indirectly through their effects on oxidative base damage and Fe-S cluster formation (Huang ; Veatch ). A total of 72 of 127 mutator alleles compiled here also have a reported CIN phenotype, highlighting the considerable overlap of the two phenotypes. Nonetheless, this list of essential mutator alleles enhances the resource of genome-destabilizing mutations available in yeast.
Determining the mutation spectra driven by different mutator alleles
The marker used as a primary screen for mutator alleles has been sequenced extensively to describe the mutational spectrum of mutator alleles (Huang ). However, only tests mutations in ∼1.8 kb of the ∼12-Mb yeast genome and isolating CANr strains requires loss-of-function mutations, biasing the mutation spectrum toward frameshifts, stop codons, and a subset of amino acid substitutions. Therefore, it is plausible that different mutator alleles will selectively drive mutation in the genome based on a set of parameters that may or may not be represented in . A well-characterized example of this phenomenon is the association of mismatch repair deficiency with increased mutation rates in homopolymeric or microsatellite sequences (Zanders ). Thus, while is a convenient marker for identifying mutator alleles, analysis of diverse sequence contexts is more likely to reveal any allele-specific locus biases that exist.To explore the mutational spectrum for diverse mutator alleles in a whole-genome context we devised a mutation accumulation experiment. Eleven mutator alleles were chosen for this analysis: two strong and well-characterized mutators from the literature, namely deletions of the homologous recombination protein and the peroxiredoxin (Huang and Kolodner 2005; Mortensen ), and nine mutator alleles identified in our primary screen of essential genes (Figure 1). The alleles were chosen partly to represent diverse cellular pathways [e.g., homologous recombination (Δ), oxidative stress tolerance (Δ), splicing (), transcription (), mitochondrial function (), and telomere capping ()] and also acknowledge the inherent bias toward DNA replication by querying different perturbations in replication machinery (i.e., , , , , and ). For each strain, a single parental clone was split into parallel cultures, grown for ∼195 generations by repeated dilution, and single colonies were derived from the endpoint cultures. Genomic DNA from the parent and four to six independently evolved progeny was whole-genome sequenced for a total of 12 starting strain backgrounds, producing 80 haploid whole-yeast-genome sequences at an average of 57-fold sequence coverage (schematized in Figure 2A; Table S2). Subtracting the parental genotype from the evolved strains identifies accumulated variants. This experiment captured single-nucleotide variants (SNVs), copy-number variants (CNVs, i.e., deletions or amplifications of large chromosomal regions), small insertion/deletion mutations (indels), and a few other types of chromosomal rearrangements (Figure 2B and Table S3). Based on published mutation rates (Lang and Murray 2008; Lynch ), we predicted zero to two mutations per WT genome and observed a slightly higher but comparable rate (median = 2.5 mutations/genome; range, two to four mutations). In contrast, the mutator alleles produced ∼2- to 10-fold more mutations on average (Figure 2B). SNVs were the most frequently identified mutation. One limitation of short read sequencing is difficulty in detecting indels between ∼15 and 50 bp. However, the indels we did detect were likely bona fide since the genomes of parents and evolved strains were compared relatively, using the same technology.
Figure 2
Determining the genomic mutation spectrum of mutator alleles. (A) Schematic of mutation accumulation experiment (see main text). Colors denote independent genomes. (B) Summary of mutations detected for mutator alleles. Fold increase in mutations/genome compared to WT are indicated above each bar. WGS, whole-genome sequencing; SV, structural variant; CNV, copy-number variant; SNV, single-nucleotide variant; indel, small insertion or deletion.
Determining the genomic mutation spectrum of mutator alleles. (A) Schematic of mutation accumulation experiment (see main text). Colors denote independent genomes. (B) Summary of mutations detected for mutator alleles. Fold increase in mutations/genome compared to WT are indicated above each bar. WGS, whole-genome sequencing; SV, structural variant; CNV, copy-number variant; SNV, single-nucleotide variant; indel, small insertion or deletion.CNVs were not observed in the WT isolates but 7 of 11 mutator allele sets accumulated at least one predicted CNV during the experiment, including whole chromosome gains and segmental gains and losses (Table S3 and Table S4). Subtelomeric gains and losses were the most common type of CNV, presumably because large deletions in other parts of the genome would be lethal to a haploid cell and because telomeres represent repetitive, difficult-to-replicate regions prone to mutation (Nishant ). Other types of structural variants were rarely detected but included large insertions, deletions, and inversions (Table S3 and Table S4).
Copy-number variants linked to hyperrecombination
In analyzing CNVs, we noted some common features of segmental changes internal to the chromosome. These CNVs were usually <20 kb, were invariably flanked by repetitive sequences (e.g., paralogues, paralogues, ncRNAs, and Ty1 retrotransposons) and often contained an origin of replication. The nature of the CNVs indicated that unscheduled recombination events were likely taking place in these mutator strains, potentially linked to defective replication fork progression. Indeed some CNVs showed direct evidence of recombination (e.g., homozygosity of and paralogues, Figure 3A) and others are known to be recombination hotspots (St Charles and Petes 2013). It is worth noting that additional events would almost certainly be evident in diploids where larger deletions or intrachromosomal rearrangements, lethal to the haploid cells in our experiment, could be tolerated.
Figure 3
Linking segmental copy-number changes to recombination. (A) Example of segmental deletion in pol1-ts. Integrative Genomics Viewer of mapped reads shows a deletion encompassing ARS432, flanked by the HXT6/HXT7 paralogues. Loss of sequence heterogeneity between HXT6 and -7 (dotted boxes) supports a deletion by intrachromosomal recombination. (B) Direct repeat recombination rates in selected mutator alleles. Error bars represent 95% confidence intervals. (C) Rad52-YFP foci accumulation in selected mutator alleles. Blue bars were significantly different from WT (P < 0.01). Error bars are the standard error of the mean.
Linking segmental copy-number changes to recombination. (A) Example of segmental deletion in pol1-ts. Integrative Genomics Viewer of mapped reads shows a deletion encompassing ARS432, flanked by the HXT6/HXT7 paralogues. Loss of sequence heterogeneity between HXT6 and -7 (dotted boxes) supports a deletion by intrachromosomal recombination. (B) Direct repeat recombination rates in selected mutator alleles. Error bars represent 95% confidence intervals. (C) Rad52-YFP foci accumulation in selected mutator alleles. Blue bars were significantly different from WT (P < 0.01). Error bars are the standard error of the mean.Together these observations led us to hypothesize that many observed CNVs could be due to increases in homologous recombination. Using a plasmid-based assay for direct repeat recombination, we found that strains with the most segmental CNVs (i.e., , , and Δ) had significant increases in direct repeat recombination (Figure 3B). Consistently, several of these strains (e.g., , and mutants) have reported increases in Rad52 foci, indicative of ongoing DNA repair, whereas the Rad52 foci status of is unknown (Ragu ; Stirling ). We scored a subset of mutator alleles from our screen for Rad52-YFP foci and found that , along with other replication and transcription mutants, cause significant increases in Rad52 foci (Figure 3C and Table S1; Stirling ). We suggest that some of the identified CNVs are driven by inappropriate recombination events due to increased DNA damage. While surprising for a circular plasmid given the role of Stn1 in telomere capping, the observed increase in plasmid-based recombination rate in cells supports its recently described role in global DNA replication fork restart via interactions with the POLα complex (Stewart ).
Varied base-substitution patterns originating in different mutator alleles
We pooled accumulated mutations for a given parental genotype to assess allele-associated mutation signatures. Variants were classified as originating in the pyrimidine base to simplify SNVs into six categories (i.e., C to T or T to C transition and T to A, T to G, C to A, and C to G transversions) (Figure 4A and Table S5). Since we did not accumulate enough mutations in our WT strain to confidently set a baseline, we relied on the whole-genome and marker sequencing data compiled in the literature to set a standard of the yeast mutational spectrum (Figure 4B) (Lynch ; Lang ).
Figure 4
Mutator allele-driven biases in mutation type and position. (A) Base substitution types across mutator allele backgrounds. Shown are the raw number of each transversion (reds) and transition (blues) detected in the mutator genomes. (B) Proportion of mutation types. Selected data from A, expressed as a proportion, and its comparison with a WT spectrum (Lynch , 2013), reveal significant variations from the WT. P-values represent Holm–Bonferroni corrected results of a Fisher’s exact test (*<0.05). (C) Mutation position as a percentage of chromosome length. Relative mutation position along the 16 chromosomes in stn1-13 and pol1-ts differ significantly from rad52Δ (Kolmogorov–Smirnov P < 0.05). The x-axis (mutation rank) reflects the position of each mutation in an ordered list, expressed as a proportion of the total.
Mutator allele-driven biases in mutation type and position. (A) Base substitution types across mutator allele backgrounds. Shown are the raw number of each transversion (reds) and transition (blues) detected in the mutator genomes. (B) Proportion of mutation types. Selected data from A, expressed as a proportion, and its comparison with a WT spectrum (Lynch , 2013), reveal significant variations from the WT. P-values represent Holm–Bonferroni corrected results of a Fisher’s exact test (*<0.05). (C) Mutation position as a percentage of chromosome length. Relative mutation position along the 16 chromosomes in stn1-13 and pol1-ts differ significantly from rad52Δ (Kolmogorov–Smirnov P < 0.05). The x-axis (mutation rank) reflects the position of each mutation in an ordered list, expressed as a proportion of the total.Even combining all the genomes sequenced for a given strain, we accumulated relatively few base changes for each strain and could not universally predict mutation biases. Previous studies with similar numbers of variants have focused on mutators that create very well-defined signatures (e.g., indels in homopolymeric sequence for mismatch repair defects) (Zanders ; Ma ). Other studies have used strains with extremely high mutation rates due to multiple mutator alleles (Larrea ). Remarkably, we did observe significant shifts in the observed mutation type for , , and alleles using Fisher’s exact test (Holm–Bonferroni corrected P < 0.05) (Figure 4B). Δ exhibited candidate variation in the Fisher test but did not meet our significance threshold after correction for multiple hypothesis testing (Figure 4B). The driving force of mutation bias for and appears to be a proportionate increase in transition frequency, while the spectrum of is more complex (e.g., double the number of T to A transversions). Additional refinement and analyses of the context of base-substitution bias in such genome-wide studies will require larger numbers of variants and a continual refinement of the true WT mutation spectrum.Limiting analysis to strains with >50 genic SNVs, we also observed a weak transcriptional strand bias in Δ mutations (i.e., increased C to T mutations in the transcribed strand), suggesting that inability to complete transcription-associated recombination may lead to an error-prone mechanism of repair in these cells (Figure S1). While we do not understand the mechanisms in each case, as has been seen in analysis of tumor genomes (Nik-Zainal ), our data suggest that transcription could have a significant allele-specific influence on mutation spectra.
Regional and functional genomic elements dictate mutation position
Biased mutation position could reflect specific chromatin environments or functional features of the genome that interact with a mutator allele to drive mutations in a specific locus. Relationships with features such as transcription, nucleotide content, or replication timing have previously been noted in cancer and model organisms (Lang and Murray 2011; Drier ). We first analyzed the relative position of mutations along the length of chromosomes irrespective of sequence features, normalizing mutation positions to the position along the chromosome from 0 to 100%. Kolmogorov–Smirnov tests showed that only the and mutation patterns differed significantly from an idealized distribution and from other alleles patterns (Figure 4C). The and mutation profiles showed hotspots of mutation in subtelomeric regions not seen in other alleles. For this bias was expected since previous work demonstrated that cells accumulate ssDNA in telomeric regions (Grandin ).We investigated potential correlations between mutation position in a given mutator allele and a variety of functional features of the genome. Generally, we did not see significant associations with GC content or with transcribed regions of the genome. The one exception to this was for , which showed increased mutations outside of genes, likely associated with mutation bias to gene-poor subtelomeres (Figure S2 and see below). Similar to observations in human evolutionary studies and in cancer genomes, we found that mutations in 5 of 11 mutators occur in regions with later average replication timing compared to the rest of the genome (Figure S2) (Raghuraman ; Stamatoyannopoulos ; Lang and Murray 2011; Drier ). Possible explanations include accumulation of damage prone ssDNA in late replicating regions (Stamatoyannopoulos ) or may relate to the fact that some translesion polymerases are not expressed until late in S phase (Waters and Walker 2006). It has been suggested that late in replication error-prone translesion synthesis becomes more important to complete replication of DNA associated with stalled replisomes (Lang and Murray 2011). These observations support the notion that late replicating regions may be sensitized sites that are revealed in certain mutator allele genetic backgrounds.
Increases in mutation clustering and region-specific mutation rates
Mutation clustering may arise due to the coordination of mutational events or as part of hotspots of mutation. It has been observed that yeast cells accumulate clusters of mutations when treated with the DNA alkylating chemical MMS and that tumor cells accumulate clusters of mutations in a variety of contexts, some related to the action of cytosine deaminases (Nik-Zainal ; Roberts ; Burns ). However, the contribution of mutator alleles to these phenomena has not been widely assessed.We defined 28 mutation clusters across the 68 evolved genomes that were unlikely to have arisen by chance (P < 9.7 × 10−5) using the criteria laid out in the literature for MMS-induced mutation clusters, namely, any two or more SNV, indel, or complex mutations within 100 kb of each other that meet our significance threshold (Roberts ). WT, , and did not produce any clusters, while the other mutator genomes contained between one and nine mutation clusters (Figure S1 and Table S6). Typically clusters contained two to three mutations but there were examples of larger clusters in , , and (Figure S1). Significantly clustered mutations spanned as few as 21 bp and as many as 126 kb, with a median of ∼3.6 kb (Figure S1). One candidate mechanism for mutation clustering in the absence of genotoxins or APOBECs, is exposure of resected ssDNA during prolonged repair reactions. Local mutation rates increase in this scenario and we found that at least and have an increased frequency of DNA breaks as measured by Rad52 foci (Figure 3C) (Hicks ; Nik-Zainal ; Roberts ; Stirling ). In support of this model, for nearly all clusters, the mutation density fell within the range of expected resection length for repair of a DNA double-strand break (i.e., up to ∼10 kb) (Chung ).To examine cluster position, we generated rainfall plots (Nik-Zainal ). Strikingly, clusters of mutations in and strains occurred almost exclusively in subtelomeric regions, suggesting that the observed telomeric bias may relate to clusters (Figure 5A). At least for , this observation supports literature showing that uncapped telomeres lead to hypermutability in association with ssDNA and that accumulates ssDNA in telomeres (Grandin ; Yang ). It is notable that some clusters in or were near repetitive sequences known to be recombination hotspots (e.g., Ty elements, gene cluster) that may be more likely to initiate inappropriate repair reactions involving resection (Table S6) (St Charles and Petes 2013). Alternatively, many of these clusters occurred adjacent to predicted CNVs (Figure 5A), suggesting that, as in other systems, break-associated mutation clustering might be driving the observed pattern (Hicks ; Deem ; Roberts ; Drier ). Indeed, break-induced replication itself has been found to be highly error prone and could contribute to clustering in the mutator genomes (Deem ).
Figure 5
Allele-specific mutation clustering behavior and locus-specific mutation rates. (A) Rainfall plot analysis of mutations in pol1-ts and stn1-13. Mutations from unique isolates are colored differently. Arrowheads indicate CNVs adjacent to mutation clusters and vertical dotted lines indicate chromosome ends. (B) Mutation rates vary based on position and mutator allele genotype. Left, schematic of chr VI-L noting the three positions at which URA3 was inserted. Right, the URA3 mutation rate at each position. Error bars indicate a 95% confidence interval.
Allele-specific mutation clustering behavior and locus-specific mutation rates. (A) Rainfall plot analysis of mutations in pol1-ts and stn1-13. Mutations from unique isolates are colored differently. Arrowheads indicate CNVs adjacent to mutation clusters and vertical dotted lines indicate chromosome ends. (B) Mutation rates vary based on position and mutator allele genotype. Left, schematic of chr VI-L noting the three positions at which URA3 was inserted. Right, the URA3 mutation rate at each position. Error bars indicate a 95% confidence interval.Overall, mutation accumulation and sequencing suggested that local mutation rates were likely higher in subtelomeres for and strains. While we did not observe a subtelomeric bias for mutations in other strains, providing the essential control for the specificity of this observation to and , subtelomeres do exhibit lower variant scores due to their repetitive nature. Moreover, we could not determine whether the primary subtelomeric sequence or the chromosomal context itself was driving mutations. Therefore, we directly tested mutation position biases by introducing the counterselectable marker into a subset of mutator lines at loci increasing in distance from the Tel VI-L (Figure 5B). This system enabled us to measure the mutation rate at the same target sequence, , in different chromosomal contexts (Lang and Murray 2011). Fluctuation analysis of positioned at 15, 84, or 112 kb from Tel VI-L showed dramatic distance-associated decreases in mutation rate for and but not for the Δ strain, which showed an identical rate regardless of the marker position (Figure 5B). The presence of mutation prone subtelomeres in and mutants is consistent with the function of in telomere protection and in recruitment of the POLα–primase complexes to subtelomeres for efficient replication (Yang ; Wang ).
Conclusion
We identified dozens of essential mutator genes which contribute to a catalog of mutator alleles that define an expanded suite of such cellular mechanisms that suppress mutations. In addition, this work takes a powerful approach to defining mutation spectra for mutator alleles: whole-genome sequencing. These data suggest that mutational signatures vary in an allele-specific fashion and are influenced by several key features: (1) concurrent chromosome instability related to increases in recombination; (2) replication timing, chromosomal context; and (3) break-associated mutation clustering.Building on this work, and the existing literature on mutator allele-driven mutation signatures (e.g., mismatch repair, DNA polymerase δ) (Larrea ; Zanders ), will require both broader studies of more mutator conditions (e.g., genotypes, environment, ploidy) to identify novel signatures, and deeper analysis of larger numbers of variants for a single mutator to improve statistical power. These analyses will be important because tumors are almost certainly influenced by the interaction between specific early genome destabilizing mutations or genotoxins and correspondingly sensitized loci within the genome. In addition to increasing the rate of mutation, mutator alleles introduce base substitution biases or broader locus-associated biases including mutation clustering. Recent detailed analysis of mutation patterns across 21 breast tumors revealed mutation clusters associated with a mutational process termed “kataegis” that is associated with cytosine deamination by APOBEC family proteins (Nik-Zainal ; Taylor ). Our data demonstrate that specific mutator genotypes are sufficient to generate mutation clusters in the absence of genotoxins or APOBECs, probably via break-associated hypermutability as seen in models and in humancancers (Roberts ; Drier ). Expanding mutation pattern studies in model systems may therefore be key to mechanistically connect specific predisposing genotypes to aspects of mutational signatures seen in tumor genomes.
Authors: M K Raghuraman; E A Winzeler; D Collingwood; S Hunt; L Wodicka; A Conway; D J Lockhart; R W Davis; B J Brewer; W L Fangman Journal: Science Date: 2001-10-05 Impact factor: 47.728
Authors: Michael Lynch; Way Sung; Krystalynne Morris; Nicole Coffey; Christian R Landry; Erik B Dopman; W Joseph Dickinson; Kazufusa Okamoto; Shilpa Kulkarni; Daniel L Hartl; W Kelley Thomas Journal: Proc Natl Acad Sci U S A Date: 2008-06-26 Impact factor: 11.205
Authors: David K Breslow; Dale M Cameron; Sean R Collins; Maya Schuldiner; Jacob Stewart-Ornstein; Heather W Newman; Sigurd Braun; Hiten D Madhani; Nevan J Krogan; Jonathan S Weissman Journal: Nat Methods Date: 2008-07-11 Impact factor: 28.547
Authors: Romulo Segovia; Yaoqing Shen; Scott A Lujan; Steven J M Jones; Peter C Stirling Journal: Proc Natl Acad Sci U S A Date: 2017-02-21 Impact factor: 11.205
Authors: Emily Yun-Chia Chang; Carolina A Novoa; Maria J Aristizabal; Yan Coulombe; Romulo Segovia; Richa Chaturvedi; Yaoqing Shen; Christelle Keong; Annie S Tam; Steven J M Jones; Jean-Yves Masson; Michael S Kobor; Peter C Stirling Journal: J Cell Biol Date: 2017-10-17 Impact factor: 10.539
Authors: Veena Mathew; Annie S Tam; Karissa L Milbury; Analise K Hofmann; Christopher S Hughes; Gregg B Morin; Christopher J R Loewen; Peter C Stirling Journal: J Cell Biol Date: 2017-10-04 Impact factor: 10.539
Authors: Adam B Burkholder; Scott A Lujan; Christopher A Lavender; Sara A Grimm; Thomas A Kunkel; David C Fargo Journal: BMC Genomics Date: 2018-05-09 Impact factor: 3.969
Authors: Jing Li; Mark Rinnerthaler; Johannes Hartl; Manuela Weber; Thomas Karl; Hannelore Breitenbach-Koller; Michael Mülleder; Jakob Vowinckel; Hans Marx; Michael Sauer; Diethard Mattanovich; Özge Ata; Sonakshi De; Gregor P Greslehner; Florian Geltinger; Bill Burhans; Chris Grant; Victoria Doronina; Meryem Ralser; Maria Karolin Streubel; Christian Grabner; Stefanie Jarolim; Claudia Moßhammer; Campbell W Gourlay; Jiri Hasek; Paul J Cullen; Gianni Liti; Markus Ralser; Michael Breitenbach Journal: G3 (Bethesda) Date: 2020-12-03 Impact factor: 3.154