Literature DB >> 23852385

A new system for comparative functional genomics of Saccharomyces yeasts.

Amy A Caudy1, Yuanfang Guan, Yue Jia, Christina Hansen, Chris DeSevo, Alicia P Hayes, Joy Agee, Juan R Alvarez-Dominguez, Hugo Arellano, Daniel Barrett, Cynthia Bauerle, Namita Bisaria, Patrick H Bradley, J Scott Breunig, Erin Bush, David Cappel, Emily Capra, Walter Chen, John Clore, Peter A Combs, Christopher Doucette, Olukunle Demuren, Peter Fellowes, Sam Freeman, Evgeni Frenkel, Daniel Gadala-Maria, Richa Gawande, David Glass, Samuel Grossberg, Anita Gupta, Latanya Hammonds-Odie, Aaron Hoisos, Jenny Hsi, Yu-Han Huang Hsu, Sachi Inukai, Konrad J Karczewski, Xiaobo Ke, Mina Kojima, Samuel Leachman, Danny Lieber, Anna Liebowitz, Julia Liu, Yufei Liu, Trevor Martin, Jose Mena, Rosa Mendoza, Cameron Myhrvold, Christian Millian, Sarah Pfau, Sandeep Raj, Matt Rich, Joe Rokicki, William Rounds, Michael Salazar, Matthew Salesi, Rajani Sharma, Sanford Silverman, Cara Singer, Sandhya Sinha, Max Staller, Philip Stern, Hanlin Tang, Sharon Weeks, Maxwell Weidmann, Ashley Wolf, Carmen Young, Jie Yuan, Christopher Crutchfield, Megan McClean, Coleen T Murphy, Manuel Llinás, David Botstein, Olga G Troyanskaya, Maitreya J Dunham.   

Abstract

Whole-genome sequencing, particularly in fungi, has progressed at a tremendous rate. More difficult, however, is experimental testing of the inferences about gene function that can be drawn from comparative sequence analysis alone. We present a genome-wide functional characterization of a sequenced but experimentally understudied budding yeast, Saccharomyces bayanus var. uvarum (henceforth referred to as S. bayanus), allowing us to map changes over the 20 million years that separate this organism from S. cerevisiae. We first created a suite of genetic tools to facilitate work in S. bayanus. Next, we measured the gene-expression response of S. bayanus to a diverse set of perturbations optimized using a computational approach to cover a diverse array of functionally relevant biological responses. The resulting data set reveals that gene-expression patterns are largely conserved, but significant changes may exist in regulatory networks such as carbohydrate utilization and meiosis. In addition to regulatory changes, our approach identified gene functions that have diverged. The functions of genes in core pathways are highly conserved, but we observed many changes in which genes are involved in osmotic stress, peroxisome biogenesis, and autophagy. A surprising number of genes specific to S. bayanus respond to oxidative stress, suggesting the organism may have evolved under different selection pressures than S. cerevisiae. This work expands the scope of genome-scale evolutionary studies from sequence-based analysis to rapid experimental characterization and could be adopted for functional mapping in any lineage of interest. Furthermore, our detailed characterization of S. bayanus provides a valuable resource for comparative functional genomics studies in yeast.

Entities:  

Keywords:  comparative genomics; gene expression; yeast

Mesh:

Substances:

Year:  2013        PMID: 23852385      PMCID: PMC3761308          DOI: 10.1534/genetics.113.152918

Source DB:  PubMed          Journal:  Genetics        ISSN: 0016-6731            Impact factor:   4.562


ANALYSIS of the genome sequences of related species has provided tremendous insight into the key functional elements of genomes as revealed by patterns of DNA sequence conservation. The Saccharomyces yeasts have been particularly well sampled by sequencing projects over the past decade (reviewed in Dujon 2010), and comparative analyses have revealed a history of gene duplication (Dietrich ; Kellis ), conservation at DNA binding sites (Cliften ; Kellis ), and coevolution of binding sites with regulators (Gasch ). However, to enable more thorough understanding of the underlying biology, sequence-based studies must be complemented by the experimental study of functional divergence. Within Saccharomyces cerevisiae, comprehensive analysis of gene expression, protein levels, and metabolite levels demonstrates the ability of gene expression rather than raw sequence data to predict phenotype (Guan ). In the yeasts, studies of promoter usage (Borneman ), transcription factor binding (Doniger ), stress sensitivity (Kvitek ), transcriptional network changes (Tsong ; Tuch ), mating (Zill and Rine 2008), replication timing (Muller and Nieduszynski 2012), protein levels (Khan ), and nucleosome occupancy (Guan ; Tsankov ) demonstrate that interesting evolutionary features emerge when processes are compared in detail within these eukaryotes. Despite this foundational work, no studies have yet attempted to experimentally characterize gene function on a systematic scale in nonmodel newly sequenced species. An ideal study of gene function in a new species would establish precise functions for all species-specific genes and allow a systematic comparison of gene function and regulation for orthologs between species. Such a study can form the groundwork for connecting functional and regulatory differences to the sequence variants that have accumulated over evolutionary time. Conversely, genes with conserved function and regulation can be used to infer DNA sequence changes that are either neutral or that coevolved to maintain the selected characters. Gene-expression analysis fits these requirements, as genes of shared functions are highly correlated in their expression, and, conversely, gene-expression correlations are highly predictive of gene function (Stuart ; van Noort ; Hibbs ; Huttenhower ). Limited comparative analyses of gene expression among different species have already been attempted and show how rapidly networks can evolve (reviewed in Whitehead and Crawford 2006). Comparisons between extremely divergent systems can discover core pathways shared over vast evolutionary differences (Stuart ; Bergmann ), while focusing on species that are less diverged permits study of more rapidly adapting processes and facilitates identification of the specific sequence changes that might be driving these differences. Furthermore, observing a phenomenon in multiple species provides solid evidence that it is not specific to a laboratory-adapted model organism but is instead an evolutionarily conserved biological response (Hess ; Zill and Rine 2008; Airoldi ). To examine the conservation and divergence of gene function, we selected the yeast Saccharomyces bayanus var. uvarum (henceforth referred to as S. bayanus for simplicity) for comparison with S. cerevisiae. The two species diverged ∼20 million years ago and have a comparable level of DNA sequence divergence as mouse and human (80% conserved in coding regions and 62% conserved in intergenic regions as compared to S. cerevisiae). We have recently used next-generation sequencing to create a high-quality assembly and gene model prediction of the S. bayanus genome, and we created an extendable genome browser to facilitate its use (Scannell ). Importantly, sequence conservation of functional elements is still detectable (for example, noncoding RNAs; Kavanaugh and Dietrich 2009). Like S. cerevisiae, S. bayanus is a species used in winemaking, and recent studies of its genome content and relationship to lager yeasts have clarified taxonomic confusion (Libkind ). The phylogenetic proximity and shared natural history with S. cerevisiae also make it possible to select specific experimental conditions for S. bayanus by reference to the vast literature available for S. cerevisiae, one of the most popular model organisms. The two species can make interspecific hybrids, allowing complementation tests with S. cerevisiae alleles. However, with a few exceptions (Serra ; Talarek ; Jones ; Zill and Rine 2008; Gallagher ; Zill ), little experimental work has been performed in S. bayanus and even less at genome scale (Bullard ; Tsankov ; Busby ; Guan ; Muller and Nieduszynski 2012). We first compared the basic growth characteristics of the two species and developed genetic tools and protocols to facilitate experimental manipulations of S. bayanus. Following this characterization of the species, we then produced a gene-expression compendium of over 300 microarrays in S. bayanus, guided by a machine-learning analysis of the entire S. cerevisiae literature that predicts an optimal set of conditions for expression analysis (Guan ), and assembled a set of published expression experiments in S. cerevisiae for comparison. Similar to comparative sequence analysis, comparing the gene-expression responses of different species allows the identification of programs of conserved gene regulation and of alterations in gene-expression response. In comparing the S. bayanus and S. cerevisiae data, we noted a number of examples of divergence in gene expression between the species (Guan ). Also, because genes of like function typically have correlated gene expression (Eisen ), patterns of coexpression can be used to predict the functional roles of genes (Sharan ). Our analysis of these data sets reveals both regulatory change and evolution of gene function amid overall conservation. Specific examples include expression rewiring in the pathways controlling meiosis and galactose utilization, oxidative stress driving expression of a species-specific network, and evidence for divergence of specific functional groups.

Materials and Methods

The strains used in this study are described in Supporting Information, Table S1. Custom oligonucleotide probes specific for S. bayanus genes were designed and printed using a pin-style arraying robot. S. bayanus cells were grown and exposed to a variety of stimuli and RNA was harvested and labeled by direct incorporation of fluorescent nucleotides into cDNA. Deletion and insertion mutants were produced in diploids by homologous recombination using adaptations of standard methods for S. cerevisiae and haploids were obtained by sporulation and dissection. S. bayanus data and a compendium of S. cerevisiae data were processed for gene function prediction using support vector machines. As there were no existing biological process annotations in S. bayanus, we adopted the annotations from S. cerevisiae for training. The microarray expression data are available from GEO as GSE16544 and GSE47613. The interactive network view of the expression data and searchable prediction results are available at http://bayanusfunction.princeton.edu. Complete methods information is included as File S1.

Results

Developing S. bayanus into a new model system required an initial characterization of its growth habits and preferences, along with the development of genetic tools to enable the types of studies that are routine in established model systems.

Phenotypic analysis and genetic tools

We began our work in S. bayanus by measuring its growth and physiology. As previously reported (Goncalves ; Salvado ), in minimal media at 20°, S. bayanus grows faster than S. cerevisiae (Figure 1A). The species grew at nearly equal rates at 25°, and at 30° S. bayanus grew more slowly than S. cerevisiae (Figure 1, B and C). Accordingly, S. bayanus was more sensitive to heat shock than S. cerevisiae; transfer to 40° slowed growth of S. bayanus more than it did in S. cerevisiae (Figure 1, D and E). This heat sensitivity precludes efficient lithium acetate transformation using heat shock at 42°, so we modified our procedure to use a milder 37° heat shock for S. bayanus.
Figure 1

Growth, physiology, and molecular characteristics of S. bayanus. (A–C) Growth of S. cerevisiae and S. bayanus at different temperatures. (D) Temperature shifts in S. bayanus. Cultures were shifted at the 120-min timepoint. (E) Temperature shifts in S. cerevisiae, as in D. (F) Diauxic shift in S. bayanus. Culture density, ethanol concentration, and glucose concentration were monitored. (G) Response of S. bayanus to stress. Culture density was measured as absorbance at 600 nm in an overnight culture. Concentrations are in millimolar, except for cadmium (10× μM), ammonium (M), and bleach (%). (H) Survival during nutrient starvation of S. bayanus and S. cerevisiae. Strains were growth to saturation in chemostat medium with the indicated limiting nutrient (phosphate or sulfate), and viability was measured over time. (I) 2μ plasmid is absent in a pure S. bayanus strain but present in a hybrid. Genomic DNA was restriction digested and hybridized with a probe corresponding to the 2μ plasmid.

Growth, physiology, and molecular characteristics of S. bayanus. (A–C) Growth of S. cerevisiae and S. bayanus at different temperatures. (D) Temperature shifts in S. bayanus. Cultures were shifted at the 120-min timepoint. (E) Temperature shifts in S. cerevisiae, as in D. (F) Diauxic shift in S. bayanus. Culture density, ethanol concentration, and glucose concentration were monitored. (G) Response of S. bayanus to stress. Culture density was measured as absorbance at 600 nm in an overnight culture. Concentrations are in millimolar, except for cadmium (10× μM), ammonium (M), and bleach (%). (H) Survival during nutrient starvation of S. bayanus and S. cerevisiae. Strains were growth to saturation in chemostat medium with the indicated limiting nutrient (phosphate or sulfate), and viability was measured over time. (I) 2μ plasmid is absent in a pure S. bayanus strain but present in a hybrid. Genomic DNA was restriction digested and hybridized with a probe corresponding to the 2μ plasmid. When grown on glucose medium to the point of glucose depletion, S. bayanus underwent a diauxic shift marked by a growth arrest followed by a shift to ethanol consumption and a slower growth rate (Figure 1F), consistent with its natural history and qualitatively similar to the behavior of S. cerevisiae. We also measured the growth inhibition by a variety of transition metals, salts, and oxidants (Figure 1G). The survival of S. bayanus and S. cerevisiae was similar during starvation for the essential nutrients sulfate and phosphate (Figure 1H). Finally, we analyzed our S. bayanus strain for the presence of the 2μ plasmid and observed that it does not carry detectable levels of the plasmid, although a hybrid with S. cerevisiae prepared in our laboratories maintains this DNA element (Figure 1I). We constructed a Tn7 insertion library (Kumar ) to create a collection of S. bayanus mutant strains. We built a Tn7 transposon carrying a ClonNat resistance marker selectable in both bacteria and yeast. The transposed marker carries stop codons in all reading frames near both termini and so is expected to produce truncations when inserted within genes. Our library contained ∼50,000 unique genomic insertions, and we have used it to screen for a variety of phenotypes including auxotrophies, drug resistance, and copper resistance (see below). By transforming the library into MATα strains and using a ClonNat resistance marker, mutants isolated from this Tn7 set can be used directly in complementation assays by mating to S. cerevisiae strains from the widely used MATa deletion set that carries complementary G418 drug resistance. Insertion mutations can also be mapped using microarray or sequencing technologies (see below). We expect that this mutant collection will be a valuable resource for mutation screening in this new species.

Gene-expression data set

Just as lessons learned from early whole-genome sequencing projects led to more efficient sequencing of related genomes in subsequent projects, we can leverage the thousands of microarray experiments performed in the yeast S. cerevisiae to direct efficient expression profiling in a related organism. Given the shared history of these species, we reasoned that experiments with high predictive value of gene function in S. cerevisiae were also likely to be useful in related yeasts. We also assumed that most of these treatments were likely to target similar ranges of functional categories in the two species. With these ideas in mind, we developed a data-driven experiment recommendation system to identify the minimal set of maximally informative experiments for functional characterization of the S. bayanus genome based on the S. cerevisiae gene-expression literature (Guan ). We carried out 304 microarray measurements in 46 experimental manipulations (detailed in Table S2). Because of the many practical similarities with S. cerevisiae, the experiments were effectively prototyped for us by their original S. cerevisiae publications, in many cases needing only minor modification to adapt them for S. bayanus. Our computationally selected treatments perturbed the majority of the genes in the cell: 4828 of the 4840 S. bayanus genes measured by our array show twofold or greater change in at least one treatment. Hierarchical clustering of this S. bayanus gene-expression compendium revealed a number of groups of genes coexpressed under a variety of conditions (Figure 2, numerical data in Table S3). Although clustering was performed solely on the S. bayanus data and was not informed by the evolutionary relationships between S. bayanus and S. cerevisiae genes, we noted many groups of S. bayanus genes nevertheless showing expression patterns similar to those in S. cerevisiae. Most strikingly, two large cohorts of genes responded coordinately to multiple stresses, with one group repressed and the other induced. This large-scale response indicates that S. bayanus shows the canonical environmental stress response identified in S. cerevisiae (Gasch ) and other yeasts (Gasch 2007). Other treatments elicited gene-expression responses from smaller groups of genes. For instance, a group of genes was strongly upregulated in response to alpha-factor pheromone. This pheromone response declined as cells were released from alpha-factor arrest into the cell cycle. As another example, two other groups of genes were expressed periodically during the cell cycle with different phases of peak gene expression.
Figure 2

S. bayanus gene-expression megacluster. Forty-six S. bayanus gene-expression data sets are shown as indicated by color-coded experiment labels with genes hierarchically clustered along the other dimension. Each data set was either zero transformed or mean centered to remove the reference. By assuming that all S. bayanus genes carry the annotations of their S. cerevisiae orthologs, statistically significant GO term enrichments for clusters of 25 or more genes with a correlation coefficient above 0.7 were determined using the program GOTermFinder, using a background distribution of only orthologous genes. These are indicated with vertical bars and labeled with related terms collapsed for simplicity. Data sets are ordered to group similar conditions. The data as plotted are available in Table S3.

S. bayanus gene-expression megacluster. Forty-six S. bayanus gene-expression data sets are shown as indicated by color-coded experiment labels with genes hierarchically clustered along the other dimension. Each data set was either zero transformed or mean centered to remove the reference. By assuming that all S. bayanus genes carry the annotations of their S. cerevisiae orthologs, statistically significant GO term enrichments for clusters of 25 or more genes with a correlation coefficient above 0.7 were determined using the program GOTermFinder, using a background distribution of only orthologous genes. These are indicated with vertical bars and labeled with related terms collapsed for simplicity. Data sets are ordered to group similar conditions. The data as plotted are available in Table S3. As an initial test of whether these expression clusters reflect functional gene groupings in both species, we started with the simplest—and almost certainly incorrect—assumption that all genes in S. bayanus have the same functions as their orthologs in S. cerevisiae. Using these inferred annotations, we calculated the Gene Ontology (GO) term enrichment for correlated clusters, and we observe significant enrichment for genes of like biological process and cellular component among the clusters of genes with coherent expression (Figure 2). Further, the expression patterns in these clusters showing compartment-specific or biological process enrichment are consistent with the expression patterns of genes involved in the same biological process in similar S. cerevisiae experiments. For instance, the cluster of genes activated by mating pheromone was enriched for genes whose S. cerevisiae orthologs have experimentally validated roles in response to pheromone, conjugation, and karyogamy.

Gene-expression patterns diverge in subtle ways

Although many aspects of gene expression are conserved, we noted a number of instances of gene-expression patterns different from those observed in S. cerevisiae orthologs in response to similar treatments. In S. cerevisiae, the galactose metabolism genes were induced only to detectable levels in the presence of galactose (Gasch ). However, in S. bayanus, the orthologs of the galactose structural genes , , , and were detectably induced not only when cells were exposed to galactose, but also when cells were switched from glucose to other less-preferred carbon sources including ethanol, raffinose, sucrose, and glycerol (Figure 3A). The derepression of galactose metabolism genes on nonglucose carbon sources has been previously described in detail in S. cerevisiae (Matsumoto ; St John and Davis 1981; Yocum ), but the magnitude of this increase in gene expression on nonglucose carbon sources is much greater in S. bayanus. We verified this expression difference between S. bayanus and S. cerevisiae using quantitative PCR for (Figure S1). This activation of the galactose structural genes by multiple carbon sources suggests that S. bayanus might have evolved in an environment in which galactose becomes available at the same time as other nonglucose carbon sources.
Figure 3

Altered expression of genes in S. bayanus. (A) Galactose structural genes are induced by multiple carbon sources in S. bayanus but not in S. cerevisiae. S. cerevisiae carbon source data (Gasch ) and diauxic shift data (Brauer ) are as published. (B) A network view of gene-expression correlations with GAL1 comparing the pairwise expression correlation over all data in both species. The nodes indicate genes, and the thickness of lines indicates expression correlation. In cases where expression correlation is negative, no line is shown for that species, and the target gene is labeled. The node color indicates whether each gene is present in one species (pink or blue) or both (purple). An interactive network viewer for all genes is available at http://bayanusfunction.princeton.edu. (C) Expression of SbayIME1 and ScerIME1 are compared under conditions of alpha-factor arrest and alpha-factor release. S. cerevisiae alpha-factor arrest (Roberts ) and release (Pramila ) are as published.

Altered expression of genes in S. bayanus. (A) Galactose structural genes are induced by multiple carbon sources in S. bayanus but not in S. cerevisiae. S. cerevisiae carbon source data (Gasch ) and diauxic shift data (Brauer ) are as published. (B) A network view of gene-expression correlations with GAL1 comparing the pairwise expression correlation over all data in both species. The nodes indicate genes, and the thickness of lines indicates expression correlation. In cases where expression correlation is negative, no line is shown for that species, and the target gene is labeled. The node color indicates whether each gene is present in one species (pink or blue) or both (purple). An interactive network viewer for all genes is available at http://bayanusfunction.princeton.edu. (C) Expression of SbayIME1 and ScerIME1 are compared under conditions of alpha-factor arrest and alpha-factor release. S. cerevisiae alpha-factor arrest (Roberts ) and release (Pramila ) are as published. We created a resource that presents a network view comparing gene expression between S. cerevisiae and S. bayanus (http://bayanusfunction.princeton.edu). The gene-expression network around showed that , , and have a correlation of 0.99 in both species under all expression conditions (Figure 3B). However, the correlation of the GAL genes with other genes revealed differences in regulation between species. For instance, the ortholog of the hexose transporter had a correlation of 0.98 with the galactose genes in S. bayanus because this and other hexose transporters were upregulated whenever glucose was low. In contrast, in S. cerevisiae the correlation between and was only 0.19 because was upregulated in response to declining glucose concentration while was not. Transcription factors as a group showed higher than expected divergence in expression between S. bayanus and S. cerevisiae, and the S. bayanus ortholog of (670.55, which we will refer to as SbayIME1) in particular showed exceptions to the diploid-specific expression observed in S. cerevisiae. In S. cerevisiae, expression is primarily limited to diploid cells (Kassir ), but in haploid MATa S. bayanus, SbayIME1 was induced over 10-fold by alpha-factor pheromone (Figure 3C). As observed in S. cerevisiae, SbayIME1 is required for sporulation (data not shown), and although SbayIME1 was strongly induced by alpha factor we did not observe significant changes in the pheromone response of Sbayime1 mutant cells (Figure S2). Chromatin immunoprecipitation experiments observed twofold higher levels of the pheromone response transcription factor SbaySte12 (570.3) at the SbayIME1 promoter as compared to Ste12 occupancy at the promoter in S. cerevisiae (Borneman ), supporting our observation of differential pheromone activation of SbayIME1 in S. bayanus as compared to ScerIME1. In S. cerevisiae Ime1 is subject to translational regulation (Sherman ), and the lack of an effect on transcription in response to pheromone in the Sbayime1 mutant could similarly be explained by post-transcriptional regulation. has been observed to be under selective pressure in S. cerevisiae (Gerke ), and the altered expression here may suggest that it is evolving to take on additional roles.

S. bayanus gene function predictions via machine learning are confirmed by mutational analysis

By comparing gene expression between orthologs under known conditions we were able to find examples of changes in gene expression and use these changes to infer functional differences between species. Such inferences are limited by existing knowledge of the link between expression and biological function and by the availability of directly comparable data sets in both species. These limits can be overcome using computational interpretation of expression data, which accurately predicts gene function over much larger data sets than a human can process (Huttenhower and Troyanskaya 2008). Using a support vector machine (SVM) learning method trained using the GO biological process annotations of S. cerevisiae orthologs, we predicted the functional roles of S. bayanus genes (Table S4). Many gene functions are preserved over vast evolutionary distance, as evidenced by the many examples of mammalian genes that can complement deletion mutations in yeast (reviewed in Osborn and Miller 2007). Accordingly, we found that many genes were predicted to have the same function in S. bayanus and S. cerevisiae even though the SVM does not reference protein sequence homology while making predictions. For example, we predicted a role in oxidative phosphorylation for 643.11, the ortholog of , the mitochondrial RNAseP required for processing mitochondrial tRNAs from transcripts. Consistent with this prediction, an insertion mutant in SbayRPM2 was respiratory deficient (Figure S3). Similarly, we predicted a role in cell morphogenesis for 678.66, the ortholog of . A knockout mutant of Sbayamn1 lost daughter cell adhesion (“clumpiness,” Figure S4), as has been observed for the deletion allele in S. cerevisiae (Yvert ). As a third example, we predicted a role for telomeric silencing and protein acetylation for 668.17, the ortholog of the protein acetyltransferase . In a MATa insertion mutant of Sbayard1, we observed repression of MATa haploid-specific genes, as reported for mutants (Whiteway ) (Figure S5A) and note that the mutation causes genome-wide expression changes (Figure S5B). For the whole-genome duplicate serine/protein kinases 642.24 () and 636.21 (), we predicted roles in the regulation of mitosis and the regulation of DNA damage checkpoints, similar to the established roles of the S. cerevisiae orthologs in regulating cytokinesis and mitotic exit. As in S. cerevisiae, mutations in these genes are synthetic lethal (data not shown). The functional predictions also can predict gene functions not yet known in S. cerevisiae. We carried out a screen for Tn7 mutants resistant to copper sulfate and identified a resistant mutant (Figure 4, A and B). Using an array-based method (Gabriel ), we mapped the insertion upstream of 610.13, the ortholog of (Figure 4C). Deletion analysis of SbayOPT1 and the divergently transcribed neighboring gene SbayPEX2 (610.12) confirmed that mutation of SbayOPT1 was responsible for resistance to copper (Figure 4D). The functional predictions for SbayOPT1 include cation homeostasis, the GO parent term that includes copper ion homeostasis (our functional predictions did not include GO terms with few members). ScerOPT1 (also named ) has been characterized as a high-affinity glutathione transporter induced by sulfur starvation (Bourbouloux ; Srikanth ). Copper resistance had not been investigated in this mutant, although sensitivity to cadmium had been noted (Serero ). The mutant in S. cerevisiae also showed increased resistance to copper (Figure 4E). Of note, S. bayanus is more sensitive to copper than the laboratory strain of S. cerevisiae; our screen in the sensitized background of S. bayanus likely provided added sensitivity to detect genes involved in the response to copper (Figure 4E). These results suggest the potential for a relationship between glutathione transport and copper resistance and demonstrate how the predictions of gene function in S. bayanus provide information about conserved gene function in S. cerevisiae.
Figure 4

610.13/OPT1 mediates resistance to copper toxicity in S. bayanus and S. cerevisiae. (A) A Tn7 insertion mutant was identified in a screen for mutants resistant to copper sulfate; a series of dilutions of cells were plated on YPD and YPD with 3 mM CuSO4. (B) The resistance phenotype cosegregates with the ClonNAT resistance marker carried by the transposon; the mutant strain was backcrossed to wild type, and tetrads (in columns, indicated by numbers) were phenotyped for resistance to ClonNAT and for growth on YPD with 3 mM CuSO4. (C) The site of the insertion was mapped by enriching genomic DNA for transposon DNA and using an array hybridization technique (Gabriel ). Data are mapped onto the chromosomes, which are aligned by the centromeres. Subsequent PCR amplification using primers specific to the transposon and flanking regions mapped the insertion site between the genes SbayPEX2 (610.12) and SbayOPT1 (610.13). (D) Mutation of the S. bayanus gene SbayOPT1 confers copper resistance; SbayOPT1 is divergently transcribed from SbayPEX2, so a deletion of only the 3′ distal portion of SbayOPT1 was also tested to exclude any effect on SbayPEX2. (E) Mutation of the 610.13 ortholog OPT1 in S. cerevisiae confers copper resistance.

610.13/OPT1 mediates resistance to copper toxicity in S. bayanus and S. cerevisiae. (A) A Tn7 insertion mutant was identified in a screen for mutants resistant to copper sulfate; a series of dilutions of cells were plated on YPD and YPD with 3 mM CuSO4. (B) The resistance phenotype cosegregates with the ClonNAT resistance marker carried by the transposon; the mutant strain was backcrossed to wild type, and tetrads (in columns, indicated by numbers) were phenotyped for resistance to ClonNAT and for growth on YPD with 3 mM CuSO4. (C) The site of the insertion was mapped by enriching genomic DNA for transposon DNA and using an array hybridization technique (Gabriel ). Data are mapped onto the chromosomes, which are aligned by the centromeres. Subsequent PCR amplification using primers specific to the transposon and flanking regions mapped the insertion site between the genes SbayPEX2 (610.12) and SbayOPT1 (610.13). (D) Mutation of the S. bayanus gene SbayOPT1 confers copper resistance; SbayOPT1 is divergently transcribed from SbayPEX2, so a deletion of only the 3′ distal portion of SbayOPT1 was also tested to exclude any effect on SbayPEX2. (E) Mutation of the 610.13 ortholog OPT1 in S. cerevisiae confers copper resistance.

Different rates of functional divergence characterize different gene groups

Just as genes involved in different biological pathways have been observed to evolve at the sequence level at different rates (Aris-Brosou 2005; Wolf ), certain classes of genes may show more rapid functional divergence. We examined our predictions of gene function in both species and identified cases in which a pair of orthologs showed very large changes in predicted function between species (Table 1, full data in Table S5). We observed the smallest number of changes in ribosomal biogenesis and in electron transport, and many core metabolic processes showed few changes, consistent with these genes’ typical conservation at the sequence level. Processes showing the highest amount of change included response to osmotic stress, autophagy, and organelle inheritance. Although it was not immediately obvious why these processes are changing so quickly, these results will help to guide future experiments. We also observed significant change in small GTPase mediated signal transduction and hypothesize that this may reflect the constitutive signaling through the mating pathway caused by a mutation common in laboratory strains of S. cerevisiae (Lang ) not present in the S. bayanus strains used here.
Table 1

Most conserved and diverged biological processes

GO termFraction genes changing functional prediction rank by 75% or more
Least change
 GO:0016072(rRNA metabolism)0.02
 GO:0006118(electron transport)0.03
 GO:0006508(proteolysis and peptidolysis)0.04
 GO:0007001(chromosome organization and biogenesis)0.05
 GO:0006807(nitrogen compound metabolism)0.06
Greatest change
 GO:0006914(autophagy)0.31
 GO:0006970(response to osmotic stress)0.32
 GO:0007264(small GTPase mediated signal transduction)0.34
 GO:0048308(organelle inheritance)0.43
 GO:0007568(aging)0.46

The top and bottom five GO SLIM biological process terms for changes in gene function. The fraction of genes with change in ranked prediction scores of 75% or more is shown for each term.

The top and bottom five GO SLIM biological process terms for changes in gene function. The fraction of genes with change in ranked prediction scores of 75% or more is shown for each term.

Annotations for species-specific genes

Genome sequence analysis allows comparison of gene content in different species, which can suggest the evolutionary pressures that shape specific lineages (Gordon ). Similarly, examining the functional roles predicted for genes found in one species but not another can suggest potential functions for these species-unique genes, revealing species-specific adaptations. We examined the expression data of S. bayanus genes that do not have orthologs in S. cerevisiae and found a prominent cluster of 25 genes that includes 13 genes specific to S. bayanus—including 8 with no orthologs in any surveyed yeast (Gordon ) (Figure 5A). These genes were induced 16- to 32-fold by peroxide stress, bleach, and MMS but not other stresses or any other conditions tested in our compendium. Peroxide, bleach, and MMS all increase reactive oxygen levels (Winter ; Kitanovic ), so we propose that this group of genes responds specifically to oxidative stress. Two DNA sequence motifs are enriched in the promoters of the S. bayanus genes in this cluster, and these motifs are very similar (Table S6, P < 7 × 10−5, Mahony ) to motifs established by analysis of sequence conservation among the sensu stricto yeasts (Kellis ). Furthermore, one of the motifs is similar to that of S. cerevisiae (Harbison ), a transcription factor with a role in stress response (Wu ). As the ortholog in S. bayanus has been annotated as a pseudogene (Scannell ), it is likely that some other transcription factor may be activating these genes. The stress-responsive gene has a similar binding site in S. cerevisiae and is a candidate for the oxidative stress activation we observe. The number of genes specific to S. bayanus annotated to oxidative stress suggests that S. bayanus may encounter a different spectrum of stresses.
Figure 5

Functional roles of a set of genes specific to S. bayanus. (A) A group of genes is strongly induced by oxidative stresses but not other stresses. We predicted biological process annotations for this cluster of genes and show the S. cerevisiae orthologs and their experimentally based biological process annotations as assigned by the Saccharomyces Genome database. The predicted annotations are the two highest scoring annotations; Table S4 contains a complete list, for all genes and Figure S7 describes the quality of gene function predictions across all terms. (B) The GAL80 ohnolog 670.20 regulates a set of genes unique to S. bayanus when cells are shifted from raffinose to galactose. Graphs show log2 ratios of expression data from microarrays, zero transformed to the initial timepoint in wild type cells.

Functional roles of a set of genes specific to S. bayanus. (A) A group of genes is strongly induced by oxidative stresses but not other stresses. We predicted biological process annotations for this cluster of genes and show the S. cerevisiae orthologs and their experimentally based biological process annotations as assigned by the Saccharomyces Genome database. The predicted annotations are the two highest scoring annotations; Table S4 contains a complete list, for all genes and Figure S7 describes the quality of gene function predictions across all terms. (B) The GAL80 ohnolog 670.20 regulates a set of genes unique to S. bayanus when cells are shifted from raffinose to galactose. Graphs show log2 ratios of expression data from microarrays, zero transformed to the initial timepoint in wild type cells. Our functional predictions for genes in our oxidative stress cluster included response to toxin (GO:0009636), sulfur metabolic process (GO:0006790), and response to temperature stimulus (GO:0009266) (Figure 5A). Many of these functions have been demonstrated for the 12 genes that have S. cerevisiae orthologs, and 10 of the 12 S. cerevisiae orthologs are induced by hydrogen peroxide (Gasch ; Causton ). Five of the S. cerevisiae orthologs of this cluster have been assigned the GO biological process of response to toxin (GO enrichment, P < 4.07 × 10−9, Bonferroni corrected), and two of the S. cerevisiae orthologs in this cluster have roles in sulfur metabolism: is a glutathione S-transferase, and is a cysteine transporter. The predicted role in toxin response is consistent with the activation by oxidative stress, because in S. cerevisiae, genes assigned to this biological process are induced by the mycotoxin citrinin, which causes oxidative stress (Iwahashi ). Also, the sulfur metabolic process includes genes involved in sulfur assimilation, a biochemical process that consumes reducing equivalents. Of the 12 proteins in this cluster that have S. cerevisiae homologs, 5 are proteins of unknown function. These functional predictions from S. bayanus may help to inform functional experiments on the S. cerevisiae orthologs. Gene duplicates are known to play a prominent role in yeast genome evolution. Among our functional predictions for the S. bayanus genome, we examined the seven genes present in duplicate in S. bayanus but not in S. cerevisiae and noted that our expression data had yielded a prediction of a role in galactose metabolism for one of these genes (Table S4), which had also been previously noted on the basis of comparative homology (Hittinger , 2010; Cliften ; Gordon ; Scannell ). Both duplicates of the ancestral gene are retained in S. bayanus, but only is present in S. cerevisiae. The S. bayanus ortholog 555.11 retains its function as a repressor of galactose genes, as GAL genes were no longer repressed when Sbaygal80 mutant cells were grown in glucose (Figure S6), a derepression known in Scergal80 mutants (Douglas and Hawthorne 1966; Yocum and Johnston 1984). In addition, 670.20, the ohnolog of SbayGAL80, which itself has no ortholog in S. cerevisiae, was predicted to function in galactose metabolism by our SVM. Indeed, we observed activation of 670.20 in response to galactose (Figure 3A), and Gal4 binding sites are present upstream of the gene. The galactose-specific activation of 670.20 differs from the response of the other S. bayanus GAL family genes, which are activated by growth on multiple nonglucose carbon sources. We also noted that 670.20 was derepressed in the Sbaygal80 mutant, as were other GAL genes (Figure S6). To more directly study the role of 670.20 in galactose metabolism, we measured the gene-expression response of 670.20 mutants to a shift from raffinose to galactose and observed a set of genes that failed to be activated by galactose in the 670.20 mutant (Figure 5B). These four genes are also members of the oxidative stress cluster shown in Figure 4. Notably, the genes regulated by the S. bayanus-specific 670.20 are themselves present only in S. bayanus, forming a species-specific network.

Discussion

Although the genomes of many nonmodel organisms are now sequenced, this flood of data has not been matched by functional experimental data in these species. Much of this can be attributed to the difficulty of working with unfamiliar organisms, but many other species lend themselves to laboratory study for comparative work. For example, the fly species sequenced by the 12 Drosophila species consortium (Consortium ) can all be lab reared, as can several sequenced species of nematodes (Cutter ). Yeast are of course another taxa with many lab-amenable species. Using gene-expression data we functionally annotated all the genes in S. bayanus (Table S4) and demonstrated the accuracy of our predictions using targeted mutational analysis. A sufficiently complex gene-expression data set can be used not only to compare strategies of gene regulation but also to predict biological function (Guan ). Identifying regulatory changes across different species provides interesting insight into selection and adaptation. For instance, comparing the protein sequences encoded in bacterial genomes has helped to predict the metabolic capabilities of different lineages (Downs 2006). Our measurements of gene expression under well-characterized conditions directly relevant to defined biological processes illustrate examples of altered gene regulation that suggest functional differences between species. Conversely, evidence of gene function in other species may be used to generate hypotheses about the functions of the orthologous genes of model systems, many of which still lack annotations (Peña-Castillo and Hughes 2007). Our study demonstrates the potential of computationally predicted annotations for both functional characterization and evolutionary analysis of new species. The tools we have developed are generic and could easily be applied to other nonmodel organism species of interest. Application of our comparative approach to other groups of related species, such as Candida yeasts, Drosophila species, worms, or mammals, could extend the evolutionary observations made here. Since our experimental and analytical framework are agnostic to species and platform, they should be easily transferable to other systems. This new style of comparative functional genomics will ultimately allow better understanding of conservation and divergence in gene function and regulation and allow rapid adoption of experimental systems beyond the traditional model organisms.
  81 in total

Review 1.  Comparative genomics of the environmental stress response in ascomycete fungi.

Authors:  Audrey P Gasch
Journal:  Yeast       Date:  2007-11       Impact factor: 3.239

Review 2.  Why are there still over 1000 uncharacterized yeast genes?

Authors:  Lourdes Peña-Castillo; Timothy R Hughes
Journal:  Genetics       Date:  2007-04-15       Impact factor: 4.562

3.  Evolution of alternative transcriptional circuits with identical logic.

Authors:  Annie E Tsong; Brian B Tuch; Hao Li; Alexander D Johnson
Journal:  Nature       Date:  2006-09-28       Impact factor: 49.962

4.  Systematic planning of genome-scale experiments in poorly studied species.

Authors:  Yuanfang Guan; Maitreya Dunham; Amy Caudy; Olga Troyanskaya
Journal:  PLoS Comput Biol       Date:  2010-03-05       Impact factor: 4.475

5.  Yeast bZip proteins mediate pleiotropic drug and metal resistance.

Authors:  A Wu; J A Wemmie; N P Edgington; M Goebl; J L Guevara; W S Moye-Rowley
Journal:  J Biol Chem       Date:  1993-09-05       Impact factor: 5.157

6.  Cluster analysis and display of genome-wide expression patterns.

Authors:  M B Eisen; P T Spellman; P O Brown; D Botstein
Journal:  Proc Natl Acad Sci U S A       Date:  1998-12-08       Impact factor: 11.205

7.  The Ashbya gossypii genome as a tool for mapping the ancient Saccharomyces cerevisiae genome.

Authors:  Fred S Dietrich; Sylvia Voegeli; Sophie Brachat; Anita Lerch; Krista Gates; Sabine Steiner; Christine Mohr; Rainer Pöhlmann; Philippe Luedi; Sangdun Choi; Rod A Wing; Albert Flavier; Thomas D Gaffney; Peter Philippsen
Journal:  Science       Date:  2004-03-04       Impact factor: 47.728

Review 8.  Rescuing yeast mutants with human genes.

Authors:  Michael J Osborn; J Ross Miller
Journal:  Brief Funct Genomic Proteomic       Date:  2007-08-13

9.  Conservation of replication timing reveals global and local regulation of replication origin activity.

Authors:  Carolin A Müller; Conrad A Nieduszynski
Journal:  Genome Res       Date:  2012-07-05       Impact factor: 9.043

10.  Predicting cellular growth from gene expression signatures.

Authors:  Edoardo M Airoldi; Curtis Huttenhower; David Gresham; Charles Lu; Amy A Caudy; Maitreya J Dunham; James R Broach; David Botstein; Olga G Troyanskaya
Journal:  PLoS Comput Biol       Date:  2009-01-02       Impact factor: 4.475

View more
  12 in total

1.  Eric Lander and David Botstein on Mapping Quantitative Traits.

Authors:  Gary A Churchill
Journal:  Genetics       Date:  2016-05-05       Impact factor: 4.562

Review 2.  Population perspectives on functional genomic variation in yeast.

Authors:  Daniel A Skelly; Paul M Magwene
Journal:  Brief Funct Genomics       Date:  2015-10-14       Impact factor: 4.241

3.  Polygenic evolution of a sugar specialization trade-off in yeast.

Authors:  Jeremy I Roop; Kyu Chul Chang; Rachel B Brem
Journal:  Nature       Date:  2016-02-10       Impact factor: 49.962

Review 4.  The evolution of the GALactose utilization pathway in budding yeasts.

Authors:  Marie-Claire Harrison; Abigail L LaBella; Chris Todd Hittinger; Antonis Rokas
Journal:  Trends Genet       Date:  2021-09-15       Impact factor: 11.639

5.  Evolution of protein-protein interaction networks in yeast.

Authors:  Andrew Schoenrock; Daniel Burnside; Houman Moteshareie; Sylvain Pitre; Mohsen Hooshyar; James R Green; Ashkan Golshani; Frank Dehne; Alex Wong
Journal:  PLoS One       Date:  2017-03-01       Impact factor: 3.240

6.  Repeated Cis-Regulatory Tuning of a Metabolic Bottleneck Gene during Evolution.

Authors:  Meihua Christina Kuang; Jacek Kominek; William G Alexander; Jan-Fang Cheng; Russell L Wrobel; Chris Todd Hittinger
Journal:  Mol Biol Evol       Date:  2018-08-01       Impact factor: 16.240

7.  Comparative expression profiling reveals widespread coordinated evolution of gene expression across eukaryotes.

Authors:  Trevor Martin; Hunter B Fraser
Journal:  Nat Commun       Date:  2018-11-23       Impact factor: 14.919

8.  Differences in environmental stress response among yeasts is consistent with species-specific lifestyles.

Authors:  Christian Brion; David Pflieger; Sirine Souali-Crespo; Anne Friedrich; Joseph Schacherer
Journal:  Mol Biol Cell       Date:  2016-03-23       Impact factor: 4.138

9.  Ongoing resolution of duplicate gene functions shapes the diversification of a metabolic network.

Authors:  Meihua Christina Kuang; Paul D Hutchins; Jason D Russell; Joshua J Coon; Chris Todd Hittinger
Journal:  Elife       Date:  2016-09-30       Impact factor: 8.140

10.  Cis-Regulatory Divergence in Gene Expression between Two Thermally Divergent Yeast Species.

Authors:  Xueying C Li; Justin C Fay
Journal:  Genome Biol Evol       Date:  2017-05-01       Impact factor: 3.416

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.