| Literature DB >> 22384314 |
Devin R Scannell, Oliver A Zill, Antonis Rokas, Celia Payen, Maitreya J Dunham, Michael B Eisen, Jasper Rine, Mark Johnston, Chris Todd Hittinger.
Abstract
High-quality, well-annotated genome sequences and standardized laboratory strains fuel experimental and evolutionary research. We present improved genome sequences of three species of Saccharomyces sensu stricto yeasts: S. bayanus var. uvarum (CBS 7001), S. kudriavzevii (IFO 1802(T) and ZP 591), and S. mikatae (IFO 1815(T)), and describe their comparison to the genomes of S. cerevisiae and S. paradoxus. The new sequences, derived by assembling millions of short DNA sequence reads together with previously published Sanger shotgun reads, have vastly greater long-range continuity and far fewer gaps than the previously available genome sequences. New gene predictions defined a set of 5261 protein-coding orthologs across the five most commonly studied Saccharomyces yeasts, enabling a re-examination of the tempo and mode of yeast gene evolution and improved inferences of species-specific gains and losses. To facilitate experimental investigations, we generated genetically marked, stable haploid strains for all three of these Saccharomyces species. These nearly complete genome sequences and the collection of genetically marked strains provide a valuable toolset for comparative studies of gene function, metabolism, and evolution, and render Saccharomyces sensu stricto the most experimentally tractable model genus. These resources are freely available and accessible through www.SaccharomycesSensuStricto.org.Entities:
Keywords: yeast species; Saccharomyces genome; evolutionary genetics; genome assembly; genomics; sensu stricto
Year: 2011 PMID: 22384314 PMCID: PMC3276118 DOI: 10.1534/g3.111.000273
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1 Resequencing and assembling the genomes of three Saccharomyces species. (A) Schematic showing phylogenetic relationships among nonhybrid members of the Saccharomyces sensu stricto genus plus the outgroup Kluyveromyces lactis based on (Kurtzman and Robnett 2003), (Nieduszynski and Liti 2011), and (Libkind, Hittinger et al., unpublished data). Branch lengths are not proportional to sequence divergence. The branch on which the whole-genome duplication occurred is marked. (B) Schematic depicting co-assembly of genomes from Illumina short-insert paired-end reads and mate-pair Sanger shotgun reads. Illumina reads were used to build contigs, which were stitched into scaffolds using mate-pair reads from the longer-insert Sanger libraries. Scaffolds were then joined into ultra-scaffolds (contiguous with chromosomes) using MEGABLAST and manual scaffold ordering.
Figure 2 Genes exhibiting lineage-specific rates of evolution in the Saccharomyces sensu stricto genus. (A) The three alternative hypotheses designed to test whether genes are evolving at a different rate in each of five species of the Saccharomyces sensu stricto genus. Under hypothesis H0 all branches of the tree exhibit the same ω ratio of nonsynonymous to synonymous substitutions. Under the set of H1 hypotheses, the ω ratio along a given species’ branch is different from that along all other branches of the tree. Under the H2 hypothesis, each branch exhibits its own ω ratio. (B) Numbers of genes with lineage-specific rates of evolution in the Saccharomyces sensu stricto genus.
Short-read library statistics
| Library Insert (bp) | Read Length (bp) | Assembly Kmer | Fold Coverage | |||
|---|---|---|---|---|---|---|
| Raw | Processed | Kmer | ||||
| 437 | 51 | 31 | 140.7 | 109.5 | 45.1 | |
| 203 | 114 | 61 | 272.2 | 202.5 | 95.9 | |
| 223 | 114 | 61 | 269.4 | 207.8 | 98.4 | |
| 259 | 80 | 61 | 379.2 | 267.9 | 67.0 | |
Coverage calculated assuming a genome size of 12.1Mb.
Read pool after reads failing quality criteria were trimmed, corrected, or discarded. The relevant procedure is described in Materials and Methods.
Genome assembly summary statistics, before and after manual ordering of scaffolds
| Unordered Assembly (Scaffolds) | Ordered Assembly (Ultra-scaffolds) | Percentage of Assembly | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Number | Bases | N50 | Gaps | Number | Bases | N50 | Gaps | ||
| 629 (147) | 11,668,028 | 444,551 | 380 | 16 | 11,467,582 | 905,555 | 394 | 98.3% | |
| 1455 (226) | 11,736,856 | 151,185 | 17 | 16 | 11,294,830 | 882,337 | 111 | 96.2% | |
| 1523 (164) | 11,642,553 | 100,201 | 10 | 16 | 11,185,947 | 882,203 | 162 | 96.2% | |
| 1220 (159) | 11,922,798 | 360,232 | 18 | 16 | 11,445,471 | 800,823 | 52 | 96.0% | |
Numbers in parentheses indicate scaffolds longer than 500 bp.
Percentage of base pairs in the unordered assembly that are also present in the ordered assembly. Neither contigs with an average Kmer coverage less than 20 nor gaps in scaffolds (i.e., N bases) were counted toward assembly statistics.
Counts of annotated tRNA and protein-coding genes across representative strains of five Saccharomyces species
| tRNAs | Protein-Coding Genes (by Homology) | Total | |||
|---|---|---|---|---|---|
| YGOB | SGD | Other | |||
| 275 | 5490 | 881 | 33 | 6679 | |
| 273 | 5440 | 745 | 46 | 6504 | |
| 291 | 5454 | 510 | 51 | 6306 | |
| 280 | 5450 | 409 | 48 | 6187 | |
| 279 | 5559 | 432 | 48 | 6318 | |
| Orthogroups | 229 | 5141 | 120 | 0 | 5490 |
S. kudriavzevii is represented by IFO 1802T.
Protein-coding gene counts are subdivided by homology to families in the Yeast Gene Order Browser (YGOB) (Byrne and Wolfe 2005), genes annotated in the Saccharomyces Genome Database (SGD) (Engel ), or other protein databases (Other) (see Materials and Methods).
Each column shows the number of genes for which syntenic orthologs were detected in all five species.
Figure 3 Relaxed molecular clock estimation of relative species divergence within the Saccharomyces sensu stricto genus. The top scale bar and the values above branches denote estimated substitutions per site. The bottom scale bar expresses species divergence in percentage points relative to the origin of the genus.
Genes not previously reported in the Saccharomyces sensu stricto
| Homolog | Presence Pattern | Functional Annotation | |
|---|---|---|---|
| KLTH0F00110 | 0:0:1:0:0 | ||
| Anc_5.74 | 0:0:0:0:1 | Uncharacterized | |
| ( | Anc_7.495 | 1:1:1:1:1 | Not annotated in SGD. dN/dS = 0.29; between |
| Anc_8.350 | 0:0:0:0:1 | Uncharacterized | |
| Anc_8.663 | 0:1:1:1:1 | Nonsyntenic; uncharacterized | |
| Anc_8.869 | 0:0:0:0:1 | Uncharacterized | |
| Anc_8.880 | 0:0:0:0:1 | Endoribonuclease in the RNase III family (budding yeast | |
| CAGL0J10714g | 0:0:0:0:1 | Syntenic homolog. dN/dS = 0.33; also annotated in | |
| RCFBP_mp20323 | 0:0:1:0:0 | NTF2_like superfamily; similar to RCFBP_mp20323 from | |
| CGSSp3BS71_00010 | 0:1:1:0:0 | Similar to CGSSp3BS71_00010 from | |
| Kwal_8.576 | 0:0:0:0:1 | Nitrilase superfamily | |
| SAKL0C00330g | 0:0:0:0:1 | Hyphal_reg_CWP superfamily |
Number of detected copies in S. cerevisiae, S. paradoxus, S. mikatae, S. kudriavzevii (IFO1802T), and S. bayanus, respectively.
Budding yeast Dicer was described in (Drinnenberg )
Loss of duplicate genes from the ancient whole-genome duplication in the Saccharomyces sensu stricto clade
| YGOB Locus | Retention Pattern | Functional Annotation | |
|---|---|---|---|
| Anc_1.22 | 2:1:1:1:2 | Uncharacterized | |
| Anc_1.363 | 2:2:1:2:2 | Predicted to have thiol-disulfide oxidoreductase active site | |
| Anc_1.474 | 2:2:1:2:2 | Hsp70 family; localized in mitochondrial nucleoids; plays a role in protein translocation | |
| Anc_1.500 | 1:1:1:1:2 | Inhibits transcriptional activation by Gal4p | |
| Anc_3.318 | 1:1:1/ψ:2:2 | RNA binding protein with similarity to hnRNP-K; localizes to the cytoplasm and subtelomeric DNA | |
| Anc_4.379 | 1:1:1:1:2 | Protein amino acid O-linked glycosylation | |
| Anc_5.274 | 2:2:1:2:2 | Serine/threonine MAP kinase involved in regulating the maintenance of cell wall integrity | |
| Anc_5.528 | 2:2:2:2:1 | AP-1-like basic leucine zipper (bZIP) transcriptional activator involved in stress responses, iron metabolism, and pleiotropic drug resistance | |
| Anc_5.554 | 1:2:1/ψ:1/ψ:1/ψ | Uncharacterized | |
| Anc_8.181 | 2:2:2:1:2 | Uncharacterized | |
| Anc_8.569 | 1:2:1:1:1 | Plasma membrane sulfite pump | |
| Anc_8.597 | 1/ψ:1/ψ:1:1/ψ:2 | Soluble GTPase with a role in regulation of membrane traffic | |
| Anc_8.827 | 2:2:1:2:2 | Phosphoribosyltransferase; fifth step in pyrimidine biosynthesis pathway |
Number of detected copies or pseudogenes (ψ) in S. cerevisiae, S. paradoxus, S. mikatae, S. kudriavzevii, and S. bayanus, respectively.
Construction of heterothallic haploid strains with auxotrophic markers for S. mikatae, S. kudriavzevii and S. bayanus
| Species | Strain | Original | Genotype | Reference |
|---|---|---|---|---|
| JRY9171 | IFO 1815T | This study | ||
| JRY9172 | IFO 1815T | This study | ||
| JRY9173 | IFO 1815T | This study | ||
| JRY9174 | IFO 1815T | This study | ||
| JRY9175 | IFO 1815T | This study | ||
| JRY9176 | IFO 1815T | This study | ||
| JRY9177 | IFO 1815T | This study | ||
| JRY9178 | IFO 1815T | This study | ||
| JRY9179 | IFO 1815T | This study | ||
| JRY9180 | IFO 1815T | This study | ||
| JRY9181 | IFO 1815T | This study | ||
| JRY9182 | IFO 1815T | This study | ||
| JRY9183 | IFO 1815T | This study | ||
| JRY9184 | IFO 1815T | This study | ||
| FM1097 | IFO 1802T | |||
| FM1098 | IFO 1802T | |||
| FM1363 | IFO 1802T | This study | ||
| FM1403 | IFO 1802T | This study | ||
| FM1122 | IFO 1802T | This study | ||
| FM1141 | IFO 1802T | This study | ||
| FM1388 | IFO 1802T | This study | ||
| JRY9185 | IFO 1802T | This study | ||
| JRY9186 | IFO 1802T | This study | ||
| JRY9187 | IFO 1802T | This study | ||
| JRY9188 | IFO 1802T | This study | ||
| FM1109 | ZP 591 | |||
| FM1110 | ZP 591 | |||
| FM1071 | ZP 591 | |||
| FM1158 | ZP 591 | This study | ||
| FM1400 | ZP 591 | This study | ||
| FM1340 | ZP 591 | |||
| FM1123 | ZP 591 | |||
| FM1192 | ZP 591 | This study | ||
| FM1194 | ZP 591 | This study | ||
| FM1131 | ZP 591 | |||
| FM1183 | ZP 591 | |||
| FM1193 | ZP 591 | This study | ||
| FM1389 | ZP 591 | This study | ||
| JRY9189 | CBS 7001 | This study | ||
| JRY9190 | CBS 7001 | This study | ||
| JRY8149 | CBS 7001 | |||
| JRY8150 | CBS 7001 | |||
| JRY8153 | CBS 7001 | |||
| JRY8154 | CBS 7001 | |||
| JRY8147 | CBS 7001 | |||
| JRY8148 | CBS 7001 | |||
| JRY9191 | CBS 7001 | This study | ||
| JRY9040 | CBS 7001 | |||
| JRY9192 | CBS 7001 | This study | ||
| JRY9193 | CBS 7001 | This study | ||
| JRY9194 | CBS 7001 | This study | ||
| JRY9195 | CBS 7001 | This study |
All strains are available upon request from C. T. Hittinger.