| Literature DB >> 23487186 |
Stacia R Engel1, J Michael Cherry.
Abstract
The first completed eukaryotic genome sequence was that of the yeast Saccharomyces cerevisiae, and the Saccharomyces Genome Database (SGD; http://www.yeastgenome.org/) is the original model organism database. SGD remains the authoritative community resource for the S. cerevisiae reference genome sequence and its annotation, and continues to provide comprehensive biological information correlated with S. cerevisiae genes and their products. A diverse set of yeast strains have been sequenced to explore commercial and laboratory applications, and a brief history of those strains is provided. The publication of these new genomes has motivated the creation of new tools, and SGD will annotate and provide comparative analyses of these sequences, correlating changes with variations in strain phenotypes and protein function. We are entering a new era at SGD, as we incorporate these new sequences and make them accessible to the scientific community, all in an effort to continue in our mission of educating researchers and facilitating discovery.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23487186 PMCID: PMC3595989 DOI: 10.1093/database/bat012
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Various S. cerevisiae genomes contain ORFs that are not present in the S288C reference genome
| Strain | ORFs not in S288C | Reference |
|---|---|---|
| AWRI796 | 74 | 15 |
| CEN.PK113-7D | 83 | 16 |
| EC1118 | 77 | 15 |
| FostersB | 36 | 15 |
| FostersO | 48 | 15 |
| JAY291 | 16 | 12 |
| Kyokai No.7 | 48 | 17 |
| QA23 | 110 | 15 |
| RM11-1a | 38 | 15 |
| Sigma1278b | 75 | 14 |
| VIN13 | 45 | 15 |
| VL3 | 54 | 15 |
| YJM789 | 34 | 15 |
Figure 1Phylogram depicting relationships among S. cerevisiae strains based on Unweighted Pair Group Method with Arithmetic Mean (UPGMA) clustering of whole-genomic distances as calculated by Nijkamp et al. (16). Redrawn from Nijkamp et al. (16).
In the years since the publication of the S288C genome, dozens of yeast genome sequences have been published
| Strain | Year | Provenance | NCBI BioProject | Contig N50 | Scaffold N50 |
|---|---|---|---|---|---|
| S288C | 1996 | Laboratory strain | PRJNA128 | N/A | N/A |
| RM11-1a | 2005 | Haploid derivative of California vineyard isolate | PRJNA13674 | 263 288 | 795 018 |
| YJM789 | 2007 | Haploid derivative of opportunistic human pathogen | PRJNA13304 | 429 709 | N/A |
| M22 | 2008 | Italian vineyard isolate | PRJNA28815 | 2207 | N/A |
| YPS163 | 2008 | Pennsylvania woodland isolate | PRJNA28813 | 2901 | N/A |
| AWRI1631 | 2008 | Haploid derivative of South African commercial wine strain N96 | PRJNA30553 | 7704 | N/A |
| JAY291 | 2009 | Haploid derivative of Brazilian industrial bioethanol strain PE-2 | PRJNA32809 | 64 336 | N/A |
| EC1118 | 2009 | Commercial wine strain | PRJEA37863 | 776 014 | N/A |
| Sigma1278b | 2009 | Laboratory strain | PRJNA39317 | 365 700 | N/A |
| Foster’s O | 2010 | Commercial ale strain | PRJNA48567 | 195 316 | N/A |
| Foster’s B | 2010 | Commercial ale strain | PRJNA48569 | 204 208 | 626 897 |
| VIN13 | 2010 | South African white wine strain | PRJNA48563 | 308 189 | 700 638 |
| AWRI796 | 2010 | South African red wine strain | PRJNA48559 | 403 341 | 565 854 |
| CLIB215 | 2010 | New Zealand bakery isolate | PRJNA60143 | 16 813 | 47 217 |
| CBS7960 | 2011 | Brazilian bioethanol factory isolate | PRJNA60391 | 18 761 | 65 099 |
| CLIB324 | 2011 | Vietnamese bakery isolate | PRJNA60415 | 4260 | 24 472 |
| CLIB382 | 2011 | Irish beer isolate | PRJNA60145 | 840 | 2711 |
| EC9-8 | 2011 | Haploid derivative of Israeli canyon isolate | PRJNA73985 | 15 539 | 541 605 |
| FL100 | 2011 | Laboratory strain | PRJNA60147 | 4244 | 26 506 |
| Kyokai No.7 | 2011 | Japanese sake yeast | PRJNA45827 | 120 978 | 902 266 |
| QA23 | 2011 | Portuguese Vinho Verde white wine strain | PRJNA48561 | 182 942 | 182 942 |
| PW5 | 2011 | Nigerian Raphia palm wine isolate | PRJNA60181 | 14 234 | 393 105 |
| T7 | 2011 | Missouri oak tree exudate isolate | PRJNA60387 | 147 205 | 476 142 |
| T73 | 2011 | Spanish red wine strain | PRJNA60195 | 2945 | 36 287 |
| UC5 | 2011 | Japanese sake yeast | PRJNA60197 | 17 142 | 356 094 |
| VL3 | 2011 | French white wine strain | PRJNA48565 | 293 399 | 656 188 |
| W303 | 2011 | Laboratory strain | PRJNA167645 | 149 943 | 367 966 |
| Y10 | 2011 | Philippine coconut isolate | PRJNA60201 | 2730 | 22 204 |
| YJM269 | 2011 | Austrian Blauer Portugieser wine grapes | PRJNA60389 | 23 452 | 58 353 |
| BY4741 | 2012 | S288C-derivative laboratory strain | N/A | N/A | N/A |
| BY4742 | 2012 | S288C-derivative laboratory strain | N/A | N/A | N/A |
| CEN.PK 113-7D | 2012 | Laboratory strain | PRJNA52955 | 48 196 | 918 791 |
| ZTW1 | 2012 | Chinese corn mash bioethanol isolate | PRJNA174065 | 556 921 | N/A |
aContig and scaffold N50 lengths are common genome statistics that indicate the minimum length in the set of individual contiguous sequences (contigs or scaffolds), which contain half of all bases in the assembly.
bN/A = not available.
Figure 2Precomputed ClustalW alignments of both amino acid and coding DNA sequences and ORF-specific dendrograms are available for each ORF at http://www.yeastgenome.org/cgibin/FUNGI/alignment.pl.
Figure 3Genomes of the various S. cerevisiae strains have been incorporated into the BLAST datasets at SGD, available for searching against genomic and coding DNA, as well as protein sequences at http://www.yeastgenome.org/cgi-bin/blast-sgd.pl.