| Literature DB >> 27252399 |
Giltae Song1, Rama Balakrishnan1, Gail Binkley1, Maria C Costanzo1, Kyla Dalusag1, Janos Demeter1, Stacia Engel1, Sage T Hellerstedt1, Kalpana Karra1, Benjamin C Hitz1, Robert S Nash1, Kelley Paskov1, Travis Sheppard1, Marek Skrzypek1, Shuai Weng1, Edith Wong1, J Michael Cherry2.
Abstract
The Saccharomyces Genome Database (SGD; http://www.yeastgenome.org/) is the authoritative community resource for the Saccharomyces cerevisiae reference genome sequence and its annotation. To provide a wider scope of genetic and phenotypic variation in yeast, the genome sequences and their corresponding annotations from 11 alternative S. cerevisiae reference strains have been integrated into SGD. Genomic and protein sequence information for genes from these strains are now available on the Sequence and Protein tab of the corresponding Locus Summary pages. We illustrate how these genome sequences can be utilized to aid our understanding of strain-specific functional and phenotypic differences.Database URL: www.yeastgenome.org.Entities:
Mesh:
Year: 2016 PMID: 27252399 PMCID: PMC4888754 DOI: 10.1093/database/baw074
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Summary information on the 11 alternative reference strains
| Strain | Description | Source (ATCC ID) | NCBI BioSample Accession | Number of ORFs | Phenotype count per strain (%) | References |
|---|---|---|---|---|---|---|
| RM11-1a | A natural isolate collected from a California vineyard | UCD 2788 (UC Davis culture collection ID) | SAMN03020228 | 5323 | 2 (0.002) | ( |
| Y55 | Laboratory strain originally isolated from wine grapes | ATCC: 52530 | SAMN03020218 | 5359 | 18 (0.015) | ( |
| FL100 | Laboratory strain | ATCC: 28383 | SAMN03020232 | 5366 | 57 (0.046) | ( |
| JK9-3d | Laboratory strain | ATCC: MYA-555 | SAMN03020238 | 5385 | 111 (0.09) | ( |
| CEN.PK | Laboratory strain popular for use in systems biology studies | ATCC: MYA-1108 | SAMN03020234 | 5379 | 213 (0.174) | ( |
| X2180-1A | S288C derivative laboratory strain | ATCC: 204504 | SAMN03020236 | 5387 | 276 (0.225) | ( |
| D273-10B | Lab strain used for mitochondrial studies | ATCC: 24657 | SAMN03020237 | 5383 | 278 (0.227) | ( |
| SEY6210 | Lab strain used in studies of autophagy and protein sorting | ATCC: 96099 | SAMN03020235 | 5400 | 414 (0.337) | ( |
| SK1 | Lab strain used for studying sporulation and meiosis | ATCC: 204720 | SAMN03020220 | 5350 | 859 (0.7) | ( |
| Sigma1278b | Used in pseudohyphal growth studies | ATCC: 42800 | SAMN03020229 | 5358 | 2170 (1.768) | ( |
| W303 | Laboratory strain used for research into aging | ATCC: 20060 | SAMN03020233 | 5397 | 3158 (2.573) | ( |
We chose 11 non-S288C strains as alternative reference strains based on the number of phenotype annotations curated in SGD. Phenotype studies are most frequently reported using the S288C reference strain, i.e. 84.78% of phenotypic counts in SGD are based on work in S288C. Other than S288C, the 11 alternative reference strains have been most frequently used for yeast phenotype studies. The alternative strains are used for specific areas of biology (e.g. CEN.PK for systems biology, D273-10B for mitochondrial studies, SEY6210 for autophagy and protein sorting, SK1 for sporulation and meiosis, Sigma1278b for pseudohyphal growth, and W303 for aging). The source of the sequenced strain genome is summarized with ATCC ID. The assembly and raw sequence data of each strain has been deposited in NCBI and can be found with NCBI BioSample accession numbers.
Figure 1.Visualization of the new sequence data for AQY1 in the database. The Sequence page can be accessed by selecting the tab at the top of the AQY1 LSP, labeled ‘Sequence.’ There are six sections within the Sequence page including the ‘Sequence Overview’ for descriptive information, ‘Alternative Reference Strains’ for viewing the DNA or protein sequence in a selected alternative reference strain, and ‘Resources’ for access to processed results using sequence analysis tools (e.g. a new tool called SGD Variant Viewer (28), updated BLAST search, and ClustalW multiple sequence alignments). Selection of SK1 as an alternative strain is illustrated in this figure. Neighbor genes of AQY1 in SK1 are also shown in the visualization. When the user clicks a sequence analysis tool (URL in blue) in the resources tab, they will view the gene-specific results using that tool.
Figure 2.Mutant phenotypes for AQY1 gene in SGD. By querying for ‘AQY1’ using the SGD search box, and selecting the Phenotype tab on the AQY1 LSP, phenotype information for AQY1 can be viewed. A bar chart summarizes how many phenotypic annotations have been curated in different strain backgrounds (e.g. two mutant phenotypes in the SK1 strain background). If the box for SK1 in the bar chart is selected, the details of the two mutant phenotypes for AQY1 in the SK1 strain background will be listed in the Annotations section. Users also can refer to the relevant literature (27) that describes studies on the mutant phenotypes resulted from polymorphisms in the AQY1 gene in strain SK1 as shown in the “Reference” column of the table. Users can also access more information of the alternative strain (SK1) by selecting the strain name (highlighted in yellow) in the Annotations table (see Figure 3).
Figure 3.The SGD SK1 Strain page. Strain pages are available for each of the alternative strains. The Strain page includes an Overview section for descriptive information of each strain, and a Genome Sequence section for brief statistics of the new sequence integrated in the database (NCBI accession number, the number of scaffolds, and the number of ORFs) and a table for information on each contig.
Figure 4.Visualization of genetic variation within AQY1 gene across alternative reference strains in SGD. Variations within AQY1 among 11 alternative strains and the S288C reference strain are depicted in the SGD Variant Viewer. Non-synonymous mutations and deletions/insertions with their location information in AQY1 are shown in the viewer. We can see that AQY1 is conserved in most laboratory strains. SK1 shows the most non-synonymous mutations relative to the S288C reference unlike the other strains. There are two common mutations that appear in four strains Sigma1278b, RM11-1a, SK1 and Y55: V121 (guided in a yellow line in the viewer) and P255. SK1 has a shorter C-terminus than other strains whereas Sigma1278b, RM11-1a and Y55 have longer C-termini than the eight laboratory strains. Other than the C-terminus and two mutations (V121 and P255), Sigma1278b, RM11-1a and Y55 show conserved amino acids with the eight laboratory strains unlike SK1.