| Literature DB >> 20157474 |
Maria C Costanzo1, Marek S Skrzypek, Robert Nash, Edith Wong, Gail Binkley, Stacia R Engel, Benjamin Hitz, Eurie L Hong, J Michael Cherry.
Abstract
The Saccharomyces Genome Database (SGD; http://www.yeastgenome.org/) organizes and displays molecular and genetic information about the genes and proteins of baker's yeast, Saccharomyces cerevisiae. Mutant phenotype screens have been the starting point for a large proportion of yeast molecular biological studies, and are still used today to elucidate the functions of uncharacterized genes and discover new roles for previously studied genes. To greatly facilitate searching and comparison of mutant phenotypes across genes, we have devised a new controlled-vocabulary system for capturing phenotype information. Each phenotype annotation is represented as an 'observable', which is the entity, or process that is observed, and a 'qualifier' that describes the change in that entity or process in the mutant (e.g. decreased, increased, or abnormal). Additional information about the mutant, such as strain background, allele name, conditions under which the phenotype is observed, or the identity of relevant chemicals, is captured in separate fields. For each gene, a summary of the mutant phenotype information is displayed on the Locus Summary page, and the complete information is displayed in tabular format on the Phenotype Details Page. All of the information is searchable and may also be downloaded in bulk using SGD's Batch Download Tool or Download Data Files Page. In the future, phenotypes will be integrated with other curated data to allow searching across different types of functional information, such as genetic and physical interaction data and Gene Ontology annotations.Database URL:http://www.yeastgenome.org/Entities:
Year: 2009 PMID: 20157474 PMCID: PMC2790299 DOI: 10.1093/database/bap001
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Observables and qualifiers in the yeast phenotype ontology (YPO)
| Observable |
| • Cellular processes |
| ◦ Chromosome/plasmid maintenance |
| ◦ Intracellular transport |
| ◦ Mitotic cell cycle |
| ◦ Stress resistance |
| • Development |
| ◦ Apoptosis |
| ◦ Budding |
| ◦ Filamentous growth |
| ◦ Lifespan |
| ◦ Necrotic cell death |
| ◦ Sexual cycle |
| ◦ Virulence |
| • Essentiality |
| ◦ Inviable |
| ◦ Viable |
| •Fitness |
| ◦ Competitive fitness |
| ◦ Haploinsufficient |
| ◦ Viability |
| •Metabolism and growth |
| ◦ RNA accumulation |
| ◦ RNA modification |
| ◦ Anaerobic metabolism |
| ◦ Chemical compound accumulation |
| ◦ Chemical compound excretion |
| ◦ Fermentative metabolism |
| ◦ Nutrient utilization |
| ◦ Protein activity |
| ◦ Protein/peptide accumulation |
| ◦ Protein/peptide distribution |
| ◦ Protein/peptide modification |
| ◦ Redox state |
| ◦ Respiratory metabolism |
| ◦ Vegetative growth |
| • Morphology |
| ◦ Cellular morphology |
| ◦ Culture appearance |
| Qualifier |
| • No qualifier |
| • Abnormal |
| ◦ Arrested |
| ◦ Decreased |
| ▪ Absent |
| ◦ Decreased duration |
| Qualifier |
| ◦ Decreased rate |
| ◦ Delayed |
| ◦Increased |
| ◦ Increased duration |
| ◦ Increased rate |
| ◦ Premature |
| • Normal |
| ◦Normal rate |
Only the two highest levels of the observables ontology are shown; most of the terms have more specific child terms. The entire list of observables may be viewed at http://www.yeastgenome.org/cache/PhenotypeTree.html. The entire set of qualifiers is shown. Qualifiers are required with most observables, except those which implicitly include a qualifier (e.g. ‘inviable’).
Experiment types in the YPO
| Phenotype assays |
| • Classical genetics |
| ◦ Heterozygous diploid |
| ◦ Homozygous diploid |
| • Large-scale survey |
| ◦ Competitive growth |
| ▪ Heterozygous diploid, competitive growth |
| ▪ Homozygous diploid, competitive growth |
| ◦ Heterozygous diploid, large-scale survey |
| ◦ Homozygous diploid, large-scale survey |
| ◦ Systematic mutation set |
| ▪ Heterozygous diploid, systematic mutation set |
| ▪ Homozygous diploid, systematic mutation set |
The complete list is shown. Haploidy is implicit unless otherwise specified.
Mutant types in the YPO
| Mutant type | Definition |
|---|---|
| Activation | The mutation increases the normal activity of the gene product |
| Conditional | The activity of the gene product is normal under some conditions and altered under others |
| Dominant negative | The mutation results in a gene product that interferes with the function of the normal, wild-type gene product |
| Gain of function | The mutation confers a new activity on the gene product |
| Misexpression | The mutation results in expression of the gene product at a developmental stage, in a cell type, or at a subcellular location different from that at which the wild-type gene is expressed |
| Null | The mutation completely abolishes the function of the gene product |
| Overexpression | The mutation causes expression of the otherwise normal gene product to higher levels than wild type |
| Reduction of function | The mutation reduces the activity of the gene product |
| Repressible | The mutation causes a reduction in levels of the gene product, often through the use of a repressible promoter |
| Unspecified | A curator has tried to determine the mutant type but it is not specified in the article |
The complete list is shown, with definitions.
Figure 1.Database schema for mutant phenotype-related data. Top, the controlled-vocabulary tables adapted from the CHADO schema. Bottom, the phenotype data tables.
Figure 2.Phenotype section of SGD's Locus Summary page for the ALG8 gene. The figure represents only a portion of the SGD locus summary for ALG8, http://www.yeastgenome.org/cgi-bin/locus.fpl?locus=ALG8. The observable, qualifier, and high-level experiment type are shown for all curated mutant phenotypes of ALG8. For phenotypes that include the observable ‘resistance to chemicals’, the name of the chemical is substituted for the word ‘chemical’ in this summary display; several examples of this are shown here. For the observable ‘protein/peptide modification’, the name of the Reporter is substituted for ‘protein/peptide’—in this instance, ‘carboxypeptidase Y (Prc1p)’.
Figure 3.Phenotype details page for the ALG8 gene. Only a portion of the table is shown. Columns of the table contain: experiment type; mutant type and allele information, if any; strain background; phenotype (observable: qualifier); chemical, if any; other details, including conditions, reporters, or details; and the reference. Each observable name in the phenotype column is hyperlinked to a list of other phenotype annotations using that observable. Each chemical name in the chemical column is hyperlinked to a list of other phenotype annotations involving that chemical. This table of phenotypes is on a page that contains the standard SGD toolbar containing links to major tools and resources in SGD; below the table are links to other resources, external to SGD, which offer mutant phenotype information or provide mutant strains (data not shown).
Figure 4.Searching phenotype data. The basic SGD Search is entered via a text box at the top of most SGD pages. Keywords entered into this search are used to search 15 major categories of information in SGD, and a summary of the results is presented on the SGD search results page (center). The link to ‘Phenotype annotations’ leads to a summary page listing the number of matches found in observables (lower left). The link to ‘Expanded phenotype search’ launches a search of all phenotype data. Results of the search are presented on the Expanded Phenotype Search Results Summary page (lower right). The top section displays matches to observables, and the bottom section displays matches to other phenotype data. On both types of results summary page, each number of matches is linked to a page displaying a table of the individual annotations.
Format of the ‘phenotype_data.tab’ file available from SGD's download data files page (http://www.yeastgenome.org/cgi-bin/batchDownload/)
| Column number | Column name | Mandatory? | Column contents |
|---|---|---|---|
| 1 | Feature name | Yes | The systematic name of the gene |
| 2 | Feature type | Yes | The feature type of the gene |
| 3 | Gene name | No | The SGD Standard Name of the gene |
| 4 | SGDID | Yes | The SGDID, unique database identifier, for the gene |
| 5 | Reference | Yes | PubMed ID | SGD Reference Identifier |
| 6 | Experiment type | Yes | The method used to detect and analyze the phenotype |
| 7 | Mutant type | Yes | Description of the impact of the mutation on activity of the gene product |
| 8 | Allele | No | Allele name and description |
| 9 | Strain background | Noa | Genetic background in which the phenotype was analyzed |
| 10 | Phenotype | Yes | Observable: qualifier |
| 11 | Chemical | Nob | Any chemical relevant to the phenotype |
| 12 | Condition | No | Condition under which the phenotype is observed |
| 13 | Details | No | Details about the phenotype |
| 14 | Reporter | Noc | The protein(s) or RNA(s) used in an experiment to track a process |
SGD's download data files page contains all phenotype data in SGD and is updated weekly. Its format is the same as the files created by the Batch Download tool (http://www.yeastgenome.org/cgi-bin/batchDownload), which may be used to retrieve phenotype and other data for a user-specified set of genes. aStrain background is mandatory in all current curation but was not entered for some of the first phenotypes curated. bA chemical is required for the following observables, because the name of the chemical is substituted for ‘chemical compound’ or ‘chemical’ on the Locus Summary page: chemical compound accumulation, chemical compound excretion, and resistance to chemicals. cA reporter is required for the following observables, because the name of the reporter is substituted for ‘protein’, ‘protein/peptide’ or ‘RNA’ on the Locus Summary page: protein activity, protein/peptide accumulation, protein/peptide distribution, protein/peptide modification, RNA accumulation, RNA localization, and RNA modification.