| Literature DB >> 23749449 |
Patrick A Gibney1, Mark J Hickman, Patrick H Bradley, John C Matese, David Botstein.
Abstract
The genome of budding yeast (Saccharomyces cerevisiae) contains approximately 5800 protein-encoding genes, the majority of which are associated with some known biological function. Yet the extent of amino acid sequence conservation of these genes over all phyla has only been partially examined. Here we provide a more comprehensive overview and visualization of the conservation of yeast genes and a means for browsing and exploring the data in detail, down to the individual yeast gene, at http://yeast-phylogroups.princeton.edu. We used data from the OrthoMCL database, which has defined orthologs from approximately 150 completely sequenced genomes, including diverse representatives of the archeal, bacterial, and eukaryotic domains. By clustering genes based on similar patterns of conservation, we organized and visualized all the protein-encoding genes in yeast as a single heat map. Most genes fall into one of eight major clusters, called "phylogroups." Gene ontology analysis of the phylogroups revealed that they were associated with specific, distinct trends in gene function, generalizations likely to be of interest to a wide range of biologists.Entities:
Keywords: evolution; genome; orthology; phylogeny; yeast
Mesh:
Substances:
Year: 2013 PMID: 23749449 PMCID: PMC3737173 DOI: 10.1534/g3.113.006585
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1The S. cerevisiae genome grouped into distinct phylogenetic categories based on genetic conservation. Green intensity indicates percent of species in a specified taxonomic category (listed along the top with number of species analyzed in each category shown in parentheses) with orthologs to the S. cerevisiae genes (along the y-axis) (see legend below for color-to-percent conversion). Genes were ordered as described in the section Materials and Methods. Distinct phylogenetic categories, which we call phylogroups, are listed descriptively to the right of the color-map. The “minor phylogroups” are examined in finer detail in Figure S2. The “No Ortholog Data” phylogroup refers to a set of yeast genes that were not curated by the OrthoMCL database and so were not analyzed in this manuscript.
Figure 2GO term enrichment of phylogroups. GO-Slim Mapper was used to identify GO terms that are enriched in each phylogroup. The most significant results are presented in a color-map with yellow intensity corresponding to significance of enrichment (see legend; the color intensity scale was defined using our significance threshold of P < 10−7). Phylogroups analyzed are listed across the top of the color-map. GO-Slim Categories are shown in the indicated order: process (top), component (bottom). The data for GO-Slim Category: function is presented in Figure S3.