| Literature DB >> 26578556 |
Travis K Sheppard1, Benjamin C Hitz1, Stacia R Engel1, Giltae Song1, Rama Balakrishnan1, Gail Binkley1, Maria C Costanzo1, Kyla S Dalusag1, Janos Demeter1, Sage T Hellerstedt1, Kalpana Karra1, Robert S Nash1, Kelley M Paskov1, Marek S Skrzypek1, Shuai Weng1, Edith D Wong1, J Michael Cherry2.
Abstract
The Saccharomyces Genome Database (SGD; http://www.yeastgenome.org) is the authoritative community resource for the Saccharomyces cerevisiae reference genome sequence and its annotation. In recent years, we have moved toward increased representation of sequence variation and allelic differences within S. cerevisiae. The publication of numerous additional genomes has motivated the creation of new tools for their annotation and analysis. Here we present the Variant Viewer: a dynamic open-source web application for the visualization of genomic and proteomic differences. Multiple sequence alignments have been constructed across high quality genome sequences from 11 different S. cerevisiae strains and stored in the SGD. The alignments and summaries are encoded in JSON and used to create a two-tiered dynamic view of the budding yeast pan-genome, available at http://www.yeastgenome.org/variant-viewer.Entities:
Mesh:
Year: 2015 PMID: 26578556 PMCID: PMC4702884 DOI: 10.1093/nar/gkv1250
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Genomes included in the initial release of the Variant Viewer
| Strain | SGD strain page URL | Provenance | Accession |
|---|---|---|---|
| CEN.PK2-1Ca | Laboratory strain | JRIV00000000 | |
| D273-10B | Laboratory strain | JRIY00000000 | |
| FL100 | Laboratory strain | JRIT00000000 | |
| JK9-3d | Laboratory strain | JRIZ00000000 | |
| RM11-1A | Derivative of California vineyard isolate | JRIP00000000 | |
| Σ1278b-10569-6B | Laboratory strain | JRIQ00000000 | |
| SEY6210 | Laboratory strain | JRIW00000000 | |
| SK1 | Laboratory strain | JRIH00000000 | |
| W303 | Laboratory strain | JRIU00000000 | |
| X2180-1A | S288C-derived laboratory strain | JRIX00000000 | |
| Y55 | Laboratory strain | JRIF00000000 |
All genome sequences were determined using the AGAPE pipeline (4).
Figure 1.Pipeline of Variant Viewer data. The pipeline of variant data begins by fetching sequences for the reference strain and 11 alternative strains. The sequences for each feature are aligned using Clustal Omega (7) and stored in SGD's database. The alignment data is then combined with GO annotations, protein domains, introns for the reference genome and location in the reference genome to a JSON object that is indexed into Elasticsearch. When a user visits the site, the SGD server responds with the same JSON object, which the browser can use to render the interface.
Figure 2.Matrix visualization of alignments. (A) A matrix is provided showing the available strains with a dendrogram illustrating their sequence relationships. The rows represent the ORFs and their variation relative to the reference. The table allows users to scroll through visible features. (B) The matrix is colored with a scale that corresponds to their alignment to S288C (C) User is allowed to toggle between DNA or protein variation view. (D) A search bar allows users to limit the features that appear in the display.
Figure 3.ORF view. The lollipop variant visualization highlights variation within a single ORF. In this example the ECM33 ORF nucleotide variation between 11 strain sequences is shown. SNPs, insertions and deletions are represented as symbols called lollipops. Below the lollipop visualization, the varied segments of the aligned sequences are shown. Mousing over a lollipop in the upper portion of the display highlights the corresponding sequence across the visible strains. In this view, users can also toggle between DNA and protein modes.