Literature DB >> 15608214

The Ashbya Genome Database (AGD)--a tool for the yeast community and genome biologists.

Leandro Hermida1, Sophie Brachat, Sylvia Voegeli, Peter Philippsen, Michael Primig.   

Abstract

The Ashbya Genome Database (AGD) is a comprehensive online source of information covering genes from the filamentous fungus Ashbya gossypii. The database content is based upon comparative genome annotation between A.gossypii and the closely related budding yeast Saccharomyces cerevisiae taking both sequence similarity and synteny (conserved order and orientation) into account. Release 2 of AGD contains 4718 protein-encoding loci located across seven chromosomes. Information can be retrieved using systematic or standard locus names from A.gossypii as well as budding and fission yeast. Approximately 90% of the genes in the genome of A.gossypii are homologous and syntenic to loci of budding yeast. Therefore, AGD is a useful tool not only for the various yeast communities in general but also for biologists who are interested in evolutionary aspects of genome research and comparative genome annotation. The database provides scientists with a convenient graphical user interface that includes various locus search and genome browsing options, data download and export functionalities and numerous reciprocal links to external databases including SGD, MIPS, GeneDB, KEGG, GermOnline and Swiss-Prot/TrEMBL. AGD is accessible at http://agd.unibas.ch/.

Entities:  

Mesh:

Year:  2005        PMID: 15608214      PMCID: PMC539963          DOI: 10.1093/nar/gki009

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Since the first eukaryotic genome sequence of budding yeast Saccharomyces cerevisiae was published in 1996 (1), similar projects have been completed for other yeast species including Schizosaccharomyces pombe and Ashbya gossypii (2–7). The identification of open reading frames (ORFs) and loci that encode RNAs was substantially facilitated by comparative genomics because this approach exploits the conservation of gene order and orientation (synteny) in several related genomes. It is therefore possible to distinguish genuine ORFs from annotation artifacts and to assign start codons and intron/exon boundaries in a more reliable manner. Moreover, aligning promoter sequences from closely related species yields novel conserved elements that might be involved in transcriptional regulation (2,3). The genome sequence of A.gossypii has proven to be extremely useful for refining the annotation of the budding yeast genome (8) and, most importantly, has provided the ultimate proof that S.cerevisiae has undergone a whole-genome duplication event (5). The sequence and annotation data are therefore highly interesting from an evolutionary standpoint because they permit reconstruction of genome rearrangements and gene deletions in budding yeast. Furthermore, A.gossypii is an excellent model system to study important biological problems like polar growth using genetic, biochemical and genomic experimental approaches (9–13). It should be emphasized that high-density oligonucleotide microarrays will become available for A.gossypii in the future, opening up an avenue to extensive profiling studies of the poorly understood transcriptional program underlying the life cycle of this interesting filamentous fungus (P. Philippsen, unpublished data). The annotation information stored in Ashbya Genome Database (AGD) is constantly experimentally verified and updated. Researchers using the database are encouraged to provide feedback on errors and inconsistencies to further improve the reliability of AGD's entries. Here, we describe Release 2 of the AGD, a comprehensive and cross-referenced source of manually verified genome annotation data accessible at http://agd.unibas.ch/.

DATABASE DEVELOPMENT AND CROSS-CONNECTION

The database and web site were constructed using the Ensembl application programming interface (API) and base web code version 14 (14–16). Details of the hardware and software required to create an Ensembl development environment can be found at http://www.ensembl.org/Docs/wiki/html/EnsemblDocs/InstallEnsemblWebsite.html. The MySQL database schema and structure was created to contain A.gossypii data in an Ensembl-compliant format. MySQL database creation and data population was accomplished with a series of Perl (http://www.perl.org/) scripts using the source GenBank sequence data and annotation files for A.gossypii provided by Dietrich et al. (5). The loading scripts also facilitate quick and seamless re-population of the database when updated GenBank files are made available. A.gossypii GenBank files for AGD Release 1 are presently accessible on the supplementary download page http://agd.unibas.ch/Ashbya_gossypii/download_ashbya. Customized web pages were created specifically for AGD using the Ensembl API and base web site structure. The database schema and scripts are available upon request. Hyperlinks into AGD from external sources are possible via the uniform resource locator (URL) http://agd.unibas.ch/Ashbya_gossypii/unisearch?type=Gene&q=[KEYWORD] where [KEYWORD] can be a budding or fission yeast gene systematic name (e.g. YPR175W, SPBP8B7.14C) or standard name (DPB2) or an A.gossypii systematic name (e.g. AEL267C).

THE SCOPE AND QUALITY OF AGD

The goal of AGD Release 2 is to cover all 4718 currently annotated loci from A.gossypii that are based on the most upstream ATG start triplet of a given ORF. The database provides convenient access to annotation information available for homologous genes from budding and fission yeasts. Note that the annotation approach in rare cases leads to start codons being wrongly assigned. This is in part due to the GC-content being higher in A.gossypii than in budding yeast (53% versus 38%) (5). We are in the process of experimentally verifying the annotation of ORFs and start codons in a number of questionable cases including some reported by users. The results of this ongoing work comprising DNA re-sequencing and/or verification of transcriptional start sites by RACE–PCR are immediately incorporated into update files submitted to Genbank and AGD. Lists of loci for which updated ORFs and/or start codons are available between database releases are accessible via the supplementary download page of AGD.

INFORMATION AND DATA RETRIEVAL

Search options

Currently, it is possible to search AGD using A.gossypii's systematic locus name (e.g. ADR058C) and the systematic or standard names from budding or fission yeast, respectively (Figure 1A). Note that fission yeast genes are linked to A.gossypii loci either directly by an automated BLAST similarity search (S. Brachat and P. Philippsen, unpublished data) or via their budding yeast homologs (V. Wood, personal communication). A popup menu includes several refined options like gene, mRNA, peptide or contig/clone. Moreover, a wildcard search using truncated gene names (e.g. CDC*, RAD*, SRB* and SPO*) allows the retrieval of several loci from A.gossypii that are shown together with their putative homologs from budding and fission yeast. It is also possible to display chromosomal regions by defining the chromosome number and the base coordinates (Figure 1B).
Figure 1

Search and browsing options in AGD. (A) The search form and (B) the chromosomal browsing function. (C and D) The Overview and the Detailed View of the ContigView page, respectively.

Genome browsing and various levels of views

The welcome page provides a whole-genome view via an interactive image of the seven A.gossypii chromosomes that enables scientists to home in on any region of interest. By clicking on a chromosome, the user gains access to the MapView page, which gives some general information and statistics (gene content and chromosome length) about the chromosome. From the MapView page, the user can click anywhere on the chromosome ideogram to jump to the contig-level view of A.gossypii features at that point. For the selection of specific chromosome regions, users can click on the link to the AnchorView page, which allows the selection of two features on a chromosome as anchor points to display the contig-level region between them. Both methods of browsing lead to the ContigView page, which comprises four sections: Chromosome, Overview (Figure 1C), Detailed View (Figure 1D) and Basepair View. Each of these sections can be hidden by clicking on the − tick box. The views are fully interactive and enable the user to click on any of the indicated genes, transcripts and legends to call up the respective locus report page. The Detailed View and Basepair View sections include zoom, image size and window jump functions allowing flexible genome navigation.

AGD report pages

The AGD Gene Report page begins with sections that indicate the gene name, database ID and the genomic location (Figure 2A). The Description section contains information on the homologous S.cerevisiae and S.pombe loci (indicating systematic and standard names) and the External Links section provides access to annotation databases for budding yeast [SGD (17), MIPS (18)], fission yeast [GeneDB (19)] and the relevant sections of databases covering multiple species [KEGG (20), GermOnline (21–23), Swiss-Prot/TrEMBL (24)]. These links enable users to access information obtained about homologous genes in related species that may be relevant for the locus in A.gossypii as well. The Predicted Transcripts section contains a graphical display of the target gene's chromosomal localization and links to the transcript, exon and protein report pages. The Transcript/Translation Summary section also gives access to transcript, exon and protein information, and contains a graphical display of the mRNA and protein structures (Figure 2B–D). The [View transcript information] link leads the user to the AGD Transcript Report page, which contains additional information regarding the mRNA sequence. Various markups can be added to the mRNA sequence display, which show the user the transcript's underlying exons, protein codons and peptide sequence. The AGD Exon Report page adds detailed information about the exon/intron structure of the gene and displays flanking 5′ and 3′ untranslated regions. The AGD Protein Report page can be called up via the [View protein information] link and shows peptide sequence and property information.
Figure 2

The locus report page of AGD. (A and B) The Gene Report and Transcript/Translation sections of the Gene Report page for the A.gossypii homolog of KIN28 (S.cerevisiae) and SPBC19F8.07 (S.pombe). (C and D) Examples of the Transcript and Peptide view, and (E) the data ExportView interface that is accessible via links in the welcome page and each Gene Report page.

Data export

From the Download AGD Release 2 link in the welcome page and the export data link in the AGD Gene Report pages, the individual mRNA or peptide sequences can be retrieved in FASTA format via an appropriate form (Figure 2E). A user can also download entire regions of the A.gossypii genome complete with annotations in EMBL, GenBank or FASTA format, or region feature lists for uploading into other databases in GFF, tab-delimited and comma-delimited formats. Finally, it is possible to retrieve the complete source AGD database content as GenBank files, the complete set of ORF translations in FASTA format and a correspondence list of A.gossypii and S.cerevisiae locus names by following the link to the supplementary download page.

FUTURE DEVELOPMENT

We are currently implementing a BLAST functionality that will enable users to query A.gossypii genes or gene products against the S.cerevisiae genome. At a later stage, this feature will include the genomes of other species as well. In addition, we plan to provide a two-way interactive synteny viewer that displays the chromosomal organization of loci from A.gossypii together with their counterparts from budding yeast. Future releases of AGD will contain a number of additional features including non-coding RNAs and GeneOntology assignments to improve annotation and query options. High-density oligonucleotide microarrays covering the entire genome will become available in the foreseeable future, and it is therefore our intention to extend the database's functionalities to cover high-throughput expression data. Finally, results from large-scale gene deletion studies produced by functional genomics will also be incorporated and information on BAC clones and plasmids containing A.gossypii genes will be made available.

CONCLUSION

AGD is a highly useful online source of information for the extensive budding and fission yeast communities and also targets scientists interested in the evolution of genomes. AGD's content is based upon comparative genome annotation between S.cerevisiae and its evolutionary ancestor A.gossypii whereby every single locus identified by automatic annotation was manually inspected. Gene annotation is further improved by a sustained effort to experimentally verify all questionable cases especially those reported by researchers who use AGD.
  24 in total

1.  GermOnline, a new cross-species community annotation database on germ-line development and gametogenesis.

Authors:  M Primig; C Wiederkehr; R Basavaraj; C Sarrauste de Menthière; L Hermida; R Koch; U Schlecht; H G Dickinson; M Fellous; J A Grootegoed; R S Hawley; B Jégou; B Maro; A Nicolas; T Orr-Weaver; T Schedl; A Villeneuve; D J Wolgemuth; M Yamamoto; D Zickler; N Lamb; R E Esposito
Journal:  Nat Genet       Date:  2003-12       Impact factor: 38.330

2.  The KEGG resource for deciphering the genome.

Authors:  Minoru Kanehisa; Susumu Goto; Shuichi Kawashima; Yasushi Okuno; Masahiro Hattori
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

3.  The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003.

Authors:  Brigitte Boeckmann; Amos Bairoch; Rolf Apweiler; Marie-Claude Blatter; Anne Estreicher; Elisabeth Gasteiger; Maria J Martin; Karine Michoud; Claire O'Donovan; Isabelle Phan; Sandrine Pilbout; Michel Schneider
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

4.  PCR-based gene targeting in the filamentous fungus Ashbya gossypii.

Authors:  J Wendland; Y Ayad-Durieux; P Knechtle; C Rebischung; P Philippsen
Journal:  Gene       Date:  2000-01-25       Impact factor: 3.688

5.  MIPS: analysis and annotation of proteins from whole genomes.

Authors:  H W Mewes; C Amid; R Arnold; D Frishman; U Güldener; G Mannhaupt; M Münsterkötter; P Pagel; N Strack; V Stümpflen; J Warfsmann; A Ruepp
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

6.  Maximal polar growth potential depends on the polarisome component AgSpa2 in the filamentous fungus Ashbya gossypii.

Authors:  Philipp Knechtle; Fred Dietrich; Peter Philippsen
Journal:  Mol Biol Cell       Date:  2003-08-22       Impact factor: 4.138

Review 7.  Life with 6000 genes.

Authors:  A Goffeau; B G Barrell; H Bussey; R W Davis; B Dujon; H Feldmann; F Galibert; J D Hoheisel; C Jacq; M Johnston; E J Louis; H W Mewes; Y Murakami; P Philippsen; H Tettelin; S G Oliver
Journal:  Science       Date:  1996-10-25       Impact factor: 47.728

8.  Finding functional features in Saccharomyces genomes by phylogenetic footprinting.

Authors:  Paul Cliften; Priya Sudarsanam; Ashwin Desikan; Lucinda Fulton; Bob Fulton; John Majors; Robert Waterston; Barak A Cohen; Mark Johnston
Journal:  Science       Date:  2003-05-29       Impact factor: 47.728

9.  Sequencing and comparison of yeast species to identify genes and regulatory elements.

Authors:  Manolis Kellis; Nick Patterson; Matthew Endrizzi; Bruce Birren; Eric S Lander
Journal:  Nature       Date:  2003-05-15       Impact factor: 49.962

10.  Reinvestigation of the Saccharomyces cerevisiae genome annotation by comparison to the genome of a related fungus: Ashbya gossypii.

Authors:  Sophie Brachat; Fred S Dietrich; Sylvia Voegeli; Zhihong Zhang; Larissa Stuart; Anita Lerch; Krista Gates; Tom Gaffney; Peter Philippsen
Journal:  Genome Biol       Date:  2003-06-25       Impact factor: 13.583

View more
  5 in total

1.  The SH3/PH domain protein AgBoi1/2 collaborates with the Rho-type GTPase AgRho3 to prevent nonpolar growth at hyphal tips of Ashbya gossypii.

Authors:  Philipp Knechtle; Jürgen Wendland; Peter Philippsen
Journal:  Eukaryot Cell       Date:  2006-09-01

2.  Dandruff-associated Malassezia genomes reveal convergent and divergent virulence traits shared with plant and human fungal pathogens.

Authors:  Jun Xu; Charles W Saunders; Ping Hu; Raymond A Grant; Teun Boekhout; Eiko E Kuramae; James W Kronstad; Yvonne M Deangelis; Nancy L Reeder; Kevin R Johnstone; Meredith Leland; Angela M Fieno; William M Begley; Yiping Sun; Martin P Lacey; Tanuja Chaudhary; Thomas Keough; Lien Chu; Russell Sears; Bo Yuan; Thomas L Dawson
Journal:  Proc Natl Acad Sci U S A       Date:  2007-11-13       Impact factor: 11.205

3.  Ashbya Genome Database 3.0: a cross-species genome and transcriptome browser for yeast biologists.

Authors:  Alexandre Gattiker; Riccarda Rischatsch; Philippe Demougin; Sylvia Voegeli; Fred S Dietrich; Peter Philippsen; Michael Primig
Journal:  BMC Genomics       Date:  2007-01-09       Impact factor: 3.969

4.  Genome-wide computational prediction of tandem gene arrays: application in yeasts.

Authors:  Laurence Despons; Philippe V Baret; Lionel Frangeul; Véronique Leh Louis; Pascal Durrens; Jean-Luc Souciet
Journal:  BMC Genomics       Date:  2010-01-21       Impact factor: 3.969

5.  Fusion and fission of genes define a metric between fungal genomes.

Authors:  Pascal Durrens; Macha Nikolski; David Sherman
Journal:  PLoS Comput Biol       Date:  2008-10-24       Impact factor: 4.475

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.