Literature DB >> 17646315

A Chado case study: an ontology-based modular schema for representing genome-associated biological information.

Christopher J Mungall1, David B Emmert.   

Abstract

MOTIVATION: A few years ago, FlyBase undertook to design a new database schema to store Drosophila data. It would fully integrate genomic sequence and annotation data with bibliographic, genetic, phenotypic and molecular data from the literature representing a distillation of the first 100 years of research on this major animal model system. In developing this new integrated schema, FlyBase also made a commitment to ensure that its design was generic, extensible and available as open source, so that it could be employed as the core schema of any model organism data repository, thereby avoiding redundant software development and potentially increasing interoperability. Our question was whether we could create a relational database schema that would be successfully reused.
RESULTS: Chado is a relational database schema now being used to manage biological knowledge for a wide variety of organisms, from human to pathogens, especially the classes of information that directly or indirectly can be associated with genome sequences or the primary RNA and protein products encoded by a genome. Biological databases that conform to this schema can interoperate with one another, and with application software from the Generic Model Organism Database (GMOD) toolkit. Chado is distinctive because its design is driven by ontologies. The use of ontologies (or controlled vocabularies) is ubiquitous across the schema, as they are used as a means of typing entities. The Chado schema is partitioned into integrated subschemas (modules), each encapsulating a different biological domain, and each described using representations in appropriate ontologies. To illustrate this methodology, we describe here the Chado modules used for describing genomic sequences. AVAILABILITY: GMOD is a collaboration of several model organism database groups, including FlyBase, to develop a set of open-source software for managing model organism data. The Chado schema is freely distributed under the terms of the Artistic License (http://www.opensource.org/licenses/artistic-license.php) from GMOD (www.gmod.org).

Entities:  

Mesh:

Year:  2007        PMID: 17646315     DOI: 10.1093/bioinformatics/btm189

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  149 in total

1.  Visualizing next-generation sequencing data with JBrowse.

Authors:  Oscar Westesson; Mitchell Skinner; Ian Holmes
Journal:  Brief Bioinform       Date:  2012-03-12       Impact factor: 11.622

2.  The ANISEED database: digital representation, formalization, and elucidation of a chordate developmental program.

Authors:  Olivier Tassy; Delphine Dauga; Fabrice Daian; Daniel Sobral; François Robin; Pierre Khoueiry; David Salgado; Vanessa Fox; Danièle Caillol; Renaud Schiappa; Baptiste Laporte; Anne Rios; Guillaume Luxardi; Takehiro Kusakabe; Jean-Stéphane Joly; Sébastien Darras; Lionel Christiaen; Magali Contensin; Hélène Auger; Clément Lamy; Clare Hudson; Ute Rothbächer; Michael J Gilchrist; Kazuhiro W Makabe; Kohji Hotta; Shigeki Fujiwara; Nori Satoh; Yutaka Satou; Patrick Lemaire
Journal:  Genome Res       Date:  2010-07-20       Impact factor: 9.043

3.  GIDL: a rule based expert system for GenBank Intelligent Data Loading into the Molecular Biodiversity Database.

Authors:  Paolo Pannarale; Domenico Catalano; Giorgio De Caro; Giorgio Grillo; Pietro Leo; Graziano Pappadà; Francesco Rubino; Gaetano Scioscia; Flavio Licciulli
Journal:  BMC Bioinformatics       Date:  2012-03-28       Impact factor: 3.169

Review 4.  Genomic resources for invertebrate vectors of human pathogens, and the role of VectorBase.

Authors:  K Megy; M Hammond; D Lawson; R V Bruggner; E Birney; F H Collins
Journal:  Infect Genet Evol       Date:  2008-01-03       Impact factor: 3.342

5.  Apollo: a community resource for genome annotation editing.

Authors:  Ed Lee; Nomi Harris; Mark Gibson; Raymond Chetty; Suzanna Lewis
Journal:  Bioinformatics       Date:  2009-05-13       Impact factor: 6.937

6.  A community-based annotation framework for linking solanaceae genomes with phenomes.

Authors:  Naama Menda; Robert M Buels; Isaak Tecle; Lukas A Mueller
Journal:  Plant Physiol       Date:  2008-06-06       Impact factor: 8.340

Review 7.  Genome and proteome annotation: organization, interpretation and integration.

Authors:  Gabrielle A Reeves; David Talavera; Janet M Thornton
Journal:  J R Soc Interface       Date:  2009-02-06       Impact factor: 4.118

8.  JBrowse: a next-generation genome browser.

Authors:  Mitchell E Skinner; Andrew V Uzilov; Lincoln D Stein; Christopher J Mungall; Ian H Holmes
Journal:  Genome Res       Date:  2009-07-01       Impact factor: 9.043

9.  Quantitative measures for the management and comparison of annotated genomes.

Authors:  Karen Eilbeck; Barry Moore; Carson Holt; Mark Yandell
Journal:  BMC Bioinformatics       Date:  2009-02-23       Impact factor: 3.169

10.  rCAD: A Novel Database Schema for the Comparative Analysis of RNA.

Authors:  Stuart Ozer; Kishore J Doshi; Weijia Xu; Robin R Gutell
Journal:  Proc IEEE Int Conf Escience       Date:  2011-12-31
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.