Literature DB >> 11294791

MetaFam: a unified classification of protein families. II. Schema and query capabilities.

E Shoop1, K A Silverstein, J E Johnson, E F Retzel.   

Abstract

MOTIVATION: Protein sequence and family data is accumulating at such a rapid rate that state-of-the-art databases and interface tools are required to aid curators with their classifications. We have designed such a system, MetaFam, to facilitate the comparison and integration of public protein sequence and family data. This paper presents the global schema, integration issues, and query capabilities of MetaFam.
RESULTS: MetaFam is an integrated data warehouse of information about protein families and their sequences. This data has been collected into a consistent global schema, and stored in an Oracle relational database. The warehouse implementation allows for quick removal of outdated data sets. In addition to the relational implementation of the primary schema, we have developed several derived tables that enable efficient access from data visualization and exploration tools. Through a series of straightforward SQL queries, we demonstrate the usefulness of this data warehouse for comparing protein family classifications and for functional assignment of new sequences.

Mesh:

Substances:

Year:  2001        PMID: 11294791     DOI: 10.1093/bioinformatics/17.3.262

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  4 in total

1.  The MetaFam Server: a comprehensive protein family resource.

Authors:  K A Silverstein; E Shoop; J E Johnson; A Kilian; J L Freeman; T M Kunau; I A Awad; M Mayer; E F Retzel
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

2.  A perspective for biomedical data integration: design of databases for flow cytometry.

Authors:  John Drakos; Marina Karakantza; Nicholas C Zoumbos; John Lakoumentas; George C Nikiforidis; George C Sakellaropoulos
Journal:  BMC Bioinformatics       Date:  2008-02-14       Impact factor: 3.169

3.  DWARF--a data warehouse system for analyzing protein families.

Authors:  Markus Fischer; Quan K Thai; Melanie Grieb; Jürgen Pleiss
Journal:  BMC Bioinformatics       Date:  2006-11-09       Impact factor: 3.169

4.  Microarrays for global expression constructed with a low redundancy set of 27,500 sequenced cDNAs representing an array of developmental stages and physiological conditions of the soybean plant.

Authors:  Lila O Vodkin; Anupama Khanna; Robin Shealy; Steven J Clough; Delkin Orlando Gonzalez; Reena Philip; Gracia Zabala; Françoise Thibaud-Nissen; Mark Sidarous; Martina V Strömvik; Elizabeth Shoop; Christina Schmidt; Ernest Retzel; John Erpelding; Randy C Shoemaker; Alicia M Rodriguez-Huete; Joseph C Polacco; Virginia Coryell; Paul Keim; George Gong; Lei Liu; Jose Pardinas; Peter Schweitzer
Journal:  BMC Genomics       Date:  2004-09-29       Impact factor: 3.969

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.