Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 MetaFam: a unified classification of protein families. II. Schema and query capabilities.

Literature DB >> 11294791

MetaFam: a unified classification of protein families. II. Schema and query capabilities.

E Shoop¹, K A Silverstein, J E Johnson, E F Retzel.

Abstract

MOTIVATION: Protein sequence and family data is accumulating at such a rapid rate that state-of-the-art databases and interface tools are required to aid curators with their classifications. We have designed such a system, MetaFam, to facilitate the comparison and integration of public protein sequence and family data. This paper presents the global schema, integration issues, and query capabilities of MetaFam.
RESULTS: MetaFam is an integrated data warehouse of information about protein families and their sequences. This data has been collected into a consistent global schema, and stored in an Oracle relational database. The warehouse implementation allows for quick removal of outdated data sets. In addition to the relational implementation of the primary schema, we have developed several derived tables that enable efficient access from data visualization and exploration tools. Through a series of straightforward SQL queries, we demonstrate the usefulness of this data warehouse for comparing protein family classifications and for functional assignment of new sequences.

Mesh：

Substances：
Proteins

Year: 2001 PMID： 11294791 DOI： 10.1093/bioinformatics/17.3.262

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

Keyword Cloud
Cited

4 in total

1. The MetaFam Server: a comprehensive protein family resource.

Authors: K A Silverstein; E Shoop; J E Johnson; A Kilian; J L Freeman; T M Kunau; I A Awad; M Mayer; E F Retzel
Journal: Nucleic Acids Res Date: 2001-01-01 Impact factor: 16.971

2. A perspective for biomedical data integration: design of databases for flow cytometry.

Authors: John Drakos; Marina Karakantza; Nicholas C Zoumbos; John Lakoumentas; George C Nikiforidis; George C Sakellaropoulos
Journal: BMC Bioinformatics Date: 2008-02-14 Impact factor: 3.169

3. DWARF--a data warehouse system for analyzing protein families.

Authors: Markus Fischer; Quan K Thai; Melanie Grieb; Jürgen Pleiss
Journal: BMC Bioinformatics Date: 2006-11-09 Impact factor: 3.169

4. Microarrays for global expression constructed with a low redundancy set of 27,500 sequenced cDNAs representing an array of developmental stages and physiological conditions of the soybean plant.

Authors: Lila O Vodkin; Anupama Khanna; Robin Shealy; Steven J Clough; Delkin Orlando Gonzalez; Reena Philip; Gracia Zabala; Françoise Thibaud-Nissen; Mark Sidarous; Martina V Strömvik; Elizabeth Shoop; Christina Schmidt; Ernest Retzel; John Erpelding; Randy C Shoemaker; Alicia M Rodriguez-Huete; Joseph C Polacco; Virginia Coryell; Paul Keim; George Gong; Lei Liu; Jose Pardinas; Peter Schweitzer
Journal: BMC Genomics Date: 2004-09-29 Impact factor: 3.969

4 in total