Literature DB >> 9730922

A model system for studying the integration of molecular biology databases.

J Macauley1, H Wang, N Goodman.   

Abstract

MOTIVATION: Integration of molecular biology databases remains limited in practice despite its practical importance and considerable research effort. The complexity of the problem is such that an experimental approach is mandatory, yet this very complexity makes it hard to design definitive experiments. This dilemma is common in science, and one tried-and-true strategy is to work with model systems. We propose a model system for this problem, namely a database of genes integrating diverse data across organisms, and describe an experiment using this model.
RESULTS: We attempted to construct a database of human and mouse genes integrating data from GenBank and the human and mouse genome-databases. We discovered numerous errors in these well-respected databases: approximately 15% of genes are apparently missing from the genome-databases; links between the sequence and genome-databases are missing for another 5-10% of the cases; about a third of likely homology links are missing between the genome-databases; 10-20% of entries classified as 'genes' are apparently misclassified. By using a model system, we were able to study the problems caused by anomalous data without having to face all the hard problems of database integration. CONTACT: nat@jax.org

Entities:  

Mesh:

Year:  1998        PMID: 9730922     DOI: 10.1093/bioinformatics/14.7.575

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  1 in total

1.  Remote data retrieval for bioinformatics applications: an agent migration approach.

Authors:  Lei Gao; Hua Dai; Tong-Liang Zhang; Kuo-Chen Chou
Journal:  PLoS One       Date:  2011-06-20       Impact factor: 3.240

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.