Literature DB >> 7804870

SORTEZ: a relational translator for NCBI's ASN.1 database.

K W Hart1, D B Searls, G C Overton.   

Abstract

The National Center for Biotechnology Information (NCBI) has created a database collection that includes several protein and nucleic acid sequence databases, a biosequence-specific subset of MEDLINE, as well as value-added information such as links between similar sequences. Information in the NCBI database is modeled in Abstract Syntax Notation 1 (ASN.1) an Open Systems Interconnection protocol designed for the purpose of exchanging structured data between software applications rather than as a data model for database systems. While the NCBI database is distributed with an easy-to-use information retrieval system, ENTREZ, the ASN.1 data model currently lacks an ad hoc query language for general-purpose data access. For that reason, we have developed a software package, SORTEZ, that transforms the ASN.1 database (or other databases with nested data structures) to a relational data model and subsequently to a relational database management system (Sybase) where information can be accessed through the relational query language, SQL. Because the need to transform data from one data model and schema to another arises naturally in several important contexts, including efficient execution of specific applications, access to multiple databases and adaptation to database evolution this work also serves as a practical study of the issues involved in the various stages of database transformation. We show that transformation from the ASN.1 data model to a relational data model can be largely automated, but that schema transformation and data conversion require considerable domain expertise and would greatly benefit from additional support tools.

Entities:  

Mesh:

Year:  1994        PMID: 7804870     DOI: 10.1093/bioinformatics/10.4.369

Source DB:  PubMed          Journal:  Comput Appl Biosci        ISSN: 0266-7061


  3 in total

1.  BIOSPIDA: A Relational Database Translator for NCBI.

Authors:  Matthew S Hagen; Eva K Lee
Journal:  AMIA Annu Symp Proc       Date:  2010-11-13

2.  Proteomic survey of metabolic pathways in rice.

Authors:  Antonius Koller; Michael P Washburn; B Markus Lange; Nancy L Andon; Cosmin Deciu; Paul A Haynes; Lara Hays; David Schieltz; Ryan Ulaszek; Jing Wei; Dirk Wolters; John R Yates
Journal:  Proc Natl Acad Sci U S A       Date:  2002-08-05       Impact factor: 11.205

3.  Identification of minimal eukaryotic introns through GeneBase, a user-friendly tool for parsing the NCBI Gene databank.

Authors:  Allison Piovesan; Maria Caracausi; Marco Ricci; Pierluigi Strippoli; Lorenza Vitale; Maria Chiara Pelleri
Journal:  DNA Res       Date:  2015-11-17       Impact factor: 4.458

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.