Literature DB >> 8790449

QGB: a system for querying sequence database fields and features.

G C Overton1, J S Aaronson, J Haas, J Adams.   

Abstract

We have developed a general system, QGB, for performing complex queries on the information in the DDBJ/EMBL/GenBank databases, including queries over the structural features of sequences implied in the FEATURE TABLE. Queries are formed in a Structured Query Language (SQL)-like syntax with language extensions to support complex types (e.g., sets, ordered sets, and records) appropriate for representing and querying sequence data. A novel aspect of QGB is its ability to deduce missing features and infer relationships among features as a consequence of constructing a parse tree of sequence structure from information described in the FEATURE TABLE. The grammar for the parse tree is implemented in a customized form of the Definite Clause Grammar syntax of the logic programming language Prolog. The logic grammar formalism was chosen because it provides a perspicuous representation for features and constraints, and Prolog provides an execution model for the grammar rules. Construction of the parse tree also identifies inconsistencies and errors in the FEATURE TABLE that can in some cases be corrected automatically and used to generate an augmented version of the table.

Mesh:

Substances:

Year:  1994        PMID: 8790449     DOI: 10.1089/cmb.1994.1.3

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  1 in total

1.  EpoDB: a database of genes expressed during vertebrate erythropoiesis.

Authors:  F Salas; J Haas; B Brunk; C J Stoeckert; G C Overton
Journal:  Nucleic Acids Res       Date:  1998-01-01       Impact factor: 16.971

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.