Literature DB >> 11038333

Object-oriented parsing of biological databases with Python.

C Ramu1, C Gemünd, T J Gibson.   

Abstract

MOTIVATION: While database activities in the biological area are increasing rapidly, rather little is done in the area of parsing them in a simple and object-oriented way.
RESULTS: We present here an elegant, simple yet powerful way of parsing biological flat-file databases. We have taken EMBL, SWISSPROT and GENBANK as examples. EMBL and SWISS-PROT do not differ much in the format structure. GENBANK has a very different format structure than EMBL and SWISS-PROT. Extracting the desired fields in an entry (for example a sub-sequence with an associated feature) for later analysis is a constant need in the biological sequence-analysis community: this is illustrated with tools to make new splice-site databases. The interface to the parser is abstract in the sense that the access to all the databases is independent from their different formats, since parsing instructions are hidden.

Mesh:

Year:  2000        PMID: 11038333     DOI: 10.1093/bioinformatics/16.7.628

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  3 in total

1.  SIRW: A web server for the Simple Indexing and Retrieval System that combines sequence motif searches with keyword searches.

Authors:  Chenna Ramu
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

2.  BLAST2SRS, a web server for flexible retrieval of related protein sequences in the SWISS-PROT and SPTrEMBL databases.

Authors:  Konstantinos Bimpikis; Aidan Budd; Rune Linding; Toby J Gibson
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

3.  Phospho.ELM: a database of phosphorylation sites--update 2008.

Authors:  Francesca Diella; Cathryn M Gould; Claudia Chica; Allegra Via; Toby J Gibson
Journal:  Nucleic Acids Res       Date:  2007-10-25       Impact factor: 16.971

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.