Literature DB >> 17044405

Automated structure extraction and XML conversion of life science database flat files.

Stephan Philippi1, Jacob Köhler.   

Abstract

In the light of the increasing number of biological databases, their integration is a fundamental prerequisite for answering complex biological questions. Database integration, therefore, is an important area of research in bioinformatics. Since most of the publicly available life science databases are still exclusively exchanged by means of proprietary flat files, database integration requires parsers for very different flat file formats. Unfortunately, the development and maintenance of database specific flat file parsers is a nontrivial and time-consuming task, which takes considerable effort in large-scale integration scenarios. This paper introduces heuristically based concepts for automatic structure extraction from life science database flat files. On the basis of these concepts the FlatEx prototype is developed for the automatic conversion of flat files into XML representations.

Mesh:

Year:  2006        PMID: 17044405     DOI: 10.1109/titb.2006.875653

Source DB:  PubMed          Journal:  IEEE Trans Inf Technol Biomed        ISSN: 1089-7771


  1 in total

1.  Dynamic integration of biological data sources using the data concierge.

Authors:  Peng Gong
Journal:  Health Inf Sci Syst       Date:  2013-02-04
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.