| Literature DB >> 17044405 |
Stephan Philippi1, Jacob Köhler.
Abstract
In the light of the increasing number of biological databases, their integration is a fundamental prerequisite for answering complex biological questions. Database integration, therefore, is an important area of research in bioinformatics. Since most of the publicly available life science databases are still exclusively exchanged by means of proprietary flat files, database integration requires parsers for very different flat file formats. Unfortunately, the development and maintenance of database specific flat file parsers is a nontrivial and time-consuming task, which takes considerable effort in large-scale integration scenarios. This paper introduces heuristically based concepts for automatic structure extraction from life science database flat files. On the basis of these concepts the FlatEx prototype is developed for the automatic conversion of flat files into XML representations.Mesh:
Year: 2006 PMID: 17044405 DOI: 10.1109/titb.2006.875653
Source DB: PubMed Journal: IEEE Trans Inf Technol Biomed ISSN: 1089-7771