Literature DB >> 19238194

BIOFFORC: tool development for biological file format conversion.

Chinnaiah Swaminathan Vinobha1, Maruthamuthu Rajadurai, Ekambaram Rajasekaran.   

Abstract

UNLABELLED: The use of bioinformatics tools require different sequence formats at various instances. Every tool uses specific set of formats for processing. Sequence in one format is often required in another format. Thus, there is a need for sequence format conversion. A number of such tools are available in the public domain. Here, we describe BIOFFORC as a file format converter. The tool is developed with a graphical user interface in PERL. AVAILABILITY: http://www.winningpath.com/biofforc/

Keywords:  bioinformatics; conversion; format; read; sequence; tools; write

Year:  2008        PMID: 19238194      PMCID: PMC2639671          DOI: 10.6026/97320630003098

Source DB:  PubMed          Journal:  Bioinformation        ISSN: 0973-2063


Background

The basic format underling information in DDBJ/EMBL/GenBank is a flat file. The correspondence between individual flat file formats facilitated the exchange of data between each of these datasets [1]. GenBank describes each sequence entry with literature reference, functional data, location of mRNA coding regions and sequence [2]. Similarly, EMBL [3] and DDBJ [4] are also resources for biological and medical research data. The sequence file format conversion with tools like READSEQ [5], FMTSEQ [6] and SeqVerter [7] are described in detail. Here, we describe BIOFFORC as a file format converter.

Methodology

Process flow

A process flow for the tool is shown in Figure 1.
Figure 1

Process flow diagram for BIOFFORC tool

Web interface

The current version of BIOFFORC is a web based tool and it uses a common gateway interface (CGI) developed in PERL.

Input and output

The use of the tool to convert a GenBank format to FASTA format is shown in Figure 2. The top panel in Figure 2 shows input in GenBank Format and the bottom panel shows output in FASTA format.
Figure 2

Sample input and output for and from BIOFFORC is shown

Caveats and Future development

The present version of BIOFFORC allows format conversion for four formats. We propose to expand conversion capability to several other required formats.
  3 in total

1.  The EMBL Nucleotide Sequence and Genome Reviews Databases.

Authors:  Peter Sterk; Tamara Kulikova; Paul Kersey; Rolf Apweiler
Journal:  Methods Mol Biol       Date:  2007

2.  Sequence file format conversion with command-line readseq.

Authors:  Don Gilbert
Journal:  Curr Protoc Bioinformatics       Date:  2003-02

3.  GenBank.

Authors:  Dennis A Benson; Ilene Karsch-Mizrachi; David J Lipman; James Ostell; David L Wheeler
Journal:  Nucleic Acids Res       Date:  2007-12-11       Impact factor: 16.971

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.