Literature DB >> 29941895

The Biological Object Notation (BON): a structured file format for biological data.

Jan P Buchmann1, Mathieu Fourment2, Edward C Holmes3.   

Abstract

The large size and high complexity of biological data can represent a major methodological challenge for the analysis and exchange of data sets between computers and applications. There has also been a substantial increase in the amount of metadata associated with biological data sets, which is being increasingly incorporated into existing data formats. Despite the existence of structured formats based on XML, biological data sets are mainly formatted using unstructured file formats, and the incorporation of metadata results in increasingly complex parsing routines such that they become more error prone. To overcome these problems, we present the "biological object notation" (BON) format, a new way to exchange and parse nearly all biological data sets more efficiently and with less error than other currently available formats. Based on JavaScript Object Notation (JSON), BON simplifies parsing by clearly separating the biological data from its metadata and reduces complexity compared to XML based formats. The ability to selectively compress data up to 87% compared to other file formats and the reduced complexity results in improved transfer times and less error prone applications.

Entities:  

Year:  2018        PMID: 29941895      PMCID: PMC6018389          DOI: 10.1038/s41598-018-28016-6

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


  13 in total

1.  NEXUS: an extensible file format for systematic information.

Authors:  D R Maddison; D L Swofford; W P Maddison
Journal:  Syst Biol       Date:  1997-12       Impact factor: 15.683

2.  LFQC: a lossless compression algorithm for FASTQ files.

Authors:  Marius Nicolae; Sudipta Pathak; Sanguthevar Rajasekaran
Journal:  Bioinformatics       Date:  2015-06-20       Impact factor: 6.937

3.  Rapid and sensitive protein similarity searches.

Authors:  D J Lipman; W R Pearson
Journal:  Science       Date:  1985-03-22       Impact factor: 47.728

4.  Jalview Version 2--a multiple sequence alignment editor and analysis workbench.

Authors:  Andrew M Waterhouse; James B Procter; David M A Martin; Michèle Clamp; Geoffrey J Barton
Journal:  Bioinformatics       Date:  2009-01-16       Impact factor: 6.937

5.  The Sequence Alignment/Map format and SAMtools.

Authors:  Heng Li; Bob Handsaker; Alec Wysoker; Tim Fennell; Jue Ruan; Nils Homer; Gabor Marth; Goncalo Abecasis; Richard Durbin
Journal:  Bioinformatics       Date:  2009-06-08       Impact factor: 6.937

Review 6.  The Sanger FASTQ file format for sequences with quality scores, and the Solexa/Illumina FASTQ variants.

Authors:  Peter J A Cock; Christopher J Fields; Naohisa Goto; Michael L Heuer; Peter M Rice
Journal:  Nucleic Acids Res       Date:  2009-12-16       Impact factor: 16.971

7.  BioXSD: the common data-exchange format for everyday bioinformatics web services.

Authors:  Matús Kalas; Pål Puntervoll; Alexandre Joseph; Edita Bartaseviciūte; Armin Töpfer; Prabakar Venkataraman; Steve Pettifer; Jan Christian Bryne; Jon Ison; Christophe Blanchet; Kristoffer Rapacki; Inge Jonassen
Journal:  Bioinformatics       Date:  2010-09-15       Impact factor: 6.937

8.  NeXML: rich, extensible, and verifiable representation of comparative data and metadata.

Authors:  Rutger A Vos; James P Balhoff; Jason A Caravas; Mark T Holder; Hilmar Lapp; Wayne P Maddison; Peter E Midford; Anurag Priyam; Jeet Sukumaran; Xuhua Xia; Arlin Stoltzfus
Journal:  Syst Biol       Date:  2012-02-22       Impact factor: 15.683

9.  Integrating genomic information with protein sequence and 3D atomic level structure at the RCSB protein data bank.

Authors:  Andreas Prlic; Tara Kalro; Roshni Bhattacharya; Cole Christie; Stephen K Burley; Peter W Rose
Journal:  Bioinformatics       Date:  2016-08-22       Impact factor: 6.937

10.  A Critical Review on the Use of Support Values in Tree Viewers and Bioinformatics Toolkits.

Authors:  Lucas Czech; Jaime Huerta-Cepas; Alexandros Stamatakis
Journal:  Mol Biol Evol       Date:  2017-06-01       Impact factor: 16.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.