Literature DB >> 24352525

Parser Combinators: a Practical Application for Generating Parsers for NMR Data.

Matthew Fenwick1, Gerard Weatherby2, Heidi Jc Ellis, Michael R Gryk.   

Abstract

Nuclear Magnetic Resonance (NMR) spectroscopy is a technique for acquiring protein data at atomic resolution and determining the three-dimensional structure of large protein molecules. A typical structure determination process results in the deposition of a large data sets to the BMRB (Bio-Magnetic Resonance Data Bank). This data is stored and shared in a file format called NMR-Star. This format is syntactically and semantically complex making it challenging to parse. Nevertheless, parsing these files is crucial to applying the vast amounts of biological information stored in NMR-Star files, allowing researchers to harness the results of previous studies to direct and validate future work. One powerful approach for parsing files is to apply a Backus-Naur Form (BNF) grammar, which is a high-level model of a file format. Translation of the grammatical model to an executable parser may be automatically accomplished. This paper will show how we applied a model BNF grammar of the NMR-Star format to create a free, open-source parser, using a method that originated in the functional programming world known as "parser combinators". This paper demonstrates the effectiveness of a principled approach to file specification and parsing. This paper also builds upon our previous work [1], in that 1) it applies concepts from Functional Programming (which is relevant even though the implementation language, Java, is more mainstream than Functional Programming), and 2) all work and accomplishments from this project will be made available under standard open source licenses to provide the community with the opportunity to learn from our techniques and methods.

Entities:  

Keywords:  Java; NMR; NMR-Star; component; functional-programming; parsing

Year:  2013        PMID: 24352525      PMCID: PMC3859343          DOI: 10.1109/ITNG.2013.39

Source DB:  PubMed          Journal:  Proc Int Conf Inf Technol New Gener


  4 in total

1.  Iterative Development of an Application to Support Nuclear Magnetic Resonance Data Analysis of Proteins.

Authors:  Heidi J C Ellis; Ronald J Nowling; Jay Vyas; Timothy O Martyn; Michael R Gryk
Journal:  Proc Int Conf Inf Technol New Gener       Date:  2011-04-11

2.  Extensions to the STAR File syntax.

Authors:  Nick Spadaccini; Sydney R Hall
Journal:  J Chem Inf Model       Date:  2012-07-31       Impact factor: 4.956

3.  An Open-Source Sandbox for Increasing the Accessibility of Functional Programming to the Bioinformatics and Scientific Communities.

Authors:  Matthew Fenwick; Colbert Sesanker; Martin R Schiller; Heidi Jc Ellis; M Lee Hinman; Jay Vyas; Michael R Gryk
Journal:  Proc Int Conf Inf Technol New Gener       Date:  2012

4.  BioMagResBank.

Authors:  Eldon L Ulrich; Hideo Akutsu; Jurgen F Doreleijers; Yoko Harano; Yannis E Ioannidis; Jundong Lin; Miron Livny; Steve Mading; Dimitri Maziuk; Zachary Miller; Eiichi Nakatani; Christopher F Schulte; David E Tolmie; R Kent Wenger; Hongyang Yao; John L Markley
Journal:  Nucleic Acids Res       Date:  2007-11-04       Impact factor: 16.971

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.