Literature DB >> 17381167

Molecular query language (MQL)--a context-free grammar for substructure matching.

Ewgenij Proschak1, Jörg K Wegner, Andreas Schüller, Gisbert Schneider, Uli Fechner.   

Abstract

We have developed a Java library for substructure matching that features easy-to-read syntax and extensibility. This molecular query language (MQL) is grounded on a context-free grammar, which allows for straightforward modification and extension. The formal description of MQL is provided in this paper. Molecule primitives are atoms, bonds, properties, branching, and rings. User-defined features can be added via a Java interface. In MQL, molecules are represented as graphs. Substructure matching was implemented using the Ullmann algorithm because of favorable run-time performance. The Ullmann algorithm carries out a fast subgraph isomorphism search by combining backtracking with effective forward checking. MQL software design was driven by the aim to facilitate the use of various cheminformatics toolkits. Two Java interfaces provide a bridge from our MQL package to an external toolkit: the first one provides the matching rules for every feature of a particular toolkit; the second one converts the found match from the internal format of MQL to the format of the external toolkit. We already implemented these interfaces for the Chemistry Development Toolkit.

Mesh:

Substances:

Year:  2007        PMID: 17381167     DOI: 10.1021/ci600305h

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  6 in total

1.  Jmol SMILES and Jmol SMARTS: specifications and applications.

Authors:  Robert M Hanson
Journal:  J Cheminform       Date:  2016-09-26       Impact factor: 5.514

2.  PatternQuery: web application for fast detection of biomacromolecular structural patterns in the entire Protein Data Bank.

Authors:  David Sehnal; Lukáš Pravda; Radka Svobodová Vařeková; Crina-Maria Ionescu; Jaroslav Koča
Journal:  Nucleic Acids Res       Date:  2015-05-26       Impact factor: 16.971

3.  Biological time series analysis using a context free language: applicability to pulsatile hormone data.

Authors:  Dennis A Dean; Gail K Adler; David P Nguyen; Elizabeth B Klerman
Journal:  PLoS One       Date:  2014-09-03       Impact factor: 3.240

4.  Ambit-SMIRKS: a software module for reaction representation, reaction search and structure transformation.

Authors:  Nikolay Kochev; Svetlana Avramova; Nina Jeliazkova
Journal:  J Cheminform       Date:  2018-08-20       Impact factor: 5.514

5.  Structure-based classification and ontology in chemistry.

Authors:  Janna Hastings; Despoina Magka; Colin Batchelor; Lian Duan; Robert Stevens; Marcus Ennis; Christoph Steinbeck
Journal:  J Cheminform       Date:  2012-04-05       Impact factor: 5.514

6.  PubChem atom environments.

Authors:  Volker D Hähnke; Evan E Bolton; Stephen H Bryant
Journal:  J Cheminform       Date:  2015-08-19       Impact factor: 5.514

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.