Literature DB >> 23167287

Models for identification of erroneous atom-to-atom mapping of reactions performed by automated algorithms.

Christophe Muller1, Gilles Marcou, Dragos Horvath, João Aires-de-Sousa, Alexandre Varnek.   

Abstract

Machine learning (SVM and JRip rule learner) methods have been used in conjunction with the Condensed Graph of Reaction (CGR) approach to identify errors in the atom-to-atom mapping of chemical reactions produced by an automated mapping tool by ChemAxon. The modeling has been performed on the three first enzymatic classes of metabolic reactions from the KEGG database. Each reaction has been converted into a CGR representing a pseudomolecule with conventional (single, double, aromatic, etc.) bonds and dynamic bonds characterizing chemical transformations. The ChemAxon tool was used to automatically detect the matching atom pairs in reagents and products. These automated mappings were analyzed by the human expert and classified as "correct" or "wrong". ISIDA fragment descriptors generated for CGRs for both correct and wrong mappings were used as attributes in machine learning. The learned models have been validated in n-fold cross-validation on the training set followed by a challenge to detect correct and wrong mappings within an external test set of reactions, never used for learning. Results show that both SVM and JRip models detect most of the wrongly mapped reactions. We believe that this approach could be used to identify erroneous atom-to-atom mapping performed by any automated algorithm.

Entities:  

Mesh:

Year:  2012        PMID: 23167287     DOI: 10.1021/ci300418q

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  5 in total

1.  Structure-reactivity modeling using mixture-based representation of chemical reactions.

Authors:  Pavel Polishchuk; Timur Madzhidov; Timur Gimadiev; Andrey Bodrov; Ramil Nugmanov; Alexandre Varnek
Journal:  J Comput Aided Mol Des       Date:  2017-07-27       Impact factor: 3.686

2.  Assessment of tautomer distribution using the condensed reaction graph approach.

Authors:  T R Gimadiev; T I Madzhidov; R I Nugmanov; I I Baskin; I S Antipin; A Varnek
Journal:  J Comput Aided Mol Des       Date:  2018-01-29       Impact factor: 3.686

3.  Comparison of logP and logD correction models trained with public and proprietary data sets.

Authors:  Ignacio Aliagas; Alberto Gobbi; Man-Ling Lee; Benjamin D Sellers
Journal:  J Comput Aided Mol Des       Date:  2022-04-01       Impact factor: 3.686

4.  Atom mapping with constraint programming.

Authors:  Martin Mann; Feras Nahar; Norah Schnorr; Rolf Backofen; Peter F Stadler; Christoph Flamm
Journal:  Algorithms Mol Biol       Date:  2014-11-29       Impact factor: 1.405

Review 5.  Machine Learning of Reaction Properties via Learned Representations of the Condensed Graph of Reaction.

Authors:  Esther Heid; William H Green
Journal:  J Chem Inf Model       Date:  2021-11-04       Impact factor: 6.162

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.