Literature DB >> 20666407

Tautomer identification and tautomer structure generation based on the InChI code.

Torsten Thalheim1, Armin Vollmer, Ralf-Uwe Ebert, Ralph Kühne, Gerrit Schüürmann.   

Abstract

An algorithm is introduced that enables a fast generation of all possible prototropic tautomers resulting from the mobile H atoms and associated heteroatoms as defined in the InChI code. The InChI-derived set of possible tautomers comprises (1,3)-shifts for open-chain molecules and (1,n)-shifts (with n being an odd number >3) for ring systems. In addition, our algorithm includes also, as extension to the InChI scope, those larger (1,n)-shifts that can be constructed from joining separate but conjugated InChI sequences of tautomer-active heteroatoms. The developed algorithm is described in detail, with all major steps illustrated through explicit examples. Application to approximately 72,500 organic compounds taken from EINECS (European Inventory of Existing Commercial Chemical Substances) shows that around 11% of the substances occur in different heteroatom-prototropic tautomeric forms. Additional QSAR (quantitative structure-activity relationship) predictions of their soil sorption coefficient and water solubility reveal variations across tautomers up to more than two and 4 orders of magnitude, respectively. For a small subset of nine compounds, analysis of quantum chemically predicted tautomer energies supports the view that among all tautomers of a given compound, those restricted to H atom exchanges between heteroatoms usually include the thermodynamically most stable structures.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 20666407     DOI: 10.1021/ci1001179

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  5 in total

1.  Many InChIs and quite some feat.

Authors:  Wendy A Warr
Journal:  J Comput Aided Mol Des       Date:  2015-06-17       Impact factor: 3.686

2.  Towards a Universal SMILES representation - A standard method to generate canonical SMILES based on the InChI.

Authors:  Noel M O'Boyle
Journal:  J Cheminform       Date:  2012-09-18       Impact factor: 5.514

3.  Interoperable chemical structure search service.

Authors:  Miroslav Kratochvíl; Jiří Vondrášek; Jakub Galgonek
Journal:  J Cheminform       Date:  2019-06-28       Impact factor: 5.514

4.  Applications of the InChI in cheminformatics with the CDK and Bioclipse.

Authors:  Ola Spjuth; Arvid Berg; Samuel Adams; Egon L Willighagen
Journal:  J Cheminform       Date:  2013-03-13       Impact factor: 5.514

5.  Experimental and Chemoinformatics Study of Tautomerism in a Database of Commercially Available Screening Samples.

Authors:  Laura Guasch; Waruna Yapamudiyansel; Megan L Peach; James A Kelley; Joseph J Barchi; Marc C Nicklaus
Journal:  J Chem Inf Model       Date:  2016-10-16       Impact factor: 4.956

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.