Literature DB >> 20232887

Design and evaluation of bonded atom pair descriptors.

Hany E A Ahmed1, Martin Vogt, Jürgen Bajorath.   

Abstract

Atom pairs have been among the first systematically derived fragment-type topological descriptors and have been one of the origins of two-dimensional fingerprint searching. These descriptors continue to be popular and widely used to this date. Herein we introduce a new type of atom pair descriptors, bonded atom pairs, that exclusively capture short-range atom environment information and, thus, depart in their design from other topological descriptors that enumerate bond paths of varying length. Bonded atom pairs combine different types of structural information including element type, hybridization state, aliphatic/aromatic character, and cyclic/acyclic arrangement. Systematic design led to a set of 117 bonded atom pairs, all of which exist in synthetic compounds. A further expanded bonded atom pair set accounting for specific halogen atoms and including a total of 159 descriptors is also provided. Atom pair distribution and frequency analysis in sets of compounds having different selectivity reveals that both conventional and bonded atom pairs capture complementary structural information. In similarity searching, bonded atom pairs meet or exceed the performance of standard atom pairs and structural fragment fingerprints. The complementary nature of structural information captured by atom pairs of different design is also reflected by individual search calculations. Taken together, our findings indicate that bonded atom pairs extend the current repertoire of topological molecular descriptors.

Entities:  

Mesh:

Year:  2010        PMID: 20232887     DOI: 10.1021/ci900512g

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  4 in total

1.  2D-Qsar for 450 types of amino acid induction peptides with a novel substructure pair descriptor having wider scope.

Authors:  Tsutomu Osoda; Satoru Miyano
Journal:  J Cheminform       Date:  2011-11-02       Impact factor: 5.514

2.  An alphabetic code based atomic level molecular similarity search in databases.

Authors:  Nallusamy Saranya; Samuel Selvaraj
Journal:  Bioinformation       Date:  2012-06-16

3.  Introducing a Chemically Intuitive Core-Substituent Fingerprint Designed to Explore Structural Requirements for Effective Similarity Searching and Machine Learning.

Authors:  Tiago Janela; Kosuke Takeuchi; Jürgen Bajorath
Journal:  Molecules       Date:  2022-04-04       Impact factor: 4.411

4.  Systematic benchmark of substructure search in molecular graphs - From Ullmann to VF2.

Authors:  Hans-Christian Ehrlich; Matthias Rarey
Journal:  J Cheminform       Date:  2012-07-31       Impact factor: 5.514

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.