Literature DB >> 24044748

Noncontiguous atom matching structural similarity function.

Ana L Teixeira1, Andre O Falcao.   

Abstract

Measuring similarity between molecules is a fundamental problem in cheminformatics. Given that similar molecules tend to have similar physical, chemical, and biological properties, the notion of molecular similarity plays an important role in the exploration of molecular data sets, query-retrieval in molecular databases, and in structure-property/activity modeling. Various methods to define structural similarity between molecules are available in the literature, but so far none has been used with consistent and reliable results for all situations. We propose a new similarity method based on atom alignment for the analysis of structural similarity between molecules. This method is based on the comparison of the bonding profiles of atoms on comparable molecules, including features that are seldom found in other structural or graph matching approaches like chirality or double bond stereoisomerism. The similarity measure is then defined on the annotated molecular graph, based on an iterative directed graph similarity procedure and optimal atom alignment between atoms using a pairwise matching algorithm. With the proposed approach the similarities detected are more intuitively understood because similar atoms in the molecules are explicitly shown. This noncontiguous atom matching structural similarity method (NAMS) was tested and compared with one of the most widely used similarity methods (fingerprint-based similarity) using three difficult data sets with different characteristics. Despite having a higher computational cost, the method performed well being able to distinguish either different or very similar hydrocarbons that were indistinguishable using a fingerprint-based approach. NAMS also verified the similarity principle using a data set of structurally similar steroids with differences in the binding affinity to the corticosteroid binding globulin receptor by showing that pairs of steroids with a high degree of similarity (>80%) tend to have smaller differences in the absolute value of binding activity. Using a highly diverse set of compounds with information about the monoamine oxidase inhibition level, the method was also able to recover a significantly higher average fraction of active compounds when the seed is active for different cutoff threshold values of similarity. Particularly, for the cutoff threshold values of 86%, 93%, and 96.5%, NAMS was able to recover a fraction of actives of 0.57, 0.63, and 0.83, respectively, while the fingerprint-based approach was able to recover a fraction of actives of 0.41, 0.40, and 0.39, respectively. NAMS is made available freely for the whole community in a simple Web based tool as well as the Python source code at http://nams.lasige.di.fc.ul.pt/.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 24044748     DOI: 10.1021/ci400324u

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  5 in total

1.  Many InChIs and quite some feat.

Authors:  Wendy A Warr
Journal:  J Comput Aided Mol Des       Date:  2015-06-17       Impact factor: 3.686

2.  Dietary and Microbial Oxazoles Induce Intestinal Inflammation by Modulating Aryl Hydrocarbon Receptor Responses.

Authors:  Shankar S Iyer; Thomas Gensollen; Amit Gandhi; Sungwhan F Oh; Joana F Neves; Frederic Collin; Richard Lavin; Carme Serra; Jonathan Glickman; Punyanganie S A de Silva; R Balfour Sartor; Gurdyal Besra; Russell Hauser; Anthony Maxwell; Amadeu Llebaria; Richard S Blumberg
Journal:  Cell       Date:  2018-05-17       Impact factor: 41.582

3.  A visual approach for analysis and inference of molecular activity spaces.

Authors:  Samina Kausar; Andre O Falcao
Journal:  J Cheminform       Date:  2019-10-22       Impact factor: 5.514

4.  A rotation-translation invariant molecular descriptor of partial charges and its use in ligand-based virtual screening.

Authors:  Francois Berenger; Arnout Voet; Xiao Yin Lee; Kam Yj Zhang
Journal:  J Cheminform       Date:  2014-05-10       Impact factor: 5.514

5.  Analysis and Comparison of Vector Space and Metric Space Representations in QSAR Modeling.

Authors:  Samina Kausar; Andre O Falcao
Journal:  Molecules       Date:  2019-04-30       Impact factor: 4.411

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.