Literature DB >> 12870906

Design and evaluation of a molecular fingerprint involving the transformation of property descriptor values into a binary classification scheme.

Ling Xue1, Jeffrey W Godden, Florence L Stahura, Jürgen Bajorath.   

Abstract

A new fingerprint design concept is introduced that transforms molecular property descriptors into two-state descriptors and thus permits binary encoding. This transformation is based on the calculation of statistical medians of descriptor distributions in large compound collections and alleviates the need for value range encoding of these descriptors. For binary encoded property descriptors, bit positions that are set off capture as much information as bit positions that are set on, different from conventional fingerprint representations. Accordingly, a variant of the Tanimoto coefficient has been defined for comparison of these fingerprints. Following our design idea, a prototypic fingerprint termed MP-MFP was implemented by combining 61 binary encoded property descriptors with 110 structural fragment-type descriptors. The performance of this fingerprint was evaluated in systematic similarity search calculations in a database containing 549 molecules belonging to 38 different activity classes and 5000 background molecules. In these calculations, MP-MFP correctly recognized approximately 34% of all similarity relationships, with only 0.04% false positives, and performed better than previous designs and MACCS keys. The results suggest that combinations of simplified two-state property descriptors have predictive value in the analysis of molecular similarity.

Mesh:

Substances:

Year:  2003        PMID: 12870906     DOI: 10.1021/ci030285+

Source DB:  PubMed          Journal:  J Chem Inf Comput Sci        ISSN: 0095-2338


  15 in total

Review 1.  Molecular similarity and diversity in chemoinformatics: from theory to applications.

Authors:  Ana G Maldonado; J P Doucet; Michel Petitjean; Bo-Tao Fan
Journal:  Mol Divers       Date:  2006-02       Impact factor: 2.943

2.  JEDA: Joint entropy diversity analysis. An information-theoretic method for choosing diverse and representative subsets from combinatorial libraries.

Authors:  Melissa R Landon; Scott E Schaus
Journal:  Mol Divers       Date:  2006-09-21       Impact factor: 2.943

3.  Reverse fingerprinting, similarity searching by group fusion and fingerprint bit importance.

Authors:  Chris Williams
Journal:  Mol Divers       Date:  2006-09-21       Impact factor: 2.943

4.  eFindSite: improved prediction of ligand binding sites in protein models using meta-threading, machine learning and auxiliary ligands.

Authors:  Michal Brylinski; Wei P Feinstein
Journal:  J Comput Aided Mol Des       Date:  2013-07-10       Impact factor: 3.686

5.  Comparison of structure-based and threading-based approaches to protein functional annotation.

Authors:  Michal Brylinski; Jeffrey Skolnick
Journal:  Proteins       Date:  2010-01

Review 6.  In silico methods for drug repurposing and pharmacology.

Authors:  Rachel A Hodos; Brian A Kidd; Khader Shameer; Ben P Readhead; Joel T Dudley
Journal:  Wiley Interdiscip Rev Syst Biol Med       Date:  2016-04-15

7.  Computer-Aided Drug Design Methods.

Authors:  Wenbo Yu; Alexander D MacKerell
Journal:  Methods Mol Biol       Date:  2017

8.  S2DV: converting SMILES to a drug vector for predicting the activity of anti-HBV small molecules.

Authors:  Jinsong Shao; Qineng Gong; Zeyu Yin; Wenjie Pan; Sanjeevi Pandiyan; Li Wang
Journal:  Brief Bioinform       Date:  2022-03-10       Impact factor: 11.622

9.  Discovering patterns in drug-protein interactions based on their fingerprints.

Authors:  Weimin Luo; Keith C C Chan
Journal:  BMC Bioinformatics       Date:  2012-06-11       Impact factor: 3.169

10.  FINDSITE: a threading-based approach to ligand homology modeling.

Authors:  Michal Brylinski; Jeffrey Skolnick
Journal:  PLoS Comput Biol       Date:  2009-06-05       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.