Literature DB >> 19391629

Influence relevance voting: an accurate and interpretable virtual high throughput screening method.

S Joshua Swamidass1, Chloé-Agathe Azencott, Ting-Wan Lin, Hugo Gramajo, Shiou-Chuan Tsai, Pierre Baldi.   

Abstract

Given activity training data from high-throughput screening (HTS) experiments, virtual high-throughput screening (vHTS) methods aim to predict in silico the activity of untested chemicals. We present a novel method, the Influence Relevance Voter (IRV), specifically tailored for the vHTS task. The IRV is a low-parameter neural network which refines a k-nearest neighbor classifier by nonlinearly combining the influences of a chemical's neighbors in the training set. Influences are decomposed, also nonlinearly, into a relevance component and a vote component. The IRV is benchmarked using the data and rules of two large, open, competitions, and its performance compared to the performance of other participating methods, as well as of an in-house support vector machine (SVM) method. On these benchmark data sets, IRV achieves state-of-the-art results, comparable to the SVM in one case, and significantly better than the SVM in the other, retrieving three times as many actives in the top 1% of its prediction-sorted list. The IRV presents several other important advantages over SVMs and other methods: (1) the output predictions have a probabilistic semantic; (2) the underlying inferences are interpretable; (3) the training time is very short, on the order of minutes even for very large data sets; (4) the risk of overfitting is minimal, due to the small number of free parameters; and (5) additional information can easily be incorporated into the IRV architecture. Combined with its performance, these qualities make the IRV particularly well suited for vHTS.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19391629      PMCID: PMC2750043          DOI: 10.1021/ci8004379

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  29 in total

1.  Novel variable selection quantitative structure--property relationship approach based on the k-nearest-neighbor principle

Authors: 
Journal:  J Chem Inf Comput Sci       Date:  2000-01

2.  Using general regression and probabilistic neural networks to predict human intestinal absorption with topological descriptors derived from two-dimensional chemical structures.

Authors:  Tomoko Niwa
Journal:  J Chem Inf Comput Sci       Date:  2003 Jan-Feb

Review 3.  Comparison of fingerprint-based methods for virtual screening using multiple bioactive reference structures.

Authors:  Jérôme Hert; Peter Willett; David J Wilton; Pierre Acklin; Kamal Azzaoui; Edgar Jacoby; Ansgar Schuffenhauer
Journal:  J Chem Inf Comput Sci       Date:  2004 May-Jun

4.  Screening for dihydrofolate reductase inhibitors using MOLPRINT 2D, a fast fragment-based method employing the naïve Bayesian classifier: limitations of the descriptor and the importance of balanced chemistry in training and test sets.

Authors:  Andreas Bender; Hamse Y Mussa; Robert C Glen
Journal:  J Biomol Screen       Date:  2005-09-16

5.  Kernels for small molecules and the prediction of mutagenicity, toxicity and anti-cancer activity.

Authors:  S Joshua Swamidass; Jonathan Chen; Jocelyne Bruand; Peter Phung; Liva Ralaivola; Pierre Baldi
Journal:  Bioinformatics       Date:  2005-06       Impact factor: 6.937

6.  Assessing different classification methods for virtual screening.

Authors:  Dariusz Plewczynski; Stéphane A H Spieser; Uwe Koch
Journal:  J Chem Inf Model       Date:  2006 May-Jun       Impact factor: 4.956

7.  The pharmacophore kernel for virtual screening with support vector machines.

Authors:  Pierre Mahé; Liva Ralaivola; Véronique Stoven; Jean-Philippe Vert
Journal:  J Chem Inf Model       Date:  2006 Sep-Oct       Impact factor: 4.956

8.  Bounds and algorithms for fast exact searches of chemical fingerprints in linear and sublinear time.

Authors:  S Joshua Swamidass; Pierre Baldi
Journal:  J Chem Inf Model       Date:  2007-02-28       Impact factor: 4.956

9.  Lossless compression of chemical fingerprints using integer entropy codes improves storage and retrieval.

Authors:  Pierre Baldi; Ryan W Benz; Daniel S Hirschberg; S Joshua Swamidass
Journal:  J Chem Inf Model       Date:  2007-10-30       Impact factor: 4.956

10.  Managing bias in ROC curves.

Authors:  Robert D Clark; Daniel J Webster-Clark
Journal:  J Comput Aided Mol Des       Date:  2008-02-07       Impact factor: 3.686

View more
  17 in total

1.  A CROC stronger than ROC: measuring, visualizing and optimizing early retrieval.

Authors:  S Joshua Swamidass; Chloé-Agathe Azencott; Kenny Daily; Pierre Baldi
Journal:  Bioinformatics       Date:  2010-04-07       Impact factor: 6.937

2.  An economic framework to prioritize confirmatory tests after a high-throughput screen.

Authors:  S Joshua Swamidass; Joshua A Bittker; Nicole E Bodycombe; Sean P Ryder; Paul A Clemons
Journal:  J Biomol Screen       Date:  2010-06-14

Review 3.  Big-Data Science in Porous Materials: Materials Genomics and Machine Learning.

Authors:  Kevin Maik Jablonka; Daniele Ongari; Seyed Mohamad Moosavi; Berend Smit
Journal:  Chem Rev       Date:  2020-06-10       Impact factor: 60.622

4.  Managing missing measurements in small-molecule screens.

Authors:  Michael R Browning; Bradley T Calhoun; S Joshua Swamidass
Journal:  J Comput Aided Mol Des       Date:  2013-04-13       Impact factor: 3.686

5.  Computational structural enzymology methodologies for the study and engineering of fatty acid synthases, polyketide synthases and nonribosomal peptide synthetases.

Authors:  Andrew J Schaub; Gabriel O Moreno; Shiji Zhao; Hau V Truong; Ray Luo; Shiou-Chuan Tsai
Journal:  Methods Enzymol       Date:  2019-04-22       Impact factor: 1.600

Review 6.  Fatty acid biosynthesis in actinomycetes.

Authors:  Gabriela Gago; Lautaro Diacovich; Ana Arabolaza; Shiou-Chuan Tsai; Hugo Gramajo
Journal:  FEMS Microbiol Rev       Date:  2011-01-19       Impact factor: 16.408

7.  Molecular graph convolutions: moving beyond fingerprints.

Authors:  Steven Kearnes; Kevin McCloskey; Marc Berndl; Vijay Pande; Patrick Riley
Journal:  J Comput Aided Mol Des       Date:  2016-08-24       Impact factor: 3.686

8.  Interpreting linear support vector machine models with heat map molecule coloring.

Authors:  Lars Rosenbaum; Georg Hinselmann; Andreas Jahn; Andreas Zell
Journal:  J Cheminform       Date:  2011-03-25       Impact factor: 5.514

9.  Accurate and efficient target prediction using a potency-sensitive influence-relevance voter.

Authors:  Alessandro Lusci; Michael Browning; David Fooshee; Joshua Swamidass; Pierre Baldi
Journal:  J Cheminform       Date:  2015-12-29       Impact factor: 5.514

Review 10.  Opportunities and obstacles for deep learning in biology and medicine.

Authors:  Travers Ching; Daniel S Himmelstein; Brett K Beaulieu-Jones; Alexandr A Kalinin; Brian T Do; Gregory P Way; Enrico Ferrero; Paul-Michael Agapow; Michael Zietz; Michael M Hoffman; Wei Xie; Gail L Rosen; Benjamin J Lengerich; Johnny Israeli; Jack Lanchantin; Stephen Woloszynek; Anne E Carpenter; Avanti Shrikumar; Jinbo Xu; Evan M Cofer; Christopher A Lavender; Srinivas C Turaga; Amr M Alexandari; Zhiyong Lu; David J Harris; Dave DeCaprio; Yanjun Qi; Anshul Kundaje; Yifan Peng; Laura K Wiley; Marwin H S Segler; Simina M Boca; S Joshua Swamidass; Austin Huang; Anthony Gitter; Casey S Greene
Journal:  J R Soc Interface       Date:  2018-04       Impact factor: 4.293

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.