Literature DB >> 16157471

Graph kernels for chemical informatics.

Liva Ralaivola1, Sanjay J Swamidass, Hiroto Saigo, Pierre Baldi.   

Abstract

Increased availability of large repositories of chemical compounds is creating new challenges and opportunities for the application of machine learning methods to problems in computational chemistry and chemical informatics. Because chemical compounds are often represented by the graph of their covalent bonds, machine learning methods in this domain must be capable of processing graphical structures with variable size. Here, we first briefly review the literature on graph kernels and then introduce three new kernels (Tanimoto, MinMax, Hybrid) based on the idea of molecular fingerprints and counting labeled paths of depth up to d using depth-first search from each possible vertex. The kernels are applied to three classification problems to predict mutagenicity, toxicity, and anti-cancer activity on three publicly available data sets. The kernels achieve performances at least comparable, and most often superior, to those previously reported in the literature reaching accuracies of 91.5% on the Mutag dataset, 65-67% on the PTC (Predictive Toxicology Challenge) dataset, and 72% on the NCI (National Cancer Institute) dataset. Properties and tradeoffs of these kernels, as well as other proposed kernels that leverage 1D or 3D representations of molecules, are briefly discussed.

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 16157471     DOI: 10.1016/j.neunet.2005.07.009

Source DB:  PubMed          Journal:  Neural Netw        ISSN: 0893-6080


  49 in total

1.  CHEMICAL COMPOUND CLASSIFICATION WITH AUTOMATICALLY MINED STRUCTURE PATTERNS.

Authors:  A M Smalter; J Huan; G H Lushington
Journal:  Proc Asia Pac Bioinform Conf       Date:  2008

2.  Bounds and algorithms for fast exact searches of chemical fingerprints in linear and sublinear time.

Authors:  S Joshua Swamidass; Pierre Baldi
Journal:  J Chem Inf Model       Date:  2007-02-28       Impact factor: 4.956

3.  EndNote: Feature-based classification of networks.

Authors:  Ian Barnett; Nishant Malik; Marieke L Kuijjer; Peter J Mucha; Jukka-Pekka Onnela
Journal:  Netw Sci (Camb Univ Press)       Date:  2019-09-23

4.  Evaluation of different virtual screening strategies on the basis of compound sets with characteristic core distributions and dissimilarity relationships.

Authors:  Tomoyuki Miyao; Swarit Jasial; Jürgen Bajorath; Kimito Funatsu
Journal:  J Comput Aided Mol Des       Date:  2019-08-21       Impact factor: 3.686

5.  Protein-ligand interaction prediction: an improved chemogenomics approach.

Authors:  Laurent Jacob; Jean-Philippe Vert
Journal:  Bioinformatics       Date:  2008-08-01       Impact factor: 6.937

Review 6.  Machine learning for in silico virtual screening and chemical genomics: new strategies.

Authors:  Jean-Philippe Vert; Laurent Jacob
Journal:  Comb Chem High Throughput Screen       Date:  2008-09       Impact factor: 1.339

7.  A constructive approach for discovering new drug leads: Using a kernel methodology for the inverse-QSAR problem.

Authors:  William Wl Wong; Forbes J Burkowski
Journal:  J Cheminform       Date:  2009-04-28       Impact factor: 5.514

8.  Estimation of the applicability domain of kernel-based machine learning models for virtual screening.

Authors:  Nikolas Fechner; Andreas Jahn; Georg Hinselmann; Andreas Zell
Journal:  J Cheminform       Date:  2010-03-11       Impact factor: 5.514

9.  Combinatorial algorithm for counting small induced graphs and orbits.

Authors:  Tomaž Hočevar; Janez Demšar
Journal:  PLoS One       Date:  2017-02-09       Impact factor: 3.240

10.  Graphlet kernels for prediction of functional residues in protein structures.

Authors:  Vladimir Vacic; Lilia M Iakoucheva; Stefano Lonardi; Predrag Radivojac
Journal:  J Comput Biol       Date:  2010-01       Impact factor: 1.479

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.