Literature DB >> 20428463

GPM: A Graph Pattern Matching Kernel with Diffusion for Chemical Compound Classification.

Aaron Smalter1, Jun Huan, Gerald Lushington.   

Abstract

Classifying chemical compounds is an active topic in drug design and other cheminformatics applications. Graphs are general tools for organizing information from heterogenous sources and have been applied in modelling many kinds of biological data. With the fast accumulation of chemical structure data, building highly accurate predictive models for chemical graphs emerges as a new challenge.In this paper, we demonstrate a novel technique called Graph Pattern Matching kernel (GPM). Our idea is to leverage existing frequent pattern discovery methods and explore their application to kernel classifiers (e.g. support vector machine) for graph classification. In our method, we first identify all frequent patterns from a graph database. We then map subgraphs to graphs in the database and use a diffusion process to label nodes in the graphs. Finally the kernel is computed using a set matching algorithm. We performed experiments on 16 chemical structure data sets and have compared our methods to other major graph kernels. The experimental results demonstrate excellent performance of our method.

Entities:  

Year:  2008        PMID: 20428463      PMCID: PMC2860184          DOI: 10.1109/BIBE.2008.4696654

Source DB:  PubMed          Journal:  Proc IEEE Int Symp Bioinformatics Bioeng        ISSN: 2159-5410


  9 in total

1.  Accurate classification of protein structural families using coherent subgraph analysis.

Authors:  J Huan; W Wang; A Washington; J Prins; R Shah; A Tropsha
Journal:  Pac Symp Biocomput       Date:  2004

2.  NIH Molecular Libraries Initiative.

Authors:  Christopher P Austin; Linda S Brady; Thomas R Insel; Francis S Collins
Journal:  Science       Date:  2004-11-12       Impact factor: 47.728

3.  CHEMICAL COMPOUND CLASSIFICATION WITH AUTOMATICALLY MINED STRUCTURE PATTERNS.

Authors:  A M Smalter; J Huan; G H Lushington
Journal:  Proc Asia Pac Bioinform Conf       Date:  2008

4.  Graph kernels for chemical informatics.

Authors:  Liva Ralaivola; Sanjay J Swamidass; Hiroto Saigo; Pierre Baldi
Journal:  Neural Netw       Date:  2005-09-12

5.  Virtual screening of molecular databases using a support vector machine.

Authors:  Robert N Jorissen; Michael K Gilson
Journal:  J Chem Inf Model       Date:  2005 May-Jun       Impact factor: 4.956

6.  Systematic discovery of functional modules and context-specific functional annotation of human genome.

Authors:  Yu Huang; Haifeng Li; Haiyan Hu; Xifeng Yan; Michael S Waterman; Haiyan Huang; Xianghong Jasmine Zhou
Journal:  Bioinformatics       Date:  2007-07-01       Impact factor: 6.937

7.  GPM: A Graph Pattern Matching Kernel with Diffusion for Chemical Compound Classification.

Authors:  Aaron Smalter; Jun Huan; Gerald Lushington
Journal:  Proc IEEE Int Symp Bioinformatics Bioeng       Date:  2008-12-08

8.  Small molecules, big players: the National Cancer Institute's Initiative for Chemical Genetics.

Authors:  Nicola Tolliday; Paul A Clemons; Paul Ferraiolo; Angela N Koehler; Timothy A Lewis; Xiaohua Li; Stuart L Schreiber; Daniela S Gerhard; Scott Eliasof
Journal:  Cancer Res       Date:  2006-09-15       Impact factor: 12.701

9.  BindingDB: a web-accessible database of experimentally determined protein-ligand binding affinities.

Authors:  Tiqing Liu; Yuhmei Lin; Xin Wen; Robert N Jorissen; Michael K Gilson
Journal:  Nucleic Acids Res       Date:  2006-12-01       Impact factor: 16.971

  9 in total
  1 in total

1.  GPM: A Graph Pattern Matching Kernel with Diffusion for Chemical Compound Classification.

Authors:  Aaron Smalter; Jun Huan; Gerald Lushington
Journal:  Proc IEEE Int Symp Bioinformatics Bioeng       Date:  2008-12-08
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.