Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 GPM: A Graph Pattern Matching Kernel with Diffusion for Chemical Compound Classification.

Literature DB >> 20428463

GPM: A Graph Pattern Matching Kernel with Diffusion for Chemical Compound Classification.

Aaron Smalter¹, Jun Huan, Gerald Lushington.

Abstract

Classifying chemical compounds is an active topic in drug design and other cheminformatics applications. Graphs are general tools for organizing information from heterogenous sources and have been applied in modelling many kinds of biological data. With the fast accumulation of chemical structure data, building highly accurate predictive models for chemical graphs emerges as a new challenge.In this paper, we demonstrate a novel technique called Graph Pattern Matching kernel (GPM). Our idea is to leverage existing frequent pattern discovery methods and explore their application to kernel classifiers (e.g. support vector machine) for graph classification. In our method, we first identify all frequent patterns from a graph database. We then map subgraphs to graphs in the database and use a diffusion process to label nodes in the graphs. Finally the kernel is computed using a set matching algorithm. We performed experiments on 16 chemical structure data sets and have compared our methods to other major graph kernels. The experimental results demonstrate excellent performance of our method.

Entities: Disease Gene Species

Year: 2008 PMID： 20428463 PMCID： PMC2860184 DOI： 10.1109/BIBE.2008.4696654

Source DB: PubMed Journal: Proc IEEE Int Symp Bioinformatics Bioeng ISSN： 2159-5410

9 in total

1 in total

1. GPM: A Graph Pattern Matching Kernel with Diffusion for Chemical Compound Classification.

Authors: Aaron Smalter; Jun Huan; Gerald Lushington
Journal: Proc IEEE Int Symp Bioinformatics Bioeng Date: 2008-12-08

1 in total

GPM: A Graph Pattern Matching Kernel with Diffusion for Chemical Compound Classification.

1. Accurate classification of protein structural families using coherent subgraph analysis.

2. NIH Molecular Libraries Initiative.

3. CHEMICAL COMPOUND CLASSIFICATION WITH AUTOMATICALLY MINED STRUCTURE PATTERNS.

4. Graph kernels for chemical informatics.

5. Virtual screening of molecular databases using a support vector machine.

6. Systematic discovery of functional modules and context-specific functional annotation of human genome.

7. GPM: A Graph Pattern Matching Kernel with Diffusion for Chemical Compound Classification.

8. Small molecules, big players: the National Cancer Institute's Initiative for Chemical Genetics.

9. BindingDB: a web-accessible database of experimentally determined protein-ligand binding affinities.

1. GPM: A Graph Pattern Matching Kernel with Diffusion for Chemical Compound Classification.