Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 GPD: a graph pattern diffusion kernel for accurate graph classification with applications in cheminformatics.

Literature DB >> 20431140

GPD: a graph pattern diffusion kernel for accurate graph classification with applications in cheminformatics.

Aaron Smalter¹, Jun Luke Huan, Yi Jia, Gerald Lushington.

Abstract

Graph data mining is an active research area. Graphs are general modeling tools to organize information from heterogeneous sources and have been applied in many scientific, engineering, and business fields. With the fast accumulation of graph data, building highly accurate predictive models for graph data emerges as a new challenge that has not been fully explored in the data mining community. In this paper, we demonstrate a novel technique called graph pattern diffusion (GPD) kernel. Our idea is to leverage existing frequent pattern discovery methods and to explore the application of kernel classifier (e.g., support vector machine) in building highly accurate graph classification. In our method, we first identify all frequent patterns from a graph database. We then map subgraphs to graphs in the graph database and use a process we call "pattern diffusion" to label nodes in the graphs. Finally, we designed a graph alignment algorithm to compute the inner product of two graphs. We have tested our algorithm using a number of chemical structure data. The experimental results demonstrate that our method is significantly better than competing methods such as those kernel functions based on paths, cycles, and subgraphs.

Entities: Chemical Disease Gene Species

Mesh：

Substances：

Year: 2010 PMID： 20431140 PMCID： PMC3058227 DOI： 10.1109/TCBB.2009.80

Source DB: PubMed Journal: IEEE/ACM Trans Comput Biol Bioinform ISSN： 1545-5963 Impact factor: 3.710

8 in total

1 in total

1. Generalized adjacency and the conservation of gene clusters in genetic networks defined by synthetic lethals.

Authors: Zhenyu Yang; David Sankoff
Journal: BMC Bioinformatics Date: 2012-06-11 Impact factor: 3.169

1 in total

GPD: a graph pattern diffusion kernel for accurate graph classification with applications in cheminformatics.

1. Accurate classification of protein structural families using coherent subgraph analysis.

2. NIH Molecular Libraries Initiative.

3. Graph kernels for chemical informatics.

4. Virtual screening of molecular databases using a support vector machine.

5. Systematic discovery of functional modules and context-specific functional annotation of human genome.

6. Prediction of human intestinal absorption of drug compounds from molecular structure.

7. Small molecules, big players: the National Cancer Institute's Initiative for Chemical Genetics.

8. Protein ranking by semi-supervised network propagation.

1. Generalized adjacency and the conservation of gene clusters in genetic networks defined by synthetic lethals.