Literature DB >> 20448828

CHEMICAL COMPOUND CLASSIFICATION WITH AUTOMATICALLY MINED STRUCTURE PATTERNS.

A M Smalter1, J Huan, G H Lushington.   

Abstract

In this paper we propose new methods of chemical structure classification based on the integration of graph database mining from data mining and graph kernel functions from machine learning. In our method, we first identify a set of general graph patterns in chemical structure data. These patterns are then used to augment a graph kernel function that calculates the pairwise similarity between molecules. The obtained similarity matrix is used as input to classify chemical compounds via a kernel machines such as the support vector machine (SVM). Our results indicate that the use of a pattern-based approach to graph similarity yields performance profiles comparable to, and sometimes exceeding that of the existing state-of-the-art approaches. In addition, the identification of highly discriminative patterns for activity classification provides evidence that our methods can make generalizations about a compound's function given its chemical structure. While we evaluated our methods on molecular structures, these methods are designed to operate on general graph data and hence could easily be applied to other domains in bioinformatics.

Year:  2008        PMID: 20448828      PMCID: PMC2864492          DOI: 10.1901/jaba.2008.6-39

Source DB:  PubMed          Journal:  Proc Asia Pac Bioinform Conf


  5 in total

1.  Anchor-GRIND: filling the gap between standard 3D QSAR and the GRid-INdependent descriptors.

Authors:  Fabien Fontaine; Manuel Pastor; Ismael Zamora; Ferran Sanz
Journal:  J Med Chem       Date:  2005-04-07       Impact factor: 7.446

2.  Graph kernels for chemical informatics.

Authors:  Liva Ralaivola; Sanjay J Swamidass; Hiroto Saigo; Pierre Baldi
Journal:  Neural Netw       Date:  2005-09-12

3.  Virtual screening of molecular databases using a support vector machine.

Authors:  Robert N Jorissen; Michael K Gilson
Journal:  J Chem Inf Model       Date:  2005 May-Jun       Impact factor: 4.956

4.  Prediction of human intestinal absorption of drug compounds from molecular structure.

Authors:  M D Wessel; P C Jurs; J W Tolan; S M Muskal
Journal:  J Chem Inf Comput Sci       Date:  1998 Jul-Aug

5.  Neighborhood behavior: a useful concept for validation of "molecular diversity" descriptors.

Authors:  D E Patterson; R D Cramer; A M Ferguson; R D Clark; L E Weinberger
Journal:  J Med Chem       Date:  1996-08-02       Impact factor: 7.446

  5 in total
  4 in total

1.  Feature Selection in the Tensor Product Feature Space.

Authors:  Aaron Smalter; Jun Huan; Gerald Lushington
Journal:  Proc IEEE Int Conf Data Min       Date:  2009

2.  G-Hash: Towards Fast Kernel-based Similarity Search in Large Graph Databases.

Authors:  Xiaohong Wang; Aaron Smalter; Jun Huan; Gerald H Lushington
Journal:  Adv Database Technol       Date:  2009

3.  GPM: A Graph Pattern Matching Kernel with Diffusion for Chemical Compound Classification.

Authors:  Aaron Smalter; Jun Huan; Gerald Lushington
Journal:  Proc IEEE Int Symp Bioinformatics Bioeng       Date:  2008-12-08

4.  Graph wavelet alignment kernels for drug virtual screening.

Authors:  Aaron Smalter; Jun Huan; Gerald Lushington
Journal:  J Bioinform Comput Biol       Date:  2009-06       Impact factor: 1.122

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.