| Literature DB >> 25871768 |
Jean-Philippe Métivier1,2, Alban Lepailleur1,3, Aleksey Buzmakov4,5, Guillaume Poezevara1,2,3, Bruno Crémilleux1,2, Sergei O Kuznetsov5, Jérémie Le Goff6, Amedeo Napoli4, Ronan Bureau1,3, Bertrand Cuissart1,2.
Abstract
This study is dedicated to the introduction of a novel method that automatically extracts potential structural alerts from a data set of molecules. These triggering structures can be further used for knowledge discovery and classification purposes. Computation of the structural alerts results from an implementation of a sophisticated workflow that integrates a graph mining tool guided by growth rate and stability. The growth rate is a well-established measurement of contrast between classes. Moreover, the extracted patterns correspond to formal concepts; the most robust patterns, named the stable emerging patterns (SEPs), can then be identified thanks to their stability, a new notion originating from the domain of formal concept analysis. All of these elements are explained in the paper from the point of view of computation. The method was applied to a molecular data set on mutagenicity. The experimental results demonstrate its efficiency: it automatically outputs a manageable number of structural patterns that are strongly related to mutagenicity. Moreover, a part of the resulting structures corresponds to already known structural alerts. Finally, an in-depth chemical analysis relying on these structures demonstrates how the method can initiate promising processes of chemical knowledge discovery.Mesh:
Substances:
Year: 2015 PMID: 25871768 DOI: 10.1021/ci500611v
Source DB: PubMed Journal: J Chem Inf Model ISSN: 1549-9596 Impact factor: 4.956