Literature DB >> 24830925

Weighted voting-based consensus clustering for chemical structure databases.

Faisal Saeed1, Ali Ahmed, Mohd Shahir Shamsir, Naomie Salim.   

Abstract

The cluster-based compound selection is used in the lead identification process of drug discovery and design. Many clustering methods have been used for chemical databases, but there is no clustering method that can obtain the best results under all circumstances. However, little attention has been focused on the use of combination methods for chemical structure clustering, which is known as consensus clustering. Recently, consensus clustering has been used in many areas including bioinformatics, machine learning and information theory. This process can improve the robustness, stability, consistency and novelty of clustering. For chemical databases, different consensus clustering methods have been used including the co-association matrix-based, graph-based, hypergraph-based and voting-based methods. In this paper, a weighted cumulative voting-based aggregation algorithm (W-CVAA) was developed. The MDL Drug Data Report (MDDR) benchmark chemical dataset was used in the experiments and represented by the AlogP and ECPF_4 descriptors. The results from the clustering methods were evaluated by the ability of the clustering to separate biologically active molecules in each cluster from inactive ones using different criteria, and the effectiveness of the consensus clustering was compared to that of Ward's method, which is the current standard clustering method in chemoinformatics. This study indicated that weighted voting-based consensus clustering can overcome the limitations of the existing voting-based methods and improve the effectiveness of combining multiple clusterings of chemical structures.

Mesh:

Year:  2014        PMID: 24830925     DOI: 10.1007/s10822-014-9750-2

Source DB:  PubMed          Journal:  J Comput Aided Mol Des        ISSN: 0920-654X            Impact factor:   3.686


  11 in total

1.  Clustering files of chemical structures using the fuzzy k-means clustering method.

Authors:  John D Holliday; Sarah L Rodgers; Peter Willett; Min-You Chen; Mahdi Mahfouf; Kevin Lawson; Graham Mullier
Journal:  J Chem Inf Comput Sci       Date:  2004 May-Jun

2.  Combining multiple classifications of chemical structures using consensus clustering.

Authors:  Chia-Wei Chu; John D Holliday; Peter Willett
Journal:  Bioorg Med Chem       Date:  2012-03-10       Impact factor: 3.641

3.  Extended-connectivity fingerprints.

Authors:  David Rogers; Mathew Hahn
Journal:  J Chem Inf Model       Date:  2010-05-24       Impact factor: 4.956

Review 4.  Editorial opinion: chemoinformatics - a ten year update.

Authors:  Frank Brown
Journal:  Curr Opin Drug Discov Devel       Date:  2005-05

5.  Cumulative voting consensus method for partitions with variable number of clusters.

Authors:  Hanan G Ayad; Mohamed S Kamel
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2008-01       Impact factor: 6.226

6.  Clustering files of chemical structures using the Székely-Rizzo generalization of Ward's method.

Authors:  Thibault Varin; Ronan Bureau; Christoph Mueller; Peter Willett
Journal:  J Mol Graph Model       Date:  2009-07-04       Impact factor: 2.518

7.  3D Pharmacophore, hierarchical methods, and 5-HT4 receptor binding data.

Authors:  Thibault Varin; Nicolas Saettel; Jonathan Villain; Aurelien Lesnard; François Dauphin; Ronan Bureau; Sylvain Rault
Journal:  J Enzyme Inhib Med Chem       Date:  2008-10       Impact factor: 5.051

8.  Consensus methods for combining multiple clusterings of chemical structures.

Authors:  Faisal Saeed; Naomie Salim; Ammar Abdo
Journal:  J Chem Inf Model       Date:  2013-04-26       Impact factor: 4.956

9.  Graph-Based Consensus Clustering for Combining Multiple Clusterings of Chemical Structures.

Authors:  Faisal Saeed; Naomie Salim; Ammar Abdo; Hamza Hentabli
Journal:  Mol Inform       Date:  2013-02-05       Impact factor: 3.353

10.  Information Theory and Voting Based Consensus Clustering for Combining Multiple Clusterings of Chemical Structures.

Authors:  Faisal Saeed; Naomie Salim; Ammar Abdo
Journal:  Mol Inform       Date:  2013-05-15       Impact factor: 3.353

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.