| Literature DB >> 27481767 |
Faisal Saeed1,2, Naomie Salim3, Ammar Abdo4,5.
Abstract
Many consensus clustering methods have been applied in different areas such as pattern recognition, machine learning, information theory and bioinformatics. However, few methods have been used for chemical compounds clustering. In this paper, an information theory and voting based algorithm (Adaptive Cumulative Voting-based Aggregation Algorithm A-CVAA) was examined for combining multiple clusterings of chemical structures. The effectiveness of clusterings was evaluated based on the ability of the clustering method to separate active from inactive molecules in each cluster, and the results were compared with Ward's method. The chemical dataset MDL Drug Data Report (MDDR) and the Maximum Unbiased Validation (MUV) dataset were used. Experiments suggest that the adaptive cumulative voting-based consensus method can improve the effectiveness of combining multiple clusterings of chemical structures.Keywords: Compound selection; Cumulative voting; Ensemble clustering; Molecular datasets
Year: 2013 PMID: 27481767 DOI: 10.1002/minf.201300004
Source DB: PubMed Journal: Mol Inform ISSN: 1868-1743 Impact factor: 3.353