Literature DB >> 18000332

Cumulative voting consensus method for partitions with variable number of clusters.

Hanan G Ayad1, Mohamed S Kamel.   

Abstract

Over the past few years, there has been a renewed interest in the consensus clustering problem. Several new methods have been proposed for finding a consensus partition for a set of n data objects that optimally summarizes an ensemble. In this paper, we propose new consensus clustering algorithms with linear computational complexity in n. We consider clusterings generated with random number of clusters, which we describe by categorical random variables. We introduce the idea of cumulative voting as a solution for the problem of cluster label alignment, where, unlike the common one-to-one voting scheme, a probabilistic mapping is computed. We seek a first summary of the ensemble that minimizes the average squared distance between the mapped partitions and the optimal representation of the ensemble, where the selection criterion of the reference clustering is defined based on maximizing the information content as measured by the entropy. We describe cumulative vote weighting schemes and corresponding algorithms to compute an empirical probability distribution summarizing the ensemble. Given the arbitrary number of clusters of the input partitions, we formulate the problem of extracting the optimal consensus as that of finding a compressed summary of the estimated distribution that preserves maximum relevant information. An efficient solution is obtained using an agglomerative algorithm that minimizes the average generalized Jensen-Shannon divergence within the cluster. The empirical study demonstrates significant gains in accuracy and superior performance compared to several recent consensus clustering algorithms.

Mesh:

Year:  2008        PMID: 18000332     DOI: 10.1109/TPAMI.2007.1138

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  6 in total

1.  Weighted voting-based consensus clustering for chemical structure databases.

Authors:  Faisal Saeed; Ali Ahmed; Mohd Shahir Shamsir; Naomie Salim
Journal:  J Comput Aided Mol Des       Date:  2014-05-15       Impact factor: 3.686

2.  Voting-based consensus clustering for combining multiple clusterings of chemical structures.

Authors:  Faisal Saeed; Naomie Salim; Ammar Abdo
Journal:  J Cheminform       Date:  2012-12-17       Impact factor: 5.514

3.  Optimized data fusion for K-means Laplacian clustering.

Authors:  Shi Yu; Xinhai Liu; Léon-Charles Tranchevent; Wolfgang Glänzel; Johan A K Suykens; Bart De Moor; Yves Moreau
Journal:  Bioinformatics       Date:  2010-10-26       Impact factor: 6.937

4.  Consensus-Based Sorting of Neuronal Spike Waveforms.

Authors:  Julien Fournier; Christian M Mueller; Mark Shein-Idelson; Mike Hemberger; Gilles Laurent
Journal:  PLoS One       Date:  2016-08-18       Impact factor: 3.240

5.  Hierarchical spectral clustering reveals brain size and shape changes in asymptomatic carriers of C9orf72.

Authors:  Rose Bruffaerts; Dorothy Gors; Alicia Bárcenas Gallardo; Mathieu Vandenbulcke; Philip Van Damme; Paul Suetens; John C van Swieten; Barbara Borroni; Raquel Sanchez-Valle; Fermin Moreno; Robert Laforce; Caroline Graff; Matthis Synofzik; Daniela Galimberti; James B Rowe; Mario Masellis; Maria Carmela Tartaglia; Elizabeth Finger; Alexandre de Mendonça; Fabrizio Tagliavini; Chris R Butler; Isabel Santana; Alexander Gerhard; Simon Ducharme; Johannes Levin; Adrian Danek; Markus Otto; Jonathan D Rohrer; Patrick Dupont; Peter Claes; Rik Vandenberghe
Journal:  Brain Commun       Date:  2022-07-18

6.  Gene prioritization and clustering by multi-view text mining.

Authors:  Shi Yu; Leon-Charles Tranchevent; Bart De Moor; Yves Moreau
Journal:  BMC Bioinformatics       Date:  2010-01-14       Impact factor: 3.169

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.