Literature DB >> 16355656

Clustering ensembles: models of consensus and weak partitions.

Alexander Topchy1, Anil K Jain, William Punch.   

Abstract

Clustering ensembles have emerged as a powerful method for improving both the robustness as well as the stability of unsupervised classification solutions. However, finding a consensus clustering from multiple partitions is a difficult problem that can be approached from graph-based, combinatorial, or statistical perspectives. This study extends previous research on clustering ensembles in several respects. First, we introduce a unified representation for multiple clusterings and formulate the corresponding categorical clustering problem. Second, we propose a probabilistic model of consensus using a finite mixture of multinomial distributions in a space of clusterings. A combined partition is found as a solution to the corresponding maximum-likelihood problem using the EM algorithm. Third, we define a new consensus function that is related to the classical intraclass variance criterion using the generalized mutual information definition. Finally, we demonstrate the efficacy of combining partitions generated by weak clustering algorithms that use data projections and random data splits. A simple explanatory model is offered for the behavior of combinations of such weak clustering components. Combination accuracy is analyzed as a function of several parameters that control the power and resolution of component partitions as well as the number of partitions. We also analyze clustering ensembles with incomplete information and the effect of missing cluster labels on the quality of overall consensus. Experimental results demonstrate the effectiveness of the proposed methods on several real-world data sets.

Mesh:

Year:  2005        PMID: 16355656     DOI: 10.1109/TPAMI.2005.237

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  21 in total

1.  Development and validation of consensus clustering-based framework for brain segmentation using resting fMRI.

Authors:  Srikanth Ryali; Tianwen Chen; Aarthi Padmanabhan; Weidong Cai; Vinod Menon
Journal:  J Neurosci Methods       Date:  2014-11-29       Impact factor: 2.390

2.  Fast and interpretable consensus clustering via minipatch learning.

Authors:  Luqin Gan; Genevera I Allen
Journal:  PLoS Comput Biol       Date:  2022-10-03       Impact factor: 4.779

3.  A Scalable Framework For Cluster Ensembles.

Authors:  Prodip Hore; Lawrence O Hall; Dmitry B Goldgof
Journal:  Pattern Recognit       Date:  2009-05       Impact factor: 7.740

4.  Protein complex detection via weighted ensemble clustering based on Bayesian nonnegative matrix factorization.

Authors:  Le Ou-Yang; Dao-Qing Dai; Xiao-Fei Zhang
Journal:  PLoS One       Date:  2013-05-02       Impact factor: 3.240

5.  Optimized data fusion for K-means Laplacian clustering.

Authors:  Shi Yu; Xinhai Liu; Léon-Charles Tranchevent; Wolfgang Glänzel; Johan A K Suykens; Bart De Moor; Yves Moreau
Journal:  Bioinformatics       Date:  2010-10-26       Impact factor: 6.937

6.  Robust consensus clustering for identification of expressed genes linked to malignancy of human colorectal carcinoma.

Authors:  Gatot Wahyudi; Ito Wasito; Tisha Melia; Indra Budi
Journal:  Bioinformation       Date:  2011-06-23

7.  Consensus clustering in complex networks.

Authors:  Andrea Lancichinetti; Santo Fortunato
Journal:  Sci Rep       Date:  2012-03-27       Impact factor: 4.379

8.  Identification of gene co-expression clusters in liver tissues from multiple porcine populations with high and low backfat androstenone phenotype.

Authors:  Sudeep Sahadevan; Ernst Tholen; Christine Große-Brinkhaus; Karl Schellander; Dawit Tesfaye; Martin Hofmann-Apitius; Mehmet Ulas Cinar; Asep Gunawan; Michael Hölker; Christiane Neuhoff
Journal:  BMC Genet       Date:  2015-02-28       Impact factor: 2.797

9.  Gene prioritization and clustering by multi-view text mining.

Authors:  Shi Yu; Leon-Charles Tranchevent; Bart De Moor; Yves Moreau
Journal:  BMC Bioinformatics       Date:  2010-01-14       Impact factor: 3.169

10.  A formal concept analysis approach to consensus clustering of multi-experiment expression data.

Authors:  Anna Hristoskova; Veselka Boeva; Elena Tsiporkova
Journal:  BMC Bioinformatics       Date:  2014-05-19       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.