Literature DB >> 20444838

LCE: a link-based cluster ensemble method for improved gene expression data analysis.

Natthakan Iam-on1, Tossapon Boongoen, Simon Garrett.   

Abstract

MOTIVATION: It is far from trivial to select the most effective clustering method and its parameterization, for a particular set of gene expression data, because there are a very large number of possibilities. Although many researchers still prefer to use hierarchical clustering in one form or another, this is often sub-optimal. Cluster ensemble research solves this problem by automatically combining multiple data partitions from different clusterings to improve both the robustness and quality of the clustering result. However, many existing ensemble techniques use an association matrix to summarize sample-cluster co-occurrence statistics, and relations within an ensemble are encapsulated only at coarse level, while those existing among clusters are completely neglected. Discovering these missing associations may greatly extend the capability of the ensemble methodology for microarray data clustering.
RESULTS: The link-based cluster ensemble (LCE) method, presented here, implements these ideas and demonstrates outstanding performance. Experiment results on real gene expression and synthetic datasets indicate that LCE: (i) usually outperforms the existing cluster ensemble algorithms in individual tests and, overall, is clearly class-leading; (ii) generates excellent, robust performance across different types of data, especially with the presence of noise and imbalanced data clusters; (iii) provides a high-level data matrix that is applicable to many numerical clustering techniques; and (iv) is computationally efficient for large datasets and gene clustering. AVAILABILITY: Online supplementary and implementation are available at: http://users.aber.ac.uk/nii07/bioinformatics2010. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Year:  2010        PMID: 20444838     DOI: 10.1093/bioinformatics/btq226

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  9 in total

1.  Machine Learning for Integrating Data in Biology and Medicine: Principles, Practice, and Opportunities.

Authors:  Marinka Zitnik; Francis Nguyen; Bo Wang; Jure Leskovec; Anna Goldenberg; Michael M Hoffman
Journal:  Inf Fusion       Date:  2018-09-21       Impact factor: 12.975

Review 2.  Machine learning: its challenges and opportunities in plant system biology.

Authors:  Mohsen Hesami; Milad Alizadeh; Andrew Maxwell Phineas Jones; Davoud Torkamaneh
Journal:  Appl Microbiol Biotechnol       Date:  2022-05-16       Impact factor: 4.813

3.  A new locally weighted K-means for cancer-aided microarray data analysis.

Authors:  Natthakan Iam-On; Tossapon Boongoen
Journal:  J Med Syst       Date:  2012-10-28       Impact factor: 4.460

4.  Semi-supervised consensus clustering for gene expression data analysis.

Authors:  Yunli Wang; Youlian Pan
Journal:  BioData Min       Date:  2014-05-08       Impact factor: 2.522

5.  Critical limitations of consensus clustering in class discovery.

Authors:  Yasin Șenbabaoğlu; George Michailidis; Jun Z Li
Journal:  Sci Rep       Date:  2014-08-27       Impact factor: 4.379

6.  Predicting implementation of active learning by tenure-track teaching faculty using robust cluster analysis.

Authors:  Austin L Zuckerman; Rebecca A Hardesty; Adriana Signorini; Andrea Aebersold; Mayank Verma; Kameryn Denaro; Petra Kranzfelder; Melinda T Owens; Brian Sato; Stanley M Lo
Journal:  Int J STEM Educ       Date:  2022-07-28

7.  Cloud based metalearning system for predictive modeling of biomedical data.

Authors:  Milan Vukićević; Sandro Radovanović; Miloš Milovanović; Miroslav Minović
Journal:  ScientificWorldJournal       Date:  2014-04-14

8.  diceR: an R package for class discovery using an ensemble driven approach.

Authors:  Derek S Chiu; Aline Talhouk
Journal:  BMC Bioinformatics       Date:  2018-01-15       Impact factor: 3.169

9.  A Random Walk Based Cluster Ensemble Approach for Data Integration and Cancer Subtyping.

Authors:  Chao Yang; Yu-Tian Wang; Chun-Hou Zheng
Journal:  Genes (Basel)       Date:  2019-01-18       Impact factor: 4.096

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.