Literature DB >> 16882653

Evaluation and comparison of gene clustering methods in microarray analysis.

Anbupalam Thalamuthu1, Indranil Mukhopadhyay, Xiaojing Zheng, George C Tseng.   

Abstract

MOTIVATION: Microarray technology has been widely applied in biological and clinical studies for simultaneous monitoring of gene expression in thousands of genes. Gene clustering analysis is found useful for discovering groups of correlated genes potentially co-regulated or associated to the disease or conditions under investigation. Many clustering methods including hierarchical clustering, K-means, PAM, SOM, mixture model-based clustering and tight clustering have been widely used in the literature. Yet no comprehensive comparative study has been performed to evaluate the effectiveness of these methods.
RESULTS: In this paper, six gene clustering methods are evaluated by simulated data from a hierarchical log-normal model with various degrees of perturbation as well as four real datasets. A weighted Rand index is proposed for measuring similarity of two clustering results with possible scattered genes (i.e. a set of noise genes not being clustered). Performance of the methods in the real data is assessed by a predictive accuracy analysis through verified gene annotations. Our results show that tight clustering and model-based clustering consistently outperform other clustering methods both in simulated and real data while hierarchical clustering and SOM perform among the worst. Our analysis provides deep insight to the complicated gene clustering problem of expression profile and serves as a practical guideline for routine microarray cluster analysis.

Entities:  

Mesh:

Year:  2006        PMID: 16882653     DOI: 10.1093/bioinformatics/btl406

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  83 in total

1.  Module-based prediction approach for robust inter-study predictions in microarray data.

Authors:  Zhibao Mi; Kui Shen; Nan Song; Chunrong Cheng; Chi Song; Naftali Kaminski; George C Tseng
Journal:  Bioinformatics       Date:  2010-08-17       Impact factor: 6.937

2.  Comparing the performance of biomedical clustering methods.

Authors:  Christian Wiwie; Jan Baumbach; Richard Röttger
Journal:  Nat Methods       Date:  2015-09-21       Impact factor: 28.547

3.  Analysis of time-series gene expression data: methods, challenges, and opportunities.

Authors:  I P Androulakis; E Yang; R R Almon
Journal:  Annu Rev Biomed Eng       Date:  2007       Impact factor: 9.590

4.  Class-specific correlations of gene expressions: identification and their effects on clustering analyses.

Authors:  Jigang Zhang; Jian Li; Hongwen Deng
Journal:  Am J Hum Genet       Date:  2008-08       Impact factor: 11.025

5.  Penalized mixtures of factor analyzers with application to clustering high-dimensional microarray data.

Authors:  Benhuai Xie; Wei Pan; Xiaotong Shen
Journal:  Bioinformatics       Date:  2009-12-23       Impact factor: 6.937

Review 6.  A ground truth based comparative study on clustering of gene expression data.

Authors:  Yitan Zhu; Zuyi Wang; David J Miller; Robert Clarke; Jianhua Xuan; Eric P Hoffman; Yue Wang
Journal:  Front Biosci       Date:  2008-05-01

7.  Performance comparison of gene family clustering methods with expert curated gene family data set in Arabidopsis thaliana.

Authors:  Kuan Yang; Liqing Zhang
Journal:  Planta       Date:  2008-05-21       Impact factor: 4.116

8.  Penalized model-based clustering with cluster-specific diagonal covariance matrices and grouped variables.

Authors:  Benhuai Xie; Wei Pan; Xiaotong Shen
Journal:  Electron J Stat       Date:  2008       Impact factor: 1.125

9.  Bayesian model-based tight clustering for time course data.

Authors:  Yongsung Joo; G Casella; J Hobert
Journal:  Comput Stat       Date:  2010-03       Impact factor: 1.000

10.  Extracting gene expression patterns and identifying co-expressed genes from microarray data reveals biologically responsive processes.

Authors:  Jeff W Chou; Tong Zhou; William K Kaufmann; Richard S Paules; Pierre R Bushel
Journal:  BMC Bioinformatics       Date:  2007-11-02       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.