Literature DB >> 19381534

Comparing algorithms for clustering of expression data: how to assess gene clusters.

Golan Yona1, William Dirks, Shafquat Rahman.   

Abstract

Clustering is a popular technique commonly used to search for groups of similarly expressed genes using mRNA expression data. There are many different clustering algorithms and the application of each one will usually produce different results. Without additional evaluation, it is difficult to determine which solutions are better.In this chapter we discuss methods to assess algorithms for clustering of gene expression data. In particular, we present a new method that uses two elements: an internal index of validity based on the MDL principle and an external index of validity that measures the consistency with experimental data. Each one is used to suggest an effective set of models, but it is only the combination of both that is capable of pinpointing the best model overall. Our method can be used to compare different clustering algorithms and pick the one that maximizes the correlation with functional links in gene networks while minimizing the error rate. We test our methods on several popular clustering algorithms as well as on clustering algorithms that are specially tailored to deal with noisy data. Finally, we propose methods for assessing the significance of individual clusters and study the correspondence between gene clusters and biochemical pathways.

Mesh:

Year:  2009        PMID: 19381534     DOI: 10.1007/978-1-59745-243-4_21

Source DB:  PubMed          Journal:  Methods Mol Biol        ISSN: 1064-3745


  5 in total

1.  Seeing the forest for the trees: using the Gene Ontology to restructure hierarchical clustering.

Authors:  Dikla Dotan-Cohen; Simon Kasif; Avraham A Melkman
Journal:  Bioinformatics       Date:  2009-06-03       Impact factor: 6.937

2.  Reducing the time requirement of k-means algorithm.

Authors:  Victor Chukwudi Osamor; Ezekiel Femi Adebiyi; Jelilli Olarenwaju Oyelade; Seydou Doumbia
Journal:  PLoS One       Date:  2012-12-11       Impact factor: 3.240

3.  Functional cohesion of gene sets determined by latent semantic indexing of PubMed abstracts.

Authors:  Lijing Xu; Nicholas Furlotte; Yunyue Lin; Kevin Heinrich; Michael W Berry; Ebenezer O George; Ramin Homayouni
Journal:  PLoS One       Date:  2011-04-14       Impact factor: 3.240

4.  Quantitative assessment of gene expression network module-validation methods.

Authors:  Bing Li; Yingying Zhang; Yanan Yu; Pengqian Wang; Yongcheng Wang; Zhong Wang; Yongyan Wang
Journal:  Sci Rep       Date:  2015-10-16       Impact factor: 4.379

5.  Measuring similarity between gene interaction profiles.

Authors:  Joëlle Barido-Sottani; Samuel D Chapman; Evsey Kosman; Arcady R Mushegian
Journal:  BMC Bioinformatics       Date:  2019-08-22       Impact factor: 3.169

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.