Literature DB >> 16141251

A new algorithm for comparing and visualizing relationships between hierarchical and flat gene expression data clusterings.

Aurora Torrente1, Misha Kapushesky, Alvis Brazma.   

Abstract

MOTIVATION: Clustering is one of the most widely used methods in unsupervised gene expression data analysis. The use of different clustering algorithms or different parameters often produces rather different results on the same data. Biological interpretation of multiple clustering results requires understanding how different clusters relate to each other. It is particularly non-trivial to compare the results of a hierarchical and a flat, e.g. k-means, clustering.
RESULTS: We present a new method for comparing and visualizing relationships between different clustering results, either flat versus flat, or flat versus hierarchical. When comparing a flat clustering to a hierarchical clustering, the algorithm cuts different branches in the hierarchical tree at different levels to optimize the correspondence between the clusters. The optimization function is based on graph layout aesthetics or on mutual information. The clusters are displayed using a bipartite graph where the edges are weighted proportionally to the number of common elements in the respective clusters and the weighted number of crossings is minimized. The performance of the algorithm is tested using simulated and real gene expression data. The algorithm is implemented in the online gene expression data analysis tool Expression Profiler. AVAILABILITY: http://www.ebi.ac.uk/expressionprofiler

Mesh:

Year:  2005        PMID: 16141251     DOI: 10.1093/bioinformatics/bti644

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  8 in total

1.  Clustering of High Throughput Gene Expression Data.

Authors:  Harun Pirim; Burak Ekşioğlu; Andy Perkins; Cetin Yüceer
Journal:  Comput Oper Res       Date:  2012-12       Impact factor: 4.008

2.  A CD8+ T cell transcription signature predicts prognosis in autoimmune disease.

Authors:  Eoin F McKinney; Paul A Lyons; Edward J Carr; Jane L Hollis; David R W Jayne; Lisa C Willcocks; Maria Koukoulaki; Alvis Brazma; Vojislav Jovanovic; D Michael Kemeny; Andrew J Pollard; Paul A Macary; Afzal N Chaudhry; Kenneth G C Smith
Journal:  Nat Med       Date:  2010-04-18       Impact factor: 53.440

3.  Gene selection and classification for cancer microarray data based on machine learning and similarity measures.

Authors:  Qingzhong Liu; Andrew H Sung; Zhongxue Chen; Jianzhong Liu; Lei Chen; Mengyu Qiao; Zhaohui Wang; Xudong Huang; Youping Deng
Journal:  BMC Genomics       Date:  2011-12-23       Impact factor: 3.969

4.  VisHiC--hierarchical functional enrichment analysis of microarray data.

Authors:  Darya Krushevskaya; Hedi Peterson; Jüri Reimand; Meelis Kull; Jaak Vilo
Journal:  Nucleic Acids Res       Date:  2009-05-29       Impact factor: 16.971

5.  Feature selection and classification of MAQC-II breast cancer and multiple myeloma microarray gene expression data.

Authors:  Qingzhong Liu; Andrew H Sung; Zhongxue Chen; Jianzhong Liu; Xudong Huang; Youping Deng
Journal:  PLoS One       Date:  2009-12-11       Impact factor: 3.240

6.  A general strategy to determine the congruence between a hierarchical and a non-hierarchical classification.

Authors:  Antonio Marco; Ignacio Marín
Journal:  BMC Bioinformatics       Date:  2007-11-15       Impact factor: 3.169

7.  Global considerations in hierarchical clustering reveal meaningful patterns in data.

Authors:  Roy Varshavsky; David Horn; Michal Linial
Journal:  PLoS One       Date:  2008-05-21       Impact factor: 3.240

8.  clustComp, a bioconductor package for the comparison of clustering results.

Authors:  Aurora Torrente; Alvis Brazma
Journal:  Bioinformatics       Date:  2017-12-15       Impact factor: 6.937

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.