Literature DB >> 19925686

HAMSTER: visualizing microarray experiments as a set of minimum spanning trees.

Raymond Wan1, Larisa Kiseleva, Hajime Harada, Hiroshi Mamitsuka, Paul Horton.   

Abstract

BACKGROUND: Visualization tools allow researchers to obtain a global view of the interrelationships between the probes or experiments of a gene expression (e.g. microarray) data set. Some existing methods include hierarchical clustering and k-means. In recent years, others have proposed applying minimum spanning trees (MST) for microarray clustering. Although MST-based clustering is formally equivalent to the dendrograms produced by hierarchical clustering under certain conditions; visually they can be quite different.
METHODS: HAMSTER (Helpful Abstraction using Minimum Spanning Trees for Expression Relations) is an open source system for generating a set of MSTs from the experiments of a microarray data set. While previous works have generated a single MST from a data set for data clustering, we recursively merge experiments and repeat this process to obtain a set of MSTs for data visualization. Depending on the parameters chosen, each tree is analogous to a snapshot of one step of the hierarchical clustering process. We scored and ranked these trees using one of three proposed schemes. HAMSTER is implemented in C++ and makes use of Graphviz for laying out each MST.
RESULTS: We report on the running time of HAMSTER and demonstrate using data sets from the NCBI Gene Expression Omnibus (GEO) that the images created by HAMSTER offer insights that differ from the dendrograms of hierarchical clustering. In addition to the C++ program which is available as open source, we also provided a web-based version (HAMSTER+) which allows users to apply our system through a web browser without any computer programming knowledge.
CONCLUSION: Researchers may find it helpful to include HAMSTER in their microarray analysis workflow as it can offer insights that differ from hierarchical clustering. We believe that HAMSTER would be useful for certain types of gradient data sets (e.g time-series data) and data that indicate relationships between cells/tissues. Both the source and the web server variant of HAMSTER are available from http://hamster.cbrc.jp/.

Entities:  

Year:  2009        PMID: 19925686      PMCID: PMC2784758          DOI: 10.1186/1751-0473-4-8

Source DB:  PubMed          Journal:  Source Code Biol Med        ISSN: 1751-0473


  19 in total

1.  Large-scale clustering of cDNA-fingerprinting data.

Authors:  R Herwig; A J Poustka; C Müller; C Bull; H Lehrach; J O'Brien
Journal:  Genome Res       Date:  1999-11       Impact factor: 9.043

2.  Reconstructing the temporal ordering of biological samples using microarray data.

Authors:  Paul M Magwene; Paul Lizardi; Junhyong Kim
Journal:  Bioinformatics       Date:  2003-05-01       Impact factor: 6.937

3.  CUBIC: identification of regulatory binding sites through data clustering.

Authors:  Victor Olman; Dong Xu; Ying Xu
Journal:  J Bioinform Comput Biol       Date:  2003-04       Impact factor: 1.122

4.  Open source clustering software.

Authors:  M J L de Hoon; S Imoto; J Nolan; S Miyano
Journal:  Bioinformatics       Date:  2004-02-10       Impact factor: 6.937

5.  Parallel clustering algorithm for large data sets with applications in bioinformatics.

Authors:  Victor Olman; Fenglou Mao; Hongwei Wu; Ying Xu
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2009 Apr-Jun       Impact factor: 3.710

6.  Cell identity mediates the response of Arabidopsis roots to abiotic stress.

Authors:  José R Dinneny; Terri A Long; Jean Y Wang; Jee W Jung; Daniel Mace; Solomon Pointer; Christa Barron; Siobhan M Brady; John Schiefelbein; Philip N Benfey
Journal:  Science       Date:  2008-04-24       Impact factor: 47.728

7.  The Repeat Pattern Toolkit (RPT): analyzing the structure and evolution of the C. elegans genome.

Authors:  P Agarwal; D J States
Journal:  Proc Int Conf Intell Syst Mol Biol       Date:  1994

8.  Cluster analysis and display of genome-wide expression patterns.

Authors:  M B Eisen; P T Spellman; P O Brown; D Botstein
Journal:  Proc Natl Acad Sci U S A       Date:  1998-12-08       Impact factor: 11.205

9.  Creatine improves health and survival of mice.

Authors:  A Bender; J Beckers; I Schneider; S M Hölter; T Haack; T Ruthsatz; D M Vogt-Weisenhorn; L Becker; J Genius; D Rujescu; M Irmler; T Mijalski; M Mader; L Quintanilla-Martinez; H Fuchs; V Gailus-Durner; M Hrabé de Angelis; W Wurst; J Schmidt; T Klopstock
Journal:  Neurobiol Aging       Date:  2007-04-09       Impact factor: 4.673

10.  NCBI GEO: mining tens of millions of expression profiles--database and tools update.

Authors:  Tanya Barrett; Dennis B Troup; Stephen E Wilhite; Pierre Ledoux; Dmitry Rudnev; Carlos Evangelista; Irene F Kim; Alexandra Soboleva; Maxim Tomashevsky; Ron Edgar
Journal:  Nucleic Acids Res       Date:  2006-11-11       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.