Literature DB >> 17204155

FLAME, a novel fuzzy clustering method for the analysis of DNA microarray data.

Limin Fu1, Enzo Medico.   

Abstract

BACKGROUND: Data clustering analysis has been extensively applied to extract information from gene expression profiles obtained with DNA microarrays. To this aim, existing clustering approaches, mainly developed in computer science, have been adapted to microarray data analysis. However, previous studies revealed that microarray datasets have very diverse structures, some of which may not be correctly captured by current clustering methods. We therefore approached the problem from a new starting point, and developed a clustering algorithm designed to capture dataset-specific structures at the beginning of the process.
RESULTS: The clustering algorithm is named Fuzzy clustering by Local Approximation of MEmbership (FLAME). Distinctive elements of FLAME are: (i) definition of the neighborhood of each object (gene or sample) and identification of objects with "archetypal" features named Cluster Supporting Objects, around which to construct the clusters; (ii) assignment to each object of a fuzzy membership vector approximated from the memberships of its neighboring objects, by an iterative converging process in which membership spreads from the Cluster Supporting Objects through their neighbors. Comparative analysis with K-means, hierarchical, fuzzy C-means and fuzzy self-organizing maps (SOM) showed that data partitions generated by FLAME are not superimposable to those of other methods and, although different types of datasets are better partitioned by different algorithms, FLAME displays the best overall performance. FLAME is implemented, together with all the above-mentioned algorithms, in a C++ software with graphical interface for Linux and Windows, capable of handling very large datasets, named Gene Expression Data Analysis Studio (GEDAS), freely available under GNU General Public License.
CONCLUSION: The FLAME algorithm has intrinsic advantages, such as the ability to capture non-linear relationships and non-globular clusters, the automated definition of the number of clusters, and the identification of cluster outliers, i.e. genes that are not assigned to any cluster. As a result, clusters are more internally homogeneous and more diverse from each other, and provide better partitioning of biological functions. The clustering algorithm can be easily extended to applications different from gene expression analysis.

Entities:  

Mesh:

Year:  2007        PMID: 17204155      PMCID: PMC1774579          DOI: 10.1186/1471-2105-8-3

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  21 in total

1.  Nonlinear dimensionality reduction by locally linear embedding.

Authors:  S T Roweis; L K Saul
Journal:  Science       Date:  2000-12-22       Impact factor: 47.728

2.  An algorithm for clustering cDNA fingerprints.

Authors:  E Hartuv; A O Schmitt; J Lange; S Meier-Ewert; H Lehrach; R Shamir
Journal:  Genomics       Date:  2000-06-15       Impact factor: 5.736

3.  Model-based clustering and data transformations for gene expression data.

Authors:  K Y Yeung; C Fraley; A Murua; A E Raftery; W L Ruzzo
Journal:  Bioinformatics       Date:  2001-10       Impact factor: 6.937

4.  CLIFF: clustering of high-dimensional microarray data via iterative feature filtering using normalized cuts.

Authors:  E P Xing; R M Karp
Journal:  Bioinformatics       Date:  2001       Impact factor: 6.937

5.  Fuzzy C-means method for clustering microarray data.

Authors:  Doulaye Dembélé; Philippe Kastner
Journal:  Bioinformatics       Date:  2003-05-22       Impact factor: 6.937

6.  Biclustering microarray data by Gibbs sampling.

Authors:  Qizheng Sheng; Yves Moreau; Bart De Moor
Journal:  Bioinformatics       Date:  2003-10       Impact factor: 6.937

7.  Supervised cluster analysis for microarray data based on multivariate Gaussian mixture.

Authors:  Yi Qu; Shizhong Xu
Journal:  Bioinformatics       Date:  2004-03-25       Impact factor: 6.937

8.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization.

Authors:  P T Spellman; G Sherlock; M Q Zhang; V R Iyer; K Anders; M B Eisen; P O Brown; D Botstein; B Futcher
Journal:  Mol Biol Cell       Date:  1998-12       Impact factor: 4.138

9.  Fuzzy J-Means and VNS methods for clustering genes from microarray data.

Authors:  Nabil Belacel; Miroslava Cuperlović-Culf; Mark Laflamme; Rodney Ouellette
Journal:  Bioinformatics       Date:  2004-02-26       Impact factor: 6.937

10.  Exploring the conditional coregulation of yeast gene expression through fuzzy k-means clustering.

Authors:  Audrey P Gasch; Michael B Eisen
Journal:  Genome Biol       Date:  2002-10-10       Impact factor: 13.583

View more
  44 in total

1.  Comparing the performance of biomedical clustering methods.

Authors:  Christian Wiwie; Jan Baumbach; Richard Röttger
Journal:  Nat Methods       Date:  2015-09-21       Impact factor: 28.547

2.  The thioxotriazole copper(II) complex A0 induces endoplasmic reticulum stress and paraptotic death in human cancer cells.

Authors:  Saverio Tardito; Claudio Isella; Enzo Medico; Luciano Marchiò; Elena Bevilacqua; Maria Hatzoglou; Ovidio Bussolati; Renata Franchi-Gazzola
Journal:  J Biol Chem       Date:  2009-06-26       Impact factor: 5.157

3.  Time-variant clustering model for understanding cell fate decisions.

Authors:  Wei Huang; Xiaoyi Cao; Fernando H Biase; Pengfei Yu; Sheng Zhong
Journal:  Proc Natl Acad Sci U S A       Date:  2014-10-22       Impact factor: 11.205

4.  Identification of cell types from single-cell transcriptomes using a novel clustering method.

Authors:  Chen Xu; Zhengchang Su
Journal:  Bioinformatics       Date:  2015-02-11       Impact factor: 6.937

5.  A Comparison of Fuzzy Clustering Approaches for Quantification of Microarray Gene Expression.

Authors:  Yu-Ping Wang; Maheswar Gunampally; Jie Chen; Douglas Bittel; Merlin G Butler; Wei-Wen Cai
Journal:  J Signal Process Syst       Date:  2007-08-16

6.  FUMET: a fuzzy network module extraction technique for gene expression data.

Authors:  Priyakshi Mahanta; Hasin Afzal Ahmed; Dhruba Kumar Bhattacharyya; Ashish Ghosh
Journal:  J Biosci       Date:  2014-06       Impact factor: 1.826

7.  High-throughput molecular analysis from leftover of fine needle aspiration cytology of mammographically detected breast cancer.

Authors:  Laura Annaratone; Caterina Marchiò; Tommaso Renzulli; Isabella Castellano; Daniela Cantarella; Claudio Isella; Luigia Macrì; Giovanna Mariscotti; Davide Balmativola; Elisabetta Cantanna; Cristina Deambrogio; Francesca Pietribiasi; Riccardo Arisio; Fernando Schmitt; Enzo Medico; Anna Sapino
Journal:  Transl Oncol       Date:  2012-06-01       Impact factor: 4.243

8.  Gene expression profiling of HGF/Met activation in neonatal mouse heart.

Authors:  Stefano Gatti; Christian Leo; Simona Gallo; Valentina Sala; Enrico Bucci; Massimo Natale; Daniela Cantarella; Enzo Medico; Tiziana Crepaldi
Journal:  Transgenic Res       Date:  2012-12-06       Impact factor: 2.788

9.  Fuzzy c-means clustering with prior biological knowledge.

Authors:  Luis Tari; Chitta Baral; Seungchan Kim
Journal:  J Biomed Inform       Date:  2008-05-24       Impact factor: 6.317

10.  Comparative analysis of missing value imputation methods to improve clustering and interpretation of microarray experiments.

Authors:  Magalie Celton; Alain Malpertuy; Gaëlle Lelandais; Alexandre G de Brevern
Journal:  BMC Genomics       Date:  2010-01-07       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.