Literature DB >> 18397779

Modified fuzzy gap statistic for estimating preferable number of clusters in fuzzy k-means clustering.

Chinatsu Arima1, Kazumi Hakamada, Masahiro Okamoto, Taizo Hanai.   

Abstract

In clustering methods, the estimation of the optimal number of clusters is significant for subsequent analysis. Without detailed biological information on the genes involved, the evaluation of the number of clusters becomes difficult, and we have to rely on an internal measure that is based on the distribution of the data of the clustering result. The Gap statistic has been proposed as a superior method for estimating the number of clusters in crisp clustering. In this study, we proposed a modified Fuzzy Gap statistic (MFGS) and applied it to fuzzy k-means clustering. For estimating the number of clusters, fuzzy k-means clustering with the MFGS was applied to two artificial data sets with noise and to two experimentally observed gene expression data sets. For the artificial data sets, compared with other internal measures, the MFGS showed a higher performance in terms of robustness against noise for estimating the optimal number of clusters. Moreover, it could be used to estimate the optimal number of clusters in experimental data sets. It was confirmed that the proposed MFGS is a useful method for estimating the number of clusters for microarray data sets.

Mesh:

Year:  2008        PMID: 18397779     DOI: 10.1263/jbb.105.273

Source DB:  PubMed          Journal:  J Biosci Bioeng        ISSN: 1347-4421            Impact factor:   2.894


  2 in total

1.  PPINGUIN: Peptide Profiling Guided Identification of Proteins improves quantitation of iTRAQ ratios.

Authors:  Chris Bauer; Frank Kleinjung; Dorothea Rutishauser; Christian Panse; Alexandra Chadt; Tanja Dreja; Hadi Al-Hasani; Knut Reinert; Ralph Schlapbach; Johannes Schuchhardt
Journal:  BMC Bioinformatics       Date:  2012-02-16       Impact factor: 3.169

2.  Analysis of gene expression profiles of soft tissue sarcoma using a combination of knowledge-based filtering with integration of multiple statistics.

Authors:  Anna Takahashi; Robert Nakayama; Nanako Ishibashi; Ayano Doi; Risa Ichinohe; Yoriko Ikuyo; Teruyoshi Takahashi; Shigetaka Marui; Koji Yasuhara; Tetsuro Nakamura; Shintaro Sugita; Hiromi Sakamoto; Teruhiko Yoshida; Tadashi Hasegawa; Hiro Takahashi
Journal:  PLoS One       Date:  2014-09-04       Impact factor: 3.240

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.