Literature DB >> 18595779

Fuzzy c-means clustering with prior biological knowledge.

Luis Tari1, Chitta Baral, Seungchan Kim.   

Abstract

We propose a novel semi-supervised clustering method called GO Fuzzy c-means, which enables the simultaneous use of biological knowledge and gene expression data in a probabilistic clustering algorithm. Our method is based on the fuzzy c-means clustering algorithm and utilizes the Gene Ontology annotations as prior knowledge to guide the process of grouping functionally related genes. Unlike traditional clustering methods, our method is capable of assigning genes to multiple clusters, which is a more appropriate representation of the behavior of genes. Two datasets of yeast (Saccharomyces cerevisiae) expression profiles were applied to compare our method with other state-of-the-art clustering methods. Our experiments show that our method can produce far better biologically meaningful clusters even with the use of a small percentage of Gene Ontology annotations. In addition, our experiments further indicate that the utilization of prior knowledge in our method can predict gene functions effectively. The source code is freely available at http://sysbio.fulton.asu.edu/gofuzzy/.

Entities:  

Mesh:

Year:  2008        PMID: 18595779      PMCID: PMC2673503          DOI: 10.1016/j.jbi.2008.05.009

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  32 in total

1.  A protein complex containing Mei5 and Sae3 promotes the assembly of the meiosis-specific RecA homolog Dmc1.

Authors:  Atsuko Hayase; Misato Takagi; Toshiko Miyazaki; Hiroyuki Oshiumi; Miki Shinohara; Akira Shinohara
Journal:  Cell       Date:  2004-12-29       Impact factor: 41.582

2.  Knowledge guided analysis of microarray data.

Authors:  Zhuo Fang; Jiong Yang; Yixue Li; Qingming Luo; Lei Liu
Journal:  J Biomed Inform       Date:  2005-09-15       Impact factor: 6.317

3.  Incorporating biological knowledge into distance-based clustering analysis of microarray gene expression data.

Authors:  Desheng Huang; Wei Pan
Journal:  Bioinformatics       Date:  2006-02-24       Impact factor: 6.937

4.  Incorporating gene functions as priors in model-based clustering of microarray gene expression data.

Authors:  Wei Pan
Journal:  Bioinformatics       Date:  2006-01-24       Impact factor: 6.937

5.  Systematic identification and functional screens of uncharacterized proteins associated with eukaryotic ribosomal complexes.

Authors:  Tracey C Fleischer; Connie M Weaver; K Jill McAfee; Jennifer L Jennings; Andrew J Link
Journal:  Genes Dev       Date:  2006-05-15       Impact factor: 11.361

6.  Combining gene annotations and gene expression data in model-based clustering: weighted method.

Authors:  Desheng Huang; Peng Wei; Wei Pan
Journal:  OMICS       Date:  2006

7.  FLAME, a novel fuzzy clustering method for the analysis of DNA microarray data.

Authors:  Limin Fu; Enzo Medico
Journal:  BMC Bioinformatics       Date:  2007-01-04       Impact factor: 3.169

8.  The novel ATP-binding cassette protein ARB1 is a shuttling factor that stimulates 40S and 60S ribosome biogenesis.

Authors:  Jinsheng Dong; Ruby Lai; Jennifer L Jennings; Andrew J Link; Alan G Hinnebusch
Journal:  Mol Cell Biol       Date:  2005-11       Impact factor: 4.272

9.  The budding yeast mei5 and sae3 proteins act together with dmc1 during meiotic recombination.

Authors:  Hideo Tsubouchi; G Shirleen Roeder
Journal:  Genetics       Date:  2004-11       Impact factor: 4.562

10.  A functional network involved in the recycling of nucleocytoplasmic pre-60S factors.

Authors:  Alice Lebreton; Cosmin Saveanu; Laurence Decourty; Jean-Christophe Rain; Alain Jacquier; Micheline Fromont-Racine
Journal:  J Cell Biol       Date:  2006-05-01       Impact factor: 10.539

View more
  12 in total

1.  Integrative clustering methods for multi-omics data.

Authors:  Xiaoyu Zhang; Zhenwei Zhou; Hanfei Xu; Ching-Ti Liu
Journal:  Wiley Interdiscip Rev Comput Stat       Date:  2021-02-07

Review 2.  Functional annotations for the Saccharomyces cerevisiae genome: the knowns and the known unknowns.

Authors:  Karen R Christie; Eurie L Hong; J Michael Cherry
Journal:  Trends Microbiol       Date:  2009-07-02       Impact factor: 17.079

3.  Semi-supervised clustering methods.

Authors:  Eric Bair
Journal:  Wiley Interdiscip Rev Comput Stat       Date:  2013

4.  Integrative clustering methods for high-dimensional molecular data.

Authors:  Prabhakar Chalise; Devin C Koestler; Milan Bimali; Qing Yu; Brooke L Fridley
Journal:  Transl Cancer Res       Date:  2014-06-01       Impact factor: 1.241

5.  An unsupervised machine learning method for discovering patient clusters based on genetic signatures.

Authors:  Christian Lopez; Scott Tucker; Tarik Salameh; Conrad Tucker
Journal:  J Biomed Inform       Date:  2018-07-29       Impact factor: 6.317

6.  Multiconstrained gene clustering based on generalized projections.

Authors:  Jia Zeng; Shanfeng Zhu; Alan Wee-Chung Liew; Hong Yan
Journal:  BMC Bioinformatics       Date:  2010-03-31       Impact factor: 3.169

7.  Semi-supervised consensus clustering for gene expression data analysis.

Authors:  Yunli Wang; Youlian Pan
Journal:  BioData Min       Date:  2014-05-08       Impact factor: 2.522

8.  Meta-analysis of cell- specific transcriptomic data using fuzzy c-means clustering discovers versatile viral responsive genes.

Authors:  Atif Khan; Dejan Katanic; Juilee Thakar
Journal:  BMC Bioinformatics       Date:  2017-06-06       Impact factor: 3.169

9.  Knowledge-based variable selection for learning rules from proteomic data.

Authors:  Jonathan L Lustgarten; Shyam Visweswaran; Robert P Bowser; William R Hogan; Vanathi Gopalakrishnan
Journal:  BMC Bioinformatics       Date:  2009-09-17       Impact factor: 3.169

10.  Semi-Supervised Fuzzy Clustering with Feature Discrimination.

Authors:  Longlong Li; Jonathan M Garibaldi; Dongjian He; Meili Wang
Journal:  PLoS One       Date:  2015-09-01       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.