Literature DB >> 16500932

Incorporating biological knowledge into distance-based clustering analysis of microarray gene expression data.

Desheng Huang1, Wei Pan.   

Abstract

MOTIVATION: Because co-expressed genes are likely to share the same biological function, cluster analysis of gene expression profiles has been applied for gene function discovery. Most existing clustering methods ignore known gene functions in the process of clustering.
RESULTS: To take advantage of accumulating gene functional annotations, we propose incorporating known gene functions into a new distance metric, which shrinks a gene expression-based distance towards 0 if and only if the two genes share a common gene function. A two-step procedure is used. First, the shrinkage distance metric is used in any distance-based clustering method, e.g. K-medoids or hierarchical clustering, to cluster the genes with known functions. Second, while keeping the clustering results from the first step for the genes with known functions, the expression-based distance metric is used to cluster the remaining genes of unknown function, assigning each of them to either one of the clusters obtained in the first step or some new clusters. A simulation study and an application to gene function prediction for the yeast demonstrate the advantage of our proposal over the standard method.

Entities:  

Mesh:

Year:  2006        PMID: 16500932     DOI: 10.1093/bioinformatics/btl065

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  20 in total

1.  Analysis of time-series gene expression data: methods, challenges, and opportunities.

Authors:  I P Androulakis; E Yang; R R Almon
Journal:  Annu Rev Biomed Eng       Date:  2007       Impact factor: 9.590

2.  Penalized model-based clustering with cluster-specific diagonal covariance matrices and grouped variables.

Authors:  Benhuai Xie; Wei Pan; Xiaotong Shen
Journal:  Electron J Stat       Date:  2008       Impact factor: 1.125

3.  A Network Approach to Wound Healing.

Authors:  Tomasz Arodz; Danail Bonchev; Robert F Diegelmann
Journal:  Adv Wound Care (New Rochelle)       Date:  2013-11       Impact factor: 4.730

4.  Clustering of High Throughput Gene Expression Data.

Authors:  Harun Pirim; Burak Ekşioğlu; Andy Perkins; Cetin Yüceer
Journal:  Comput Oper Res       Date:  2012-12       Impact factor: 4.008

5.  Seeing the forest for the trees: using the Gene Ontology to restructure hierarchical clustering.

Authors:  Dikla Dotan-Cohen; Simon Kasif; Avraham A Melkman
Journal:  Bioinformatics       Date:  2009-06-03       Impact factor: 6.937

6.  Fuzzy c-means clustering with prior biological knowledge.

Authors:  Luis Tari; Chitta Baral; Seungchan Kim
Journal:  J Biomed Inform       Date:  2008-05-24       Impact factor: 6.317

7.  Semi-supervised clustering methods.

Authors:  Eric Bair
Journal:  Wiley Interdiscip Rev Comput Stat       Date:  2013

8.  Multiconstrained gene clustering based on generalized projections.

Authors:  Jia Zeng; Shanfeng Zhu; Alan Wee-Chung Liew; Hong Yan
Journal:  BMC Bioinformatics       Date:  2010-03-31       Impact factor: 3.169

9.  CLEAN: CLustering Enrichment ANalysis.

Authors:  Johannes M Freudenberg; Vineet K Joshi; Zhen Hu; Mario Medvedovic
Journal:  BMC Bioinformatics       Date:  2009-07-29       Impact factor: 3.169

10.  Lung cancer gene expression database analysis incorporating prior knowledge with support vector machine-based classification method.

Authors:  Peng Guan; Desheng Huang; Miao He; Baosen Zhou
Journal:  J Exp Clin Cancer Res       Date:  2009-07-18
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.