Literature DB >> 18407922

Divisive Correlation Clustering Algorithm (DCCA) for grouping of genes: detecting varying patterns in expression profiles.

Anindya Bhattacharya1, Rajat K De.   

Abstract

MOTIVATION: Cluster analysis (of gene-expression data) is a useful tool for identifying biologically relevant groups of genes that show similar expression patterns under multiple experimental conditions. Various methods have been proposed for clustering gene-expression data. However most of these algorithms have several shortcomings for gene-expression data clustering. In the present article, we focus on several shortcomings of conventional clustering algorithms and propose a new one that is able to produce better clustering solution than that produced by some others.
RESULTS: We present the Divisive Correlation Clustering Algorithm (DCCA) that is suitable for finding a group of genes having similar pattern of variation in their expression values. To detect clusters with high correlation and biological significance, we use the correlation clustering concept introduced by Bansal et al. Our proposed algorithm DCCA produces a clustering solution without taking number of clusters to be created as an input. DCCA uses the correlation matrix in such a way that all genes in a cluster have highest average correlation with genes in that cluster. To test the performance of the DCCA, we have applied DCCA and some well-known conventional methods to an artificial dataset, and nine gene-expression datasets, and compared the performance of the algorithms. The clustering results of the DCCA are found to be more significantly relevant to the biological annotations than those of the other methods. All these facts show the superiority of the DCCA over some others for the clustering of gene-expression data. AVAILABILITY: The software has been developed using C and Visual Basic languages, and can be executed on the Microsoft Windows platforms. The software may be downloaded as a zip file from http://www.isical.ac.in/~rajat. Then it needs to be installed. Two word files (included in the zip file) need to be consulted before installation and execution of the software.

Entities:  

Mesh:

Year:  2008        PMID: 18407922     DOI: 10.1093/bioinformatics/btn133

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  5 in total

1.  Strategies to explore functional genomics data sets in NCBI's GEO database.

Authors:  Stephen E Wilhite; Tanya Barrett
Journal:  Methods Mol Biol       Date:  2012

2.  A semi-supervised learning approach to predict synthetic genetic interactions by combining functional and topological properties of functional gene network.

Authors:  Zhu-Hong You; Zheng Yin; Kyungsook Han; De-Shuang Huang; Xiaobo Zhou
Journal:  BMC Bioinformatics       Date:  2010-06-24       Impact factor: 3.169

3.  A GPU-accelerated algorithm for biclustering analysis and detection of condition-dependent coexpression network modules.

Authors:  Anindya Bhattacharya; Yan Cui
Journal:  Sci Rep       Date:  2017-06-23       Impact factor: 4.379

4.  Functional clustering of time series gene expression data by Granger causality.

Authors:  André Fujita; Patricia Severino; Kaname Kojima; João Ricardo Sato; Alexandre Galvão Patriota; Satoru Miyano
Journal:  BMC Syst Biol       Date:  2012-10-30

5.  Functional analyses of NSF1 in wine yeast using interconnected correlation clustering and molecular analyses.

Authors:  Kyrylo Bessonov; Christopher J Walkey; Barry J Shelp; Hennie J J van Vuuren; David Chiu; George van der Merwe
Journal:  PLoS One       Date:  2013-10-09       Impact factor: 3.240

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.