| Literature DB >> 22697238 |
Qinghui Gao1, Christine Ho, Yingmin Jia, Jingyi Jessica Li, Haiyan Huang.
Abstract
Identifying a bicluster, or submatrix of a gene expression dataset wherein the genes express similar behavior over the columns, is useful for discovering novel functional gene interactions. In this article, we introduce a new algorithm for finding biClusters with Linear Patterns (CLiP). Instead of solely maximizing Pearson correlation, we introduce a fitness function that also considers the correlation of complementary genes and conditions. This eliminates the need for a priori determination of the bicluster size. We employ both greedy search and the genetic algorithm in optimization, incorporating resampling for more robust discovery. When applied to both real and simulation datasets, our results show that CLiP is superior to existing methods. In analyzing RNA-seq fly and worm time-course data from modENCODE, we uncover a set of similarly expressed genes suggesting maternal dependence. Supplementary Material is available online (at www.liebertonline.com/cmb).Mesh:
Year: 2012 PMID: 22697238 PMCID: PMC3375643 DOI: 10.1089/cmb.2012.0032
Source DB: PubMed Journal: J Comput Biol ISSN: 1066-5277 Impact factor: 1.479