Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 An efficient voting algorithm for finding additive biclusters with random background.

Literature DB >> 19040364

An efficient voting algorithm for finding additive biclusters with random background.

Jing Xiao¹, Lusheng Wang, Xiaowen Liu, Tao Jiang.

Abstract

The biclustering problem has been extensively studied in many areas, including e-commerce, data mining, machine learning, pattern recognition, statistics, and, more recently, computational biology. Given an n x m matrix A (n >or= m), the main goal of biclustering is to identify a subset of rows (called objects) and a subset of columns (called properties) such that some objective function that specifies the quality of the found bicluster (formed by the subsets of rows and of columns of A) is optimized. The problem has been proved or conjectured to be NP-hard for various objective functions. In this article, we study a probabilistic model for the implanted additive bicluster problem, where each element in the n x m background matrix is a random integer from [0, L - 1] for some integer L, and a k x k implanted additive bicluster is obtained from an error-free additive bicluster by randomly changing each element to a number in [0, L - 1] with probability theta. We propose an O(n(2)m) time algorithm based on voting to solve the problem. We show that when k >or= Omega(square root of (n log n)), the voting algorithm can correctly find the implanted bicluster with probability at least 1 - (9/n(2)). We also implement our algorithm as a C++ program named VOTE. The implementation incorporates several ideas for estimating the size of an implanted bicluster, adjusting the threshold in voting, dealing with small biclusters, and dealing with overlapping implanted biclusters. Our experimental results on both simulated and real datasets show that VOTE can find biclusters with a high accuracy and speed.

Entities: Gene

Mesh：

Year: 2008 PMID： 19040364 PMCID： PMC3131804 DOI： 10.1089/cmb.2007.0219

Source DB: PubMed Journal: J Comput Biol ISSN： 1066-5277 Impact factor: 1.479

15 in total

1. Clustering gene expression patterns.

Authors: A Ben-Dor; R Shamir; Z Yakhini
Journal: J Comput Biol Date: 1999 Fall-Winter Impact factor: 1.479

2. Characterizing gene sets with FuncAssociate.

Authors: Gabriel F Berriz; Oliver D King; Barbara Bryant; Chris Sander; Frederick P Roth
Journal: Bioinformatics Date: 2003-12-12 Impact factor: 6.937

3. A systematic comparison and evaluation of biclustering methods for gene expression data.

Authors: Amela Prelić; Stefan Bleuler; Philip Zimmermann; Anja Wille; Peter Bühlmann; Wilhelm Gruissem; Lars Hennig; Lothar Thiele; Eckart Zitzler
Journal: Bioinformatics Date: 2006-02-24 Impact factor: 6.937

4. Biclustering algorithms for biological data analysis: a survey.

Authors: Sara C Madeira; Arlindo L Oliveira
Journal: IEEE/ACM Trans Comput Biol Bioinform Date: 2004 Jan-Mar Impact factor: 3.710

5. A general framework for biclustering gene expression data.

Authors: Haifeng Li; Xin Chen; Keshu Zhang; Tao Jiang
Journal: J Bioinform Comput Biol Date: 2006-08 Impact factor: 1.122

6. Computing the maximum similarity bi-clusters of gene expression data.

Authors: Xiaowen Liu; Lusheng Wang
Journal: Bioinformatics Date: 2006-11-07 Impact factor: 6.937

7. BicAT: a biclustering analysis toolbox.

Authors: Simon Barkow; Stefan Bleuler; Amela Prelic; Philip Zimmermann; Eckart Zitzler
Journal: Bioinformatics Date: 2006-03-21 Impact factor: 6.937

8. Genomic expression programs in the response of yeast cells to environmental changes.

Authors: A P Gasch; P T Spellman; C M Kao; O Carmel-Harel; M B Eisen; G Storz; D Botstein; P O Brown
Journal: Mol Biol Cell Date: 2000-12 Impact factor: 4.138

9. Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays.

Authors: U Alon; N Barkai; D A Notterman; K Gish; S Ybarra; D Mack; A J Levine
Journal: Proc Natl Acad Sci U S A Date: 1999-06-08 Impact factor: 11.205

10. Defining transcription modules using large-scale gene expression data.

Authors: Jan Ihmels; Sven Bergmann; Naama Barkai
Journal: Bioinformatics Date: 2004-03-25 Impact factor: 6.937

1 in total

1. Biclustering methods: biological relevance and application in gene expression analysis.

Authors: Ali Oghabian; Sami Kilpinen; Sampsa Hautaniemi; Elena Czeizler
Journal: PLoS One Date: 2014-03-20 Impact factor: 3.240

1 in total