Literature DB >> 21593133

An ensemble biclustering approach for querying gene expression compendia with experimental lists.

Riet De Smet1, Kathleen Marchal.   

Abstract

MOTIVATION: Query-based biclustering techniques allow interrogating a gene expression compendium with a given gene or gene list. They do so by searching for genes in the compendium that have a profile close to the average expression profile of the genes in this query-list. As it can often not be guaranteed that the genes in a long query-list will all be mutually coexpressed, it is advisable to use each gene separately as a query. This approach, however, leaves the user with a tedious post-processing of partially redundant biclustering results. The fact that for each query-gene multiple parameter settings need to be tested in order to detect the 'most optimal bicluster size' adds to the redundancy problem.
RESULTS: To aid with this post-processing, we developed an ensemble approach to be used in combination with query-based biclustering. The method relies on a specifically designed consensus matrix in which the biclustering outcomes for multiple query-genes and for different possible parameter settings are merged in a statistically robust way. Clustering of this matrix results in distinct, non-redundant consensus biclusters that maximally reflect the information contained within the original query-based biclustering results. The usefulness of the developed approach is illustrated on a biological case study in Escherichia coli.
AVAILABILITY AND IMPLEMENTATION: Compiled Matlab code is available from http://homes.esat.kuleuven.be/~kmarchal/Supplementary_Information_DeSmet_2011/.

Entities:  

Mesh:

Year:  2011        PMID: 21593133     DOI: 10.1093/bioinformatics/btr307

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  5 in total

Review 1.  It is time to apply biclustering: a comprehensive review of biclustering applications in biological and biomedical data.

Authors:  Juan Xie; Anjun Ma; Anne Fennell; Qin Ma; Jing Zhao
Journal:  Brief Bioinform       Date:  2019-07-19       Impact factor: 11.622

2.  Coordinated Functional Divergence of Genes after Genome Duplication in Arabidopsis thaliana.

Authors:  Riet De Smet; Ehsan Sabaghian; Zhen Li; Yvan Saeys; Yves Van de Peer
Journal:  Plant Cell       Date:  2017-10-25       Impact factor: 11.277

3.  A probabilistic coevolutionary biclustering algorithm for discovering coherent patterns in gene expression dataset.

Authors:  Je-Gun Joung; Soo-Jin Kim; Soo-Yong Shin; Byoung-Tak Zhang
Journal:  BMC Bioinformatics       Date:  2012-12-13       Impact factor: 3.169

4.  COLOMBOS v2.0: an ever expanding collection of bacterial expression compendia.

Authors:  Pieter Meysman; Paolo Sonego; Luca Bianco; Qiang Fu; Daniela Ledezma-Tejeida; Socorro Gama-Castro; Veerle Liebens; Jan Michiels; Kris Laukens; Kathleen Marchal; Julio Collado-Vides; Kristof Engelen
Journal:  Nucleic Acids Res       Date:  2013-11-08       Impact factor: 16.971

5.  A loop-counting method for covariate-corrected low-rank biclustering of gene-expression and genome-wide association study data.

Authors:  Aaditya V Rangan; Caroline C McGrouther; John Kelsoe; Nicholas Schork; Eli Stahl; Qian Zhu; Arjun Krishnan; Vicky Yao; Olga Troyanskaya; Seda Bilaloglu; Preeti Raghavan; Sarah Bergen; Anders Jureus; Mikael Landen
Journal:  PLoS Comput Biol       Date:  2018-05-14       Impact factor: 4.475

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.