Literature DB >> 24682815

Bi-Force: large-scale bicluster editing and its application to gene expression data biclustering.

Peng Sun1, Nora K Speicher2, Richard Röttger2, Jiong Guo3, Jan Baumbach4.   

Abstract

The explosion of the biological data has dramatically reformed today's biological research. The need to integrate and analyze high-dimensional biological data on a large scale is driving the development of novel bioinformatics approaches. Biclustering, also known as 'simultaneous clustering' or 'co-clustering', has been successfully utilized to discover local patterns in gene expression data and similar biomedical data types. Here, we contribute a new heuristic: 'Bi-Force'. It is based on the weighted bicluster editing model, to perform biclustering on arbitrary sets of biological entities, given any kind of pairwise similarities. We first evaluated the power of Bi-Force to solve dedicated bicluster editing problems by comparing Bi-Force with two existing algorithms in the BiCluE software package. We then followed a biclustering evaluation protocol in a recent review paper from Eren et al. (2013) (A comparative analysis of biclustering algorithms for gene expressiondata. Brief. Bioinform., 14:279-292.) and compared Bi-Force against eight existing tools: FABIA, QUBIC, Cheng and Church, Plaid, BiMax, Spectral, xMOTIFs and ISA. To this end, a suite of synthetic datasets as well as nine large gene expression datasets from Gene Expression Omnibus were analyzed. All resulting biclusters were subsequently investigated by Gene Ontology enrichment analysis to evaluate their biological relevance. The distinct theoretical foundation of Bi-Force (bicluster editing) is more powerful than strict biclustering. We thus outperformed existing tools with Bi-Force at least when following the evaluation protocols from Eren et al. Bi-Force is implemented in Java and integrated into the open source software package of BiCluE. The software as well as all used datasets are publicly available at http://biclue.mpi-inf.mpg.de.
© The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Year:  2014        PMID: 24682815      PMCID: PMC5769343          DOI: 10.1093/nar/gku201

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  24 in total

1.  Biclustering of expression data.

Authors:  Y Cheng; G M Church
Journal:  Proc Int Conf Intell Syst Mol Biol       Date:  2000

2.  Extracting conserved gene expression motifs from gene expression data.

Authors:  T M Murali; Simon Kasif
Journal:  Pac Symp Biocomput       Date:  2003

3.  More powerful procedures for multiple significance testing.

Authors:  Y Hochberg; Y Benjamini
Journal:  Stat Med       Date:  1990-07       Impact factor: 2.373

4.  Biclustering of adverse drug events in the FDA's spontaneous reporting system.

Authors:  R Harpaz; H Perez; H S Chase; R Rabadan; G Hripcsak; C Friedman
Journal:  Clin Pharmacol Ther       Date:  2010-12-29       Impact factor: 6.875

5.  Biclustering algorithms for biological data analysis: a survey.

Authors:  Sara C Madeira; Arlindo L Oliveira
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2004 Jan-Mar       Impact factor: 3.710

6.  Integrated simultaneous analysis of different biomedical data types with exact weighted bi-cluster editing.

Authors:  Peng Sun; Jiong Guo; Jan Baumbach
Journal:  J Integr Bioinform       Date:  2012-07-17

7.  Iterative signature algorithm for the analysis of large-scale gene expression data.

Authors:  Sven Bergmann; Jan Ihmels; Naama Barkai
Journal:  Phys Rev E Stat Nonlin Soft Matter Phys       Date:  2003-03-11

8.  Spectral biclustering of microarray data: coclustering genes and conditions.

Authors:  Yuval Kluger; Ronen Basri; Joseph T Chang; Mark Gerstein
Journal:  Genome Res       Date:  2003-04       Impact factor: 9.043

9.  clusterMaker: a multi-algorithm clustering plugin for Cytoscape.

Authors:  John H Morris; Leonard Apeltsin; Aaron M Newman; Jan Baumbach; Tobias Wittkop; Gang Su; Gary D Bader; Thomas E Ferrin
Journal:  BMC Bioinformatics       Date:  2011-11-09       Impact factor: 3.307

10.  GenBank.

Authors:  Dennis A Benson; Mark Cavanaugh; Karen Clark; Ilene Karsch-Mizrachi; David J Lipman; James Ostell; Eric W Sayers
Journal:  Nucleic Acids Res       Date:  2012-11-27       Impact factor: 16.971

View more
  2 in total

1.  Carotta: Revealing Hidden Confounder Markers in Metabolic Breath Profiles.

Authors:  Anne-Christin Hauschild; Tobias Frisch; Jörg Ingo Baumbach; Jan Baumbach
Journal:  Metabolites       Date:  2015-06-10

2.  Network-aided Bi-Clustering for discovering cancer subtypes.

Authors:  Guoxian Yu; Xianxue Yu; Jun Wang
Journal:  Sci Rep       Date:  2017-04-21       Impact factor: 4.379

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.