Literature DB >> 19497096

A polynomial time biclustering algorithm for finding approximate expression patterns in gene expression time series.

Sara C Madeira1, Arlindo L Oliveira.   

Abstract

BACKGROUND: The ability to monitor the change in expression patterns over time, and to observe the emergence of coherent temporal responses using gene expression time series, obtained from microarray experiments, is critical to advance our understanding of complex biological processes. In this context, biclustering algorithms have been recognized as an important tool for the discovery of local expression patterns, which are crucial to unravel potential regulatory mechanisms. Although most formulations of the biclustering problem are NP-hard, when working with time series expression data the interesting biclusters can be restricted to those with contiguous columns. This restriction leads to a tractable problem and enables the design of efficient biclustering algorithms able to identify all maximal contiguous column coherent biclusters.
METHODS: In this work, we propose e-CCC-Biclustering, a biclustering algorithm that finds and reports all maximal contiguous column coherent biclusters with approximate expression patterns in time polynomial in the size of the time series gene expression matrix. This polynomial time complexity is achieved by manipulating a discretized version of the original matrix using efficient string processing techniques. We also propose extensions to deal with missing values, discover anticorrelated and scaled expression patterns, and different ways to compute the errors allowed in the expression patterns. We propose a scoring criterion combining the statistical significance of expression patterns with a similarity measure between overlapping biclusters.
RESULTS: We present results in real data showing the effectiveness of e-CCC-Biclustering and its relevance in the discovery of regulatory modules describing the transcriptomic expression patterns occurring in Saccharomyces cerevisiae in response to heat stress. In particular, the results show the advantage of considering approximate patterns when compared to state of the art methods that require exact matching of gene expression time series. DISCUSSION: The identification of co-regulated genes, involved in specific biological processes, remains one of the main avenues open to researchers studying gene regulatory networks. The ability of the proposed methodology to efficiently identify sets of genes with similar expression patterns is shown to be instrumental in the discovery of relevant biological phenomena, leading to more convincing evidence of specific regulatory mechanisms. AVAILABILITY: A prototype implementation of the algorithm coded in Java together with the dataset and examples used in the paper is available in http://kdbio.inesc-id.pt/software/e-ccc-biclustering.

Entities:  

Year:  2009        PMID: 19497096      PMCID: PMC2709627          DOI: 10.1186/1748-7188-4-8

Source DB:  PubMed          Journal:  Algorithms Mol Biol        ISSN: 1748-7188            Impact factor:   1.405


  19 in total

1.  Extracting conserved gene expression motifs from gene expression data.

Authors:  T M Murali; Simon Kasif
Journal:  Pac Symp Biocomput       Date:  2003

2.  Discovering local structure in gene expression data: the order-preserving submatrix problem.

Authors:  Amir Ben-Dor; Benny Chor; Richard Karp; Zohar Yakhini
Journal:  J Comput Biol       Date:  2003       Impact factor: 1.479

Review 3.  Analyzing time series gene expression data.

Authors:  Ziv Bar-Joseph
Journal:  Bioinformatics       Date:  2004-05-06       Impact factor: 6.937

4.  Mining gene expression data for positive and negative co-regulated gene clusters.

Authors:  Liping Ji; Kian-Lee Tan
Journal:  Bioinformatics       Date:  2004-05-14       Impact factor: 6.937

5.  Gene expression module discovery using gibbs sampling.

Authors:  Chang-Jiun Wu; Yutao Fu; T M Murali; Simon Kasif
Journal:  Genome Inform       Date:  2004

6.  A systematic comparison and evaluation of biclustering methods for gene expression data.

Authors:  Amela Prelić; Stefan Bleuler; Philip Zimmermann; Anja Wille; Peter Bühlmann; Wilhelm Gruissem; Lars Hennig; Lothar Thiele; Eckart Zitzler
Journal:  Bioinformatics       Date:  2006-02-24       Impact factor: 6.937

7.  Gene Ontology friendly biclustering of expression profiles.

Authors:  Jinze Liu; Wei Wang; Jiong Yang
Journal:  Proc IEEE Comput Syst Bioinform Conf       Date:  2004

8.  Biclustering in gene expression data by tendency.

Authors:  Jinze Liu; Jiong Wang; Wei Wang
Journal:  Proc IEEE Comput Syst Bioinform Conf       Date:  2004

9.  Biclustering algorithms for biological data analysis: a survey.

Authors:  Sara C Madeira; Arlindo L Oliveira
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2004 Jan-Mar       Impact factor: 3.710

10.  Genomic expression programs in the response of yeast cells to environmental changes.

Authors:  A P Gasch; P T Spellman; C M Kao; O Carmel-Harel; M B Eisen; G Storz; D Botstein; P O Brown
Journal:  Mol Biol Cell       Date:  2000-12       Impact factor: 4.138

View more
  14 in total

1.  A comparative analysis of biclustering algorithms for gene expression data.

Authors:  Kemal Eren; Mehmet Deveci; Onur Küçüktunç; Ümit V Çatalyürek
Journal:  Brief Bioinform       Date:  2012-07-06       Impact factor: 11.622

2.  Reverse engineering dynamic temporal models of biological processes and their relationships.

Authors:  Naren Ramakrishnan; Satish Tadepalli; Layne T Watson; Richard F Helm; Marco Antoniotti; Bud Mishra
Journal:  Proc Natl Acad Sci U S A       Date:  2010-06-22       Impact factor: 11.205

Review 3.  It is time to apply biclustering: a comprehensive review of biclustering applications in biological and biomedical data.

Authors:  Juan Xie; Anjun Ma; Anne Fennell; Qin Ma; Jing Zhao
Journal:  Brief Bioinform       Date:  2019-07-19       Impact factor: 11.622

4.  Biclustering of gene expression data by correlation-based scatter search.

Authors:  Juan A Nepomuceno; Alicia Troncoso; Jesús S Aguilar-Ruiz
Journal:  BioData Min       Date:  2011-01-24       Impact factor: 2.522

5.  A bi-ordering approach to linking gene expression with clinical annotations in gastric cancer.

Authors:  Fan Shi; Christopher Leckie; Geoff MacIntyre; Izhak Haviv; Alex Boussioutas; Adam Kowalczyk
Journal:  BMC Bioinformatics       Date:  2010-09-23       Impact factor: 3.169

6.  Identifying modules of coexpressed transcript units and their organization of Saccharopolyspora erythraea from time series gene expression profiles.

Authors:  Xiao Chang; Shuai Liu; Yong-Tao Yu; Yi-Xue Li; Yuan-Yuan Li
Journal:  PLoS One       Date:  2010-08-12       Impact factor: 3.240

7.  FABIA: factor analysis for bicluster acquisition.

Authors:  Sepp Hochreiter; Ulrich Bodenhofer; Martin Heusel; Andreas Mayr; Andreas Mitterecker; Adetayo Kasim; Tatsiana Khamiakova; Suzy Van Sanden; Dan Lin; Willem Talloen; Luc Bijnens; Hinrich W H Göhlmann; Ziv Shkedy; Djork-Arné Clevert
Journal:  Bioinformatics       Date:  2010-04-23       Impact factor: 6.937

8.  Transcriptional signatures of regulatory and toxic responses to benzo-[a]-pyrene exposure.

Authors:  Jacob J Michaelson; Saskia Trump; Susanne Rudzok; Carolin Gräbsch; Danielle J Madureira; Franziska Dautel; Juliane Mai; Sabine Attinger; Kristin Schirmer; Martin von Bergen; Irina Lehmann; Andreas Beyer
Journal:  BMC Genomics       Date:  2011-10-13       Impact factor: 3.969

9.  BiGGEsTS: integrated environment for biclustering analysis of time series gene expression data.

Authors:  Joana P Gonçalves; Sara C Madeira; Arlindo L Oliveira
Journal:  BMC Res Notes       Date:  2009-07-07

10.  Maximization of negative correlations in time-course gene expression data for enhancing understanding of molecular pathways.

Authors:  Tao Zeng; Jinyan Li
Journal:  Nucleic Acids Res       Date:  2009-10-23       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.