Literature DB >> 15044247

Defining transcription modules using large-scale gene expression data.

Jan Ihmels1, Sven Bergmann, Naama Barkai.   

Abstract

MOTIVATION: Large-scale gene expression data comprising a variety of cellular conditions hold the promise of a global view on the transcription program. While conventional clustering algorithms have been successfully applied to smaller datasets, the utility of many algorithms for the analysis of large-scale data is limited by their inability to capture combinatorial and condition-specific co-regulation. In addition, there is an increasing need to integrate the rapidly accumulating body of other high-throughput biological data with the expression analysis. In a previous work, we introduced the signature algorithm, which overcomes the problems of conventional clustering and allows for intuitive integration of additional biological data. However, this approach is constrained by the comprehensiveness of relevant external data and its lacking ability to capture hierarchical modularity.
METHODS: We present a novel method for the analysis of large-scale expression data, which assigns genes into context-dependent and potentially overlapping regulatory units. We introduce the notion of a transcription module as a self-consistent regulatory unit consisting of a set of co-regulated genes as well as the experimental conditions that induce their co-regulation. Self-consistency is defined by a rigorous mathematical criterion. We propose an efficient algorithm to identify such modules, which is based on the iterative application of the signature algorithm. A threshold parameter that determines the resolution of the modular decomposition is introduced.
RESULTS: The method is applied systematically to over 1000 expression profiles of the yeast Saccharomyces cerevisiae, and the results are presented using two complementary visualization schemes we developed. The average biological coherence, as measured by the conservation of putative cis-regulatory motifs between four related yeast species, is higher for transcription modules than for clusters identified by other methods applied to the same dataset. Our method is related to singular value decomposition (SVD) and to the pairwise average linkage clustering algorithm. It extends SVD by filtering out noise in the expression data and offering variable resolution to reveal hierarchical organization. It furthermore has the advantage over both methods of capturing overlapping modules in the presence of combinatorial regulation. SUPPLEMENTARY INFORMATION: http://www.weizmann.ac.il/~barkai/modules

Entities:  

Mesh:

Substances:

Year:  2004        PMID: 15044247     DOI: 10.1093/bioinformatics/bth166

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  96 in total

1.  The evolution of gene expression levels in mammalian organs.

Authors:  David Brawand; Magali Soumillon; Anamaria Necsulea; Philippe Julien; Gábor Csárdi; Patrick Harrigan; Manuela Weier; Angélica Liechti; Ayinuer Aximu-Petri; Martin Kircher; Frank W Albert; Ulrich Zeller; Philipp Khaitovich; Frank Grützner; Sven Bergmann; Rasmus Nielsen; Svante Pääbo; Henrik Kaessmann
Journal:  Nature       Date:  2011-10-19       Impact factor: 49.962

2.  Biclustering of linear patterns in gene expression data.

Authors:  Qinghui Gao; Christine Ho; Yingmin Jia; Jingyi Jessica Li; Haiyan Huang
Journal:  J Comput Biol       Date:  2012-06       Impact factor: 1.479

Review 3.  Advantages and limitations of current network inference methods.

Authors:  Riet De Smet; Kathleen Marchal
Journal:  Nat Rev Microbiol       Date:  2010-08-31       Impact factor: 60.633

4.  Multilevel support vector regression analysis to identify condition-specific regulatory networks.

Authors:  Li Chen; Jianhua Xuan; Rebecca B Riggins; Yue Wang; Eric P Hoffman; Robert Clarke
Journal:  Bioinformatics       Date:  2010-04-07       Impact factor: 6.937

5.  Independent component analysis: mining microarray data for fundamental human gene expression modules.

Authors:  Jesse M Engreitz; Bernie J Daigle; Jonathan J Marshall; Russ B Altman
Journal:  J Biomed Inform       Date:  2010-07-07       Impact factor: 6.317

6.  An up-down bit pattern approach to coregulated and negative-coregulated gene clustering of microarray data.

Authors:  Jiun-Rung Chen; Ye-In Chang
Journal:  J Comput Biol       Date:  2011-01-06       Impact factor: 1.479

Review 7.  Network inference and network response identification: moving genome-scale data to the next level of biological discovery.

Authors:  Diogo F T Veiga; Bhaskar Dutta; Gábor Balázsi
Journal:  Mol Biosyst       Date:  2009-12-11

8.  A network of transcriptionally coordinated functional modules in Saccharomyces cerevisiae.

Authors:  Allegra A Petti; George M Church
Journal:  Genome Res       Date:  2005-08-18       Impact factor: 9.043

9.  Genome-wide computational prediction of transcriptional regulatory modules reveals new insights into human gene expression.

Authors:  Mathieu Blanchette; Alain R Bataille; Xiaoyu Chen; Christian Poitras; Josée Laganière; Céline Lefèbvre; Geneviève Deblois; Vincent Giguère; Vincent Ferretti; Dominique Bergeron; Benoit Coulombe; François Robert
Journal:  Genome Res       Date:  2006-04-10       Impact factor: 9.043

10.  Identification of condition-specific regulatory modules through multi-level motif and mRNA expression analysis.

Authors:  Li Chen; Jianhua Xuan; Yue Wang; Eric P Hoffman; Rebecca B Riggins; Robert Clarke
Journal:  Int J Comput Biol Drug Des       Date:  2009
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.