Literature DB >> 19825796

Detailing regulatory networks through large scale data integration.

Curtis Huttenhower1, K Tsheko Mutungu, Natasha Indik, Woongcheol Yang, Mark Schroeder, Joshua J Forman, Olga G Troyanskaya, Hilary A Coller.   

Abstract

MOTIVATION: Much of a cell's regulatory response to changing environments occurs at the transcriptional level. Particularly in higher organisms, transcription factors (TFs), microRNAs and epigenetic modifications can combine to form a complex regulatory network. Part of this system can be modeled as a collection of regulatory modules: co-regulated genes, the conditions under which they are co-regulated and sequence-level regulatory motifs.
RESULTS: We present the Combinatorial Algorithm for Expression and Sequence-based Cluster Extraction (COALESCE) system for regulatory module prediction. The algorithm is efficient enough to discover expression biclusters and putative regulatory motifs in metazoan genomes (>20,000 genes) and very large microarray compendia (>10,000 conditions). Using Bayesian data integration, it can also include diverse supporting data types such as evolutionary conservation or nucleosome placement. We validate its performance using a functional evaluation of co-clustered genes, known yeast and Escherichea coli TF targets, synthetic data and various metazoan data compendia. In all cases, COALESCE performs as well or better than current biclustering and motif prediction tools, with high accuracy in functional and TF/target assignments and zero false positives on synthetic data. COALESCE provides an efficient and flexible platform within which large, diverse data collections can be integrated to predict metazoan regulatory networks. AVAILABILITY: Source code (C++) is available at http://function.princeton.edu/sleipnir, and supporting data and a web interface are provided at http://function.princeton.edu/coalesce. CONTACT: ogt@cs.princeton.edu; hcoller@princeton.edu. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19825796      PMCID: PMC2788929          DOI: 10.1093/bioinformatics/btp588

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  38 in total

Review 1.  Genetic and biochemical diversity in the Pax gene family.

Authors:  D A Underhill
Journal:  Biochem Cell Biol       Date:  2000       Impact factor: 3.626

Review 2.  Core promoters: active contributors to combinatorial gene regulation.

Authors:  S T Smale
Journal:  Genes Dev       Date:  2001-10-01       Impact factor: 11.361

3.  Module networks: identifying regulatory modules and their condition-specific regulators from gene expression data.

Authors:  Eran Segal; Michael Shapira; Aviv Regev; Dana Pe'er; David Botstein; Daphne Koller; Nir Friedman
Journal:  Nat Genet       Date:  2003-06       Impact factor: 38.330

4.  Weeder Web: discovery of transcription factor binding sites in a set of sequences from co-regulated genes.

Authors:  Giulio Pavesi; Paolo Mereghetti; Giancarlo Mauri; Graziano Pesole
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

5.  Predicting gene expression from sequence.

Authors:  Michael A Beer; Saeed Tavazoie
Journal:  Cell       Date:  2004-04-16       Impact factor: 41.582

6.  Revealing modularity and organization in the yeast molecular network by integrated analysis of highly heterogeneous genomewide data.

Authors:  Amos Tanay; Roded Sharan; Martin Kupiec; Ron Shamir
Journal:  Proc Natl Acad Sci U S A       Date:  2004-02-18       Impact factor: 11.205

7.  Identification of thermosensory and olfactory neuron-specific genes via expression profiling of single neuron types.

Authors:  Marc E Colosimo; Adam Brown; Saikat Mukhopadhyay; Christopher Gabel; Anne E Lanjuin; Aravinthan D T Samuel; Piali Sengupta
Journal:  Curr Biol       Date:  2004-12-29       Impact factor: 10.834

8.  Sequencing and comparison of yeast species to identify genes and regulatory elements.

Authors:  Manolis Kellis; Nick Patterson; Matthew Endrizzi; Bruce Birren; Eric S Lander
Journal:  Nature       Date:  2003-05-15       Impact factor: 49.962

9.  DISTILLER: a data integration framework to reveal condition dependency of complex regulons in Escherichia coli.

Authors:  Karen Lemmens; Tijl De Bie; Thomas Dhollander; Sigrid C De Keersmaecker; Inge M Thijs; Geert Schoofs; Ami De Weerdt; Bart De Moor; Jos Vanderleyden; Julio Collado-Vides; Kristof Engelen; Kathleen Marchal
Journal:  Genome Biol       Date:  2009-03-06       Impact factor: 13.583

10.  Finding DNA regulatory motifs within unaligned noncoding sequences clustered by whole-genome mRNA quantitation.

Authors:  F P Roth; J D Hughes; P W Estep; G M Church
Journal:  Nat Biotechnol       Date:  1998-10       Impact factor: 54.908

View more
  33 in total

1.  Integrating heterogeneous gene expression data for gene regulatory network modelling.

Authors:  Alina Sîrbu; Heather J Ruskin; Martin Crane
Journal:  Theory Biosci       Date:  2011-09-24       Impact factor: 1.919

2.  New meta-analysis tools reveal common transcriptional regulatory basis for multiple determinants of behavior.

Authors:  Seth A Ament; Charles A Blatti; Cedric Alaux; Marsha M Wheeler; Amy L Toth; Yves Le Conte; Greg J Hunt; Ernesto Guzmán-Novoa; Gloria Degrandi-Hoffman; Jose Luis Uribe-Rubio; Gro V Amdam; Robert E Page; Sandra L Rodriguez-Zas; Gene E Robinson; Saurabh Sinha
Journal:  Proc Natl Acad Sci U S A       Date:  2012-06-12       Impact factor: 11.205

3.  A comparative analysis of biclustering algorithms for gene expression data.

Authors:  Kemal Eren; Mehmet Deveci; Onur Küçüktunç; Ümit V Çatalyürek
Journal:  Brief Bioinform       Date:  2012-07-06       Impact factor: 11.622

Review 4.  Advantages and limitations of current network inference methods.

Authors:  Riet De Smet; Kathleen Marchal
Journal:  Nat Rev Microbiol       Date:  2010-08-31       Impact factor: 60.633

5.  cMonkey2: Automated, systematic, integrated detection of co-regulated gene modules for any organism.

Authors:  David J Reiss; Christopher L Plaisier; Wei-Ju Wu; Nitin S Baliga
Journal:  Nucleic Acids Res       Date:  2015-04-14       Impact factor: 16.971

6.  SeQuery: an interactive graph database for visualizing the GPCR superfamily.

Authors:  Geng-Ming Hu; M K Secario; Chi-Ming Chen
Journal:  Database (Oxford)       Date:  2019-01-01       Impact factor: 3.451

Review 7.  Principles and methods of integrative genomic analyses in cancer.

Authors:  Vessela N Kristensen; Ole Christian Lingjærde; Hege G Russnes; Hans Kristian M Vollan; Arnoldo Frigessi; Anne-Lise Børresen-Dale
Journal:  Nat Rev Cancer       Date:  2014-05       Impact factor: 60.716

8.  Regulatory network inferred using expression data of small sample size: application and validation in erythroid system.

Authors:  Fan Zhu; Lihong Shi; James Douglas Engel; Yuanfang Guan
Journal:  Bioinformatics       Date:  2015-04-02       Impact factor: 6.937

9.  A semi-parametric Bayesian model for unsupervised differential co-expression analysis.

Authors:  Johannes M Freudenberg; Siva Sivaganesan; Michael Wagner; Mario Medvedovic
Journal:  BMC Bioinformatics       Date:  2010-05-07       Impact factor: 3.169

10.  Multi-species integrative biclustering.

Authors:  Peter Waltman; Thadeous Kacmarczyk; Ashley R Bate; Daniel B Kearns; David J Reiss; Patrick Eichenberger; Richard Bonneau
Journal:  Genome Biol       Date:  2010-09-29       Impact factor: 13.583

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.