Literature DB >> 17934055

Learning regulatory programs that accurately predict differential expression with MEDUSA.

Anshul Kundaje1, Steve Lianoglou, Xuejing Li, David Quigley, Marta Arias, Chris H Wiggins, Li Zhang, Christina Leslie.   

Abstract

Inferring gene regulatory networks from high-throughput genomic data is one of the central problems in computational biology. In this paper, we describe a predictive modeling approach for studying regulatory networks, based on a machine learning algorithm called MEDUSA. MEDUSA integrates promoter sequence, mRNA expression, and transcription factor occupancy data to learn gene regulatory programs that predict the differential expression of target genes. Instead of using clustering or correlation of expression profiles to infer regulatory relationships, MEDUSA determines condition-specific regulators and discovers regulatory motifs that mediate the regulation of target genes. In this way, MEDUSA meaningfully models biological mechanisms of transcriptional regulation. MEDUSA solves the problem of predicting the differential (up/down) expression of target genes by using boosting, a technique from statistical learning, which helps to avoid overfitting as the algorithm searches through the high-dimensional space of potential regulators and sequence motifs. Experimental results demonstrate that MEDUSA achieves high prediction accuracy on held-out experiments (test data), that is, data not seen in training. We also present context-specific analysis of MEDUSA regulatory programs for DNA damage and hypoxia, demonstrating that MEDUSA identifies key regulators and motifs in these processes. A central challenge in the field is the difficulty of validating reverse-engineered networks in the absence of a gold standard. Our approach of learning regulatory programs provides at least a partial solution for the problem: MEDUSA's prediction accuracy on held-out data gives a concrete and statistically sound way to validate how well the algorithm performs. With MEDUSA, statistical validation becomes a prerequisite for hypothesis generation and network building rather than a secondary consideration.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17934055     DOI: 10.1196/annals.1407.020

Source DB:  PubMed          Journal:  Ann N Y Acad Sci        ISSN: 0077-8923            Impact factor:   5.691


  8 in total

1.  FastMEDUSA: a parallelized tool to infer gene regulatory networks.

Authors:  Serdar Bozdag; Aiguo Li; Stefan Wuchty; Howard A Fine
Journal:  Bioinformatics       Date:  2010-05-30       Impact factor: 6.937

2.  Inference of cell type specific regulatory networks on mammalian lineages.

Authors:  Deborah Chasman; Sushmita Roy
Journal:  Curr Opin Syst Biol       Date:  2017-04-17

Review 3.  The recurrent architecture of tumour initiation, progression and drug sensitivity.

Authors:  Andrea Califano; Mariano J Alvarez
Journal:  Nat Rev Cancer       Date:  2016-12-15       Impact factor: 60.716

4.  G =  MAT: linking transcription factor expression and DNA binding data.

Authors:  Konstantin Tretyakov; Sven Laur; Jaak Vilo
Journal:  PLoS One       Date:  2011-01-31       Impact factor: 3.240

5.  Genomic analysis of immune response against Vibrio cholerae hemolysin in Caenorhabditis elegans.

Authors:  Surasri N Sahu; Jada Lewis; Isha Patel; Serdar Bozdag; Jeong H Lee; Joseph E LeClerc; Hediye Nese Cinar
Journal:  PLoS One       Date:  2012-05-31       Impact factor: 3.240

6.  Functional characterization of somatic mutations in cancer using network-based inference of protein activity.

Authors:  Mariano J Alvarez; Yao Shen; Federico M Giorgi; Alexander Lachmann; B Belinda Ding; B Hilda Ye; Andrea Califano
Journal:  Nat Genet       Date:  2016-06-20       Impact factor: 38.330

7.  Dynamic regulatory module networks for inference of cell type-specific transcriptional networks.

Authors:  Alireza Fotuhi Siahpirani; Sara Knaack; Deborah Chasman; Morten Seirup; Rupa Sridharan; Ron Stewart; James Thomson; Sushmita Roy
Journal:  Genome Res       Date:  2022-06-15       Impact factor: 9.438

8.  Development of a novel prediction method of cis-elements to hypothesize collaborative functions of cis-element pairs in iron-deficient rice.

Authors:  Yusuke Kakei; Yuko Ogo; Reiko N Itai; Takanori Kobayashi; Takashi Yamakawa; Hiromi Nakanishi; Naoko K Nishizawa
Journal:  Rice (N Y)       Date:  2013-09-22       Impact factor: 4.783

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.