Literature DB >> 17044183

Combining sequence and time series expression data to learn transcriptional modules.

Anshul Kundaje1, Manuel Middendorf, Feng Gao, Chris Wiggins, Christina Leslie.   

Abstract

Our goal is to cluster genes into transcriptional modules--sets of genes where similarity in expression is explained by common regulatory mechanisms at the transcriptional level. We want to learn modules from both time series gene expression data and genome-wide motif data that are now readily available for organisms such as S. cereviseae as a result of prior computational studies or experimental results. We present a generative probabilistic model for combining regulatory sequence and time series expression data to cluster genes into coherent transcriptional modules. Starting with a set of motifs representing known or putative regulatory elements (transcription factor binding sites) and the counts of occurrences of these motifs in each gene's promoter region, together with a time series expression profile for each gene, the learning algorithm uses expectation maximization to learn module assignments based on both types of data. We also present a technique based on the Jensen-Shannon entropy contributions of motifs in the learned model for associating the most significant motifs to each module. Thus, the algorithm gives a global approach for associating sets of regulatory elements to "modules" of genes with similar time series expression profiles. The model for expression data exploits our prior belief of smooth dependence on time by using statistical splines and is suitable for typical time course data sets with relatively few experiments. Moreover, the model is sufficiently interpretable that we can understand how both sequence data and expression data contribute to the cluster assignments, and how to interpolate between the two data sources. We present experimental results on the yeast cell cycle to validate our method and find that our combined expression and motif clustering algorithm discovers modules with both coherent expression and similar motif patterns, including binding motifs associated to known cell cycle transcription factors.

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 17044183     DOI: 10.1109/TCBB.2005.34

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  8 in total

1.  Rapid temporal changes in the expression of a set of neuromodulatory genes during alcohol withdrawal in the dorsal vagal complex: molecular evidence of homeostatic disturbance.

Authors:  Kate Freeman; Mary M Staehle; Zeynep H Gümüş; Rajanikanth Vadigepalli; Gregory E Gonye; Carmen N Nichols; Babatunde A Ogunnaike; Jan B Hoek; James S Schwaber
Journal:  Alcohol Clin Exp Res       Date:  2012-04-06       Impact factor: 3.455

2.  Discovering transcriptional modules by Bayesian data integration.

Authors:  Richard S Savage; Zoubin Ghahramani; Jim E Griffin; Bernard J de la Cruz; David L Wild
Journal:  Bioinformatics       Date:  2010-06-15       Impact factor: 6.937

Review 3.  Computational methods for analyzing dynamic regulatory networks.

Authors:  Anthony Gitter; Yong Lu; Ziv Bar-Joseph
Journal:  Methods Mol Biol       Date:  2010

4.  Modeling regulatory cascades using Artificial Neural Networks: the case of transcriptional regulatory networks shaped during the yeast stress response.

Authors:  Maria E Manioudaki; Panayiota Poirazi
Journal:  Front Genet       Date:  2013-06-20       Impact factor: 4.599

5.  Motif-guided sparse decomposition of gene expression data for regulatory module identification.

Authors:  Ting Gong; Jianhua Xuan; Li Chen; Rebecca B Riggins; Huai Li; Eric P Hoffman; Robert Clarke; Yue Wang
Journal:  BMC Bioinformatics       Date:  2011-03-22       Impact factor: 3.169

6.  Patient-specific data fusion defines prognostic cancer subtypes.

Authors:  Yinyin Yuan; Richard S Savage; Florian Markowetz
Journal:  PLoS Comput Biol       Date:  2011-10-20       Impact factor: 4.475

7.  Reconstructing dynamic regulatory maps.

Authors:  Jason Ernst; Oded Vainas; Christopher T Harbison; Itamar Simon; Ziv Bar-Joseph
Journal:  Mol Syst Biol       Date:  2007-01-16       Impact factor: 11.429

8.  Regulatory Snapshots: integrative mining of regulatory modules from expression time series and regulatory networks.

Authors:  Joana P Gonçalves; Ricardo S Aires; Alexandre P Francisco; Sara C Madeira
Journal:  PLoS One       Date:  2012-05-01       Impact factor: 3.240

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.