Literature DB >> 10977099

Mining for putative regulatory elements in the yeast genome using gene expression data.

J Vilo1, A Brazma, I Jonassen, A Robinson, E Ukkonen.   

Abstract

We have developed a set of methods and tools for automatic discovery of putative regulatory signals in genome sequences. The analysis pipeline consists of gene expression data clustering, sequence pattern discovery from upstream sequences of genes, a control experiment for pattern significance threshold limit detection, selection of interesting patterns, grouping of these patterns, representing the pattern groups in a concise form and evaluating the discovered putative signals against existing databases of regulatory signals. The pattern discovery is computationally the most expensive and crucial step. Our tool performs a rapid exhaustive search for a priori unknown statistically significant sequence patterns of unrestricted length. The statistical significance is determined for a set of sequences in each cluster with respect to a set of background sequences allowing the detection of subtle regulatory signals specific for each cluster. The potentially large number of significant patterns is reduced to a small number of groups by clustering them by mutual similarity. Automatically derived consensus patterns of these groups represent the results in a comprehensive way for a human investigator. We have performed a systematic analysis for the yeast Saccharomyces cerevisiae. We created a large number of independent clusterings of expression data simultaneously assessing the "goodness" of each cluster. For each of the over 52,000 clusters acquired in this way we discovered significant patterns in the upstream sequences of respective genes. We selected nearly 1,500 significant patterns by formal criteria and matched them against the experimentally mapped transcription factor binding sites in the SCPD database. We clustered the 1,500 patterns to 62 groups for which we derived automatically alignments and consensus patterns. Of these 62 groups 48 had patterns that have matching sites in SCPD database.

Entities:  

Mesh:

Year:  2000        PMID: 10977099

Source DB:  PubMed          Journal:  Proc Int Conf Intell Syst Mol Biol        ISSN: 1553-0833


  24 in total

1.  Discovering regulatory binding-site modules using rule-based learning.

Authors:  Torgeir R Hvidsten; Bartosz Wilczyński; Andriy Kryshtafovych; Jerzy Tiuryn; Jan Komorowski; Krzysztof Fidelis
Journal:  Genome Res       Date:  2005-06       Impact factor: 9.043

2.  Integrating regulatory motif discovery and genome-wide expression analysis.

Authors:  Erin M Conlon; X Shirley Liu; Jason D Lieb; Jun S Liu
Journal:  Proc Natl Acad Sci U S A       Date:  2003-03-07       Impact factor: 11.205

3.  Detection and preliminary analysis of motifs in promoters of anaerobically induced genes of different plant species.

Authors:  Bijayalaxmi Mohanty; S P T Krishnan; Sanjay Swarup; Vladimir B Bajic
Journal:  Ann Bot       Date:  2005-07-18       Impact factor: 4.357

4.  Discovering novel cis-regulatory motifs using functional networks.

Authors:  Laurence M Ettwiller; Johan Rung; Ewan Birney
Journal:  Genome Res       Date:  2003-05       Impact factor: 9.043

5.  Genes Induced by Reovirus Infection Have a Distinct Modular Cis-Regulatory Architecture.

Authors:  R Lapadat; R L Debiasi; G L Johnson; K L Tyler; I Shah
Journal:  Curr Genomics       Date:  2005       Impact factor: 2.236

6.  Relationships between probabilistic Boolean networks and dynamic Bayesian networks as models of gene regulatory networks.

Authors:  Harri Lähdesmäki; Sampsa Hautaniemi; Ilya Shmulevich; Olli Yli-Harja
Journal:  Signal Processing       Date:  2006-04       Impact factor: 4.662

7.  G =  MAT: linking transcription factor expression and DNA binding data.

Authors:  Konstantin Tretyakov; Sven Laur; Jaak Vilo
Journal:  PLoS One       Date:  2011-01-31       Impact factor: 3.240

8.  The word landscape of the non-coding segments of the Arabidopsis thaliana genome.

Authors:  Jens Lichtenberg; Alper Yilmaz; Joshua D Welch; Kyle Kurz; Xiaoyu Liang; Frank Drews; Klaus Ecker; Stephen S Lee; Matt Geisler; Erich Grotewold; Lonnie R Welch
Journal:  BMC Genomics       Date:  2009-10-08       Impact factor: 3.969

9.  Global transcriptional responses of fission yeast to environmental stress.

Authors:  Dongrong Chen; W Mark Toone; Juan Mata; Rachel Lyne; Gavin Burns; Katja Kivinen; Alvis Brazma; Nic Jones; Jürg Bähler
Journal:  Mol Biol Cell       Date:  2003-01       Impact factor: 4.138

10.  Searching for transcription factor binding sites in vector spaces.

Authors:  Chih Lee; Chun-Hsi Huang
Journal:  BMC Bioinformatics       Date:  2012-08-27       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.