Literature DB >> 18340376

Motif discovery in tissue-specific regulatory sequences using directed information.

Arvind Rao1, Alfred O Hero, David J States, James Douglas Engel.   

Abstract

Motif discovery for the identification of functional regulatory elements underlying gene expression is a challenging problem. Sequence inspection often leads to discovery of novel motifs (including transcription factor sites) with previously uncharacterized function in gene expression. Coupled with the complexity underlying tissue-specific gene expression, there are several motifs that are putatively responsible for expression in a certain cell type. This has important implications in understanding fundamental biological processes such as development and disease progression. In this work, we present an approach to the identification of motifs (not necessarily transcription factor sites) and examine its application to some questions in current bioinformatics research. These motifs are seen to discriminate tissue-specific gene promoter or regulatory regions from those that are not tissue-specific. There are two main contributions of this work. Firstly, we propose the use of directed information for such classification constrained motif discovery, and then use the selected features with a support vector machine (SVM) classifier to find the tissue specificity of any sequence of interest. Such analysis yields several novel interesting motifs that merit further experimental characterization. Furthermore, this approach leads to a principled framework for the prospective examination of any chosen motif to be discriminatory motif for a group of coexpressed/coregulated genes, thereby integrating sequence and expression perspectives. We hypothesize that the discovery of these motifs would enable the large-scale investigation for the tissue-specific regulatory role of any conserved sequence element identified from genome-wide studies.

Year:  2007        PMID: 18340376      PMCID: PMC3171326          DOI: 10.1155/2007/13853

Source DB:  PubMed          Journal:  EURASIP J Bioinform Syst Biol        ISSN: 1687-4145


  24 in total

1.  Localization of distant urogenital system-, central nervous system-, and endocardium-specific transcriptional regulatory elements in the GATA-3 locus.

Authors:  G Lakshmanan; K H Lieuw; K C Lim; Y Gu; F Grosveld; J D Engel; A Karis
Journal:  Mol Cell Biol       Date:  1999-02       Impact factor: 4.272

2.  The prediction of vertebrate promoter regions using differential hexamer frequency analysis.

Authors:  G B Hutchinson
Journal:  Comput Appl Biosci       Date:  1996-10

Review 3.  Regulation of muscle transcription by the MyoD family. The heart of the matter.

Authors:  E N Olson
Journal:  Circ Res       Date:  1993-01       Impact factor: 17.367

4.  Opposing roles of Elk-1 and its brain-specific isoform, short Elk-1, in nerve growth factor-induced PC12 differentiation.

Authors:  P Vanhoutte; J L Nissen; B Brugg; B D Gaspera; M J Besson; R A Hipskind; J Caboche
Journal:  J Biol Chem       Date:  2000-10-24       Impact factor: 5.157

5.  In vivo enhancer analysis of human conserved non-coding sequences.

Authors:  Len A Pennacchio; Nadav Ahituv; Alan M Moses; Shyam Prabhakar; Marcelo A Nobrega; Malak Shoukry; Simon Minovitsky; Inna Dubchak; Amy Holt; Keith D Lewis; Ingrid Plajzer-Frick; Jennifer Akiyama; Sarah De Val; Veena Afzal; Brian L Black; Olivier Couronne; Michael B Eisen; Axel Visel; Edward M Rubin
Journal:  Nature       Date:  2006-11-05       Impact factor: 49.962

6.  Pax-2 is a DNA-binding protein expressed in embryonic kidney and Wilms tumor.

Authors:  G R Dressler; E C Douglass
Journal:  Proc Natl Acad Sci U S A       Date:  1992-02-15       Impact factor: 11.205

7.  Multiple, distant Gata2 enhancers specify temporally and tissue-specific patterning in the developing urogenital system.

Authors:  Melin Khandekar; Norio Suzuki; Jon Lewton; Masayuki Yamamoto; James Douglas Engel
Journal:  Mol Cell Biol       Date:  2004-12       Impact factor: 4.272

8.  Using hexamers to predict cis-regulatory motifs in Drosophila.

Authors:  Bob Y Chan; Dennis Kibler
Journal:  BMC Bioinformatics       Date:  2005-10-27       Impact factor: 3.169

9.  ROKU: a novel method for identification of tissue-specific genes.

Authors:  Koji Kadota; Jiazhen Ye; Yuji Nakai; Tohru Terada; Kentaro Shimizu
Journal:  BMC Bioinformatics       Date:  2006-06-12       Impact factor: 3.169

Review 10.  Long-range control of gene expression: emerging mechanisms and disruption in disease.

Authors:  Dirk A Kleinjan; Veronica van Heyningen
Journal:  Am J Hum Genet       Date:  2004-11-17       Impact factor: 11.025

View more
  1 in total

1.  Sequence-based classification using discriminatory motif feature selection.

Authors:  Hao Xiong; Daniel Capurso; Saunak Sen; Mark R Segal
Journal:  PLoS One       Date:  2011-11-10       Impact factor: 3.240

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.