Literature DB >> 15728117

A Gibbs sampler for identification of symmetrically structured, spaced DNA motifs with improved estimation of the signal length.

A V Favorov1, M S Gelfand, A V Gerasimova, D A Ravcheev, A A Mironov, V J Makeev.   

Abstract

MOTIVATION: Transcription regulatory protein factors often bind DNA as homo-dimers or hetero-dimers. Thus they recognize structured DNA motifs that are inverted or direct repeats or spaced motif pairs. However, these motifs are often difficult to identify owing to their high divergence. The motif structure included explicitly into the motif recognition algorithm improves recognition efficiency for highly divergent motifs as well as estimation of motif geometric parameters. RESULT: We present a modification of the Gibbs sampling motif extraction algorithm, SeSiMCMC (Sequence Similarities by Markov Chain Monte Carlo), which finds structured motifs of these types, as well as non-structured motifs, in a set of unaligned DNA sequences. It employs improved estimators of motif and spacer lengths. The probability that a sequence does not contain any motif is accounted for in a rigorous Bayesian manner. We have applied the algorithm to a set of upstream regions of genes from two Escherichia coli regulons involved in respiration. We have demonstrated that accounting for a symmetric motif structure allows the algorithm to identify weak motifs more accurately. In the examples studied, ArcA binding sites were demonstrated to have the structure of a direct spaced repeat, whereas NarP binding sites exhibited the palindromic structure. AVAILABILITY: The WWW interface of the program, its FreeBSD (4.0) and Windows 32 console executables are available at http://bioinform.genetika.ru/SeSiMCMC

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 15728117     DOI: 10.1093/bioinformatics/bti336

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  33 in total

Review 1.  Comparative genomic reconstruction of transcriptional regulatory networks in bacteria.

Authors:  Dmitry A Rodionov
Journal:  Chem Rev       Date:  2007-07-18       Impact factor: 60.622

2.  Discovering gapped binding sites of yeast transcription factors.

Authors:  Chien-Yu Chen; Huai-Kuang Tsai; Chen-Ming Hsu; Mei-Ju May Chen; Hao-Geng Hung; Grace Tzu-Wei Huang; Wen-Hsiung Li
Journal:  Proc Natl Acad Sci U S A       Date:  2008-02-13       Impact factor: 11.205

3.  Tmod: toolbox of motif discovery.

Authors:  Hanchang Sun; Yuan Yuan; Yibo Wu; Hui Liu; Jun S Liu; Hongwei Xie
Journal:  Bioinformatics       Date:  2009-12-10       Impact factor: 6.937

4.  ArcA and AppY antagonize IscR repression of hydrogenase-1 expression under anaerobic conditions, revealing a novel mode of O2 regulation of gene expression in Escherichia coli.

Authors:  A D Nesbit; A S Fleischhacker; S J Teter; P J Kiley
Journal:  J Bacteriol       Date:  2012-10-12       Impact factor: 3.490

5.  Lack of conservation of bacterial type promoters in plastids of Streptophyta.

Authors:  Vassily A Lyubetsky; Lev I Rubanov; Alexandr V Seliverstov
Journal:  Biol Direct       Date:  2010-05-10       Impact factor: 4.540

6.  Promoter analysis: gene regulatory motif identification with A-GLAM.

Authors:  Leonardo Mariño-Ramírez; Kannan Tharakaraman; John L Spouge; David Landsman
Journal:  Methods Mol Biol       Date:  2009

7.  Anaerobic regulation of Shigella flexneri virulence: ArcA regulates Fur and iron acquisition genes.

Authors:  Megan L Boulette; Shelley M Payne
Journal:  J Bacteriol       Date:  2007-07-27       Impact factor: 3.490

8.  MotifAdjuster: a tool for computational reassessment of transcription factor binding site annotations.

Authors:  Jens Keilwagen; Jan Baumbach; Thomas A Kohl; Ivo Grosse
Journal:  Genome Biol       Date:  2009-05-01       Impact factor: 13.583

9.  Variable structure motifs for transcription factor binding sites.

Authors:  John E Reid; Kenneth J Evans; Nigel Dyer; Lorenz Wernisch; Sascha Ott
Journal:  BMC Genomics       Date:  2010-01-14       Impact factor: 3.969

10.  Intergenic, gene terminal, and intragenic CpG islands in the human genome.

Authors:  Yulia A Medvedeva; Marina V Fridman; Nina J Oparina; Dmitry B Malko; Ekaterina O Ermakova; Ivan V Kulakovskiy; Andreas Heinzel; Vsevolod J Makeev
Journal:  BMC Genomics       Date:  2010-01-19       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.