Literature DB >> 18349034

A transdimensional Bayesian model for pattern recognition in DNA sequences.

Sierra M Li1, Jon Wakefield, Steve Self.   

Abstract

Identification of transcription factor binding sites (TFBSs) is essential to elucidate gene regulatory networks. This article is focused on the recognition of overpresented short patterns, called "motifs", that may correspond to regulatory binding sites in the DNA sequences upstream of genes. An integrated Bayesian model is proposed to incorporate all unknown characteristics in motif discovery, including the number of motifs, motif widths, motif compositions, the number of motif sites, and locations of motif sites. Reversible jump Markov chain Monte Carlo is used to obtain posterior inference in the transdimensional parameter space. We present a number of suggestions for graphical summarization of the posterior distribution over the complex parameter space. The basic model is extended using a third-order Markov structure for nonmotif bases and allowing positions within a motif to be switched between 2 types: "conserved" and "degenerate." We evaluate the prediction accuracy for the simulated data with 3 motifs and apply the model to upstream sequences in high signal-to-noise regions in a human ChIP-chip study. The performance of the Bayesian model is assessed using yeast data sets of various numbers of sequences and background structures, with and without true TFBSs. The performance is also compared to other computational methods, including 2 statistical approaches, AlignACE and multiple expectation maximization for motif elicitation, and 1 word numeration-based approach, yeast motif finder (YMF).

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18349034     DOI: 10.1093/biostatistics/kxm058

Source DB:  PubMed          Journal:  Biostatistics        ISSN: 1465-4644            Impact factor:   5.899


  2 in total

1.  Exhaustive search for over-represented DNA sequence motifs with CisFinder.

Authors:  Alexei A Sharov; Minoru S H Ko
Journal:  DNA Res       Date:  2009-09-09       Impact factor: 4.458

Review 2.  Review of Different Sequence Motif Finding Algorithms.

Authors:  Fatma A Hashim; Mai S Mabrouk; Walid Al-Atabany
Journal:  Avicenna J Med Biotechnol       Date:  2019 Apr-Jun
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.