Literature DB >> 17068091

Adding sequence context to a Markov background model improves the identification of regulatory elements.

Nak-Kyeong Kim1, Kannan Tharakaraman, John L Spouge.   

Abstract

MOTIVATION: Many computational methods for identifying regulatory elements use a likelihood ratio between motif and background models. Often, the methods use a background model of independent bases. At least two different Markov background models have been proposed with the aim of increasing the accuracy of predicting regulatory elements. Both Markov background models suffer theoretical drawbacks, so this article develops a third, context-dependent Markov background model from fundamental statistical principles.
RESULTS: Datasets containing known regulatory elements in eukaryotes provided a basis for comparing the predictive accuracies of the different background models. Non-parametric statistical tests indicated that Markov models of order 3 constituted a statistically significant improvement over the background model of independent bases. Our model performed slightly better than the previous Markov background models. We also found that for discriminating between the predictive accuracies of competing background models, the correlation coefficient is a more sensitive measure than the performance coefficient. AVAILABILITY: Our C++ program is available at ftp://ftp.ncbi.nih.gov/pub/spouge/papers/archive/AGLAM/2006-07-19

Entities:  

Mesh:

Substances:

Year:  2006        PMID: 17068091     DOI: 10.1093/bioinformatics/btl528

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  4 in total

1.  Most of the tight positional conservation of transcription factor binding sites near the transcription start site reflects their co-localization within regulatory modules.

Authors:  Natalia Acevedo-Luna; Leonardo Mariño-Ramírez; Armand Halbert; Ulla Hansen; David Landsman; John L Spouge
Journal:  BMC Bioinformatics       Date:  2016-11-21       Impact factor: 3.169

2.  In silico identification of short nucleotide sequences associated with gene expression of pollen development in rice.

Authors:  Motohiro Mihara; Takeshi Itoh; Takeshi Izawa
Journal:  Plant Cell Physiol       Date:  2008-10-03       Impact factor: 4.927

3.  Finding sequence motifs with Bayesian models incorporating positional information: an application to transcription factor binding sites.

Authors:  Nak-Kyeong Kim; Kannan Tharakaraman; Leonardo Mariño-Ramírez; John L Spouge
Journal:  BMC Bioinformatics       Date:  2008-06-04       Impact factor: 3.169

4.  Discovering sequence motifs with arbitrary insertions and deletions.

Authors:  Martin C Frith; Neil F W Saunders; Bostjan Kobe; Timothy L Bailey
Journal:  PLoS Comput Biol       Date:  2008-05-09       Impact factor: 4.475

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.