Literature DB >> 17237046

Computing exact P-values for DNA motifs.

Jing Zhang1, Bo Jiang, Ming Li, John Tromp, Xuegong Zhang, Michael Q Zhang.   

Abstract

MOTIVATION: Many heuristic algorithms have been designed to approximate P-values of DNA motifs described by position weight matrices, for evaluating their statistical significance. They often significantly deviate from the true P-value by orders of magnitude. Exact P-value computation is needed for ranking the motifs. Furthermore, surprisingly, the complexity of the problem is unknown.
RESULTS: We show the problem to be NP-hard, and present MotifRank, software based on dynamic programming, to calculate exact P-values of motifs. We define the exact P-value on a general and more precise model. Asymptotically, MotifRank is faster than the best exact P-value computing algorithm, and is in fact practical. Our experiments clearly demonstrate that MotifRank significantly improves the accuracy of existing approximation algorithms. AVAILABILITY: MotifRank is available from http://bio.dlg.cn. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Mesh:

Substances:

Year:  2007        PMID: 17237046     DOI: 10.1093/bioinformatics/btl662

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  16 in total

1.  The power of detecting enriched patterns: an HMM approach.

Authors:  Zhiyuan Zhai; Shih-Yen Ku; Yihui Luan; Gesine Reinert; Michael S Waterman; Fengzhu Sun
Journal:  J Comput Biol       Date:  2010-04       Impact factor: 1.479

2.  A new statistic for efficient detection of repetitive sequences.

Authors:  Sijie Chen; Yixin Chen; Fengzhu Sun; Michael S Waterman; Xuegong Zhang
Journal:  Bioinformatics       Date:  2019-11-01       Impact factor: 6.937

3.  Compound poisson approximation of the number of occurrences of a position frequency matrix (PFM) on both strands.

Authors:  Utz J Pape; Sven Rahmann; Fengzhu Sun; Martin Vingron
Journal:  J Comput Biol       Date:  2008 Jul-Aug       Impact factor: 1.479

4.  Towards a theoretical understanding of false positives in DNA motif finding.

Authors:  Amin Zia; Alan M Moses
Journal:  BMC Bioinformatics       Date:  2012-06-27       Impact factor: 3.169

5.  P-value-based regulatory motif discovery using positional weight matrices.

Authors:  Holger Hartmann; Eckhart W Guthöhrlein; Matthias Siebert; Sebastian Luehr; Johannes Söding
Journal:  Genome Res       Date:  2012-09-18       Impact factor: 9.043

6.  FastPval: a fast and memory efficient program to calculate very low P-values from empirical distribution.

Authors:  Mulin Jun Li; Pak Chung Sham; Junwen Wang
Journal:  Bioinformatics       Date:  2010-09-21       Impact factor: 6.937

7.  Significant speedup of database searches with HMMs by search space reduction with PSSM family models.

Authors:  Michael Beckstette; Robert Homann; Robert Giegerich; Stefan Kurtz
Journal:  Bioinformatics       Date:  2009-10-14       Impact factor: 6.937

8.  Accurate recognition of cis-regulatory motifs with the correct lengths in prokaryotic genomes.

Authors:  Guojun Li; Bingqiang Liu; Ying Xu
Journal:  Nucleic Acids Res       Date:  2009-11-11       Impact factor: 16.971

9.  Efficient and accurate P-value computation for Position Weight Matrices.

Authors:  Hélène Touzet; Jean-Stéphane Varré
Journal:  Algorithms Mol Biol       Date:  2007-12-11       Impact factor: 1.405

10.  Exact p-value calculation for heterotypic clusters of regulatory motifs and its application in computational annotation of cis-regulatory modules.

Authors:  Valentina Boeva; Julien Clément; Mireille Régnier; Mikhail A Roytberg; Vsevolod J Makeev
Journal:  Algorithms Mol Biol       Date:  2007-10-10       Impact factor: 1.405

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.