Literature DB >> 9672830

Compound Poisson and Poisson process approximations for occurrences of multiple words in Markov chains.

G Reinert1, S Schbath.   

Abstract

We derive a Poisson process approximation for the occurrences of clumps of multiple words and a compound Poisson process approximation for the number of occurrences of multiple words in a sequence of letters generated by a stationary Markov chain. Using the Chen-Stein method, we provide a bound on the error in the approximations. For rare words, these errors tend to zero as the length of the sequence increases to infinity. Modeling a DNA sequence as a stationary Markov chain, we show as an application that the compound Poisson approximation is efficient for the number of occurrences of rare stem-loop motifs.

Mesh:

Substances:

Year:  1998        PMID: 9672830     DOI: 10.1089/cmb.1998.5.223

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  10 in total

1.  Distributional regimes for the number of k-word matches between two random sequences.

Authors:  Ross A Lippert; Haiyan Huang; Michael S Waterman
Journal:  Proc Natl Acad Sci U S A       Date:  2002-10-08       Impact factor: 11.205

2.  SPA: Simple web tool to assess statistical significance of DNA patterns.

Authors:  H Richard; G Nuel
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

3.  Normal and compound poisson approximations for pattern occurrences in NGS reads.

Authors:  Zhiyuan Zhai; Gesine Reinert; Kai Song; Michael S Waterman; Yihui Luan; Fengzhu Sun
Journal:  J Comput Biol       Date:  2012-06       Impact factor: 1.479

4.  The power of detecting enriched patterns: an HMM approach.

Authors:  Zhiyuan Zhai; Shih-Yen Ku; Yihui Luan; Gesine Reinert; Michael S Waterman; Fengzhu Sun
Journal:  J Comput Biol       Date:  2010-04       Impact factor: 1.479

Review 5.  Nonrandom clusters of palindromes in herpesvirus genomes.

Authors:  Ming-Ying Leung; Kwok Pui Choi; Aihua Xia; Louis H Y Chen
Journal:  J Comput Biol       Date:  2005-04       Impact factor: 1.479

6.  Approximation of sojourn-times via maximal couplings: motif frequency distributions.

Authors:  Manuel E Lladser; Stephen R Chestnut
Journal:  J Math Biol       Date:  2013-06-06       Impact factor: 2.259

7.  The distribution of word matches between Markovian sequences with periodic boundary conditions.

Authors:  Conrad J Burden; Paul Leopardi; Sylvain Forêt
Journal:  J Comput Biol       Date:  2013-10-26       Impact factor: 1.479

8.  Exact distribution of a pattern in a set of random sequences generated by a Markov source: applications to biological data.

Authors:  Leslie Regad; Juliette Martin; Gregory Nuel; Anne-Claude Camproux
Journal:  Algorithms Mol Biol       Date:  2010-01-26       Impact factor: 1.405

9.  Analysis of pattern overlaps and exact computation of P-values of pattern occurrences numbers: case of Hidden Markov Models.

Authors:  Mireille Régnier; Evgenia Furletova; Victor Yakovlev; Mikhail Roytberg
Journal:  Algorithms Mol Biol       Date:  2014-12-16       Impact factor: 1.405

10.  Exact p-value calculation for heterotypic clusters of regulatory motifs and its application in computational annotation of cis-regulatory modules.

Authors:  Valentina Boeva; Julien Clément; Mireille Régnier; Mikhail A Roytberg; Vsevolod J Makeev
Journal:  Algorithms Mol Biol       Date:  2007-10-10       Impact factor: 1.405

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.