| Literature DB >> 17049037 |
Abstract
We propose here a review of the methods available to compute pattern statistics on text generated by a Markov source. Theoretical, but also numerical aspects are detailed for a wide range of techniques (exact, Gaussian, large deviations, binomial and compound Poisson). The SPatt package (Statistics for Pattern, free software available at http://stat.genopole.cnrs.fr/spatt) implementing all these methods is then used to compare all these approaches in terms of computational time and reliability in the most complete pattern statistics benchmark available at the present time.Mesh:
Year: 2006 PMID: 17049037 DOI: 10.2202/1544-6115.1219
Source DB: PubMed Journal: Stat Appl Genet Mol Biol ISSN: 1544-6115