Literature DB >> 17049037

Numerical solutions for patterns statistics on Markov chains.

Gregory Nuel1.   

Abstract

We propose here a review of the methods available to compute pattern statistics on text generated by a Markov source. Theoretical, but also numerical aspects are detailed for a wide range of techniques (exact, Gaussian, large deviations, binomial and compound Poisson). The SPatt package (Statistics for Pattern, free software available at http://stat.genopole.cnrs.fr/spatt) implementing all these methods is then used to compare all these approaches in terms of computational time and reliability in the most complete pattern statistics benchmark available at the present time.

Mesh:

Year:  2006        PMID: 17049037     DOI: 10.2202/1544-6115.1219

Source DB:  PubMed          Journal:  Stat Appl Genet Mol Biol        ISSN: 1544-6115


  7 in total

1.  Globally, unrelated protein sequences appear random.

Authors:  Daniel T Lavelle; William R Pearson
Journal:  Bioinformatics       Date:  2009-11-30       Impact factor: 6.937

2.  Mining protein loops using a structural alphabet and statistical exceptionality.

Authors:  Leslie Regad; Juliette Martin; Gregory Nuel; Anne-Claude Camproux
Journal:  BMC Bioinformatics       Date:  2010-02-04       Impact factor: 3.169

3.  Exact distribution of a pattern in a set of random sequences generated by a Markov source: applications to biological data.

Authors:  Leslie Regad; Juliette Martin; Gregory Nuel; Anne-Claude Camproux
Journal:  Algorithms Mol Biol       Date:  2010-01-26       Impact factor: 1.405

4.  SA-Mot: a web server for the identification of motifs of interest extracted from protein loops.

Authors:  Leslie Regad; Adrien Saladin; Julien Maupetit; Colette Geneix; Anne-Claude Camproux
Journal:  Nucleic Acids Res       Date:  2011-06-10       Impact factor: 16.971

5.  Dissecting protein loops with a statistical scalpel suggests a functional implication of some structural motifs.

Authors:  Leslie Regad; Juliette Martin; Anne-Claude Camproux
Journal:  BMC Bioinformatics       Date:  2011-06-20       Impact factor: 3.169

6.  An analysis of single amino acid repeats as use case for application specific background models.

Authors:  Paweł P Łabaj; Peter Sykacek; David P Kreil
Journal:  BMC Bioinformatics       Date:  2011-05-19       Impact factor: 3.169

7.  Analysis of pattern overlaps and exact computation of P-values of pattern occurrences numbers: case of Hidden Markov Models.

Authors:  Mireille Régnier; Evgenia Furletova; Victor Yakovlev; Mikhail Roytberg
Journal:  Algorithms Mol Biol       Date:  2014-12-16       Impact factor: 1.405

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.