Literature DB >> 11571071

Numerical comparison of several approximations of the word count distribution in random sequences.

S Robin1, S Schbath.   

Abstract

The exact distribution of word counts in random sequences and several approximations have been proposed in the past few years. The exact distribution has no theoretical limit but may require prohibitive computation time. On the other hand, approximate distributions can be rapidly calculated but, in practice, are only accurate under specific conditions. After making a survey of these distributions, we compare them according to both their accuracy and computational cost. Rules are suggested for choosing between Gaussian approximations, compound Poisson approximation, and exact distribution. This work is illustrated with the detection of exceptional words in the phage Lambda genome.

Entities:  

Mesh:

Substances:

Year:  2001        PMID: 11571071     DOI: 10.1089/106652701752236179

Source DB:  PubMed          Journal:  J Comput Biol        ISSN: 1066-5277            Impact factor:   1.479


  9 in total

Review 1.  Computational approaches to identify promoters and cis-regulatory elements in plant genomes.

Authors:  Stephane Rombauts; Kobe Florquin; Magali Lescot; Kathleen Marchal; Pierre Rouzé; Yves van de Peer
Journal:  Plant Physiol       Date:  2003-07       Impact factor: 8.340

2.  The power of detecting enriched patterns: an HMM approach.

Authors:  Zhiyuan Zhai; Shih-Yen Ku; Yihui Luan; Gesine Reinert; Michael S Waterman; Fengzhu Sun
Journal:  J Comput Biol       Date:  2010-04       Impact factor: 1.479

3.  A New Context Tree Inference Algorithm for Variable Length Markov Chain Model with Applications to Biological Sequence Analyses.

Authors:  Shaokun An; Jie Ren; Fengzhu Sun; Lin Wan
Journal:  J Comput Biol       Date:  2022-04-22       Impact factor: 1.549

4.  Compound poisson approximation of the number of occurrences of a position frequency matrix (PFM) on both strands.

Authors:  Utz J Pape; Sven Rahmann; Fengzhu Sun; Martin Vingron
Journal:  J Comput Biol       Date:  2008 Jul-Aug       Impact factor: 1.479

5.  Detection of microRNAs in color space.

Authors:  Antonio Marco; Sam Griffiths-Jones
Journal:  Bioinformatics       Date:  2011-12-09       Impact factor: 6.937

6.  Genomic DNA k-mer spectra: models and modalities.

Authors:  Benny Chor; David Horn; Nick Goldman; Yaron Levy; Tim Massingham
Journal:  Genome Biol       Date:  2009-10-08       Impact factor: 13.583

7.  Comparative analysis of DNA word abundances in four yeast genomes using a novel statistical background model.

Authors:  Ramkumar Hariharan; Reji Simon; M Radhakrishna Pillai; Todd D Taylor
Journal:  PLoS One       Date:  2013-03-05       Impact factor: 3.240

8.  Identification of DNA motifs implicated in maintenance of bacterial core genomes by predictive modeling.

Authors:  David Halpern; Hélène Chiapello; Sophie Schbath; Stéphane Robin; Christelle Hennequet-Antier; Alexandra Gruss; Meriem El Karoui
Journal:  PLoS Genet       Date:  2007-09       Impact factor: 5.917

9.  SIGffRid: a tool to search for sigma factor binding sites in bacterial genomes using comparative approach and biologically driven statistics.

Authors:  Fabrice Touzain; Sophie Schbath; Isabelle Debled-Rennesson; Bertrand Aigle; Gregory Kucherov; Pierre Leblond
Journal:  BMC Bioinformatics       Date:  2008-01-31       Impact factor: 3.169

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.