Literature DB >> 12824392

SPA: Simple web tool to assess statistical significance of DNA patterns.

H Richard1, G Nuel.   

Abstract

Many statistical methods and programs are available to compute the significance of a given DNA pattern in a genome sequence. In this paper, after outlining the mathematical background of this problem, we present SPA (Statistic for PAtterns), an expert system with a simple web interface designed to be applied to two of these methods (large deviation approximations and exact computations using simple recurrences). A few results are presented, leading to a comparison between the two methods and to a simple decision rule in the choice of that to be used. Finally, future developments of SPA are discussed. This tool is available at the following address: http://stat.genopole.cnrs.fr/SPA/.

Mesh:

Year:  2003        PMID: 12824392      PMCID: PMC169190          DOI: 10.1093/nar/gkg613

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  10 in total

1.  A web site for the computational analysis of yeast regulatory sequences.

Authors:  J van Helden; B André; J Collado-Vides
Journal:  Yeast       Date:  2000-01-30       Impact factor: 3.239

2.  Patterns of variant polyadenylation signal usage in human genes.

Authors:  E Beaudoing; S Freier; J R Wyatt; J M Claverie; D Gautheret
Journal:  Genome Res       Date:  2000-07       Impact factor: 9.043

3.  Fast approximate motif statistics.

Authors:  P Nicodème
Journal:  J Comput Biol       Date:  2001       Impact factor: 1.479

4.  Distribution patterns of over-represented k-mers in non-coding yeast DNA.

Authors:  Steven Hampson; Dennis Kibler; Pierre Baldi
Journal:  Bioinformatics       Date:  2002-04       Impact factor: 6.937

5.  Predicting gene regulatory elements in silico on a genomic scale.

Authors:  A Brazma; I Jonassen; J Vilo; E Ukkonen
Journal:  Genome Res       Date:  1998-11       Impact factor: 9.043

6.  Compound Poisson and Poisson process approximations for occurrences of multiple words in Markov chains.

Authors:  G Reinert; S Schbath
Journal:  J Comput Biol       Date:  1998       Impact factor: 1.479

7.  Avoidance of palindromic words in bacterial and archaeal genomes: a close connection with restriction enzymes.

Authors:  M S Gelfand; E V Koonin
Journal:  Nucleic Acids Res       Date:  1997-06-15       Impact factor: 16.971

8.  An efficient statistic to detect over- and under-represented words in DNA sequences.

Authors:  S Schbath
Journal:  J Comput Biol       Date:  1997       Impact factor: 1.479

9.  Statistical analysis of yeast genomic downstream sequences reveals putative polyadenylation signals.

Authors:  J van Helden; M del Olmo; J E Pérez-Ortín
Journal:  Nucleic Acids Res       Date:  2000-02-15       Impact factor: 16.971

10.  Statistical analyses of counts and distributions of restriction sites in DNA sequences.

Authors:  S Karlin; C Burge; A M Campbell
Journal:  Nucleic Acids Res       Date:  1992-03-25       Impact factor: 16.971

  10 in total
  1 in total

1.  On the representability of complete genomes by multiple competing finite-context (Markov) models.

Authors:  Armando J Pinho; Paulo J S G Ferreira; António J R Neves; Carlos A C Bastos
Journal:  PLoS One       Date:  2011-06-30       Impact factor: 3.240

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.