Literature DB >> 2251125

Statistical analysis of nucleotide sequences.

E E Stückle1, C Emmrich, U Grob, P J Nielsen.   

Abstract

In order to scan nucleic acid databases for potentially relevant but as yet unknown signals, we have developed an improved statistical model for pattern analysis of nucleic acid sequences by modifying previous methods based on Markov chains. We demonstrate the importance of selecting the appropriate parameters in order for the method to function at all. The model allows the simultaneous analysis of several short sequences with unequal base frequencies and Markov order k not equal to 0 as is usually the case in databases. As a test of these modifications, we show that in E. coli sequences there is a bias against palindromic hexamers which correspond to known restriction enzyme recognition sites.

Entities:  

Mesh:

Year:  1990        PMID: 2251125      PMCID: PMC332623          DOI: 10.1093/nar/18.22.6641

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  27 in total

1.  K-tuple frequency analysis: from intron/exon discrimination to T-cell epitope mapping.

Authors:  J M Claverie; I Sauvaget; L Bougueleret
Journal:  Methods Enzymol       Date:  1990       Impact factor: 1.600

2.  Prediction of the frequencies of restriction endonuclease recognition sequences using di- and mononucleotide frequencies.

Authors:  R C Peterson
Journal:  Biotechniques       Date:  1988-01       Impact factor: 1.993

3.  Restriction and modification enzymes and their recognition sequences.

Authors:  R J Roberts
Journal:  Nucleic Acids Res       Date:  1985       Impact factor: 16.971

4.  GenBank: current status and future directions.

Authors:  C Burks; M J Cinkosky; P Gilna; J E Hayden; Y Abe; E J Atencio; S Barnhouse; D Benton; C A Buenafe; K E Cumella
Journal:  Methods Enzymol       Date:  1990       Impact factor: 1.600

5.  Efficient algorithms for folding and comparing nucleic acid sequences.

Authors:  J P Dumas; J Ninio
Journal:  Nucleic Acids Res       Date:  1982-01-11       Impact factor: 16.971

6.  A comprehensive set of sequence analysis programs for the VAX.

Authors:  J Devereux; P Haeberli; O Smithies
Journal:  Nucleic Acids Res       Date:  1984-01-11       Impact factor: 16.971

7.  Statistical characterization of nucleic acid sequence functional domains.

Authors:  T F Smith; M S Waterman; J R Sadler
Journal:  Nucleic Acids Res       Date:  1983-04-11       Impact factor: 16.971

8.  Characterization of translational initiation sites in E. coli.

Authors:  G D Stormo; T D Schneider; L M Gold
Journal:  Nucleic Acids Res       Date:  1982-05-11       Impact factor: 16.971

9.  Doublet frequencies in evolutionary distinct groups.

Authors:  R Nussinov
Journal:  Nucleic Acids Res       Date:  1984-02-10       Impact factor: 16.971

10.  A Markov analysis of DNA sequences.

Authors:  H Almagor
Journal:  J Theor Biol       Date:  1983-10-21       Impact factor: 2.691

View more
  7 in total

1.  Measuring the coding potential of genomic sequences through a combination of triplet occurrence patterns and RNY preference.

Authors:  Christoforos Nikolaou; Yannis Almirantis
Journal:  J Mol Evol       Date:  2004-09       Impact factor: 2.395

2.  WORDUP: an efficient algorithm for discovering statistically significant patterns in DNA sequences.

Authors:  G Pesole; N Prunella; S Liuni; M Attimonelli; C Saccone
Journal:  Nucleic Acids Res       Date:  1992-06-11       Impact factor: 16.971

3.  "Word" preference in the genomic text and genome evolution: different modes of n-tuplet usage in coding and noncoding sequences.

Authors:  Christoforos Nikolaou; Yannis Almirantis
Journal:  J Mol Evol       Date:  2005-07-19       Impact factor: 2.395

4.  A simple method for global sequence comparison.

Authors:  E Pizzi; M Attimonelli; S Liuni; C Frontali; C Saccone
Journal:  Nucleic Acids Res       Date:  1992-01-11       Impact factor: 16.971

5.  Assessment of inhomogeneities in an E. coli physical map.

Authors:  S Karlin; C Macken
Journal:  Nucleic Acids Res       Date:  1991-08-11       Impact factor: 16.971

6.  Statistical analysis of yeast genomic downstream sequences reveals putative polyadenylation signals.

Authors:  J van Helden; M del Olmo; J E Pérez-Ortín
Journal:  Nucleic Acids Res       Date:  2000-02-15       Impact factor: 16.971

7.  Statistical analyses of counts and distributions of restriction sites in DNA sequences.

Authors:  S Karlin; C Burge; A M Campbell
Journal:  Nucleic Acids Res       Date:  1992-03-25       Impact factor: 16.971

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.