Literature DB >> 8114510

An improved method for detection of words with unusual occurrence frequency in nucleotide sequences.

A Colosimo1, S Morante, V Parisi, G C Rossi.   

Abstract

A statistical analysis designed to deal with the problem of identifying rare or abundant "words" of arbitrary length in genomic fragments is presented. Our approach has the novelty of taking into account the statistical role of the presence of shorter words nested into longer ones and of introducing a Bayesian correction to minimize the effects of statistical fluctuations and of possible mistakes in genomic data. The method is successfully used in a thorough analysis of the abundance of short nucleotide sequences in the Escherichia coli genome.

Entities:  

Mesh:

Year:  1993        PMID: 8114510     DOI: 10.1006/jtbi.1993.1212

Source DB:  PubMed          Journal:  J Theor Biol        ISSN: 0022-5193            Impact factor:   2.691


  2 in total

1.  Peptide vocabulary analysis reveals ultra-conservation and homonymity in protein sequences.

Authors:  Derek Gatherer
Journal:  Bioinform Biol Insights       Date:  2009-11-24

2.  n-Gram characterization of genomic islands in bacterial genomes.

Authors:  Gordana M Pavlović-Lazetić; Nenad S Mitić; Milos V Beljanski
Journal:  Comput Methods Programs Biomed       Date:  2008-12-19       Impact factor: 5.428

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.