| Literature DB >> 9228617 |
Abstract
In this note, we point out a very efficient statistic to detect over- and under-represented words in DNA sequences, when Markov chain models are used to represent the sequences. This statistic is missing from the recent review done on this important problem and appears to be a better measure of rarity and abundance of words in DNA sequences.Mesh:
Substances:
Year: 1997 PMID: 9228617 DOI: 10.1089/cmb.1997.4.189
Source DB: PubMed Journal: J Comput Biol ISSN: 1066-5277 Impact factor: 1.479