| Literature DB >> 15893551 |
Abstract
We introduce a novel method to analyse complete genomes and recognise some distinctive features by means of an adaptive compression algorithm, which is not DNA-oriented, based on the Lempel-Ziv scheme. We study the Information Content as a function of the number of symbols encoded by the algorithm and we analyse the dictionary created by the algorithm. Preliminary results are shown concerning regions showing a sublinear type of information growth, which is strictly connected to the presence of highly repetitive subregions that might be supposed to have a regulatory function within the genome.Mesh:
Substances:
Year: 2004 PMID: 15893551 DOI: 10.1016/j.bulm.2004.10.005
Source DB: PubMed Journal: Bull Math Biol ISSN: 0092-8240 Impact factor: 1.758