Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Genes, information and sense: complexity and knowledge retrieval.

Literature DB >> 18443840

Genes, information and sense: complexity and knowledge retrieval.

Michael G Sadovsky¹, Julia A Putintseva, Alexander S Shchepanovsky.

Abstract

Information capacity of nucleotide sequences measures the unexpectedness of a continuation of a given string of nucleotides, thus having a sound relation to a variety of biological issues. A continuation is defined in a way maximizing the entropy of the ensemble of such continuations. The capacity is defined as a mutual entropy of real frequency dictionary of a sequence with respect to the one bearing the most expected continuations; it does not depend on the length of strings contained in a dictionary. Various genomes exhibit a multi-minima pattern of the dependence of information capacity on the string length, thus reflecting an order within a sequence. The strings with significant deviation of an expected frequency from the real one are the words of increased information value. Such words exhibit a non-random distribution alongside a sequence, thus making it possible to retrieve the correlation between a structure, and a function encoded within a sequence.

Mesh：

Substances：
DNA

Year: 2008 PMID： 18443840 DOI： 10.1007/s12064-008-0032-1

Source DB: PubMed Journal: Theory Biosci ISSN： 1431-7613 Impact factor: 1.919

6 in total

2 in total

1. Protein languages differ depending on microorganism lifestyle.

Authors: Joseph J Grzymski; Adam G Marsh
Journal: PLoS One Date: 2014-05-14 Impact factor: 3.240

2. Extracting DNA words based on the sequence features: non-uniform distribution and integrity.

Authors: Zhi Li; Hongyan Cao; Yuehua Cui; Yanbo Zhang
Journal: Theor Biol Med Model Date: 2016-01-25 Impact factor: 2.432

2 in total

Genes, information and sense: complexity and knowledge retrieval.

1. Codon adaptation index as a measure of dominating codon bias.

2. Information capacity of nucleotide sequences and its applications.

3. Comparison of Real Frequencies of Strings vs. the Expected Ones Reveals the Information Capacity of Macromoleculae.

4. [Redundancy of genetic sequences and mosaic structure of a genome].

Review 5. Codon usage: mutational bias, translational selection, or both?

6. [Introns differ from exons by their redundancy].

1. Protein languages differ depending on microorganism lifestyle.

2. Extracting DNA words based on the sequence features: non-uniform distribution and integrity.