Literature DB >> 18443840

Genes, information and sense: complexity and knowledge retrieval.

Michael G Sadovsky1, Julia A Putintseva, Alexander S Shchepanovsky.   

Abstract

Information capacity of nucleotide sequences measures the unexpectedness of a continuation of a given string of nucleotides, thus having a sound relation to a variety of biological issues. A continuation is defined in a way maximizing the entropy of the ensemble of such continuations. The capacity is defined as a mutual entropy of real frequency dictionary of a sequence with respect to the one bearing the most expected continuations; it does not depend on the length of strings contained in a dictionary. Various genomes exhibit a multi-minima pattern of the dependence of information capacity on the string length, thus reflecting an order within a sequence. The strings with significant deviation of an expected frequency from the real one are the words of increased information value. Such words exhibit a non-random distribution alongside a sequence, thus making it possible to retrieve the correlation between a structure, and a function encoded within a sequence.

Mesh:

Substances:

Year:  2008        PMID: 18443840     DOI: 10.1007/s12064-008-0032-1

Source DB:  PubMed          Journal:  Theory Biosci        ISSN: 1431-7613            Impact factor:   1.919


  6 in total

1.  Codon adaptation index as a measure of dominating codon bias.

Authors:  A Carbone; A Zinovyev; F Képès
Journal:  Bioinformatics       Date:  2003-11-01       Impact factor: 6.937

2.  Information capacity of nucleotide sequences and its applications.

Authors:  M G Sadovsky
Journal:  Bull Math Biol       Date:  2006-04-07       Impact factor: 1.758

3.  Comparison of Real Frequencies of Strings vs. the Expected Ones Reveals the Information Capacity of Macromoleculae.

Authors:  Michael G Sadovsky
Journal:  J Biol Phys       Date:  2003-03       Impact factor: 1.365

4.  [Redundancy of genetic sequences and mosaic structure of a genome].

Authors:  A N Gorban'; T G Popova; M G Sadovskiĭ
Journal:  Mol Biol (Mosk)       Date:  1994 Mar-Apr

Review 5.  Codon usage: mutational bias, translational selection, or both?

Authors:  P M Sharp; M Stenico; J F Peden; A T Lloyd
Journal:  Biochem Soc Trans       Date:  1993-11       Impact factor: 5.407

6.  [Introns differ from exons by their redundancy].

Authors:  T G Popova; M G Asocakiĭ
Journal:  Genetika       Date:  1995-10
  6 in total
  2 in total

1.  Protein languages differ depending on microorganism lifestyle.

Authors:  Joseph J Grzymski; Adam G Marsh
Journal:  PLoS One       Date:  2014-05-14       Impact factor: 3.240

2.  Extracting DNA words based on the sequence features: non-uniform distribution and integrity.

Authors:  Zhi Li; Hongyan Cao; Yuehua Cui; Yanbo Zhang
Journal:  Theor Biol Med Model       Date:  2016-01-25       Impact factor: 2.432

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.