| Literature DB >> 3419187 |
Abstract
From the point of view of information theory, a statistical analysis of 2000 nucleic acid sequences (732 coding regions and 1177 non-coding regions) is given. The sequences are grouped into 20 categories. The probability-order-difference (POD) matrix is defined which is used to analyse the evolutionary distance of any two categories of sequences. The informational parameters D1, D2 and X = (1 + D1/D2)-1 and F are calculated for each sequence and averaged in each category. The statistical dependence of these parameters on molecular evolution is discussed. It is found that [X] is a good statistical quantity which describes the vocabulary compositions as well as the grammatical constructions of the genetic language. From the statistical analysis it is shown that [X] may play an important role in investigating the evolutionary level of nucleic acid molecules.Mesh:
Substances:
Year: 1988 PMID: 3419187 DOI: 10.1016/s0022-5193(88)80034-1
Source DB: PubMed Journal: J Theor Biol ISSN: 0022-5193 Impact factor: 2.691