| Literature DB >> 26857678 |
Abstract
Biological information encoded in genomes is fundamentally different from and effectively orthogonal to Shannon entropy. The biologically relevant concept of information has to do with 'meaning', i.e. encoding various biological functions with various degree of evolutionary conservation. Apart from direct experimentation, the meaning, or biological information content, can be extracted and quantified from alignments of homologous nucleotide or amino acid sequences but generally not from a single sequence, using appropriately modified information theoretical formulae. For short, information encoded in genomes is defined vertically but not horizontally. Informally but substantially, biological information density seems to be equivalent to 'meaning' of genomic sequences that spans the entire range from sharply defined, universal meaning to effective meaninglessness. Large fractions of genomes, up to 90% in some plants, belong within the domain of fuzzy meaning. The sequences with fuzzy meaning can be recruited for various functions, with the meaning subsequently fixed, and also could perform generic functional roles that do not require sequence conservation. Biological meaning is continuously transferred between the genomes of selfish elements and hosts in the process of their coevolution. Thus, in order to adequately describe genome function and evolution, the concepts of information theory have to be adapted to incorporate the notion of meaning that is central to biology.Entities:
Keywords: evolution; information; meaning; selfish elements
Mesh:
Year: 2016 PMID: 26857678 PMCID: PMC4760125 DOI: 10.1098/rsta.2015.0065
Source DB: PubMed Journal: Philos Trans A Math Phys Eng Sci ISSN: 1364-503X Impact factor: 4.226
Figure 1.Biological information and information density depending on genome size: viruses, prokaryotes and eukaryotes. The biological information and density values were calculated using equations (1.4) and (1.5), respectively, and the data on genomes were from Genbank. The plot is on a double logarithmic scale. 1, encephalomyocarditis virus (RNA virus); 2, lambda phage; 3, T4 phage; 4, Mycoplasma genitalium (parasitic bacterium); 5, acanthamoeba polyphaga mimivirus (giant virus); 6, Archaeoglobus fulgidus (free-living archaeon); 7, Escherichia coli (free-living bacterium); 8, Saccharomyces cerevisiae; 9, Arabidopsis thaliana; 10, Drosophila melanogaster; 11, Homo sapiens. (Online version in colour.)
Figure 2.The fuzzy meaning concept and gain and loss of meaning. The cartoon schematically shows a fragment of a genome of a complex multicellular organism (animal or plant) that consists mostly of sequences with fuzzy meaning, interspersed with ‘islands’ of defined meaning such as genes (exons) encoding structural RNAs and proteins as well as evolutionarily conserved regulatory elements. (Online version in colour.)
Figure 3.Flow of meaning between selfish elements and hosts. (Online version in colour.)