Literature DB >> 28007553

Shannon information entropy in the canonical genetic code.

Louis R Nemzer1.   

Abstract

The Shannon entropy measures the expected information value of messages. As with thermodynamic entropy, the Shannon entropy is only defined within a system that identifies at the outset the collections of possible messages, analogous to microstates, that will be considered indistinguishable macrostates. This fundamental insight is applied here for the first time to amino acid alphabets, which group the twenty common amino acids into families based on chemical and physical similarities. To evaluate these schemas objectively, a novel quantitative method is introduced based the inherent redundancy in the canonical genetic code. Each alphabet is taken as a separate system that partitions the 64 possible RNA codons, the microstates, into families, the macrostates. By calculating the normalized mutual information, which measures the reduction in Shannon entropy, conveyed by single nucleotide messages, groupings that best leverage this aspect of fault tolerance in the code are identified. The relative importance of properties related to protein folding - like hydropathy and size - and function, including side-chain acidity, can also be estimated. This approach allows the quantification of the average information value of nucleotide positions, which can shed light on the coevolution of the canonical genetic code with the tRNA-protein translation mechanism.
Copyright © 2016 Elsevier Ltd. All rights reserved.

Keywords:  Amino acids; Genetic code; Information theory; RNA translation; Shannon entropy

Mesh:

Substances:

Year:  2016        PMID: 28007553     DOI: 10.1016/j.jtbi.2016.12.010

Source DB:  PubMed          Journal:  J Theor Biol        ISSN: 0022-5193            Impact factor:   2.691


  6 in total

Review 1.  Systems protobiology: origin of life in lipid catalytic networks.

Authors:  Doron Lancet; Raphael Zidovetzki; Omer Markovitch
Journal:  J R Soc Interface       Date:  2018-07       Impact factor: 4.118

2.  Global importance of RNA secondary structures in protein-coding sequences.

Authors:  Markus Fricke; Ruman Gerst; Bashar Ibrahim; Michael Niepmann; Manja Marz
Journal:  Bioinformatics       Date:  2019-02-15       Impact factor: 6.937

3.  QuantumIS: A Qualia Consciousness Awareness and Information Theory Quale Approach to Reducing Strategic Decision-Making Entropy.

Authors:  James A Rodger
Journal:  Entropy (Basel)       Date:  2019-01-29       Impact factor: 2.524

4.  Feature-extraction and analysis based on spatial distribution of amino acids for SARS-CoV-2 Protein sequences.

Authors:  Ranjeet Kumar Rout; Sk Sarif Hassan; Sabha Sheikh; Saiyed Umer; Kshira Sagar Sahoo; Amir H Gandomi
Journal:  Comput Biol Med       Date:  2021-11-10       Impact factor: 6.698

5.  Association of the characteristics of B- and T-cell repertoires with papillary thyroid carcinoma.

Authors:  Guoping Sun; Lumei Qiu; Zhiqiang Cheng; Weibing Pan; Jingjun Qiu; Chang Zou; Ni Xie; Song Liu; Peng Zhu; Jun Zeng; Yong Dai
Journal:  Oncol Lett       Date:  2018-05-24       Impact factor: 2.967

6.  A model of k-mer surprisal to quantify local sequence information content surrounding splice regions.

Authors:  Sam Humphrey; Alastair Kerr; Magnus Rattray; Caroline Dive; Crispin J Miller
Journal:  PeerJ       Date:  2020-11-04       Impact factor: 2.984

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.