Literature DB >> 8833749

Linguistic complexity of protein sequences as compared to texts of human languages.

O Popov1, D M Segal, E N Trifonov.   

Abstract

A notion and a measure of linguistic complexity introduced earlier (Trifonov, 1990) were originally used for analysis of nucleotide sequences. This measure was shown to reflect multiplicity of codes (messages) of different natures superimposed in the sequences. Unlike human language texts, genetic texts are 'read' by cellular mechanisms in several different ways, each time using a different selection of the characters of the same text while skipping others (Trifonov, 1989). Human texts are read in one way only, sequentially and involving all characters (one code). The conceptual significance and essence of the idea on the multiplicity of overlapping codes in genetic sequences, as opposed to human languages, is discussed. The linguistic complexity technique allows a calculation to be made of the structural complexity of any linear sequence of characters irrespective of whether the text is cognized or presently undeciphered. The texts (sequences) are compared exclusively from the point of view of their structural complexity with no reference to the meaning of the texts which is beyond the scope of this article. Results of such a comparison of protein sequences with various texts, written in English, Italian and Welsh are presented. The human texts are found to be structurally simpler than genetic (protein) texts, reflecting, apparently, a difference in the reading modes: single code versus many codes.

Entities:  

Mesh:

Substances:

Year:  1996        PMID: 8833749     DOI: 10.1016/0303-2647(95)01568-x

Source DB:  PubMed          Journal:  Biosystems        ISSN: 0303-2647            Impact factor:   1.973


  9 in total

1.  Bijective codon transformations show genetic code symmetries centered on cytosine's coding properties.

Authors:  Hervé Seligmann
Journal:  Theory Biosci       Date:  2017-11-16       Impact factor: 1.919

2.  RNAexinv: An extended inverse RNA folding from shape and physical attributes to sequences.

Authors:  Assaf Avihoo; Alexander Churkin; Danny Barash
Journal:  BMC Bioinformatics       Date:  2011-08-03       Impact factor: 3.307

Review 3.  Small Insertions and Deletions Drive Genomic Plasticity during Adaptive Evolution of Yersinia pestis.

Authors:  Yarong Wu; Tongyu Hao; Xiuwei Qian; Xianglilan Zhang; Yajun Song; Ruifu Yang; Yujun Cui
Journal:  Microbiol Spectr       Date:  2022-04-19

Review 4.  A frequency-based linguistic approach to protein decoding and design: Simple concepts, diverse applications, and the SCS Package.

Authors:  Kenta Motomura; Morikazu Nakamura; Joji M Otaki
Journal:  Comput Struct Biotechnol J       Date:  2013-03-29       Impact factor: 7.271

5.  Unbiased Mitoproteome Analyses Confirm Non-canonical RNA, Expanded Codon Translations.

Authors:  Hervé Seligmann
Journal:  Comput Struct Biotechnol J       Date:  2016-10-05       Impact factor: 7.271

6.  Evolution of Nucleotide Punctuation Marks: From Structural to Linear Signals.

Authors:  Nawal El Houmami; Hervé Seligmann
Journal:  Front Genet       Date:  2017-03-27       Impact factor: 4.599

7.  Reversible gene silencing through frameshift indels and frameshift scars provide adaptive plasticity for Mycobacterium tuberculosis.

Authors:  Aditi Gupta; David Alland
Journal:  Nat Commun       Date:  2021-08-04       Impact factor: 14.919

8.  Word decoding of protein amino Acid sequences with availability analysis: a linguistic approach.

Authors:  Kenta Motomura; Tomohiro Fujita; Motosuke Tsutsumi; Satsuki Kikuzato; Morikazu Nakamura; Joji M Otaki
Journal:  PLoS One       Date:  2012-11-21       Impact factor: 3.240

9.  Google matrix analysis of DNA sequences.

Authors:  Vivek Kandiah; Dima L Shepelyansky
Journal:  PLoS One       Date:  2013-05-09       Impact factor: 3.240

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.