Literature DB >> 8917736

A complementary circular code in the protein coding genes.

D G Arquès1, C J Michel.   

Abstract

Recently, shifted periodicities 1 modulo 3 and 2 modulo 3 have been identified in protein (coding) genes of both prokaryotes and eukaryotes with autocorrelation functions analysing eight of 64 trinucleotides (Arquès et al., 1995). This observation suggests that the trinucleotides are associated with frames in protein genes. In order to verify this hypothesis, a distribution of the 64 trinucleotides AAA,..., TTT is studied in both gene populations by using a simple method based on the trinucleotide frequencies per frame. In protein genes, the trinucleotides can be read in three frames: the reading frame 0 established by the ATG start trinucleotide and frame 1 (resp. 2) which is the frame 0 shifted by 1 (resp. 2) nucleotide in the 5'-3' direction. Then, the occurrence frequencies of the 64 trinucleotides are computed in the three frames. By classifying each of the 64 trinucleotides in its preferential occurrence frame, i.e. the frame associated with its highest frequency, three subsets of trinucleotides can be identified in the three frames. This approach is applied in the two gene populations. Unexpectedly, the same three subsets of trinucleotides are identified in these two gene populations: Tzero = Xzero [symbol: see text] {AAA,TTT} with Xzero = {AAC,AAT,ACC,ATC,ATT, CAG,CTC,CTG,GAA,GAC,GAG, GAT,GCC,GGC,GGT,GTA,GTC,GTT,TAC,TTC} in frame 0, T1 = X1 [symbol: see text] {CCC} in frame 1 and T2 = X2 [symbol: see text] {GGG} in frame 2, each subset Xzero, X1 and X2 having 20 trinucleotides. Surprisingly, these three subsets have five important properties: (i) the property of maximal circular code for Xzero (resp. X1, X2) allowing the automatical retrieval of frame 0 (resp. 1, 2) in any region of a protein gene model (formed by a series of trinucleotides of Xzero) without using a start codon; (ii) the DNA complementarity property C (e.g. C(AAC) = GTT): C(T0) = T0, C(T1) = T2 and C(T2) = T1 allowing the two paired reading frames of a DNA double helix simultaneously to code for amino acids; (iii) the circular permutation property P (e.g. P(AAC) = ACA): P(Xzero) = X1 and P(X1) = X2 implying that the two subsets X1 and X2 can be deduced from Xzero; (iv) the rarity property with an occurrence probability of Xzero equal to 6 x 10(-8); and (v) the concatenation property with: a high frequency (27.5%) of misplaced trinucleotides in the shifted frames, a maximum (13 nucleotides) length of the minimal window to automatically retrieve the frame and an occurrence of the four types of nucleotides in the three trinucleotides sites, in favour of an evolutionary code. In the Discussion, the identified subsets Tzero, T1 and T2 replaced in the three two-letter genetic alphabets purine/pyrimidine, amino/ceto and strong/weak interaction, allow us to deduce that the RNY model (R = purine = A or G, Y = pyrimidine = C or T, N = R or Y) (Eigen & Schuster, 1978) is the closest two-letter codon model to the trinucleotides of Tzero. Then, these three subsets are related to the genetic code. The trinucleotides of Tzero code for 13 amino acids: Ala, Asn, Asp, Gln, Glu, Gly, Ile, Leu, Lys, Phe, Thr, Tyr and Val. Finally, a strong correlation between the usage of the trinucleotides of Tzero in protein genes and the amino acid frequencies in proteins is observed as six among seven amino acids not coded by Tzero, have as expected the lowest frequencies in proteins of both prokaryotes and eukaryotes.

Entities:  

Mesh:

Substances:

Year:  1996        PMID: 8917736     DOI: 10.1006/jtbi.1996.0142

Source DB:  PubMed          Journal:  J Theor Biol        ISSN: 0022-5193            Impact factor:   2.691


  27 in total

1.  Gene algebra from a genetic code algebraic structure.

Authors:  R Sanchez; E Morgado; R Grau
Journal:  J Math Biol       Date:  2005-07-13       Impact factor: 2.259

2.  The Maximal C³ Self-Complementary Trinucleotide Circular Code X in Genes of Bacteria, Archaea, Eukaryotes, Plasmids and Viruses.

Authors:  Christian J Michel
Journal:  Life (Basel)       Date:  2017-04-18

3.  Circular codes, symmetries and transformations.

Authors:  Elena Fimmel; Simone Giannerini; Diego Luis Gonzalez; Lutz Strüngmann
Journal:  J Math Biol       Date:  2014-07-10       Impact factor: 2.259

4.  Self-complementary circular codes in coding theory.

Authors:  Elena Fimmel; Christian J Michel; Martin Starman; Lutz Strüngmann
Journal:  Theory Biosci       Date:  2018-03-12       Impact factor: 1.919

5.  Bijective codon transformations show genetic code symmetries centered on cytosine's coding properties.

Authors:  Hervé Seligmann
Journal:  Theory Biosci       Date:  2017-11-16       Impact factor: 1.919

6.  RNA Rings Strengthen Hairpin Accretion Hypotheses for tRNA Evolution: A Reply to Commentaries by Z.F. Burton and M. Di Giulio.

Authors:  Jacques Demongeot; Hervé Seligmann
Journal:  J Mol Evol       Date:  2020-02-05       Impact factor: 2.395

7.  Codon Distribution in Error-Detecting Circular Codes.

Authors:  Elena Fimmel; Lutz Strüngmann
Journal:  Life (Basel)       Date:  2016-03-15

8.  Pentamers with Non-redundant Frames: Bias for Natural Circular Code Codons.

Authors:  Jacques Demongeot; Hervé Seligmann
Journal:  J Mol Evol       Date:  2020-01-07       Impact factor: 2.395

9.  Fail-safe genetic codes designed to intrinsically contain engineered organisms.

Authors:  Jonathan Calles; Isaac Justice; Detravious Brinkley; Alexa Garcia; Drew Endy
Journal:  Nucleic Acids Res       Date:  2019-11-04       Impact factor: 16.971

10.  On the evolution of the standard genetic code: vestiges of critical scale invariance from the RNA world in current prokaryote genomes.

Authors:  Marco V José; Tzipe Govezensky; José A García; Juan R Bobadilla
Journal:  PLoS One       Date:  2009-02-02       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.