| Literature DB >> 32248310 |
Elena Fimmel1, Martin Starman2, Lutz Strüngmann2.
Abstract
The origin of the modern genetic code and the mechanisms that have contributed to its present form raise many questions. The main goal of this work is to test two hypotheses concerning the development of the genetic code for their compatibility and complementarity and see if they could benefit from each other. On the one hand, Gonzalez, Giannerini and Rosa developed a theory, based on four-based codons, which they called tesserae. This theory can explain the degeneracy of the modern vertebrate mitochondrial code. On the other hand, in the 1990s, so-called circular codes were discovered in nature, which seem to ensure the maintenance of a correct reading-frame during the translation process. It turns out that the two concepts not only do not contradict each other, but on the contrary complement and enrichen each other.Entities:
Keywords: Circular code; Degeneracy; Genetic code; Tessera
Mesh:
Substances:
Year: 2020 PMID: 32248310 PMCID: PMC7128014 DOI: 10.1007/s11538-020-00724-z
Source DB: PubMed Journal: Bull Math Biol ISSN: 0092-8240 Impact factor: 1.758
Fig. 1Graphical representation of the primeval base symmetries. KM is represented by red, YR by green and SW by blue colored lines (Color figure online)
Each column is one of the four equivalence classes of dinucleotides: , , , under the action of on
| AA | AU | AC | AG | |
| UU | UA | UG | UC | |
| CC | CG | CA | CU | |
| GG | GC | GU | GA |
The left most column shows the transformation that sends the first dinucleotide in the class to the second, third and fourth, respectively, e.g. . The column header are the equivalence classes names. The header index is the unique transformation used for mapping the first nucleotide of a dinucleotide to the second
The table of all tessera with the generating transformation
| Dinucleotide | ||||
|---|---|---|---|---|
Fig. 2Schematic representation of the mapping between the tessera onto the codon . (Color figure online)
Fig. 3Graphical representation of the dinucleotide code X = {UC, CG, GU, AC, AA} which has only one component . (Color figure online)
Fig. 4Graphical representation of the trinucleotide code X = {UCA, UAC, CAU, ACA, ACG} which has only one component that is not connected. (Color figure online)
Fig. 5Graphical representation of the tetranucleotide code X = {AAUC, ACUA, ACUU, CUCU, CUUU} which has two components and that are both not connected but have two components themselves. (Color figure online)
Fig. 6Graphical representation of the di-cut-graph of the Tessera code X = {UCUC, AUGC, CUAG, GCCG}. (Color figure online)
Numbers of self-complementary circular codes of different code lengths
| Code length | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Number | 12 | 72 | 304 | 996 | 2580 | 5408 | 9264 | 12708 | 13696 | 11232 | 6144 | 1584 |
List of complete equivalence classes
| Tessera | Shift 1 | Shift 2 | Shift 3 | Class number |
|---|---|---|---|---|
Self-complementary tesserae are in bold
Fig. 7An acyclic tournament on four nodes. (Color figure online)
Numbers of circular, comma-free and -tessera codes of different code lengths
| Code length | # 1-circular codes | # Circular codes | # | # Comma free codes |
|---|---|---|---|---|
| 1 | 48 | 48 | 48 | 48 |
| 2 | 1056 | 1056 | 1056 | 1056 |
| 3 | 14080 | 14048 | 14016 | 13952 |
| 4 | 126720 | 125544 | 124368 | 122376 |
| 5 | 811008 | 791952 | 773088 | 745584 |
| 6 | 3784704 | 3606048 | 3433584 | 3214272 |
| 7 | 12976128 | 11908800 | 10922112 | 9816960 |
| 8 | 32440320 | 28230456 | 24577404 | 20952504 |
| 9 | 57671680 | 46720800 | 37987120 | 30297824 |
| 10 | 69206016 | 51111024 | 38129856 | 28015728 |
| 11 | 50331648 | 33113472 | 22240992 | 14790144 |
| 12 | 16777216 | 9592512 | 5685408 | 3351232 |
The list of all self-complementary comma-free tessera codes of maximal length
| UUAA | CCAA | AGGA | UCCU | UUGG | CCGG | UCGA | CAUG | ACGU | AGCU | ACUG | CAGU |
| AAUU | AACC | AGGA | UCCU | GGUU | GGCC | UCGA | CAUG | ACGU | AGCU | ACUG | CAGU |
| UUAA | CCAA | GAAG | CUUC | UUGG | CCGG | GAUC | CAUG | ACGU | CUAG | ACUG | CAGU |
| AAUU | AACC | GAAG | CUUC | GGUU | GGCC | GAUC | CAUG | ACGU | CUAG | ACUG | CAGU |
| UUAA | CCAA | AGGA | UCCU | UUGG | CCGG | UCGA | UGCA | GUAC | AGCU | UGAC | GUCA |
| AAUU | AACC | AGGA | UCCU | GGUU | GGCC | UCGA | UGCA | GUAC | AGCU | UGAC | GUCA |
| UUAA | CCAA | GAAG | CUUC | UUGG | CCGG | GAUC | UGCA | GUAC | CUAG | UGAC | GUCA |
| AAUU | AACC | GAAG | CUUC | GGUU | GGCC | GAUC | UGCA | GUAC | CUAG | UGAC | GUCA |
| AAUU | ACCA | AAGG | CCUU | UGGU | CCGG | GAUC | UGCA | ACGU | AGCU | GACU | AGUC |
| UUAA | ACCA | GGAA | UUCC | UGGU | GGCC | GAUC | UGCA | ACGU | AGCU | GACU | AGUC |
| AAUU | CAAC | AAGG | CCUU | GUUG | CCGG | GAUC | CAUG | GUAC | AGCU | GACU | AGUC |
| UUAA | CAAC | GGAA | UUCC | GUUG | GGCC | GAUC | CAUG | GUAC | AGCU | GACU | AGUC |
| AAUU | ACCA | AAGG | CCUU | UGGU | CCGG | UCGA | UGCA | ACGU | CUAG | CUGA | UCAG |
| UUAA | ACCA | GGAA | UUCC | UGGU | GGCC | UCGA | UGCA | ACGU | CUAG | CUGA | UCAG |
| AAUU | CAAC | AAGG | CCUU | GUUG | CCGG | UCGA | CAUG | GUAC | CUAG | CUGA | UCAG |
| UUAA | CAAC | GGAA | UUCC | GUUG | GGCC | UCGA | CAUG | GUAC | CUAG | CUGA | UCAG |