| Literature DB >> 30841864 |
Paweł BłaŻej1, Małgorzata Wnetrzak2, Dorota Mackiewicz2, Paweł Mackiewicz2.
Abstract
BACKGROUND: The standard genetic code is a recipe for assigning unambiguously 21 labels, i.e. amino acids and stop translation signal, to 64 codons. However, at early stages of the translational machinery development, the codons did not have to be read unambiguously and the early genetic codes could have contained some ambiguous assignments of codons to amino acids. Therefore, the goal of this work was to obtain the genetic code structures which could have evolved assuming different types of inaccuracy of the translational machinery starting from unambiguous assignments of codons to amino acids.Entities:
Keywords: Amino acid; Codon; Evolution; Evolutionary algorithm; Graph theory; Optimization; The standard genetic code
Mesh:
Substances:
Year: 2019 PMID: 30841864 PMCID: PMC6404327 DOI: 10.1186/s12859-019-2661-4
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Changes in the best approximation of the fitness function F with the number of generations (the black line). All approximations were done for 50 simulations using the Generalized Additive Models. The simulations were run under M1 scenario with different initial seeds. The independent simulations show a very narrow confidence interval depicted by the grey strip. The results were compared with the average fitness value calculated for the standard genetic code (the orange line)
Fig. 2Changes in the average genetic code entropy value during the simulation time calculated for three scenarios M1,M2,M3. The average genetic code entropy is the arithmetic mean of the genetic code entropy evaluated for all candidate solution
Fig. 3Box-plots of the average coding signal strength calculated at the end of the simulations under three scenarios M1,M2 and M3 for 50 independent simulation runs per scenario. The thick horizontal line indicates the median (IQR, the inter-quartile range), the box shows the range between the first and the third quartiles and the whiskers determine the range without outliers for the assumption 1.5×IQR
Fig. 4Box-plots of the average code conductance calculated at the end of the simulations under three scenarios M1,M2 and M3 for 50 independent simulation runs per scenario. The thick black horizontal line (inside each box) indicates the median (IQR, the inter-quartile range), the box shows the range between the first and the third quartiles and the whiskers determine the range without outliers for the assumption 1.5×IQR. The results were compared with the average code conductance Φ calculated for the standard genetic code (the orange horizontal line) and the minimum value of the average code conductance (the red horizontal line)
Fig. 5The frequencies of codon group sizes observed in the standard genetic code (a) as well as in the MLGP representations of genetic codes at the end of 50 independent simulation runs under the M1 (b), M2 (c) and M3 (d) scenarios
Fig. 6The matrix representation of a genetic code at the beginning of the simulations (a) as well as obtained at the end of the simulations under the M1 (b), M2 (c) and M3 (d) scenarios. Each row contains values of the probability function represented by a respective rectangle. The colour of the rectangles indicates high (light blue) or low (dark blue) probability that a given codon (row) encodes a given label (column). It is evident that codon blocks of the size 2 and 4 show high probabilities (light blue colour) and dominate in the code under the M1 scenario. In the case of other scenarios the codes show much greater ambiguity
Fig. 7The examples of graph representations of codon groups with the minimal 2, 4 and 6-size conductance: ϕ2(G), ϕ4(G) and ϕ6(G), respectively. The first two cases dominate in the best genetic code produced under the M1 scenario and the latter is observed in the best genetic code produced under the M2 scenario
The codon groups of the best genetic code in terms of the fitness function F extracted from 50 independent simulations under the M1 scenario
| Codon group ( |
| |||
|---|---|---|---|---|
| { | 4 | 1.0000000 |
|
|
| { | 4 | 1.0000000 |
|
|
| { | 4 | 1.0000000 |
|
|
| { | 4 | 1.0000000 |
|
|
| { | 4 | 1.0000000 |
|
|
| { | 4 | 1.0000000 |
|
|
| { | 4 | 1.0000000 |
|
|
| { | 4 | 1.0000000 |
|
|
| { | 4 | 1.0000000 |
|
|
| { | 4 | 1.0000000 |
|
|
| { | 4 | 1.0000000 |
|
|
| { | 2 | 1.0000000 |
|
|
| { | 2 | 1.0000000 |
|
|
| { | 2 | 1.0000000 |
|
|
| { | 2 | 1.0000000 |
|
|
| { | 2 | 0.8648035 |
|
|
| { | 2 | 0.8648030 |
|
|
| { | 2 | 0.8344635 |
|
|
| { | 2 | 0.8344630 |
|
|
| { | 2 | 0.6446255 |
|
|
| { | 2 | 0.6446250 |
|
|
The groups S are characterized by: the size k, the coding strength ψ(S), the conductance ϕ(S) and the minimal conductance of the codon group with the size k ϕ(G)
The codon groups of the best genetic code in terms of the fitness function F extracted from 50 independent simulations under the M3 scenario
| Codon group ( |
| |||
|---|---|---|---|---|
| { | 4 | 0.7453878 |
|
|
| { | 4 | 0.7347630 |
|
|
| { | 4 | 0.5928058 |
|
|
| { | 4 | 0.6837612 |
|
|
| { | 4 | 0.5734470 |
|
|
| { | 3 | 0.9164170 |
|
|
| { | 3 | 0.8623860 |
|
|
| { | 3 | 0.8267687 |
|
|
| { | 3 | 0.8261313 |
|
|
| { | 3 | 0.8157710 |
|
|
| { | 3 | 0.7968670 |
|
|
| { | 3 | 0.7812357 |
|
|
| { | 3 | 0.7741430 |
|
|
| { | 3 | 0.7475287 |
|
|
| { | 3 | 0.7347630 |
|
|
| { | 3 | 0.7112670 |
|
|
| { | 3 | 0.7241587 | 1 |
|
| { | 2 | 1.0000000 |
|
|
| { | 2 | 1.0000000 |
|
|
| { | 2 | 1.0000000 |
|
|
| { | 2 | 1.0000000 |
|
|
The groups S are characterized by: the size k, the coding strength ψ(S), the conductance ϕ(S) and the minimal conductance of the codon group with the size k ϕ(G)
The codon groups of the best genetic code in terms of the fitness function F extracted from 50 independent simulations under the M2 scenario
| Codon group ( |
| |||
|---|---|---|---|---|
| { | 6 | 0.9867038 |
|
|
| { | 6 | 0.9866648 |
|
|
| { | 4 | 1.0000000 |
|
|
| { | 4 | 1.0000000 |
|
|
| { | 4 | 0.8262542 |
|
|
| { | 4 | 0.7837840 |
|
|
| { | 4 | 0.6426162 |
|
|
| { | 3 | 0.8457703 |
|
|
| { | 3 | 0.8136870 |
|
|
| { | 3 | 0.7359857 |
|
|
| { | 3 | 0.7170777 |
|
|
| { | 3 | 0.5927813 |
|
|
| { | 3 | 0.5598730 |
|
|
| { | 3 | 0.4932700 |
|
|
| { | 2 | 0.8650405 |
|
|
| { | 2 | 0.8027995 |
|
|
| { | 2 | 0.7850055 |
|
|
| { | 2 | 0.7838025 |
|
|
| { | 1 | 1.0000000 | 1 | 1 |
| { | 1 | 0.9967190 | 1 | 1 |
| { | 1 | 0.5734410 | 1 | 1 |
The groups S are characterized by: the size k, the coding strength ψ(S), the conductance ϕ(S) and the minimal conductance of the codon group with the size k ϕ(G)