| Literature DB >> 19325778 |
Mehmet Levent Kurnaz1, Isil Aksan Kurnaz2.
Abstract
The rules that specify how the information contained in DNA is translated into amino acid "language" during protein synthesis are called "the genetic code", commonly called the "Standard" or "Universal" Genetic Code Table. As a matter of fact, this coding table is not at all "universal": in addition to different genetic code tables used by different organisms, even within the same organism the nuclear and mitochondrial genes may be subject to two different coding tables. Results In an attempt to understand the advantages and disadvantages these coding tables may bring to an organism, we have decided to analyze various coding tables on genes subject to mutations, and have estimated how these genes "survive" over generations. We have used this as indicative of the "evolutionary" success of that particular coding table. We find that the "standard" genetic code is not actually the most robust of all coding tables, and interestingly, Flatworm Mitochondrial Code (FMC) appears to be the highest ranking coding table given our assumptions. Conclusions It is commonly hypothesized that the more robust a genetic code, the better suited it is for maintenance of the genome. Our study shows that, given the assumptions in our model, Standard Genetic Code is quite poor when compared to other alternate code tables in terms of robustness. This brings about the question of why Standard Code has been so widely accepted by a wider variety of organisms instead of FMC, which needs to be addressed for a thorough understanding of genetic code evolution.Entities:
Keywords: evolution; genetic code; robustness; statistical analysis
Year: 2008 PMID: 19325778 PMCID: PMC2635705 DOI: 10.3390/ijms9050679
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 6.208
Figure 1.The “Universal Genetic Code Table”, adopted from Introduction to Biology, Campbell and Reece (6th Ed, 2002). START codon (AUG) encodes for Methionine (Met, M), and the three STOP codons are indicated (UAA, UAG, UGA).
Comparison of various genetic coding tables (information accessed from NCBI Entrez).
| Abbrev. Standard | Code Table Standard Code | Differences from Standard |
|---|---|---|
| VMC | The Vertebrate Mitochondrial Code | Code 2 Std |
| AGA stop * Arg R | ||
| AGG stop * Arg R | ||
| AUA Met M Ile I | ||
| UGA Trp W stop * | ||
| YMC | The Yeast Mitochondrial Code | Code 3 Std |
| AUA Met M Ile I | ||
| CUU Thr T Leu L | ||
| CUC Thr T Leu L | ||
| CUA Thr T Leu L | ||
| CUG Thr T Leu L | ||
| UGA Trp W stop * | ||
| CGA absent Arg R | ||
| CGC absent Arg R | ||
| MSC | The Mold, Protozoan, and Coelenterate Mitochondrial Code and the Mycoplasma/Spiroplasma Code | Code 4 Std |
| UGA Trp W stop * | ||
| IMC | The Invertebrate Mitochondrial Code | Code 5 Std |
| AGA Ser S Arg R | ||
| AGG Ser S Arg R | ||
| AUA Met M Ile I | ||
| UGA Trp W stop * | ||
| CDH | The Ciliate, Dasycladacean and Hexamita Nuclear Code | Code 6 Std |
| UAA Gln Q stop * | ||
| UAG Gln Q stop * | ||
| EMC | The Echinoderm Mitochondrial Code | Code 9 Std |
| AAA Asn N Lys K | ||
| AGA Ser S Arg R | ||
| AGG Ser S Arg R | ||
| UGA Trp W stop * | ||
| ENC | The Euplotid Nuclear Code | Code 10 Std |
| UGA Cys C stop * | ||
| AYNC | The Alternative Yeast Nuclear Code | Code 12 Std |
| CUG Ser S Leu L | ||
| AMC | The Ascidian Mitochondrial Code | Code 13 Std |
| AGA Gly G Arg R | ||
| AGG Gly G Arg R | ||
| AUA Met M Ile I | ||
| UGA Trp W stop * | ||
| FMC | The Flatworm Mitochondrial Code | Code 14 Std |
| AAA Asn N Lys K | ||
| AGA Ser S Arg R | ||
| AGG Ser S Arg R | ||
| UAA Tyr Y stop * | ||
| UGA Trp W stop * | ||
| BNC | Blepharisma Nuclear Code | Code 10 Std |
| UAG Gln Q stop * |
Representative genes that are encoded by different coding tables were used in this study.
| Coding Table | GenBank Accession Number | Gene Name / Explanation |
|---|---|---|
| YMC | X69431 | |
| YMC | X69430 | |
| YMC | X02439 | |
| YMC | AF442220 | |
| YMC | KLU75348 | |
| YMC | Mitochondrion | |
| YMC | Mitochondrion | |
| YMC | Mitochondrion | |
| IMC | AF329059;
| |
| IMC | DQ202128; | |
| CDS 32-520 | (NADH2) gene, partial cds; mitochondrial. | |
| IMC | AB275882 | |
| IMC | X99667 | |
| IMC | DROMTM2A | |
| IMC | AF164587 | |
| IMC | S76764 | |
| IMC | Caenorhabditis | |
| IMC | Drosophila | |
| VMC | NM_002488 | |
| VMC | BC128726 | |
| VMC | BC010318 | |
| VMC | X79547 | |
| VMC | NM_001079924 | |
| VMC | PTU12706 | |
| VMC | NM_008617 | |
| VMC | NM_029696 | |
| VMC | NM_008618 | |
| VMC | NM_010344 | |
| VMC | NM_001009329 | |
| VMC | ||
| VMC | ||
| VMC | ||
| FMC | AJ621238 | |
| FMC | AF188122 | |
| FMC | DQ402037 | |
| FMC | AY147416 | |
| FMC | ||
| ENC | AY124990 | |
| ENC | X71353 | |
| ENC | EF030059 | |
| ENC | DQ866998 | |
| ENC | Y09551 | |
| ENC | AF273753 | |
| ENC | AY295877 | |
| ENC | DQ864704 | |
| ENC | S72098 | |
| ENC | J04533 | |
| ENC | ||
| ENC | ||
| CDH | AY293806 | |
| CDH | HIU37081 | |
| CDH | Z11836 | |
| CDH | AY008386 | |
| CDH | X57926 | |
| CDH | AF194336 | |
| CDH | XM_001032213 | |
| CDH | XM_001031057 | |
| CDH | ||
| CDH | ||
| MSC | X65223 | |
| MSC | X65223 | |
| MSC | X65223 | |
| MSC | NEUMTCOIJ | |
| MSC | AY548157 | |
| MSC | ||
| MSC | ||
| Std | NM_001614 | Human actin, gamma1 |
| Std | AB062393 | Human tubulin-beta |
| Std | AF141347 | Human tubulin-alpha |
| Std | HUMACTA1 | Human actin-beta |
| Std | AB292109 | |
| Std | AB292108 | |
| Std | NM_001081838 | |
| Std | X69884 | |
| Std | NM_001009165 | |
| Std | NM_001009945 | |
| Std | NM_001034095 | |
| Std | NM_001045509 | |
| Std | NM_001098544 | |
| Std | NM_001098572 | |
| Std | AF091101 | |
| Std | MUSHSC70T | |
| Std | NM_007906 | |
| Std | NM_007393 | |
| Std | NM_009609 | |
| Std | NM_134024 | |
| Std | NM_009984 | |
| Std | NM_013486 | |
| Std | NM_001009326 | |
| Std | NM_001009841 | |
| Std | EF407948 | |
| Std | EF201934 | |
| Std | DQ256465 | |
| Std | EF199625 | |
| Std | Human average | |
| Std | ||
| Std | ||
| Std | ||
| Std |
(based on genome-based codon usage frequencies obtained from http://www.kazusa.or.jp/codon/)
Figure 2.Comparison of various coding tables with respect to robustness for selected human and primate genes that are normally subject to Standard Genetic Code. “Average” genes represent hypothetical and idealized genes constructed using average codon usage frequencies (see Methods); accession numbers of the genes are listed in Table 2. Coding table abbreviations are given in Table 1.
Figure 3.Comparison of various coding tables with respect to robustness for selected genes from cat, horse and mouse that are normally subject to Standard Genetic Code. “Average” genes represent hypothetical and idealized genes constructed using average codon usage frequencies (see Methods); accession numbers of the genes are listed in Table 2. Coding table abbreviations are given in Table 1.
Figure 4.Comparison of various coding tables with respect to robustness for selected genes that are normally subject to CDH Coding Table. “Average” genes represent hypothetical and idealized genes constructed using average codon usage frequencies (see Methods); accession numbers of the genes are listed in Table 2. Coding table abbreviations are given in Table 1.
Figure 5.Comparison of various coding tables with respect to robustness for selected genes that are normally subject to FMC Coding Table. “Average” genes represent hypothetical and idealized genes constructed using average codon usage frequencies (see Methods); accession numbers of the genes are listed in Table 2. Coding table abbreviations are given in Table 1.
Figure 6.Ranks of coding tables across organisms and genes in terms of robustness and survivability. (A) A representative list of statistical calculations of survivability for a given gene if it were subject to different coding tables. (B) For genes that are normally decoded by one given Code Table, the average rank of certain alternate tables in terms of performance in survivability (projection of robustness). (C) A summary of performance comparison of all the coding tables tested with respect to one another. The rows show averages of all the genes analyzed for that particular coding scheme (ie, AVERAGE(FMC) means average ranks for genes that are normally subject to FMC), while the columns indicate what would have happened if alternative coding schemes were adopted.