Literature DB >> 31922535

Evidence of multifaceted functions of codon usage in translation within the model beetle Tribolium castaneum.

Carrie A Whittle1, Arpita Kulkarni1, Cassandra G Extavour1,2.   

Abstract

Synonymous codon use is non-random. Codons most used in highly transcribed genes, often called optimal codons, typically have high gene counts of matching tRNA genes (tRNA abundance) and promote accurate and/or efficient translation. Non-optimal codons, those least used in highly expressed genes, may also affect translation. In multicellular organisms, codon optimality may vary among tissues. At present, however, tissue specificity of codon use remains poorly understood. Here, we studied codon usage of genes highly transcribed in germ line (testis and ovary) and somatic tissues (gonadectomized males and females) of the beetle Tribolium castaneum. The results demonstrate that: (i) the majority of optimal codons were organism-wide, the same in all tissues, and had numerous matching tRNA gene copies (Opt-codon↑tRNAs), consistent with translational selection; (ii) some optimal codons varied among tissues, suggesting tissue-specific tRNA populations; (iii) wobble tRNA were required for translation of certain optimal codons (Opt-codonwobble), possibly allowing precise translation and/or protein folding; and (iv) remarkably, some non-optimal codons had abundant tRNA genes (Nonopt-codon↑tRNAs), and genes using those codons were tightly linked to ribosomal and stress-response functions. Thus, Nonopt-codon↑tRNAs codons may regulate translation of specific genes. Together, the evidence suggests that codon use and tRNA genes regulate multiple translational processes in T. castaneum.
© The Author(s) 2020. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

Entities:  

Keywords:  non-optimal codons; optimal codons; tRNA genes; translation regulation; translational selection

Mesh:

Substances:

Year:  2019        PMID: 31922535      PMCID: PMC6993815          DOI: 10.1093/dnares/dsz025

Source DB:  PubMed          Journal:  DNA Res        ISSN: 1340-2838            Impact factor:   4.458


1. Introduction

In protein coding genes, the synonymous codons of amino acids are not used randomly. Biases in codon usage are thought to result from selection for translational efficiency and/or accuracy. Mutational pressures can also shape codon usage., Translational selection in many organisms has been supported by findings that the highly transcribed genes preferentially use a subset of codons, often described as ‘optimal’ codons,,, and has been observed in bacteria,,, fungi,,, plants,,, and animals, including spiders and insects (e.g. Drosophila, Aedes, Anopheles, Gryllus, and Oncopeltus, and weakly observed in Bombyx,,). Whole genome data show that optimal codons typically have correspondingly high numbers of iso-accepting tRNA gene copies in the genome, reflecting an organism’s relative tRNA abundance,,,,,, and consistent with selection for translational optimization.,,,,,, The utility of tRNA gene number to quantify organismal tRNA abundance has been supported in vivo in bacteria and eukaryotes.,, For instance, the addition of tRNA genes for a codon of a specific amino acid to the Escherichia coli genome markedly improved translation rates of genes containing that amino acid. In this regard, the increased use of optimal codons in highly transcribed genes,,, and the correspondence of these codons to abundant tRNA genes,, suggest that selection may favour optimization for cost efficient and/or accurate translation. In contrast to unicellular systems, in multicellular organisms measuring codon usage can be complicated by the plurality of tissues, as optimal codons and tRNA populations may vary among tissue types. For instance, cellular tRNA abundances can vary among tissues or cell types for at least some codons,,, suggesting that translational selection may differ among tissues. This has also been supported by findings of some variation in codon use of genes transcribed in different tissues in the few organisms studied to date. For example, in the plant Arabidopsis, the use of specific codons in a gene depends on the tissue type in which the gene is maximally expressed, suggesting this species has localized tRNA populations, a pattern that has also been proposed for rice. Although similar studies in metazoans have been rare, a recent investigation in Drosophila melanogaster showed that codons associated with elevated expression were not universal across tissues. For example, AAT was more commonly used than AAC for Asn in some tissues (e.g. testis and hindgut), while TGT was favoured over TGC for Cys in the salivary glands. These observations were interpreted as providing evidence of tissue-specific tRNA populations. Additional studies are warranted to determine the universality of distinct optimal codon identities in various tissues of an organism. In particular, the germ line and somatic tissues comprise contrasts of significant interest, as the former directly determines an organism’s reproductive success and fitness, and experiences haploid selection in the meiotic and sex cells, such that translational optimization may be particularly relevant to those tissues. While much attention has been focussed on optimal codons in the literature, growing experimental research, largely from single-celled models or in vitro systems, suggests that non-optimal codons, those codons least used in highly transcribed genes (and/or codons defined as ‘rare’ in some studies), can also play significant regulatory roles in translation.,, In yeast, for example, it was shown that cells altered their tRNA populations under stress and had increased levels of tRNAs that matched the rare codons found in stress-response genes, thus allowing the preferential translation of those mRNAs under stressful conditions, without any change in mRNA abundance. Findings in cyanobacteria have indicated that circadian rhythms are regulated post-transcriptionally based on non-optimized codon use in genes of the kaiABC1 cluster. Further, non-optimal codons have been shown to slow rates of translational elongation and to control ribosome traffic on mRNA, which allows proper co-translational protein folding and/or functionality, based on in vitro cell-free translation systems from Neurospora and Drosophila. Non-optimal codons have also been found to facilitate co-translational protein folding in various yeast models. These data show that the use of one or a few types of rare codon(s) in a gene may markedly affect its translation, depending on the tRNA pool, suggesting that the supply–demand relationship between non-optimal codons and their matching tRNA abundances could comprise an adaptive mechanism of translational regulation., To further understand this phenomenon, genomics and molecular evolution research on codon usage patterns in animal systems should expand beyond the typical focus on optimal codons, and specifically include assessments of non-optimal codons, and their relationships to tRNA genes. In addition to non-optimal codons per se, some studies have indicated that the use of codons that have no matching tRNA, and obligately require wobble codon–anticodon tRNAs (wobbly at the third nucleotide of the codon) may also influence translation. For instance, an investigation in four divergent eukaryotes found that the relative translation levels of cell-cycling gene mRNAs during various stages of the cell cycle depend on the frequency of codons lacking corresponding tRNA gene copies in the genome, and thus requiring wobble tRNA. Further, experimental research in yeast, human cells, and nematodes has shown that obligatory use of wobble tRNA decelerates translational elongation by slowing ribosomal translocation on the mRNA.,, In this regard, the use of codons that require wobble tRNA could have a significant effect on translational dynamics, particularly in slowing translation, and thus should also be considered in studies of codon usage patterns in an organism. A metazoan species providing a promising pathway for the comprehensive study of codon usage in a multicellular system is the Coleopteran rust red flour beetle Tribolium castaneum. T. castaneum is a long standing model for genetics and developmental biology, has a well characterized genome,,, and is estimated to have diverged from the fellow insect Drosophila approximately 300 Mya. While a prior pioneering study had identified a putative list of optimal codons for T. castaneum, the approach used in that study involved correlation analyses between codon frequency and expression level. Given that this method has been thought to often be poorly suited to revealing optimal codons, defined as those most common in highly transcribed genes,,, analyses of codon use in this taxon would benefit from being revisited with alternative methods. Optimal codons can be most readily revealed via direct contrasts of codon usage in the highest versus lowest expressed genes in the genome, also known as the contrast method.,,,, At present, as is the case for most multicellular model organisms, a multifaceted integrative approach has not yet been applied to assessments of codon usage in this beetle taxon, including the identification of optimal and non-optimal codons in highly transcribed genes at an organism-wide level, and within the somatic versus germ line tissues, nor have assessments been available of the links between codon usage and tRNA gene counts, wobble tRNA, and gene functionality. In the present study, we address these outstanding issues on codon usage in T. castaneum using genome-wide protein-sequence datasets (CDS) and large-scale transcriptome datasets from the male and female germ lines and somatic tissues [testes, ovaries, gonadectomized (GT-) males, and GT-females]. From these data, we rigorously study optimal and non-optimal codons in this taxon, and their relationships to tRNA abundances and gene ontology. We report strong evidence for organism-wide optimal codons in all four tissue types and both sexes. The majority of these optimal codons have abundant matching tRNAs (Opt-codon↑tRNA status), consistent with pervasive translational selection for efficient and/or accurate protein synthesis in this species. A minority of optimal codons vary among the four tissues, suggesting small, but potentially meaningful, differences in tRNA populations between tissue types. Crucially, we report that a subset of the optimal codons did not have direct tRNA matches, and thus presumably obligately required wobble tRNA for translation (Opt-codonwobble), which we propose may comprise a mechanism for slowing translation for accuracy or protein-folding purposes. Finally, we find that a number of non-optimal codons unexpectedly have abundant perfectly matching tRNA gene copies (Nonopt-codon↑tRNAs) and that these rare codons are preferentially used in genes with specific functions, including ribosomal protein genes and stress-response genes. Thus, we hypothesize that the use of codons with Nonopt-codon↑tRNAs status may be a potential mechanism to ensure preferential translation of specific gene mRNAs. Collectively, our results reveal the multiple roles of codon usage in this beetle, suggesting not just pervasive selection for the use of specific codons in highly transcribed genes for efficient and/or accurate translation, but also translational regulatory roles of wobble codons and of non-optimal codons.

2. Materials and methods

2.1. Tribolium castaneum CDS

The annotated CDS of our main target species T. castaneum (v.5.2) were downloaded from Ensembl Metazoa (http://metazoa.ensembl.org) and are also available at BeetleBase., The full CDS per gene (longest CDS per gene, N = 16,434) was used for the study of codon usage. The full genome and its descriptive GFF file were also downloaded for assessments.

2.2. Biological samples and RNA-seq

We aimed to determine the expression level (FPKM) for each of 16,434 genes in T. castaneum for germ line and somatic tissues. For this we generated large-scale RNA-seq datasets for the ovaries, testes, GT-females, and GT-males, shown in Supplementary Table S1. The T. castaneum specimens were provided by the Brown lab at KSU (https://www.k-state.edu/biology/people/tenure/brown/). Samples were grown under standard conditions until adulthood and tissue dissections were then performed on unmated adults (a total of 150 animals per sex per biological replicate). RNA was extracted and processed for RNA-seq as described previously.

2.3. Gene expression

The RNA-seq reads (76 bp) per sample were trimmed of adapters and poor-quality bases using the program BBduk available from the Joint Genome Initiative (https://jgi.doe.gov/data-and-tools/bbtools/) set at default parameters. Gene expression level was determined for the 16,434 genes (CDS) as FPKM after mapping each RNA-seq dataset per tissue to the full CDS list for each species using Geneious Read Mapper, which yielded highly similar results as other common mappers such as BBmap (https://jgi.doe.gov/data-and-tools/bbtools/; data not shown). The average FPKM across samples per tissue type (Supplementary Table S1) was used to measure expression per tissue. FPKM values were highly correlated between replicates of each sample type (Spearman’s Ranked R > 0.9, P < 2×10−7).

2.4. Identification of optimal and non-optimal codons

For identification of the optimal codons, we measured the relative synonymous codon usage (RSCU) per codon per amino acid for each gene under study using CAICal. RSCU values indicate the relative usage of a codon in a synonymous codon family, and values >1 and <1 indicate favoured and unfavoured usage as compared with that expected under equal usage of all codons, respectively. Greater relative RSCU values among codons indicates elevated usage. For each of the 18 amino acids in the genetic code with synonymous codons (note that Trp and Met only have one codon each), we identified the optimal codon using the contrast method.,,,,, For this, we determined the difference in RSCU (ΔRSCU) per codon between genes with the highest 5% versus the lowest 5% expression. The primary optimal codon for each amino acid was defined as the codon with the highest and statistically significant positive ΔRSCU value, indicating preferred usage in highly transcribed genes.,,,,, The primary non-optimal codon per amino acid was defined as the codon with the largest negative and statistically significant ΔRSCU value, indicating low usage in highly transcribed genes. Statistical significance per codon was applied using a t-test between RSCU values across all genes for high versus low expressed genes. As the literature reflects some variation in codon use terminology among studies to date, we explicitly define the term ‘optimal codons’ herein as those codons most used in highly transcribed genes based on ΔRSCU, which infers an innate advantage of the codon under high transcription. Then, we secondarily assessed each optimal codon’s correspondence to the number of matching (codon–anticodon) tRNA genes in order to test their role in translational accuracy/efficiency,,,,,, or to infer possible other functions (e.g. wobble codons for translational slowing). For non-optimal codons, a similar approach was used wherein the non-optimal codon status was identified based solely on ΔRSCU, and their relationships to tRNA were then separately assessed. The frequency of optimal codons (Fop) is a measure of the degree of optimal codon usage per gene. Fop was determined in CodonW using the primary optimal codons identified herein. Fop was also determined using the primary optimal codons previously identified by Williford and Demuth 2012. As multiple codons per amino acid were classified as optimal in that assessment, we defined each primary optimal codon from the study as that with the strongest average positive correlation across tissues for measuring Fop. For an additional layer of stringency, we wished to exclude the possibility that expression-mediated mutational biases towards specific nucleotides, which have been observed to some extent in certain organisms to date (e.g. E. coli, humans,), contribute towards codon differences among high and low expressed genes herein. For this, we extracted all introns for every gene in the genome (those with introns) using the GFF file available (see section 2.1). Introns are thought to be mostly selectively neutral,, and thus their nucleotide content should reflect any underlying mutational pressures in the genome, and on the nucleotide composition of synonymous codons in an organism.,, If mutational pressures on introns are not associated with gene expression level, this factor may be excluded as causing optimal codons in the highly expressed genes, further suporting a role for selection in optimal codon distribution. All introns that were >50bp were extracted as the regions between exons and were concatenated per gene. The association between intronic GC content and expression level was assessed using a scatter plot and Spearman’s ranked R.

2.5. Identification of tRNA genes

To assess whether or how the optimal and non-optimal codons were related to the tRNA gene copy number, we determined the number of iso-accepting tRNA genes per codon in the genome (T. castaneum v. 5.2) using tRNA-scan SE.,, The list of tRNA gene numbers identified in the current genome version was identical to that reported previously and is shown in Table 1.
Table 1

The organism-wide ΔRSCU between high versus low expressed genes at the organism-wide level (averaged expression across tissues) and for each of the four tissue types: the ovaries, testes, GT-females, and GT-males)

Organism-wide RSCU and ΔRSCU (from average expression across all tissues)
ΔRSCU per tissue type (from expression within each tissue)
Amino acidCodonHigh RSCULow RSCUΔRSCU P Opt.Non opt.tRNA No.Standard/ WobbleΔRSCU ovaries P ΔRSCU testes P ΔRSCU GT- female P ΔRSCU GT- male P
AlaGCT1.1441.001 +0.143 ** X 14 +0.109 +0.136 * +0.179 ** +0.146 **
AlaGCC1.2381.203+0.0340GGC/AGC+0.023+0.020+0.115 * +0.198 **
AlaGCA0.8330.867−0.0332−0.065*a−0.008−0.110 * −0.175 **
AlaGCG0.7310.899−0.168 ** X 3−0.047−0.128 ** −0.163 ** −0.161 **
ArgCGT0.9190.830+0.089 * 5+0.082+0.087 * −0.043+0.116 *
ArgCGC0.9071.117−0.209 ** 0GCG/ACG−0.204 * −0.112 ** −0.231 ** −0.091 *
ArgCGA0.9461.212−0.265 ** X b 4−0.304 ** −0.143 ** −0.189 ** −0.282 **
ArgCGG0.6500.941−0.290 ** X 0CCG/UCG−0.235 ** −0.263 ** −0.195 ** −0.272 **
ArgAGA1.4010.990 +0.411 ** X 3 +0.393 ** +0.341 ** +0.415 ** +0.276 **
ArgAGG1.0960.801+0.295 ** 3+0.367 ** +0.247 ** +0.250 ** +0.287 **
AsnAAT1.0300.997+0.0330AUU/GUU+0.039+0.041*a+0.012−0.027
AsnAAC0.9550.864 +0.091 ** X 5 +0.071 * +0.067 * +0.066 * +0.115 **
AspGAT1.0020.938 +0.063 * X 0AUC/GUCc +0.115 ** +0.084 ** +0.070 * +0.031
AspGAC0.9420.964−0.021 X d 10−0.035*a−0.008−0.026+0.000
CysTGT0.9860.854 +0.131 ** X 0ACA/GCAc +0.204 ** +0.130 ** +0.155 ** +0.103 **
CysTGC0.8020.827−0.0253−0.026+0.028−0.029 * −0.006
GlnCAA1.1791.098 +0.081 * X 5 +0.043 +0.089 * +0.099 * +0.064 *a
GlnCAG0.7580.785−0.0263+0.026+0.002−0.053−0.015
GluGAA1.2361.110 +0.125 ** X 8 +0.093 * +0.101 ** +0.100 * +0.086 *
GluGAG0.7330.767−0.0345−0.006+0.002−0.036−0.024
GlyGGT0.9180.801 +0.116 ** X 0ACC/GCCc+0.138 * +0.133 ** +0.046 +0.063 *
GlyGGC1.0171.122−0.104 * X 8−0.109−0.077 * −0.070−0.001
GlyGGA1.1241.201−0.077*a X b 15−0.137 ** −0.071 ** −0.017 * −0.070 **
GlyGGG0.8590.792+0.0661 +0.171 ** +0.072 * +0.059 * +0.043
HisCAT0.8400.817+0.0230AUG/GUG+0.055+0.032+0.016−0.017
HisCAC1.0140.978+0.0367 +0.067 *a+0.054 +0.053 * +0.084 *
IleATT1.3591.278 +0.081 * X e 7 +0.121 ** +0.051 * +0.115 * +0.033*a
IleATC1.0240.941 +0.083 * X e 0GAU/AAU+0.005+0.078+0.012 +0.165 **
IleATA0.5780.661−0.083 * X 2−0.048−0.057−0.071−0.141 **
LeuTTA0.9991.127−0.128 * X 2−0.087 * −0.095 * −0.121 * −0.211 **
LeuTTG1.7941.336 +0.458 ** X 4 +0.409 ** +0.391 ** +0.339 ** +0.444 **
LeuCTT0.9010.998−0.096 * 5−0.139 ** −0.082 * −0.003−0.053*a
LeuCTC0.8770.926−0.0490GAG/AAG−0.081−0.033−0.042+0.036
LeuCTA0.4920.561−0.068 * 2+0.023−0.075*a−0.003−0.073 *
LeuCTG0.9001.008−0.107 * 2−0.093−0.092 * −0.158 ** −0.108 *
LysAAA1.2721.273−0.0006−0.019−0.005 +0.068 * −0.011
LysAAG0.7280.654 +0.074 * X 5 +0.075 * +0.067 * −0.020 +0.058 *
PheTTT1.0581.042+0.0151+0.073+0.003+0.016−0.092 **
PheTTC0.9160.850 +0.065 * X 5 +0.015 +0.076 * +0.034 +0.160 **
ProCCT0.9040.785+0.119 * 7 +0.126 * +0.092 * +0.097 * +0.076 *
ProCCC1.0900.917 +0.172 ** X 0GGG/AGGc+0.064 * +0.131 * +0.163 * +0.264 **
ProCCA1.0441.014+0.029 X d 13+0.166+0.021+0.063−0.021
ProCCG0.8891.102−0.213 ** X 1−0.220 ** −0.140 ** −0.249 ** −0.237 **
SerTCT0.8490.732+0.116 * 4+0.084+0.073+0.061+0.026
SerTCC0.8941.056−0.162 ** 0GGA/AGA−0.137−0.152 ** −0.051−0.010
SerTCA1.0590.977+0.082*a2+0.114+0.072 * −0.017+0.001
SerTCG1.1281.231−0.103 * 2−0.066−0.023−0.101 * −0.012
SerAGT1.1490.922 +0.226 ** X 0ACU/GCUc +0.218 ** +0.197 ** +0.268 ** +0.137 **
SerAGC0.9001.039−0.138 * X 3−0.156 * −0.156 ** −0.125 ** −0.105 **
ThrACT1.1070.884 +0.222 ** X 5 +0.199 ** +0.266 ** +0.188 ** +0.207 **
ThrACC1.0321.003+0.0290GGU/AGU+0.026−0.026*a+0.148 * +0.178 **
ThrACA1.0011.013−0.0123−0.027−0.059−0.076−0.136 **
ThrACG0.8121.006−0.194 ** X 2−0.129 * −0.113 * −0.213 ** −0.211 **
TyrTAT0.8190.881−0.062 * 0AUA/GUA+0.040−0.018 * −0.041 * −0.080 **
TyrTAC1.0960.898 +0.197 ** X 13 +0.123 * +0.162 ** +0.156 ** +0.210 **
ValGTT1.2621.133 +0.129 * X 7 +0.124 * +0.119 * +0.157 * +0.101 *
ValGTC0.9760.999−0.0220GAC/AAC−0.017−0.034*a−0.027+0.053
ValGTA0.5520.625−0.073 * X 5−0.047 * −0.064 * −0.019−0.082 *
ValGTG1.1561.140+0.0153+0.020+0.032−0.055−0.041

The ΔRSCU are shown when high and low expressed genes were determined for each of the four individual tissue types. The primary optimal (Opt.) codons are in bold and have the largest positive and statistically significant ΔRSCU (t-test P < 0.05) per amino acid. For the combined four tissue assessment (organism-wide), the primary optimal (Opt.) and non-optimal codons (non opt.) are shown with X. Cases where relatively plentiful tRNA genes match the optimal codon per amino acid are underlined and bold. The wobble anticodons for codons with zero matching tRNA copies are shown (standard anticodon/wobble anticodon shown according to classical wobble rules; see also,).

P < 0.05 and ≥0.001.

P < 0.001.

P-values are between 0.05 and 0.1 and thus is considered a putative optimal or non-optimal codon.

Secondary non-optimal codon with relatively high matching tRNA count.

Codon has Opt-codonwobble status.

While not having a statistically significant negative ΔRSCU, the codon is not optimal and is notable by its high tRNA count.

Both codons are optimal codons at nearly the same level.

The organism-wide ΔRSCU between high versus low expressed genes at the organism-wide level (averaged expression across tissues) and for each of the four tissue types: the ovaries, testes, GT-females, and GT-males) The ΔRSCU are shown when high and low expressed genes were determined for each of the four individual tissue types. The primary optimal (Opt.) codons are in bold and have the largest positive and statistically significant ΔRSCU (t-test P < 0.05) per amino acid. For the combined four tissue assessment (organism-wide), the primary optimal (Opt.) and non-optimal codons (non opt.) are shown with X. Cases where relatively plentiful tRNA genes match the optimal codon per amino acid are underlined and bold. The wobble anticodons for codons with zero matching tRNA copies are shown (standard anticodon/wobble anticodon shown according to classical wobble rules; see also,). P < 0.05 and ≥0.001. P < 0.001. P-values are between 0.05 and 0.1 and thus is considered a putative optimal or non-optimal codon. Secondary non-optimal codon with relatively high matching tRNA count. Codon has Opt-codonwobble status. While not having a statistically significant negative ΔRSCU, the codon is not optimal and is notable by its high tRNA count. Both codons are optimal codons at nearly the same level.

2.6. GO functions

The predicted GO functions were determined using Panther using the option for T. castaneum as species.

2.7. Data availability

The CDS and genome v. 5.2 for T. castaneum are available at Ensembl Metazoa (http://metazoa.ensembl.org). RNA-seq data for all samples from T. castaneum described in Supplementary Table S1 are available at the SRA database under Bio-project number PRJNA564136.

3. Results and discussion

3.1. Optimal codons in T. castaneum

We first report the organism-wide, or global, optimal codon per amino acid for T. castaneum using ΔRSCU and the average expression levels of all annotated genes across all four studied tissue types (testis, ovary, GT-male, and GT-female) in Table 1. The primary optimal codon, defined as the codon with the largest positive ΔRSCU between highly and lowly transcribed genes and with P < 0.05), was identified for 17 of the 18 amino acids with synonymous codons. Seven primary optimal codons ended in T, three in A, five in C, and two in G. We noted that Ile had two codons with nearly identical ΔRSCU values. Further, CAC for His showed signs of optimal codon usage in several individual tissues (see following section), and including this codon yields a study-wide total of 18 optimal codons (Table 1). The range of ΔRSCU values was similar to or larger than that observed in other multicellular eukaryotes, including nematode species, Drosophila, Populus, and Neurospora., Thus, the patterns in Table 1 are consistent with the hypothesis that selection pressures have favoured the use of a specific subset of codons in highly expressed genes (for results on non-optimal codons see Section 3.4). While the striking use of specific optimal codons in genes under high expression levels in Table 1 in itself provides evidence of selection on codon usage, we wished to include additional layers of stringency to affirm the role of selection in favouring these codons. First, we determined the frequency of optimal codons (Fop), a measure of the degree of optimal codon usage per gene, for all studied genes in the genome (N = 16,434). As shown in Fig. 1A, we found that the Fop increased from genes with low (lowest 5% in the genome), to moderate (5–95%), to high (highest 5%) expression levels (Ranked ANOVA and Dunn’s paired test P < 0.05). As low and high expressed genes were used to identify the optimal codons, the Fop was, as expected, lowest and highest in those categories of genes, respectively. Importantly, however, moderately expressed genes, which were not used to identify the optimal codons, showed intermediate Fop values, suggesting a genome-wide tendency for greater use of optimal codons in CDS with elevated expression. Second, as codon usage can vary with protein length in some eukaryotes,,, we repeated the assessment in Fig. 1A using genes with similar CDS lengths, which we binned into short (<150 codons), medium (≥150 to <300), and long CDS (≥300). For each of these three length categories, we found the same stepwise increase of Fop values with expression level (Ranked-ANOVAs P < 0.001). Thus, the link between expression and optimal codons cannot be explained by protein length. Third, from examination of introns, wherein nucleotide content is mostly shaped by mutational pressures,,, we found that the GC (and thus AT) content of introns was uncorrelated to gene expression level (Spearman’s correlation R= −0.09, Fig. 1B),, and thus indicates an absence of expression-mediated mutational biases,,, in this species. Further to this point, unlike some organisms wherein optimal codons typically end in only two or three types of nucleotides,,,, all four nucleotides are represented at the terminal position of optimal codons of this species (Table 1); this also excludes mutational biases in shaping the optimal codons in highly transcribed genes in this taxon. Taken together, while we do not exclude the possibility that non-selective (mutational) mechanisms may contribute towards codon use of genes, particularly those under low or even moderate expression, our observations indicate that a history of selection pressures likely plays a significant role in shaping the codon use of the most highly transcribed genes (top 5% expression) in this organism (Table 1).
Figure 1

(A) Box plots of the frequency of optimal codons (Fop) across all 16,434 genes studied in T. castaneum. Genes are categorized into low (lowest 5%), moderate (5–95%), and high (top 5%) transcription groups based on average expression across all four tissue types (testes, ovaries, GT-males, and GT-females). Different letters below bars indicate a statistically significant difference using Ranked ANOVA and Dunn’s paired contrasts (P < 0.05). (B) The GC content of introns with respect to the expression level per gene (Spearman’s Ranked R is shown). Values are shown for all genes with introns >50 bp.

(A) Box plots of the frequency of optimal codons (Fop) across all 16,434 genes studied in T. castaneum. Genes are categorized into low (lowest 5%), moderate (5–95%), and high (top 5%) transcription groups based on average expression across all four tissue types (testes, ovaries, GT-males, and GT-females). Different letters below bars indicate a statistically significant difference using Ranked ANOVA and Dunn’s paired contrasts (P < 0.05). (B) The GC content of introns with respect to the expression level per gene (Spearman’s Ranked R is shown). Values are shown for all genes with introns >50 bp.

3.2. Most, but not all, optimal codons are the same across germ line and somatic tissues

To compare optimal codon usage among the tissues under study, we next determined the optimal codons (using ΔRSCU) using genes with high versus low expression (top and lowest 5%) separately for each of the four individual tissue types, ovaries, testes, GT-females, and GT-males. For rigour in this assessment, we identified the subset of genes in the top 5% expression class that were only in the top category for one tissue type (and were not in the top 5% expression class in any of the other three tissues), to discern whether or not there was a tissue effect on optimal codons. Under these criteria, we identified 372, 450, 444, and 272 genes for analysis, for ovaries, testes, GT-females, and GT-males, respectively. This allowed us to specifically assess the codon usage of genes that were maximally transcribed only in one individual tissue, as it has been found that if tissue type has an effect on codon use, this effect is most apt to be evident in its highly transcribed genes. The results for ΔRSCU per tissue type are shown in Table 1. We report that 15 of the 18 primary optimal codons (including His) from the organism-wide assessment were identified as having the same optimal codon in three, or all four, of the individual tissue types (Table 1). Thus, the vast majority of primary optimal codons were the same in these divergent tissues, including male and female germ lines and somatic tissue types. However, several significant differences were also observed among tissues. For example, a male-specific primary optimal codon was identified for the amino acid Phe (with two synonymous codons), as the codon TTC was optimal in the testes and GT-males, but not in the ovaries or GT-females (Table 1). Similarly, a GT-male-specific primary optimal codon ATC was identified for Ile (with three synonymous codons), whereas ATT was optimal for the other three tissues. In turn, an ovary-specific optimal codon was evident for Pro (with four synonymous codons), as the primary optimal codon was CCC in all tissues except for the ovaries, where it was CCT. In addition, a GT-female optimal codon was identified for Lys (two synonymous codons), where AAG was optimal in the ovaries, testes, and GT-males, but its alternate codon AAA was optimal for GT-females. These examples show that the primary optimal codon varies among tissue types in this beetle. This pattern therefore suggests that translational selection regimes, and thus corresponding tRNA populations may also vary among tissues. Further, it is worth noting that in some cases there may be tissue-specific preferences for codons using wobble tRNA (e.g. ATC for Ile in GT-males, see Section 3.3). These present results are consistent with the few available studies of tissue-specific codon usages and translational selection from the fellow insect D. melanogaster and in studied plants, (note that although some evidence suggests humans have tissue-specific optimal codons, this has been debated, and may largely be an effect of the GC content of isochores,). Together, while the vast majority of optimal codons are shared across tissues in these beetles, non-negligible differences are observed between tissues and sexes. Direct quantification of tRNAs in cells or tissues has been mostly restricted to date to lab models of bacteria, yeast, or in vitro human cell lines,,,,, and the accuracy and limitations of the various approaches used for tRNA quantification (microarrays, Northern blot, quantitative PCR, and RNA-seq) remains debated.,,, Nevertheless, the development of robust methods to sequence tRNAs that are applicable to non-traditional model organisms will allow further tests of whether or how tRNA expression levels vary with tissues in T. castaneum, as is strongly suggested by these results.

3.3. A majority of organism-wide optimal codons have high tRNA gene copy numbers

Given the minimal differences among tissues, for our remaining analyses, we focus on the organism-wide optimal codon usages (Table 1). The number of tRNA gene copies in the genome has commonly been used as a measure of the relative abundance of each tRNA species.,,,,,, If optimal codon usage were consistently a result of selection in response to abundant tRNAs, then the primary optimal codon per amino acid should also have high relative tRNA gene frequency (Opt-codon↑tRNAs status). When using the organism-wide optimal codon list (Table 1), we found that 12 of the primary optimal codons also had the highest, or near the highest, tRNA gene counts of all codons per amino acid: GCT (Ala), AGA (Arg), AAC (Asn), CAA (Gln), GAA (Glu), ATT (Ile), TTG (Leu), AAG (Lys), TTC (Phe), ACT (Thr), TAC (Tyr), and GTT (Val). Further, while the positive ΔRSCU of CAC for His was not statistically significant using the organism-wide assessment (P = 0.26), this codon was optimal when individually considered in the ovaries, GT-females, and GT-males (P < 0.05), and had seven matching tRNA genes. Thus, including CAC for His as a codon with optimal status yields a study-wide total of 13 of the 18 primary optimal codons that have plentiful matching tRNA genes. In other words, a majority of optimal codons have Opt↑tRNA status. These results strongly suggest translational selection for accuracy and/or efficiency, across the majority of amino acids in this beetle.

Hypothesis 1: Optimal codons use wobble tRNA to resolve conflict of high translation with sequence fidelity and/or for protein folding

While 13 optimal codons had a high number of direct tRNA matches as expected under selection for optimization of efficient and accurate translation, for the remaining 5 amino acids, a much different pattern was observed. Specifically, the primary optimal codon (highly used in abundant transcripts) had no direct matching tRNA genes, and a wobble tRNA (shown in Table 1) must thus be employed for translation of these codons (denoted as Opt-codonwobble). For instance, Opt-codonwobble status was observed for the amino acids Asp (GAT), Cys (TGT), Gly (GGT), Pro (CCC), and Ser (AGT). Thus, this result shows that while these identified optimal codons are preferred in highly transcribed genes, their innate benefit cannot be due to having abundant direct matching tRNA, and thus another mechanism must explain their high usage. Further, as shown in Supplementary Text File S1 and Supplementary Fig. S1, within the group of highly transcribed genes, each of these five individual codons with Opt-codonwobble status showed strong associations with protein length, suggesting putatively significant roles of the use of these types of codons in the translation of abundant mRNAs, which may vary with the length of the translated sequence. Experimental studies in bacteria and eukaryotic models have shown that codons using wobble tRNA act to slow translation by decelerating the translocation of ribosomes on mRNA.,, In addition, a study of the genomes of various eukaryotes (humans, yeast, and Arabidopsis) indicated that cell-cycle genes had high usage of codons that had no matching tRNA genes in the genome, and thus must employ wobble tRNA, which inherently have lower codon–anticodon binding affinity than those codons with perfect matches. The differential use of codons using wobble tRNA in cell-cycle genes, combined with potential oscillations in tRNA abundances, were proposed to differentially regulate the translation rates of these mRNAs during various stages of the cell cycle. Further, this was speculated to possibly comprise a broader evolutionarily conserved phenomenon for translational regulation in eukaryotes. In addition, the usage of wobble tRNAs in a gene could have some parallel functions to the use of non-optimal codons with low tRNA abundance (Nonopt-codon↓tRNAs; see Table 1 for non-optimal codons with few tRNAs) which can prevent jamming of multiple ribosomes during the initiation of translation, and/or slow or pause translation during elongation, which would facilitate accurate protein folding.,,, In this regard, the results from these various studies suggest that the slowing of translation that is induced by wobble tRNA,, could comprise an evolutionarily conserved mechanism shaping various aspects of translation. Significantly, a key modification that mediates wobbling at the first anticodon position (position 34 of the anticodon loop) involves A34, which may be engnzymatically deaminated by adenosine deaminase tRNA (ADATs) to form inosine (I34). The resulting I34 can pair with mRNA 3′codon bases A, C, or U in Eukarya, [see also for an A37 ADAT (Adat1) in D. melanogaster]. For A34 modifications in eukaryotes, available research to date suggests that deamination requires the ADAT2/ADAT3 (hetADAT) enzymes, which are thought to allow A34 modifications across diverse eukaryotic systems., This modification would be essential for some codons obligately requiring wobble tRNA (those with no matching tRNAs and no matching unmodified wobble tRNAs) in the highly transcribed genes studied here, including codons with Opt-codonwobble status (e.g. Pro, CCC, Table 1). Thus, in addition to wobble codons using unmodified tRNAs, further functional study of ADATs is warranted in model insects, such as T. castaneum, including possible variation in expression and activity among tissues. This will help to further ascertain the potential consequences of use of wobble codons requiring tRNA modification at A34 on translation rates and protein folding.,, Taken together, we hypothesize here that for this beetle, the use of codons with Opt-codonwobble status in highly expressed genes comprises a mechanism to slow or pause translation at various sites, which may lead to increased accuracy of translation or allow co-translational protein folding. In addition, the high frequency of five specific codons with Opt-codonwobble status in genes with abundant mRNAs (Table 1) suggests that these codons might also play a significant role in post-transcriptional differential regulation of protein levels in these beetles. Additional studies of protein levels of genes with high usage of codons with Opt-codonwobble status will be needed to further test this aspect of the hypothesis.

3.4. Certain non-optimal codons have abundant tRNA genes

Herein, we defined the primary non-optimal codon per amino acid stringently as the codon with the largest negative ΔRSCU per amino acid, rather than simply all codons that were not optimal. Using these data, we assessed whether those codons with low usage in highly transcribed genes also exhibit few tRNA gene copies, as might be expected if codon usage is mostly shaped by translational selection for efficient and accurate translation (i.e. for adaptation of optimal codons and tRNA abundance). The organism-wide primary non-optimal codons (per amino acid) are shown in Table 1. The results showed that some non-optimal codons, as expected, had low numbers of matching tRNA genes [Nonopt-codon↓tRNAs status, e.g. two tRNA genes for ACG (Thr), ATA (Ile), and TTA (Leu), one for CCG (Pro)]. Unexpectedly, however, certain non-optimal codons had relatively moderate to high tRNA gene abundance (denoted as Nonopt-codon↑tRNAs). For instance, for Arg, while the codon CGG had no tRNA gene copies, its sister non-optimal codon CGA (ΔRSCU= −0.290 and −0.265, respectively) had four tRNA gene matches. For Gly, both the primary and secondary non-optimal codons GGC and GGA (−0.104 and −0.077, respectively) had 8 and 15 matching tRNA gene copies, respectively. For Val, the primary non-optimal codon GTA had five tRNA genes, only slightly lower than the seven observed for its optimal codon GTT. We noted that if we relaxed our definition of a non-optimal codon to consider any codon that is not optimal, we found that some of those codons also had many corresponding tRNA genes. For example, for Pro, the non-optimal codon CCA (which had a weak and nonsignificant positive ΔRSCU value, +0.029, and thus would not have satisfied our strict definition of having the largest negative ΔRSCU for this amino acid) had 13 tRNA genes, an extraordinarily high value compared with other codons. Moreover, for Asp, the (less stringently) defined non-optimal codon GAC had 10 matching tRNA copies. Collectively, it is evident that codons that are not the optimal codons in this taxon are not inevitably linked to a low abundance of matching tRNA genes, and rather in some cases exhibit high matching tRNA gene counts. Thus, these patterns suggest it is possible that non-optimal codons with elevated tRNAs play a specific regulatory role for highly transcribed genes. A recent study in yeast has indicated that stress genes may preferentially use non-optimal codons that have abundant iso-accepting tRNA genes, to increase effective gene expression by promoting their translation over other proteins rather than affecting mRNA levels. Based on this notion, we hypothesize here that codons with Nonopt-codon↑tRNAs status in T. castaneum may regulate the translation of abundant mRNAs of proteins with specific functions in this beetle. To further evaluate this possibility, we examined the predicted gene ontology functions of the highly transcribed genes that had relatively elevated usage of non-optimal codons with abundant matching tRNAs.

Hypothesis 2: Non-optimal codons post-transcriptionally regulate translation based on protein functions

We assessed the GO functions of highly transcribed genes (top 5% in the genome from the organism-wide analyses across all four tissues, N = 822; and a cut-off of 103.3 FPKM) that had relatively elevated use of codons with Nonopt-codon↑tRNAs status (Table 1). For this assessment, rather than assess all strictly defined non-optimal codons, we chose as examples the codons GGC for Gly, GTA for Val, and CGA for Arg. These three codons were defined as non-optimal by our strict definition (having a large negative and statistically significant ΔRSCU, Table 1) and had substantial matching tRNA gene copy counts (four to eight tRNA genes each). These codons also had negative ΔRSCU values in all four of the tissue types studied (Table 1), indicating they consistently have non-favoured status in this organism. For the amino acid Gly, we identified those highly transcribed genes that had RSCU values for GGC of >1.5. An RSCU value of 1 is expected for each of the four Gly codons under equal usage, and thus values of 1.5–4 for GGC are relatively high. Thus, while Nonopt-codon↑tRNAs are by definition rare in highly expressed genes, this approach allowed us to specifically examine the functions of this group (of highly expressed genes) that had unusually elevated use (RSCU) of this codon with Nonopt-codon↑tRNAs status. A total of 20.4% of the highly transcribed gene set was in this class. As shown in Table 2, these genes included those involved in oxidative stress response, such as peroxiredoxin, and those involved in olfactory activity. Thus, we speculate that these types of genes, which use codons with Nonopt-codon↑tRNAs status, will exhibit less tRNA competition during translation elongation than those genes that use codons with few or no matching tRNA genes, such as the fellow Gly codon GGG (with only one tRNA match), or even those genes using non-optimal codons for other amino acids, such as CCG for Pro (with one tRNA match) (Table 1). In addition, we found that genes with elevated GGC frequency encoded numerous (N = 15) ribosomal proteins. Thus, this finding suggests that usage of the non-optimal codon GGC may shape translation via a second mechanism: namely, by shaping the cellular abundance of specific ribosomal proteins per se, which are needed for translation. In this regard, the non-optimal codon usage profiles in Gly appear consistent with a hypothesis wherein the usage of GGC regulates the translation of a subset of genes in this taxon, and may even regulate translation rates per se via effects on certain ribosomal proteins.
Table 2

Examples of functions of the highly transcribed genes in T. castaneum that have elevated use of codons with Nonopt-codon↑tRNAs status [non-optimal codons with abundant matching tRNA genes (≥4)]

Gene functions
High GGC usage for Gly (with RSCU > 1.5)
 Ribosomal protein genes
  TC00610914-3-3 protein epsilon-like protein
  TC01112340S ribosomal protein S13-like protein
  TC00866740S ribosomal protein S20-like protein
  TC00598440S ribosomal protein S26
  TC01083040S ribosomal protein S6
  TC00921440S ribosomal protein S7
  TC01475740S ribosomal protein S8
  TC01630660S acidic ribosomal protein P0
  TC01041360S acidic ribosomal protein P1-like protein
  TC01501360S acidic ribosomal protein P2-like protein
  TC01353660S ribosomal protein L17-like protein
  TC00793260S ribosomal protein L21-like protein
  TC01316860S ribosomal protein L4-like protein
  TC03066660S ribosomal protein L6-like protein
  TC01118260S ribosomal protein L7a-like protein
 Olfactory
  TC007741Odorant binding protein 12
  TC010070Odorant binding protein C06
  TC008681Chemosensory protein 1
 Stress-response
  TC004948Peroxiredoxin 1-like protein
  TC014929Peroxiredoxin 1-like protein
 Uncharacterized proteins (N=50)
High GTA usage for Val (with RSCU>1.5)
 Cytoskeletal
  TC001574Cofilin/actin-depolymerizing factor homologue-like protein
  TC033072Profilin
 p53 related
  TC034594Cell death-inducing p53-target protein 1-like protein
 Ribosomal protein genes (N=0)
 Uncharacterized proteins (N=15)
High CGA usage for Arg (with RSCU>1.5)
 Olfactory
  TC010070Odorant binding protein C06; TcOBP7M orthologue
  TC030421Odorant receptor 305; Or305; orthologue
  TC008681Chemosensory protein 1; TcCSP7K; orthologue
 p53 related
  TC034594Cell death-inducing p53-target protein 1-like protein
 Cytoskeletal
  TC007700Tubulin-specific chaperone cofactor E-like protein
  TC009721Microtubule-protein RP/EB family member 1
  TC012270Troponin C, isoform 1-like protein
  TC033072Profilin
  TC001942Putative dynactin subunit 2-like protein (fragment)
 Ribosomal protein genes (N=0)

While these codons are by definition typically uncommon in highly transcribed genes (Table 1), the subset of genes with elevated use of these codons, RSCU >1.5, were identified and are shown above. These genes are candidates for translational upregulation due to the elevated use of codons with Nonopt-codon↑tRNAs status.

Examples of functions of the highly transcribed genes in T. castaneum that have elevated use of codons with Nonopt-codon↑tRNAs status [non-optimal codons with abundant matching tRNA genes (≥4)] While these codons are by definition typically uncommon in highly transcribed genes (Table 1), the subset of genes with elevated use of these codons, RSCU >1.5, were identified and are shown above. These genes are candidates for translational upregulation due to the elevated use of codons with Nonopt-codon↑tRNAs status. In terms of Val, those genes with high expression (top 5% in the genome) very rarely used the identified primary non-optimal codon GTA. In fact, only 5.1% of the 822 highly transcribed genes had GTA RSCU values >1.5, an extraordinarily low frequency. Those that did exhibit RSCU values >1.5 included genes involved in cytoskeleton functions and actin synthesis, such as Cofilin/actin-depolymerizing factor homologue-like protein and profilin, as well as a p53-related cell death protein, and a number of uncharacterized proteins (Table 2). For Arg, which has six synonymous codons, genes with RSCU values >1.5 for the non-optimal codon CGA included genes involved in olfactory signalling and with cytoskeleton roles (Table 2). It is particularly noteworthy that unlike the genes with elevated RSCU for GGC (Gly), which included abundant ribosomal protein genes, no ribosomal protein genes were among those with elevated frequency of GTA in Val or CGA for Arg. Thus, the ribosomal proteins in particular appear to be strongly connected to the usage of the non-optimal GGC Gly codon, and thus we speculate that this codon may be particularly essential to their regulation. As mentioned above, prior data have suggested that non-optimal codons, when combined with low tRNA abundance, can play important regulatory roles by preventing the jamming of multiple ribosomes during initiation of translation, or slowing translation elongation and facilitating precise protein folding.,,,, The present study, however, shows an additional, and much different, plausible effect of non-optimal codons in T. castaneum. Specifically, we show that the use of non-optimal codons with abundant tRNA genes (Nonopt-codon↑tRNAs) is tightly linked to predicted gene functionality (Table 2), and thus these codons may be likely to contribute to the preferential translation of mRNAs of specific types of genes. This notion agrees with recent experimental data in yeast suggesting that non-optimal or rare codons in stress genes promote the preferential translation of their mRNA in cells in response to stress-induced changes in tRNA pools. Thus, this comprises a potential mechanism for preferential translation of specific mRNAs. Herein, however, given that abundant tRNA gene copies are available in the genome for codons with Nonopt-codon↑tRNAs status (and thus tRNAs should be consistently abundant in cells), we speculate that the use of these non-optimal codons in certain ribosomal protein and stress genes (Table 2) likely acts as a mechanism to ensure their preferential translation among the various mRNAs within cells at an organism-wide level, perhaps independent of environmental or tissue-specific fluctuations in tRNA levels. Collectively, our data on codons with Nonopt-codon↑tRNAs status add to the growing support for a mechanism wherein non-optimal or rare codons, combined with elevated tRNA abundances, significantly shape translational regulation in eukaryotes.,,, Further study in T. castaneum, possibly including assessments of protein abundance of genes with elevated usage of codons with Nonopt-codon↑tRNAs status and with high usage of non-optimal codons with rare tRNA genes, will help unravel the relationships between non-optimal codon usage and translation. In addition, in vivo quantification of the tRNA populations in diverse tissue types in this beetle species,,,,, will help affirm whether these codons consistently exhibit high tRNA abundances, which could promote their preferential translation at an organism-wide level.

3.5. T. castaneum codon usage bias in context

Selection(s) on optimal codons with abundant tRNAs (defined here for those codons with Opt-codon↑tRNA status, and typically denoted as under translational selection), may be influenced by factors such as effective population size (Ne) and genome size. In previous studies, smaller Ne (or NeS∼1; and/or shorter generation times), or larger genomes in eukaryotes have been linked to reduced selection pressures on codon use. For example, using a statistic aimed to quantify an organism’s genome-wide selection on codon usage (using predicted selection pressures per codon and tRNAs; see also) to compare among species, it was reported that strong selection pressures on codon use occurs for some bacteria such as E. coli [which also have highly skewed RSCU values (>2) for some of its codons], with intermediate pressure in D. melanogaster and weak or absent pressure in humans. The authors of that study suggested that this pattern was related to their (respectively increasing) genome sizes. In this context, the overall translational selection pressures on optimal codon in T. castaneum (Table 1) may be expected to be moderate, and similar to those of its fellow insect D. melanogaster (genome sizes of 160 and 175 MB, respectively)., However, such between-taxon differences on selected codon bias could also reflect weaker pressure in taxa with smaller effective population sizes, which decrease, respectively, in bacteria, insects, and humans. Nonetheless, our present study is largely focussed on the dynamics of the most highly transcribed (top 5%) genes in the genome in T. castaneum (rather than all genes; Table 1), and includes analyses not only of the translational selection on optimal codons per se (Opt-codon↑tRNAs status), but also putative selection favouring roles of wobble and non-optimal codons (Opt-codonwobble and Nonopt-codon↑tRNAs status) and their relationships to tRNAs in shaping translational processes in this taxon. While our data suggest selection has been a factor in shaping the frequency of each of these types of codons in highly transcribed genes in T. castaneum, further similar studies in more multicellular organisms, including additional Tribolium species, will ascertain the breadth of such patterns across diverse metazoans.

3.6. Comparison of present optimal codon list to a prior report

On a final note, it is worthwhile mentioning here that the optimal codon list we present in Table 1 differs from that previously reported in T. castaneum. The previous report used a correlation method to determine optimal codons, and a comparison of the present primary optimal codon list in Table 1 (for the whole organism analyses) to those earlier findings is shown in Supplementary Table S2. We found that only 9 of the 18 primary optimal codons identified herein, were also identified as optimal by the previous study under the correlation method, even when we used very loose criteria for defining a match to that prior assessment (i.e. considering all optimal codons that were defined at any level under the correlation method, regardless of whether they were the primary, secondary, or tertiary optimal codon, as a match to our primary optimal codon). It has been previously argued that the use of a correlation approach can often yield a misleading list of optimal codons. Further, the R values observed for the codons defined as optimal using the prior correlation method were typically <0.1 (the highest value was 0.237, Supplementary Table S2). A range of such low values, even when statistically significant, is sometimes considered a very weak or absent correlation (R < 0.3),, and thus may not be conducive to revealing codons most often used in highly transcribed genes, as was the goal here. Moreover, we found that increased gene expression level (organism-wide expression) was not positively connected to the Fop when using the optimal codons (primary optimal codon defined as strongest correlation) identified under the prior correlation method. Rather, as shown in Supplementary Fig. S2, we found only mild variation in Fop among expression classes, and Fop was reduced in low and high expressed genes as compared with moderately expressed (Ranked ANOVA and Dunn’s P < 0.05), trends inconsistent with a persistent connection between Fop and expression level. However, we did find a strong connection between expression level and Fop using the optimal codons identified herein (Table 1, Fig. 1A). The method of employing ΔRSCU between high and low expressed genes has repeatedly been shown effective for specifically revealing the optimal codons, defined as those preferentially used in the most highly transcribed genes in the genome,,,,,, as was the present objective. Thus, the optimal codons defined herein are those most often used in highly transcribed genes, and were used for all our analyses (Table 1).

4. Conclusions

The present study has revealed the complex dynamics of codon usage in the multicellular beetle model system T. castaneum. We found that the majority of optimal codons in this animal model are shared at the organism-wide level and match tRNA with abundant gene copies, supporting the presence of species-wide translational selection for efficient and/or accurate translation. However, we also showed that a non-negligible subset of optimal codons varied among the four tissue types, suggesting a likelihood of tissue- and sex-specific tRNA populations, and thus localized translational selection. Based on codon optimality status and tRNA gene copies, we propose two hypotheses. The first hypothesis suggests that the usage of codons with Opt-codonwobble status in highly transcribed genes in this beetle has evolved as a mechanism that slows translation, which could increase precision of translation and/or protein folding. The second hypothesis proposes that usage of codons with Nonopt-codon↑tRNAs status is as a mechanism that promotes high translation of mRNA of genes with specific cellular functions, which we show here to include stress-response and ribosomal protein genes. Further study in T. castaneum, including assessments of cellular protein levels of genes using codons with Opt-codonwobble and Nonopt-codon↑tRNAs status in germ line and somatic tissues, will help further unravel their potential roles in translation regulation. In addition, in vivo quantification of the tRNA populations in various tissue types and under stressful conditions in this beetle, as this methodology improves,,,,,, will provide additional valuable insights into tRNA population stability and variation between tissues. Our data suggest that the frequency of specific codons in T. castaneum obligately requiring wobble tRNA, similar to those non-optimal codons with few tRNAs, may be linked to translational slowing or protein-folding functions in highly transcribed genes. Future follow-up studies should assess whether such codons cluster or show original use patterns at or near protein (folding) structural elements, which some research suggests may occur in certain organisms,,,, and/or whether those codons may effectively slow or pause translation.,, In an understudied metazoan model such as T. castaneum, the former may be achieved via comprehensive bioinformatics analysis of protein structural properties and codon use,, and/or the development of a cell-free translation system allowing manipulation of codon use in mRNAs such as those from Neurospora and Drosophila,, while the latter may be informed by ribosomal profiling analyses during translation.,, Population-level approaches will also be valuable to further ascertaining the selection pressures acting on codon use,,, particularly research on the mutational spectra of codons with Opt-codon↑tRNAs, Opt-codonwobble, and Nonopt-codon↑tRNAs status, to ascertain whether such codon mutations show signals of selection favouring their fixation in highly transcribed genes of T. castaneum. At present, most non-traditional multicellular organisms have not had as many protocols optimized for lab-based experimental or transgenic research of codon optimization, including rates of translation elongation, protein folding, tRNA-charging, or codon–anticodon tRNA binding, as compared with the established widely studied single-celled models or in vitro cell lines.,,, We have shown here, however, using the species T. castaneum, that a multifaceted approach using analyses of gene expression, tRNA genes, tissue type, and gene functionality can be used to suggest how codon usage shapes translational optimization and regulation in a metazoan system. Click here for additional data file.
  92 in total

1.  A segmentation gene in tribolium produces a polycistronic mRNA that codes for multiple conserved peptides.

Authors:  Joël Savard; Henrique Marques-Souza; Manuel Aranda; Diethard Tautz
Journal:  Cell       Date:  2006-08-11       Impact factor: 41.582

2.  Evolution of codon usage bias in Drosophila.

Authors:  J R Powell; E N Moriyama
Journal:  Proc Natl Acad Sci U S A       Date:  1997-07-22       Impact factor: 11.205

Review 3.  A-to-I editing on tRNAs: biochemical, biological and evolutionary implications.

Authors:  Adrian Gabriel Torres; David Piñeyro; Liudmila Filonava; Travis H Stracker; Eduard Batlle; Lluis Ribas de Pouplana
Journal:  FEBS Lett       Date:  2014-09-27       Impact factor: 4.124

Review 4.  Codon bias evolution in Drosophila. Population genetics of mutation-selection drift.

Authors:  H Akashi
Journal:  Gene       Date:  1997-12-31       Impact factor: 3.688

5.  Nonoptimal codon usage influences protein structure in intrinsically disordered regions.

Authors:  Mian Zhou; Tao Wang; Jingjing Fu; Guanghua Xiao; Yi Liu
Journal:  Mol Microbiol       Date:  2015-06-25       Impact factor: 3.501

6.  Codon optimality is a major determinant of mRNA stability.

Authors:  Vladimir Presnyak; Najwa Alhusaini; Ying-Hsin Chen; Sophie Martin; Nathan Morris; Nicholas Kline; Sara Olson; David Weinberg; Kristian E Baker; Brenton R Graveley; Jeff Coller
Journal:  Cell       Date:  2015-03-12       Impact factor: 41.582

Review 7.  Quality over quantity: optimizing co-translational protein folding with non-'optimal' synonymous codons.

Authors:  Giselle N Jacobson; Patricia L Clark
Journal:  Curr Opin Struct Biol       Date:  2016-06-16       Impact factor: 6.809

Review 8.  Codon usage and tRNA content in unicellular and multicellular organisms.

Authors:  T Ikemura
Journal:  Mol Biol Evol       Date:  1985-01       Impact factor: 16.240

9.  Mutational bias and translational selection shaping the codon usage pattern of tissue-specific genes in rice.

Authors:  Qingpo Liu
Journal:  PLoS One       Date:  2012-10-29       Impact factor: 3.240

10.  Discrepancy among the synonymous codons with respect to their selection as optimal codon in bacteria.

Authors:  Siddhartha Sankar Satapathy; Bhesh Raj Powdel; Alak Kumar Buragohain; Suvendra Kumar Ray
Journal:  DNA Res       Date:  2016-10-01       Impact factor: 4.458

View more
  2 in total

1.  Absence of a Faster-X Effect in Beetles (Tribolium, Coleoptera).

Authors:  Carrie A Whittle; Arpita Kulkarni; Cassandra G Extavour
Journal:  G3 (Bethesda)       Date:  2020-03-05       Impact factor: 3.154

2.  Adaptation of codon and amino acid use for translational functions in highly expressed cricket genes.

Authors:  Carrie A Whittle; Arpita Kulkarni; Nina Chung; Cassandra G Extavour
Journal:  BMC Genomics       Date:  2021-04-06       Impact factor: 3.969

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.