| Literature DB >> 21272306 |
Kazuharu Misawa1, Reiko F Kikuno.
Abstract
BACKGROUND: Codon bias is a phenomenon that refers to the differences in the frequencies of synonymous codons among different genes. In many organisms, natural selection is considered to be a cause of codon bias because codon usage in highly expressed genes is biased toward optimal codons. Methods have previously been developed to predict the expression level of genes from their nucleotide sequences, which is based on the observation that synonymous codon usage shows an overall bias toward a few codons called major codons. However, the relationship between codon bias and gene expression level, as proposed by the translation-selection model, is less evident in mammals.Entities:
Year: 2011 PMID: 21272306 PMCID: PMC3038927 DOI: 10.1186/1756-0500-4-20
Source DB: PubMed Journal: BMC Res Notes ISSN: 1756-0500
Figure 1Correlation between amino acid composition and gene expression level. For preparation of this plot, we sorted the genes by their expression levels. The genes were sorted into bins of 50. Subsequently, the 50 genes were concatenated as a single large gene for analysis. Thereafter, the amino acid contents of the proteins were calculated. Each point on the plot represents a bin.
Correlation between amino acid abundance and gene expression level
| Overall | After Elimination of Outliers | |||||
|---|---|---|---|---|---|---|
| Amino Acid | Correlation coefficient | P-value | Correlation coefficient | P-value | ||
| Phe | 0.091 | 0.002 | * | 0.067 | 0.030 | |
| Leu | -0.060 | 0.040 | -0.047 | 0.124 | ||
| Ser | -0.123 | 0.000 | ** | -0.055 | 0.071 | |
| Tyr | 0.046 | 0.114 | 0.059 | 0.055 | ||
| Cys | -0.090 | 0.002 | * | -0.118 | 0.000 | ** |
| Trp | -0.052 | 0.074 | -0.031 | 0.319 | ||
| Pro | -0.080 | 0.006 | * | -0.062 | 0.045 | |
| His | -0.125 | 0.000 | ** | -0.169 | 0.000 | ** |
| Gln | -0.053 | 0.067 | -0.010 | 0.757 | ||
| Arg | -0.061 | 0.036 | -0.033 | 0.281 | ||
| Ile | 0.135 | 0.000 | ** | 0.084 | 0.006 | * |
| Met | 0.060 | 0.040 | 0.054 | 0.081 | ||
| Thr | -0.040 | 0.174 | -0.045 | 0.141 | ||
| Asn | 0.065 | 0.025 | 0.030 | 0.328 | ||
| Lys | 0.091 | 0.002 | * | 0.031 | 0.308 | |
| Val | 0.091 | 0.002 | * | 0.069 | 0.024 | |
| Ala | 0.037 | 0.199 | 0.072 | 0.018 | ||
| Asp | 0.101 | 0.000 | ** | 0.054 | 0.079 | |
| Glu | 0.084 | 0.004 | * | 0.066 | 0.032 | |
| Gly | 0.004 | 0.884 | 0.018 | 0.567 | ||
* Significant at 1% level
** Significant at 0.1% level
Correlation between the nucleotide composition at the 3rd position of codons and gene expression level
| 3' adjacent nucleotide | Overall | After elimination of outliers |
|---|---|---|
| All amino acids | ||
| T | -0.063 | -0.064 |
| C | -0.061 | -0.054 |
| A | -0.068 | -0.048 |
| G | 0.020 | 0.019 |
| T-adapted amino acids | ||
| T | -0.069 | -0.068 |
| C | -0.051 | -0.033 |
| A | -0.065 | -0.050 |
| G | 0.012 | 0.000 |
| C-adapted amino acids | ||
| T | -0.049 | -0.061 |
| C | -0.061 | -0.061 |
| A | -0.054 | -0.032 |
| G | 0.023 | 0.034 |
Observed number of combinations of nucleotide at the third position and their 3' adjacent nucleotide
| The 3' adjacent nucleotide | |||||||
|---|---|---|---|---|---|---|---|
| Codon type | Third nucleotide | T | C | A | G | H | |
| All | T | 45922 | 71142 | 48193 | 135728 | 165257 | |
| C | 88111 | 113015 | 141307 | 45671 | ** | 342433 | |
| T-adapted | T | 24963 | 42070 | 22892 | 64855 | 89925 | |
| C | 44589 | 54213 | 72472 | 21423 | ** | 171274 | |
| C-adapted | T | 20959 | 29072 | 25301 | 70873 | 75332 | |
| C | 43522 | 58802 | 68835 | 24248 | ** | 171159 | |
H stands for A, C, or T
** Significantly lower than the number of Cs when the 3' adjacent nucleotide is H (p < 0.1%) after Bonferroni correction when a chi-square test was used.