| Literature DB >> 18763298 |
Xiao-zhong Xu1, Qing-po Liu, Long-jiang Fan, Xiao-feng Cui, Xue-ping Zhou.
Abstract
Begomoviruses are single-stranded DNA viruses and cause severe diseases in major crop plants worldwide. Based on current genome sequence analyses, we found that synonymous codon usage variations in the protein-coding genes of begomoviruses are mainly influenced by mutation bias. Base composition analysis suggested that the codon usage bias of AV1 and BV1 genes is significant and their expressions are high. Fourteen codons were determined as translational optimal ones according to the comparison of codon usage patterns between highly and lowly expressed genes. Interestingly the codon usages between begomoviruses from the Old and the New Worlds are apparently different, which supports the idea that the bipartite begomoviruses of the New World might originate from bipartite ones of the Old World, whereas the latter evolve from the Old World monopartite begomoviruses.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18763298 PMCID: PMC2528880 DOI: 10.1631/jzus.B0820005
Source DB: PubMed Journal: J Zhejiang Univ Sci B ISSN: 1673-1581 Impact factor: 3.066
The GC content at the different codon positions and the A, T, G and C contents at the third position of begomovirus genes
| Genes | No. | GC | GC1 | GC2 | GC3 | A3 | T3 | G3 | C3 |
| 147 | 0.452 (0.021) | 0.465 (0.015) | 0.408 (0.013) | 0.485 (0.059) | 0.159 (0.030) | 0.357 (0.044) | 0.266 (0.028) | 0.219 (0.041) | |
| 92 | 0.484 (0.018) | 0.563 (0.040) | 0.404 (0.020) | 0.487 (0.042) | 0.220 (0.030) | 0.294 (0.033) | 0.245 (0.030) | 0.241 (0.028) | |
| 147 | 0.448 (0.018) | 0.497 (0.018) | 0.395 (0.012) | 0.452 (0.040) | 0.275 (0.027) | 0.273 (0.028) | 0.190 (0.027) | 0.262 (0.023) | |
| 143 | 0.471 (0.022) | 0.521 (0.031) | 0.466 (0.026) | 0.427 (0.037) | 0.293 (0.029) | 0.280 (0.028) | 0.185 (0.029) | 0.241 (0.032) | |
| 145 | 0.391 (0.021) | 0.421 (0.028) | 0.308 (0.027) | 0.442 (0.040) | 0.283 (0.034) | 0.275 (0.026) | 0.200 (0.030) | 0.242 (0.028) | |
| 117 | 0.483 (0.023) | 0.412 (0.026) | 0.502 (0.055) | 0.534 (0.030) | 0.265 (0.031) | 0.201 (0.039) | 0.331 (0.038) | 0.203 (0.025) | |
| 53 | 0.423 (0.028) | 0.459 (0.026) | 0.411 (0.025) | 0.399 (0.059) | 0.218 (0.030) | 0.383 (0.050) | 0.218 (0.036) | 0.181 (0.043) | |
| 53 | 0.447 (0.028) | 0.457 (0.017) | 0.412 (0.017) | 0.473 (0.068) | 0.300 (0.046) | 0.227 (0.041) | 0.230 (0.047) | 0.243 (0.053) |
Values in parenthesis indicate standard deviation (SD)
Fig. 1Relationship between effective number of codons (N c) and GC3 content of the begomovirus genes. The continuous curve represents the expected curve between GC3 content and N c under random codon usage
Fig. 2(a) Relationship between the effective number of codons (N c) and GC content of the third codon position (GC3) of three different genes (≥80 codons long) in 147 begomoviruses. The continuous curves represent the expected curve between GC3 content and N c under random codon usage; (b) Relationship between the GC content at the first (GC1)/the second (GC2) codon position and that at the third codon position (GC3) of AV1, AC1 and AC4 genes (≥80 codons long) in 147 begomoviruses
Comparison of codon usage and relative synonymous codon usage (RSCU) between the high and low expressed genes in begomoviruses
| Amino acid | Codon | High | Low | Δ |
| Ala | GCA | 1.107 | 0.885 | 0.222 |
| GCC | 0.734 | 1.401 | −0.667 | |
| GCG | 0.438 | 0.977 | −0.539 | |
| GCT | 1.722 | 0.737 | 0.985 | |
| Cys | TGC | 0.060 | 1.556 | −1.495 |
| TGT | 1.940 | 0.444 | 1.495 | |
| Asp | GAC | 0.387 | 1.043 | −0.657 |
| GAT | 1.613 | 0.957 | 0.657 | |
| Glu | GAA | 0.859 | 1.087 | −0.228 |
| GAG | 1.141 | 0.913 | 0.228 | |
| Phe | TTC | 0.748 | 1.106 | −0.358 |
| TTT | 1.252 | 0.894 | 0.358 | |
| Gly | GGA | 0.647 | 1.485 | −0.838 |
| GGC | 0.849 | 0.782 | 0.067 | |
| GGG | 0.696 | 1.042 | −0.347 | |
| GGT | 1.809 | 0.691 | 1.118 | |
| His | CAC | 0.376 | 1.075 | −0.698 |
| CAT | 1.624 | 0.925 | 0.698 | |
| Ile | ATA | 0.907 | 0.692 | 0.215 |
| ATC | 0.804 | 1.286 | −0.481 | |
| ATT | 1.288 | 1.022 | 0.266 | |
| Lys | AAA | 0.451 | 1.034 | −0.583 |
| AAG | 1.549 | 0.966 | 0.583 | |
| Leu | CTA | 0.186 | 1.330 | −1.145 |
| CTC | 0.594 | 0.923 | −0.329 | |
| CTG | 0.755 | 1.045 | −0.291 | |
| CTT | 0.829 | 1.412 | −0.583 | |
| TTA | 1.113 | 0.937 | 0.177 | |
| TTG | 2.524 | 0.353 | 2.171 | |
| Asn | AAC | 0.866 | 0.835 | 0.031 |
| AAT | 1.134 | 1.165 | −0.031 | |
| Pro | CCA | 1.096 | 1.383 | −0.287 |
| CCC | 1.773 | 0.697 | 1.077 | |
| CCG | 0.162 | 0.886 | −0.723 | |
| CCT | 0.968 | 1.035 | −0.067 | |
| Gln | CAA | 0.780 | 1.277 | −0.497 |
| CAG | 1.220 | 0.723 | 0.497 | |
| Arg | AGA | 1.199 | 1.192 | 0.007 |
| AGG | 1.438 | 1.611 | −0.174 | |
| CGA | 0.570 | 1.336 | −0.767 | |
| CGC | 0.526 | 0.537 | −0.011 | |
| CGG | 0.461 | 0.511 | −0.050 | |
| CGT | 1.807 | 0.812 | 0.994 | |
| Ser | AGC | 1.142 | 0.748 | 0.394 |
| AGT | 1.042 | 0.830 | 0.212 | |
| TCA | 0.592 | 1.477 | −0.886 | |
| TCC | 0.917 | 0.985 | −0.068 | |
| TCG | 0.867 | 1.094 | −0.228 | |
| TCT | 1.442 | 0.866 | 0.575 | |
| Thr | ACA | 0.661 | 1.237 | −0.576 |
| ACC | 0.602 | 0.728 | −0.127 | |
| ACG | 0.558 | 1.546 | −0.988 | |
| ACT | 2.179 | 0.489 | 1.690 | |
| Val | GTA | 0.448 | 1.208 | −0.760 |
| GTC | 0.998 | 0.849 | 0.149 | |
| GTG | 1.034 | 0.784 | 0.250 | |
| GTT | 1.520 | 1.159 | 0.361 | |
| Tyr | TAC | 0.540 | 0.938 | −0.399 |
| TAT | 1.460 | 1.062 | 0.399 |
Major codons, defined by a higher RSCU value in highly expressed genes, are indicated as (ΔRSCU≥0.30)
Major codons, defined by a higher RSCU value in highly expressed genes, are indicated as (ΔRSCU≥0.20)
Major codons, defined by a higher RSCU value in highly expressed genes, are indicated as (ΔRSCU≥0.08)
Fig. 3Positions of the 147 begomoviruses in the plot of the first two major axes by correspondence analysis of relative synonymous codon usage (RSCU) values. The first and second major axes can account for 15.2% and 9.9% of the total variations, respectively. The begomoviruses are divided into three groups, the Old World begomoviruses with monopartite genomes (OM), the Old World viruses with bipartite genomes (OB) and the New World viruses with bipartite genomes (NB)