| Literature DB >> 30965565 |
Kajal Kumar Biswas1, Supratik Palchoudhury2, Prosenjit Chakraborty3, Utpal K Bhattacharyya4, Dilip K Ghosh5, Palash Debnath6, Chandrika Ramadugu7, Manjunath L Keremane8, Ravi K Khetarpal9, Richard F Lee10.
Abstract
Citrus tristeza virus (CTV), a member of the aphid-transmitted closterovirus group, is the causal agent of the notorious tristeza disease in several citrus species worldwide. The codon usage patterns of viruses reflect the evolutionary changes for optimization of their survival and adaptation in their fitness to the external environment and the hosts. The codon usage adaptation of CTV to specific citrus hosts remains to be studied; thus, its role in CTV evolution is not clearly comprehended. Therefore, to better explain the host⁻virus interaction and evolutionary history of CTV, the codon usage patterns of the coat protein (CP) genes of 122 CTV isolates originating from three economically important citrus hosts (55 isolate from Citrus sinensis, 38 from C. reticulata, and 29 from C. aurantifolia) were studied using several codon usage indices and multivariate statistical methods. The present study shows that CTV displays low codon usage bias (CUB) and higher genomic stability. Neutrality plot and relative synonymous codon usage analyses revealed that the overall influence of natural selection was more profound than that of mutation pressure in shaping the CUB of CTV. The contribution of high-frequency codon analysis and codon adaptation index value show that CTV has host-specific codon usage patterns, resulting in higheradaptability of CTV isolates originating from C. reticulata (Cr-CTV), and low adaptability in the isolates originating from C. aurantifolia (Ca-CTV) and C. sinensis (Cs-CTV). The combination of codon analysis of CTV with citrus genealogy suggests that CTV evolved in C. reticulata or other Citrus progenitors. The outcome of the study enhances the understanding of the factors involved in viral adaptation, evolution, and fitness toward their hosts. This information will definitely help devise better management strategies of CTV.Entities:
Keywords: Citrus tristeza virus; citrus host; codon usage adaptation; codon usage bias; high-frequency codons; natural selection
Year: 2019 PMID: 30965565 PMCID: PMC6521185 DOI: 10.3390/v11040331
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Nucleotide (Nt) contents of coat protein (CP) genes of Citrus tristeza virus (CTV)isolates.
| Nt Composition | Ca-CTV | Cr-CTV | Cs-CTV | Overall CTV |
|---|---|---|---|---|
|
| 28.78 (0.10) | 28.72 (0.26) | 28.71 (0.48) | 28.73 (0.05) |
|
| 18.41 (0.36) | 18.30 (0.51) | 18.42 (0.66) | 18.38 (0.04) |
|
| 26.91 (0.09) | 27.06 (0.29) | 26.98 (0.51) | 26.99 (0.03) |
|
| 25.90 (0.15) | 25.92 (0.33) | 25.88 (0.53) | 25.90 (0.05) |
|
| 44.31 (0.53) | 44.23 (0.18) | 44.30 (0.20) | 44.28 (0.05) |
|
| 55.69 (0.91) | 55.77 (0.47) | 55.70 (0.07) | 55.72 (0.05) |
|
| 21.09 (0.40) | 20.64 (0.53) | 20.69 (0.65) | 20.77 (0.14) |
|
| 20.26 (0.33) | 19.92 (0.49) | 20.40 (0.64) | 20.22 (0.09) |
|
| 33.84 (0.24) | 34.26 (0.17) | 33.86 (0.40) | 33.98 (0.09) |
|
| 24.82 (0.34) | 25.18 (0.43) | 25.06 (0.59) | 25.04 (0.15) |
|
| 45.07 (0.61) | 45.10 (0.32) | 45.45 (0.25) | 45.25 (0.13) |
|
| 45.91 (0.58) | 45.82 (0.23) | 45.75 (0.19) | 45.81 (0.06) |
|
| 58.65 (1.08) | 59.43 (0.65) | 58.91(0.32) | 59.01 (0.20) |
|
| 54.93 (0.92) | 54.90 (0.51) | 54.55(0.19) | 54.75 (0.13) |
|
| 54.09 (0.86) | 54.18 (0.44) | 54.25 (0.08) | 54.19 (0.06) |
|
| 0.50 (0.00) | 0.50 (0.00) | 0.50 (0.00) | 0.50 (0.00) |
|
| 0.38 (0.00) | 0.38 (0.00) | 0.38 (0.00) | 0.38 (0.00) |
|
| 0.44 (0.00) | 0.44 (0.00) | 0.44 (0.00) | 0.44 (0.00) |
|
| 0.44 (0.00) | 0.44 (0.01) | 0.44 (0.01) | 0.44(0.01) |
|
| 0.05 (0.01) | 0.04 (0.00) | 0.04 (0.00) | 0.05 (0.00) |
|
| 0.45 (0.00) | 0.45 (0.00) | 0.45 (0.00) | 0.45 (0.00) |
|
| 54.63 (0.46) | 53.51 (0.39) | 53.74 (0.30) | 53.88 (0.22) |
|
| 216.69 (0.08) | 216.76 (0.08) | 216.78 (0.07) | 216.75 (0.04) |
|
| 223.00 (0.00) | 223.00 (0.00) | 223.00 (0.00) | 223.00 (0.00) |
|
| −0.48 (0.00) | −0.49 (0.00) | −0.48 (0.00) | −0.48 (0.00) |
|
| 0.08 (0.00) | 0.08 (0.00) | 0.08 (0.00) | 0.08 (0.00) |
Note: Ca-CTV, Cr-CTV, and Cs-CTV denote the CTV isolates originated from Citrusaurantifolia, C. reticulata, and C. sinensis, respectively. Values within parentheses indicate ± standard errors. A%, U%, C%, and G% represent the overall frequencies of adenine (A), cytosine (C), uracil (U), and guanine (G) nucleotides of the CTV CP gene; A3%,U3%, C3%, and G3% represent the nucleotide frequency at the third codon position; GC and AU denote the percentage of G+C and A+U; GC1, GC2, and GC3 denote the G+C at the first, second, and third synonymous codon positions; GA3, GU3, AU3, and CU3 represent the G+A, G+U, A+U, and C+U percentages at the third codon position; CBI, codon bias index, measures the extent to which a gene uses a subset of optimal codons; Fop, frequency of optimum codons, is the ratio of optimal codons to synonymous codons; ENc, effective number of codons; L_sym, number of synonymous codons; L_aa, number of translatable codons; Gravy, general average hydropathicity; Aromo, frequency of aromatic amino acids in hypothetically translated gene product.
Figure 1ENc-GC3 plot analysis of the coat protein (CP) genes of Citrus tristeza virus (CTV) isolates. ENc denotes the effective number of codons, and GC3 denotes the GC content at the third synonymous codon position. The black dotted line represents the expected curve derived from the positions of strains when the codon usage is only determined by the GC3 composition. Different CTV subgroups are indicated as Ca-CTV, Cr-CTV, and Cs-CTV (from Citrus aurantifolia, C. reticulata, and C. sinensis, respectively) with different color markers as shown to the right of the figure.
Figure 2Neutrality plot analysis (GC12 vs. GC3) for the coat protein genes of CTV isolates. GC12 stands for the average value of GC contents at the first and second positions of the codons (GC1 and GC2), while GC3 refers to the GC contents at the thirdposition of the codons. The black line is the linear regression of GC12 against GC3; the regression curve can be described as y = −0.058x + 0.463, R² = 0.045. Different CTV subgroups are indicated with different color markers as shown to the right of the figure.
Correlation analysis among different nucleotides compositions of CTV.
| CAI | CBI | Fop | ENc | GC1 | GC2 | GC12 | GC3 | A | C | U | G | GU | AC | GU3 | AU3 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| CAI | ||||||||||||||||
| CBI | 0.820 ** | |||||||||||||||
| Fop | 0.826 ** | 0.998 ** | ||||||||||||||
| ENc | 0.487 ** | 0.454 ** | 0.435 ** | |||||||||||||
| GC1 | 0.387 ** | 0.247 ** | 0.233 ** | 0.610 ** | ||||||||||||
| GC2 | −0.278 ** | −0.178 * | −0.172 | 0.047 | 0.056 | |||||||||||
| GC12 | 0.151 | 0.096 | 0.088 | 0.512 ** | 0.827 ** | 0.608 ** | ||||||||||
| GC3 | −0.563 ** | −0.412 ** | −0.402 ** | −0.373 ** | −0.329 ** | 0.093 | −0.210 * | |||||||||
| A | 0.733 ** | 0.704 ** | 0.696 ** | 0.500 ** | 0.191 * | −0.268 ** | 0.001 | −0.795 ** | ||||||||
| C | 0.425 ** | 0.665 ** | 0.653 ** | 0.614 ** | 0.382 ** | 0.090 | 0.355 ** | 0.005 | 0.388 ** | −0.845 ** | ||||||
| U | −0.393 ** | −0.506 ** | −0.504 ** | −0.595 ** | −0.426 ** | −0.175 | −0.438 ** | −0.087 | −0.358 ** | −0.537 ** | 0.313 ** | |||||
| G | −0.704 ** | −0.778 ** | −0.764 ** | −0.514 ** | −0.182 * | 0.273 ** | 0.009 | 0.741 ** | −0.931 ** | −0.788 ** | 0.686 ** | 0.906 ** | ||||
| GU | −0.715 ** | −0.822 ** | −0.810 ** | −0.659 ** | −0.329 ** | 0.131 | −0.188 * | 0.529 ** | −0.874 ** | 0.788 ** | −0.686 ** | −0.906 ** | −0.471 ** | |||
| AC | 0.715 ** | 0.822 ** | 0.810 ** | 0.659 ** | 0.329 ** | −0.131 | 0.188 * | −0.529 ** | 0.874 ** | −0.725 ** | 0.605 ** | 0.873 ** | 0.938 ** | −0.938 ** | ||
| GU3 | −0.749 ** | −0.836 ** | −0.821 ** | −0.646 ** | −0.282 ** | 0.118 | −0.158 | 0.521 ** | −0.831 ** | 0.000 | 0.097 | −0.748 ** | −0.530 ** | 0.530 ** | −1.000 ** | |
| AU3 | 0.568 ** | 0.431 ** | 0.420 ** | 0.370 ** | 0.325 ** | −0.090 | 0.208 * | −0.996 ** | 0.793 ** | 0.000 | −0.097 | 0.748 ** | 0.530 ** | −0.530 ** | 0.526 ** | −1.000 ** |
** Correlation is significant at the 0.01 level (two-tailed), highlighted with orange color; * correlation is significant at the 0.05 level (two-tailed).
Figure 3Correlations between effective number of codons (ENc) and the nucleotide composition of CTV coat protein gene. The relationship of ENc with C nucleotide content (a), U nucleotide content (b), GC1 content (c), GU content (d), AC content (e), and GU3 content (f). The regression curves can be described as y = 0.137x + 11.04, R2 = 0.404 (a); y = −0.100x + 32.36, R2 = 0.336 (b); y = 0.001x + 0.408, R2 = 0.426 (c); y = −0.236x + 67.01, R2 = 0.503 (d); y = 0.263x + 32.98, R2 = 0.503 (e); y = −0.707x + 96.92, R2 = 0.465 (f). Different CTV subgroups are indicated with different color markers as shown to the right of the figure.
Figure 4(a) Contribution axes based on the correspondence analysis (COA). The relative inertia of 34 axes generated by COA of the relative synonymous codon usage (RSCU) values of CTV subgroup isolates. (b) COA analysis of synonymous codon usage patterns: RSCU values of the 59 synonymous codons of three CTV subgroup isolates were plotted in the first two main axes (Axis 1 and Axis 2). Different nucleotide-ended codons of each CTV subgroup isolates are marked in the figure by different colors and shapes.
The relative synonymous codon usage (RSCU) patterns of CTV isolates and their respective citrus hosts.
| AA | Codon | Ca-CTV | Cr-CTV | Cs-CTV |
|
|
|
|---|---|---|---|---|---|---|---|
| Phe | UUU |
|
| 1.01 |
|
|
|
| UUC | 0.89 | 0.86 | 0.99 | 0.80 | 0.92 | 0.76 | |
| Leu | UUA |
|
|
| 0.95 | 0.88 | 0.87 |
| UUG |
|
|
|
|
|
| |
| CUU |
|
|
|
|
|
| |
| CUC | 0.01 | 0.01 | 0.01 |
| 0.91 | 0.71 | |
| CUA | 0.24 | 0.26 | 0.31 | 0.48 | 0.43 | 0.57 | |
| CUG | 0.54 | 0.50 | 0.46 | 0.52 | 0.98 | 0.82 | |
| Ile | AUU | 0.91 | 0.81 | 0.80 |
|
|
|
| AUC | 0.50 | 0.59 | 0.55 | 0.93 | 0.90 | 0.71 | |
| AUA |
|
|
| 0.68 | 0.63 | 0.77 | |
| Val | GUU |
|
|
|
|
|
|
| GUC | 1.00 | 1.01 |
| 0.90 | 0.84 | 0.63 | |
| GUA | 0.51 | 0.48 | 0.37 | 0.55 | 0.54 | 0.66 | |
| GUG |
|
|
| 0.98 |
|
| |
| Pro | CCU |
|
|
|
|
|
|
| CCC | 0.03 | 0.04 | 0.06 | 0.66 | 0.69 | 0.58 | |
| CCA | 0.37 | 0.22 | 0.28 |
|
|
| |
| CCG |
|
|
| 0.32 | 0.49 | 0.45 | |
| Thr | ACU |
|
|
|
|
|
|
| ACC | 0.89 | 0.78 | 0.77 | 0.99 | 0.84 | 0.75 | |
| ACA | 0.66 | 0.64 | 0.65 |
|
|
| |
| ACG | 0.73 | 0.72 | 0.73 | 0.47 | 0.45 | 0.42 | |
| Ala | GCU |
|
|
|
|
|
|
| GCC | 0.76 | 0.82 | 0.87 |
| 1.01 | 0.69 | |
| GCA | 0.88 | 0.84 | 0.83 | 0.78 |
|
| |
| GCG | 0.47 | 0.48 | 0.44 | 0.25 | 0.46 | 0.32 | |
| Tyr | UAU |
|
|
| 0.94 | 1.03 |
|
| UAC | 0.71 | 0.72 | 0.71 |
| 0.97 | 0.77 | |
| Ser | UCU |
|
|
|
|
|
|
| UCC | 0.71 | 0.56 | 0.53 | 0.96 | 0.84 | 0.72 | |
| UCA | 0.75 | 0.84 | 0.81 |
|
|
| |
| UCG | 0.38 | 0.32 | 0.30 | 0.48 | 0.69 | 0.42 | |
| AGU |
|
|
| 0.80 | 0.87 |
| |
| AGC | 0.44 | 0.29 | 0.41 | 0.89 | 0.85 | 0.77 | |
| Arg | AGA |
|
|
|
|
|
|
| AGG | 0.69 | 0.75 | 0.76 | 0.83 |
|
| |
| CGU |
|
|
| 0.89 | 0.91 | 0.79 | |
| CGC | 0.55 | 0.49 | 0.53 | 0.49 | 0.70 | 0.48 | |
| CGA | 0.95 | 1.01 | 0.92 | 0.89 | 0.69 | 0.70 | |
| CGG | 0.45 | 0.44 | 0.47 | 0.52 | 0.56 | 0.55 | |
| Cys | UGU |
|
|
| 0.97 | 0.76 |
|
| UGC | 0.00 | 0.05 | 0.05 | 1.03 |
| 0.87 | |
| His | CAU | 0.59 | 0.62 | 0.61 |
|
|
|
| CAC |
|
|
| 0.85 | 0.95 | 0.68 | |
| Gln | CAA | 1.01 | 1.04 | 1.02 | 1.02 | 1.00 | 1.10 |
| CAG | 0.99 | 0.96 | 0.98 | 0.98 | 1.00 | 0.90 | |
| Asn | AAU | 0.52 | 0.59 | 0.59 | 0.83 |
|
|
| AAC |
|
|
|
| 0.89 | 0.70 | |
| Lys | AAA | 0.84 | 0.84 | 0.79 |
| 0.93 | 0.99 |
| AAG |
|
|
| 0.95 |
| 1.01 | |
| Asp | GAU |
|
|
|
|
|
|
| GAC | 0.84 | 0.83 | 0.84 | 0.71 | 0.67 | 0.58 | |
| Glu | GAA |
|
|
| 1.04 | 0.99 |
|
| GAG | 0.68 | 0.74 | 0.65 | 0.96 | 1.01 | 0.90 | |
| Gly | GGU |
|
|
|
|
|
|
| GGC | 0.65 | 0.73 | 0.75 |
| 0.98 | 0.73 | |
| GGA | 0.76 | 0.74 | 0.77 |
|
|
| |
| GGG | 0.97 | 1.00 | 0.99 | 0.50 | 0.71 | 0.77 | |
| Trp | UGG | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 |
| Met | AUG | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 |
AA represents the three-letter abbreviation code of 20amino acids. Ca-CTV, Cr-CTV, and Cs-CTV denote the CTV isolates originated from C. aurantifolia, C. reticulata, and C. sinensis, respectively. RSCU values of high-frequency codons/abundantly used codons of the virus and its host are marked in bold. Host-preferred high-frequency codons (HFCH) of CTV showing a coincident relationship with host codons are highlighted with gray color, and the virus-preferred high-frequency codons (HFCV) of CTV showing an antagonist relationship with host codons are highlighted with green color.
Figure 5Contribution of high-frequency codons. Bar diagram showing the percentage of 20 amino acids in the coat protein of the three CTV subgroup isolates. Different color stacked bars indicate the percentages of amino acid encoded by the host-preferred high-frequency codon (HFCH), virus-preferred high-frequency codon (HFCV) and the other synonymous codon (OSC). CHFCH, CHFCV, and COSC denote contributions of host-preferred high-frequency codon, virus-preferred high-frequency codon, and other synonymous codon. Pie diagram represents the percentages of CHFCH, CHFCV, and COSC encoded amino acids in coat protein of different CTV subgroups. Different CTV subgroups are indicated with different color markers as shown to the right of the figure.