| Literature DB >> 24091362 |
Abstract
To assess the codon evolution in virus-host systems, Avian coronavirus and its natural host Gallus gallus were used as a model. Codon usage (CU) was measured for the viral spike (S), nucleocapsid (N), nonstructural protein 2 (NSP2) and papain-like protease (PL(pro)) genes from a diverse set of A. coronavirus lineages and for G. gallus genes (lung surfactant protein A, intestinal cholecystokinin, oviduct ovomucin alpha subunit, kidney vitamin D receptor and the ubiquitary beta-actin) for different A. coronavirus replicating sites. Relative synonymous codon usage (RSCU) trees accommodating all virus and host genes in a single topology showed a higher proximity of A. coronavirus CU to the respiratory tract for all genes. The codon adaptation index (CAI) showed a lower adaptation of S to G. gallus compared to NSP2, PL(pro) and N. The effective number of codons (Nc) and GC3% revealed that natural selection and genetic drift are the evolutionary forces driving the codon usage evolution of both A. coronavirus and G. gallus regardless of the gene being considered. The spike gene showed only one 100% conserved amino acid position coded by an A. coronavirus preferred codon, a significantly low number when compared to the three other genes (p<0.0001). Virus CU evolves independently for each gene in a manner predicted by the protein function, with a balance between natural selection and mutation pressure, giving further molecular basis for the viruses' ability to exploit the host's cellular environment in a concerted virus-host molecular evolution.Entities:
Keywords: Avian coronavirus; Codon usage; Gallus gallus; Virus–host
Mesh:
Substances:
Year: 2013 PMID: 24091362 PMCID: PMC7114390 DOI: 10.1016/j.virusres.2013.09.033
Source DB: PubMed Journal: Virus Res ISSN: 0168-1702 Impact factor: 3.303
Fig. 1Neighbor-joining distance tree for the relative synonymous codon usage (RSCU) for the Avian coronavirus spike (S), nucleocapsid (N), non-structural protein 2 (NSP2) and papain-like protease (PLpro) genes and the Gallus gallus beta-actin, lung surfactant protein A (SFTPA1, gray background), intestinal cholecystokinin (CCK), oviduct ovomucin alpha subunit (OSA) and kidney vitamin D receptor genes. The tree was based on binary data using the value 1 for RSCUs > 1 (codon is preferred) or 0 for RSCUs ≤ 1 when the codon is not preferred (RSCU < 1) or is neutral (RSCU = 1). ENC (effective number of codons) values <40 and >45 are marked with an asterisk and a hash, respectively; sequences with ENC values between 40 and 45 have no marks. The arrow indicates the separation between G. gallus and Avian coronavirus clusters. Numbers at each node are bootstrap values (1000 replicates, only values >50 are shown). The bar represents the codon usage preferences distance.
Fig. 2Four graphs showing the expected (seen in the curves of each graph) and observed (seen in the points of each graph) effective number of codons (ENC and Nc, respectively) (Y axis) and the expected and observed GC3% (X axis) for (a) Avian coronavirus spike (S); (b) nucleocapsid (N); (c) non-structural protein 2 (NSP2) and (d) papain-like protease (PLpro) (dots) and Gallus gallus beta-actin, lung surfactant protein A, intestinal cholecystokinin, oviduct ovomucin alpha subunit and kidney vitamin D receptor (asterisks).
Fig. 3Boxplot distribution for the codon adaptation index (CAI) for Avian coronavirus spike (S), nucleocapsid (N), non-structural protein 2 (NSP2) and papain-like protease (PLpro) and Gallus gallus beta-actin, lung surfactant protein A, intestinal cholecystokinin, oviduct ovomucin alpha subunit and kidney vitamin D receptor (represented together in a single boxplot).
Conserved amino acid (aa) positions in the Avian coronavirus spike (S), nucleocapsid (N), non-structural protein 2 (NSP2) and papain-like protease (PLpro) genes coded by a preferred codon and the preferred codon for each aa in the Gallus gallus beta-actin (B-act), lung surfactant protein A (SFTPA1), intestinal cholecystokinin (CCK), oviduct ovomucin alpha subunit (Ovo) and kidney vitamin D receptor (ViTD rec) genes. Tryptophan and methionine, coded by a single codon, were excluded. Codon preference was indicated by relative synonymous codon usage (RSCU) >1. Positions are provided only for Avian coronavirus genes as G. gallus genes were used as the reference for comparison.
| aa | B-act | CCK | Ovo | SFTPA1 | VitD rec | S | Position | N | Position | Nsp2 | Position | PLPro | Position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| F | UUC | UUC | UUU/UUC | UUU | UUC | UUU | 155 | UUU | 313, 390 | UUU | 52, 64, 101, 138, 200, 217 | UUU | 28, 54, 57, 102, 144, 211, 220, 270, 326, 369, 393 |
| L | CUC/CUG | CUC/CUG | CUG | UUG/CUU/CUA/CUG | CUC/CUG | NC | NC | NC | NC | NC | NC | CUU | 16, 47, 58, 94, 111, 129, 273, 288, 315, 413 |
| I | AUU/AUC | AUC | AUU/AUC | AUU | AUC | NC | NC | AUU | 319, 397 | AUU | 23, 199, 210 | AUU | 52, 133, 194, 226, 383, 429 |
| V | GUG | GUG | GUU/GUG | GUU/GUG | GUC/GUG | NC | NC | NC | NC | GUU | 154, 163, 226, 228, 235 | GUU | 191, 192, 317, 323, 324, 371, 394, 430, 435 |
| S | UCU/UCC/AGC | UCC/AGC | UCU/UCC/UCA/AGU/AGC | UCU/AGU/AGC | UCC/AGC | NC | NC | UCA | 340, 344 | NC | NC | NC | NC |
| P | CCU/CCC | CCC | CCU/CCC/CCA | CCU | CCC | NC | NC | CCA | 338 | NC | NC | CCU | 178, 294, 338 |
| T | ACC/ACA | ACA | ACU/ACC/ACA | ACU/ACA | ACC/ACG | NC | NC | NC | NC | ACU | 123, 167, 241 | NC | NC |
| A | GCC | GCU/GCG | GCU/GCC/GCA | GCU/GCA | GCC | NC | NC | GCA | 376 | NC | NC | NC | NC |
| Y | UAC | UAC | UAU/UAC | UAU/UAC | UAC | NC | NC | UAU | 316 | NC | NC | NC | NC |
| H | CAC | CAC | CAU/CAC | CAU | CAC | NC | NC | NF | NF | NC | NC | CAU | 143, 201 |
| Q | CAG | CAG | CAA/CAG | CAA | CAG | NC | NC | CAG | 312, 369, 387 | NC | NC | NC | NC |
| N | AAC | All RSCUs = 1 | AAC | AAU | AAC | NC | NC | AAU | 315, 385, 407 | NC | NC | AAU | 27, 82, 97, 140, 186, 296, 343 |
| K | AAG | AAG | AAA | AAA | AAG | NC | NC | NC | NC | AAA | 6, 21, 86 | NC | NC |
| D | GAU | GAU | GAU/GAC | GAC | GAC | NC | NC | GAU | 314, 374 | NC | NC | GAU | 5, 105, 160, 176, 182, 184, 217, 258, 281 |
| E | GAG | All RSCUs = 1 | GAA | GAG | GAG | NC | NC | NC | NC | GAA | 98, 136, 142, 165 | GAA | 130, 164, 185, 342 |
| C | UGC | UGC | UGU | UGU | UGC | NC | NC | UGU | 320, 323 | UGU | 68, 242 | UGU | 132, 154, 183, 202, 439 |
| R | CGU/AGA | CGC/CGG/AGG | AGA/AGG | CGA/AGA | CGC/CGG/AGG | NC | NC | AGA | 349 | CGU | 54, 111 | NC | NC |
| G | GGU/GGC | GGC | GGA | GGA | GGC | NC | NC | NC | NC | NC | NC | GGU | 56, 86, 177, 319, 402 |
NC: no 100% conserved amino acids positions coded by the preferred codon; NF: amino acid not found in the sequence.
The mean number of amino acid residues in the sequences used for this study from the Avian coronavirus spike (S), nucleocapsid (N), non-structural protein 2 (NSP2) and papain-like protease (PLpro) genes coded by a preferred codon and the preferred codon for each aa in the Gallus gallus beta-actin (B-act), lung surfactant protein A (SFTPA1), intestinal cholecystokinin (CCK), oviduct ovomucin alpha subunit (Ovo) and kidney vitamin D receptor (ViTD rec) genes.
| Amino acid | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| S | N | Nsp2 | PLpro | B-act | CCK | Ovo | SFTPA1 | VitD rec | |
| F | 9.2 | 3 | 15.9 | 21.8 | 13 | 3 | 87 | 5 | 24 |
| L | 14.6 | 4.28 | 26.7 | 37.7 | 27 | 12 | 111 | 27 | 43 |
| I | 5.8 | 2.04 | 13.1 | 19.3 | 28 | 5 | 112 | 8 | 20 |
| V | 14.3 | 7.08 | 23.1 | 38.3 | 22 | 5 | 137 | 10 | 22 |
| S | 19.8 | 7.24 | 15.2 | 33.3 | 25 | 16 | 163.5 | 14 | 45 |
| P | 7.0 | 9.96 | 8.1 | 14.7 | 19 | 8 | 116 | 9 | 24 |
| T | 13.1 | 5.52 | 13.6 | 26.6 | 26 | 4 | 147 | 11 | 19 |
| A | 14.2 | 5.6 | 28.0 | 36.3 | 29 | 13 | 85.5 | 15 | 24.5 |
| Y | 11.0 | 1.2 | 3.0 | 19.1 | 15 | 5 | 76 | 12 | 7 |
| H | 5.3 | 0 | 1.0 | 5.3 | 9 | 4 | 46.5 | 1 | 13 |
| Q | 7.1 | 6.12 | 13.9 | 11.9 | 12 | 8 | 77.5 | 12 | 21 |
| N | 12.6 | 5.8 | 3.9 | 28.1 | 9 | 2 | 103.5 | 14 | 13 |
| K | 6.0 | 11.48 | 21.2 | 33.7 | 19 | 3 | 133.5 | 14 | 28 |
| D | 2.6 | 13.6 | 12.2 | 28.5 | 23 | 6 | 118 | 7 | 33 |
| E | 1.8 | 9.96 | 12.9 | 22.3 | 26 | 6 | 131 | 19 | 33.5 |
| C | 7.7 | 2.16 | 5.0 | 11.9 | 6 | 2 | 201 | 8 | 13 |
| R | 2.9 | 8.24 | 11.8 | 13.1 | 18 | 10.5 | 60 | 7 | 26 |
| G | 11.4 | 4.56 | 9.1 | 22.3 | 28 | 13 | 142 | 20 | 18 |
| M | 4.3 | 0 | 5.4 | 2.3 | 17 | 3 | 37 | 5 | 22 |
| W | 2.6 | 1 | 2.0 | 10.0 | 4 | 1.5 | 23 | 4 | 2 |