| Literature DB >> 27729643 |
Azeem Mehmood Butt1, Izza Nasrullah1, Raheel Qamar1, Yigang Tong2.
Abstract
The codon usage patterns of viruses reflect the evolutionary changes that allow them to optimize their survival and adapt their fitness to the external environment and, most importantly, their hosts. Here we report the genotype-specific codon usage patterns of Zika virus (ZIKV) strains from the current and previous outbreaks. Several genotype-specific and common codon usage traits were noted in the ZIKV coding sequences, indicating their independent evolutionary origins from a common ancestor. The overall influence of natural selection was more profound than that of mutation pressure, acting on a specific set of viral genes in the Asian-genotype ZIKV strains from the recent outbreak. An interplay between codon adaptation and deoptimization may have allowed the virus to adapt to multiple host and vectors and is reported for the first time in ZIKV genomes. Combining our codon analysis with geographical data on Aedes populations in the Americas suggested that ZIKV has evolved host- and vector-specific codon usage patterns to maintain successful replication and transmission chains within multiple hosts and vectors.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27729643 PMCID: PMC5117728 DOI: 10.1038/emi.2016.106
Source DB: PubMed Journal: Emerg Microbes Infect ISSN: 2222-1751 Impact factor: 7.163
Figure 1Genotype-specific comparative analysis of ENC values of ZIKV coding sequences.
The relative synonymous codon usage (RSCU) patterns of ZIKV, its host and transmission vectors
| Phe | UUU | 0.96 | 0.92 | 0.93 | 0.92 | 0.56 | 0.48 | Ser | UCU | 0.66 | 0.64 | 0.79 | 0.57 | 1.14 | 0.66 | 0.54 | |
| 0.81 | UCC | 0.90 | 0.89 | 0.89 | 1.12 | 1.32 | 1.20 | 1.38 | |||||||||
| Leu | UUA | 0.32 | 0.31 | 0.40 | 0.30 | 0.48 | 0.36 | 0.24 | 1.11 | 0.90 | 0.66 | 0.48 | |||||
| UUG | 1.32 | 1.31 | 1.37 | 1.36 | 0.78 | 1.32 | 1.14 | UCG | 0.60 | 0.58 | 0.61 | 0.74 | 0.30 | ||||
| CUU | 0.68 | 0.65 | 0.80 | 0.78 | 0.78 | 0.66 | 0.48 | AGU | 1.13 | 1.07 | 1.31 | 0.90 | 0.96 | 0.78 | |||
| CUC | 1.00 | 1.01 | 0.99 | 0.92 | 1.20 | 0.84 | 0.84 | AGC | 1.32 | 1.39 | 1.05 | 1.12 | 1.08 | 1.08 | |||
| CUA | 0.67 | 0.67 | 0.60 | 0.77 | 0.42 | 0.54 | 0.54 | Arg | 0.66 | 0.60 | |||||||
| CGU | 0.41 | 0.41 | 0.39 | 0.41 | 0.48 | ||||||||||||
| Ile | AUU | 0.88 | 0.89 | 0.93 | 0.78 | 1.08 | 0.99 | 0.75 | CGC | 0.33 | 0.32 | 0.37 | 0.38 | 1.08 | 1.26 | 1.32 | |
| CGA | 0.22 | 0.17 | 0.39 | 0.40 | 0.66 | 1.20 | 0.96 | ||||||||||
| AUA | 1.00 | 1.02 | 0.86 | 0.96 | 0.51 | 0.39 | 0.39 | CGG | 0.78 | 0.81 | 0.70 | 0.66 | 1.20 | 1.02 | 1.20 | ||
| Val | GUU | 0.71 | 0.68 | 0.82 | 0.73 | 0.72 | 1.04 | 0.88 | AGG | 1.63 | 1.64 | 1.55 | 1.68 | 0.54 | 0.42 | ||
| GUC | 1.20 | 1.27 | 0.95 | 1.04 | 0.96 | 1.08 | Cys | 0.92 | 0.84 | 0.70 | |||||||
| GUA | 0.43 | 0.39 | 0.60 | 0.58 | 0.48 | 0.60 | 0.52 | UGC | 0.83 | 0.86 | 0.69 | 0.73 | |||||
| His | CAU | 0.81 | 0.80 | 0.80 | 0.84 | 0.84 | 0.76 | ||||||||||
| Pro | CCU | 0.62 | 0.63 | 0.67 | 0.52 | 1.16 | 0.68 | 0.36 | 0.89 | ||||||||
| CCC | 1.16 | 1.16 | 1.12 | 1.26 | 0.84 | 1.12 | Gln | 0.74 | 0.54 | 0.82 | 0.60 | ||||||
| 1.12 | 1.20 | 1.08 | CAG | 0.67 | 0.64 | 0.71 | |||||||||||
| CCG | 0.39 | 0.40 | 0.31 | 0.42 | 0.44 | Asn | AAU | 0.66 | 0.63 | 0.79 | 0.76 | 0.94 | 0.80 | 0.64 | |||
| Thr | ACU | 1.05 | 1.06 | 1.01 | 0.95 | 1.00 | 0.80 | 0.64 | |||||||||
| ACC | 1.04 | 1.04 | 1.03 | 1.00 | Lys | AAA | 0.75 | 0.76 | 0.70 | 0.76 | 0.86 | 0.80 | 0.58 | ||||
| 1.12 | 0.72 | 0.60 | |||||||||||||||
| ACG | 0.43 | 0.46 | 0.31 | 0.32 | 0.44 | 1.00 | 1.00 | Asp | GAU | 0.95 | 0.97 | 0.76 | 0.92 | 0.92 | 0.96 | ||
| Ala | GCU | 1.10 | 1.10 | 1.08 | 1.14 | 1.08 | 1.08 | 1.00 | 0.88 | ||||||||
| Glu | GAA | 0.99 | 0.99 | 0.88 | 0.84 | ||||||||||||
| GCA | 1.15 | 1.12 | 1.27 | 1.23 | 0.92 | 0.76 | 0.60 | 0.99 | 0.84 | 0.90 | |||||||
| GCG | 0.49 | 0.54 | 0.35 | 0.34 | 0.44 | 0.68 | 0.60 | Gly | GGU | 0.56 | 0.54 | 0.62 | 0.56 | 0.64 | 1.12 | ||
| Tyr | UAU | 0.48 | 0.44 | 0.61 | 0.58 | 0.88 | 0.64 | 0.56 | GGC | 0.67 | 0.69 | 0.61 | 0.66 | 1.04 | 1.08 | ||
| 1.00 | 1.20 | ||||||||||||||||
| GGG | 1.03 | 1.07 | 0.83 | 0.97 | 1.00 | 0.36 | 0.48 | ||||||||||
Abbreviations: amino acid, AA; H. sapiens, HS; Ae. aegypti, AG; Ae. Albopictus, AB.
Preferred codons of ZIKV, H. sapiens, Ae. aegypti and Ae. albopictus are shown in bold.
Figure 2Correspondence analysis (CA). Genotype-specific CA plots were constructed for whole genome and individual ZIKV coding sequences. (A) Whole genome. (B) C. (C) prM. (D) E. (E) NS1. (F) NS2A. (G) NS2B. (H) NS3. (I) NS4A. (J) NS4B. (K) NS5. The description of color coding is same as that of A. The ZIKV strain isolated from Ae. aegypti is represented as a black color diamond.
Figure 3ENC–GC3 plots. The curve indicates the expected codon usage if GC compositional constraints alone account for the codon usage bias. (A) Whole genome. (B) C. (C) prM. (D) E. (E) NS1. (F) NS2A. (G) NS2B. (H) NS3. (I) NS4A. (J) NS4B. (K) NS5. The description of color coding is same as that of A.
Figure 4CAI analysis of the ZIKV coding sequences in relation to its host and vectors. East African, EA; west African, WA.
Figure 5RCDI analysis of the ZIKV coding sequences in relation to its host and vectors. East African, EA; west African, WA.
Figure 6SiD analysis of ZIKV strains in relation to its host and vectors. (A) Whole genome. (B) C. (C) prM. (D) E. (E) NS1. (F) NS2A. (G) NS2B. (H) NS3. (I) NS4A. (J) NS4B. (K) NS5. The description of color coding is same as that of A.