| Literature DB >> 27893824 |
Hongju Wang1, Siqing Liu2, Bo Zhang2, Wenqiang Wei1.
Abstract
Zika virus (ZIKV) is a mosquito-borne virus (arbovirus) in the family Flaviviridae, and the symptoms caused by ZIKV infection in humans include rash, fever, arthralgia, myalgia, asthenia and conjunctivitis. Codon usage bias analysis can reveal much about the molecular evolution and host adaption of ZIKV. To gain insight into the evolutionary characteristics of ZIKV, we performed a comprehensive analysis on the codon usage pattern in 46 ZIKV strains by calculating the effective number of codons (ENc), codon adaptation index (CAI), relative synonymous codon usage (RSCU), and other indicators. The results indicate that the codon usage bias of ZIKV is relatively low. Several lines of evidence support the hypothesis that translational selection plays a role in shaping the codon usage pattern of ZIKV. The results from a correspondence analysis (CA) indicate that other factors, such as base composition, aromaticity, and hydrophobicity may also be involved in shaping the codon usage pattern of ZIKV. Additionally, the results from a comparative analysis of RSCU between ZIKV and its hosts suggest that ZIKV tends to evolve codon usage patterns that are comparable to those of its hosts. Moreover, selection pressure from Homo sapiens on the ZIKV RSCU patterns was found to be dominant compared with that from Aedes aegypti and Aedes albopictus. Taken together, both natural translational selection and mutation pressure are important for shaping the codon usage pattern of ZIKV. Our findings contribute to understanding the evolution of ZIKV and its adaption to its hosts.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27893824 PMCID: PMC5125587 DOI: 10.1371/journal.pone.0166260
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
The accession number, origin and isolation year, and indices of polyprotein-coding region of each ZIKV isolate.
| Accession | GC3s | GC | Mononucleotide frequency [%] | GC12 | GC3 | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| No | number | Species | CAI | ENc | [%] | [%] | U3s | C3s | A3s | G3s | Gravy | Aromo | [%] | [%] |
| 1 | LC002520 | Uganda(1947) | 0.737 | 52.94 | 50.80 | 50.70 | 24.97 | 31.64 | 34.21 | 31.23 | -0.140 | 0.080 | 49.07 | 53.94 |
| 2 | KF383116 | Senegal(1968) | 0.739 | 52.58 | 51.20 | 51.00 | 24.78 | 31.71 | 33.85 | 31.73 | -0.142 | 0.079 | 49.22 | 54.38 |
| 3 | EU545988 | Micronesia(1968) | 0.735 | 53.64 | 51.80 | 51.00 | 25.28 | 31.93 | 32.65 | 32.28 | -0.148 | 0.081 | 49.11 | 54.88 |
| 4 | KJ776791 | Polynesia(2013) | 0.735 | 53.36 | 52.00 | 51.10 | 24.84 | 32.19 | 32.85 | 32.27 | -0.147 | 0.081 | 49.11 | 55.08 |
| 5 | KU312312 | Suriname(2015) | 0.735 | 53.40 | 52.00 | 51.10 | 24.79 | 32.24 | 32.86 | 32.24 | -0.147 | 0.081 | 49.15 | 55.08 |
| 6 | KU321639 | Brazil(2015) | 0.736 | 53.28 | 52.00 | 51.10 | 24.85 | 32.24 | 32.82 | 32.23 | -0.148 | 0.081 | 49.11 | 55.11 |
| 7 | KF383119 | Senegal(2001) | 0.738 | 52.92 | 50.80 | 50.90 | 25.00 | 31.70 | 34.07 | 31.19 | -0.139 | 0.079 | 49.24 | 54.03 |
| 8 | KF383118 | Senegal(2001) | 0.741 | 52.67 | 50.80 | 50.80 | 24.99 | 31.52 | 34.12 | 31.32 | -0.151 | 0.078 | 49.14 | 54.03 |
| 9 | KF383115 | CAF | 0.738 | 52.56 | 50.00 | 50.50 | 25.68 | 30.97 | 34.42 | 31.03 | -0.140 | 0.080 | 48.98 | 53.13 |
| 10 | KF268949 | CAF(1980) | 0.737 | 52.69 | 49.60 | 50.40 | 25.74 | 31.09 | 34.90 | 30.34 | -0.142 | 0.080 | 49.08 | 52.87 |
| 11 | KF268948 | CAF(1979) | 0.739 | 52.64 | 49.90 | 50.40 | 25.67 | 31.24 | 34.59 | 30.57 | -0.141 | 0.080 | 49.07 | 53.17 |
| 12 | KF268950 | CAF(1976) | 0.739 | 52.67 | 49.90 | 50.40 | 25.67 | 31.24 | 34.61 | 30.55 | -0.140 | 0.080 | 49.07 | 53.14 |
| 13 | KF383117 | Senegal(1997) | 0.735 | 52.77 | 50.30 | 50.60 | 25.66 | 30.72 | 34.03 | 31.68 | -0.141 | 0.080 | 49.04 | 53.42 |
| 14 | KU509998 | Haiti(2014) | 0.736 | 53.30 | 52.00 | 51.10 | 24.85 | 32.24 | 32.82 | 32.22 | -0.148 | 0.081 | 49.12 | 55.11 |
| 15 | KU501217 | Guatemala(2015) | 0.735 | 53.15 | 51.90 | 51.10 | 24.82 | 32.23 | 33.07 | 32.03 | -0.144 | 0.081 | 49.11 | 54.91 |
| 16 | KU365777 | Brazil(2015) | 0.735 | 53.27 | 51.90 | 51.10 | 24.83 | 32.22 | 32.95 | 32.14 | -0.147 | 0.081 | 49.14 | 54.99 |
| 17 | KU501216 | Guatemala(2015) | 0.735 | 53.18 | 51.80 | 51.00 | 24.85 | 32.20 | 33.07 | 32.03 | -0.144 | 0.081 | 49.11 | 54.88 |
| 18 | KU501215 | PR | 0.735 | 53.25 | 52.10 | 51.10 | 24.65 | 32.40 | 32.95 | 32.12 | -0.147 | 0.081 | 49.12 | 55.14 |
| 19 | KU365780 | Brazil(2015) | 0.735 | 53.27 | 51.90 | 51.10 | 24.87 | 32.18 | 32.95 | 32.14 | -0.147 | 0.081 | 49.14 | 54.96 |
| 20 | KU365779 | Brazil(2015) | 0.735 | 53.29 | 51.90 | 51.10 | 24.87 | 32.18 | 32.95 | 32.14 | -0.147 | 0.081 | 49.12 | 54.96 |
| 21 | KU365778 | Brazil(2015) | 0.735 | 53.34 | 51.90 | 51.10 | 24.83 | 32.22 | 33.03 | 32.05 | -0.147 | 0.081 | 49.14 | 54.94 |
| 22 | KU647676 | Martinique(2015) | 0.735 | 53.21 | 51.90 | 51.10 | 24.82 | 32.24 | 33.07 | 32.00 | -0.146 | 0.081 | 49.12 | 54.94 |
| 23 | KU866423 | China(2016) | 0.735 | 53.32 | 52.00 | 51.10 | 24.70 | 32.27 | 32.96 | 32.24 | -0.147 | 0.081 | 49.14 | 55.11 |
| 24 | KU870645 | USA(2016) | 0.734 | 53.29 | 51.90 | 51.10 | 24.89 | 32.12 | 32.96 | 32.16 | -0.142 | 0.080 | 49.14 | 54.91 |
| 25 | KU926310 | Brazil(2016) | 0.735 | 53.28 | 52.00 | 51.10 | 24.72 | 32.32 | 32.98 | 32.12 | -0.147 | 0.081 | 49.12 | 55.05 |
| 26 | KU926309 | Brazil(2016) | 0.736 | 53.22 | 52.10 | 51.20 | 24.67 | 32.31 | 32.86 | 32.30 | -0.146 | 0.081 | 49.12 | 55.17 |
| 27 | KU922960 | Mexico(2016) | 0.736 | 53.21 | 51.90 | 51.10 | 24.83 | 32.23 | 33.05 | 32.03 | -0.147 | 0.081 | 49.10 | 54.94 |
| 28 | KU820898 | China(2016) | 0.736 | 53.27 | 51.90 | 51.10 | 24.73 | 32.27 | 33.09 | 32.04 | -0.146 | 0.081 | 49.14 | 54.96 |
| 29 | KU853013 | Italy(2016) | 0.736 | 53.23 | 52.00 | 51.10 | 24.72 | 32.33 | 33.04 | 32.01 | -0.146 | 0.081 | 49.17 | 54.99 |
| 30 | KU853012 | Italy(2016) | 0.736 | 53.27 | 52.00 | 51.10 | 24.74 | 32.35 | 32.99 | 32.04 | -0.147 | 0.081 | 49.15 | 55.02 |
| 31 | KU955591 | Senegal(1984) | 0.740 | 52.67 | 51.20 | 51.00 | 24.91 | 31.58 | 33.73 | 31.91 | -0.143 | 0.080 | 49.21 | 54.41 |
| 32 | KU955592 | Senegal(1984) | 0.740 | 52.67 | 51.20 | 50.90 | 24.94 | 31.54 | 33.69 | 31.96 | -0.143 | 0.080 | 49.19 | 54.41 |
| 33 | KU955593 | Cambodia(2010) | 0.735 | 53.53 | 51.60 | 51.00 | 25.40 | 31.72 | 32.74 | 32.31 | -0.148 | 0.081 | 49.17 | 54.73 |
| 34 | KU955594 | Uganda(1947) | 0.738 | 52.89 | 50.90 | 50.80 | 24.79 | 31.80 | 34.20 | 31.25 | -0.139 | 0.080 | 49.09 | 54.09 |
| 35 | KU955595 | Senegal(1984) | 0.740 | 52.68 | 51.30 | 51.00 | 24.87 | 31.61 | 33.69 | 31.96 | -0.143 | 0.079 | 49.19 | 54.47 |
| 36 | KX056898 | China(2016) | 0.735 | 53.27 | 51.70 | 51.10 | 24.94 | 32.06 | 33.08 | 32.05 | -0.147 | 0.081 | 49.17 | 54.82 |
| 37 | KU681082 | Philippines(2012) | 0.735 | 53.55 | 51.50 | 51.00 | 25.29 | 31.69 | 33.04 | 32.07 | -0.149 | 0.081 | 49.14 | 54.59 |
| 38 | KU681081 | Thailand(2014) | 0.735 | 53.56 | 51.90 | 51.10 | 24.77 | 32.34 | 33.00 | 31.96 | -0.146 | 0.081 | 49.17 | 55.02 |
| 39 | KU761564 | China(2016) | 0.736 | 53.25 | 51.90 | 51.10 | 24.76 | 32.26 | 33.03 | 32.09 | -0.146 | 0.081 | 49.14 | 54.99 |
| 40 | KU729218 | Brazil(2015) | 0.735 | 53.33 | 52.00 | 51.10 | 24.93 | 32.09 | 32.81 | 32.31 | -0.148 | 0.081 | 49.14 | 54.99 |
| 41 | KU744693 | China(2016) | 0.737 | 53.29 | 52.10 | 51.20 | 24.77 | 32.43 | 32.75 | 32.07 | -0.147 | 0.081 | 49.24 | 55.20 |
| 42 | KU729217 | Brazil(2015) | 0.736 | 53.21 | 52.00 | 51.10 | 24.71 | 32.31 | 32.99 | 32.16 | -0.146 | 0.081 | 49.15 | 55.02 |
| 43 | KU720415 | Uganda(1947) | 0.738 | 52.91 | 50.90 | 50.70 | 24.86 | 31.75 | 34.21 | 31.23 | -0.140 | 0.080 | 49.07 | 54.03 |
| 44 | KU497555 | Brazil(2015) | 0.736 | 53.26 | 52.00 | 51.10 | 24.68 | 32.32 | 33.04 | 32.11 | -0.145 | 0.081 | 49.11 | 55.05 |
| 45 | KU707826 | Brazil(2015) | 0.735 | 53.29 | 51.90 | 51.10 | 24.87 | 32.18 | 32.95 | 32.14 | -0.147 | 0.081 | 49.12 | 54.96 |
| 46 | KU527068 | Brazil(2015) | 0.735 | 53.38 | 52.00 | 51.10 | 24.75 | 32.28 | 32.94 | 32.18 | -0.144 | 0.081 | 49.13 | 55.05 |
a CAI represents codon adaptation index.
b ENc represents the effective number of codons.
c GC3s represents the frequency of the nucleotides G+C at the third positions of synonymous codons.
d GC represents the G+C content.
e U3s represents the frequency of the nucleotide U at the third positions of codons.
f C3s represent the frequency of the nucleotide C at the third positions of codons.
g A3s represents the frequency of the nucleotide A at the third positions of codons.
h G3s represents the frequency of the nucleotide G at the third positions of codons.
i Gravy represents the hydrophobicity of protein.
j Aromo represents the aromaticity of protein.
k GC12 represents the G+C content at the first and second positions of codons.
l GC3 represents the G+C content at the third positions of codons.
m represents Central African Republic.
n represents Puerto Rico.
Fig 1Phylogenetic tree based on the polyprotein-coding regions of 46 ZIKV strains.
The tree was generated by the neighbor-joining (NJ) method using MEGA6 software. The bootstraps values were calculated with 1000 replicates.
The relative synonymous codon usage frequency (RSCU) of ZIKV and its natural hosts.
| AA | Codon | RSCU | |||
|---|---|---|---|---|---|
| ZIKV | |||||
| Phe | UUU | 1.008 | 0.87 | 0.56 | 0.48 |
| UUC | 0.992 | 1.13 | 1.44 | 1.52 | |
| Leu | UUA | 0.330 | 0.39 | 0.35 | 0.23 |
| UUG | 1.323 | 0.73 | 1.34 | 1.11 | |
| CUU | 0.801 | 0.73 | 0.67 | 0.49 | |
| CUC | 1.000 | 1.21 | 0.81 | 0.87 | |
| CUA | 0.659 | 0.40 | 0.54 | 0.57 | |
| CUG | 1.892 | 2.53 | 2.28 | 2.73 | |
| Ile | AUU | 0.864 | 1.03 | 1.00 | 0.74 |
| AUC | 1.167 | 1.52 | 1.59 | 1.86 | |
| AUA | 0.967 | 0.44 | 0.40 | 0.40 | |
| Val | GUU | 0.843 | 0.69 | 1.05 | 0.88 |
| GUC | 1.099 | 1.00 | 1.09 | 1.30 | |
| GUA | 0.387 | 0.42 | 0.60 | 0.51 | |
| GUG | 1.668 | 1.90 | 1.26 | 1.31 | |
| Ser | UCU | 0.854 | 1.11 | 0.67 | 0.54 |
| UCC | 0.985 | 1.39 | 1.20 | 1.40 | |
| UCA | 1.515 | 0.84 | 0.68 | 0.48 | |
| UCG | 0.420 | 0.33 | 1.41 | 1.70 | |
| AGU | 0.997 | 0.84 | 0.93 | 0.79 | |
| AGC | 1.228 | 1.50 | 1.11 | 1.08 | |
| Pro | CCU | 0.671 | 1.12 | 0.67 | 0.35 |
| CCC | 1.129 | 1.35 | 0.83 | 1.13 | |
| CCA | 1.809 | 1.07 | 1.20 | 1.07 | |
| CCG | 0.391 | 0.46 | 1.30 | 1.44 | |
| Thr | ACU | 0.980 | 0.94 | 0.80 | 0.64 |
| ACC | 1.133 | 1.52 | 1.48 | 1.79 | |
| ACA | 1.478 | 1.07 | 0.70 | 0.58 | |
| ACG | 0.409 | 0.46 | 1.01 | 0.99 | |
| Ala | GCU | 1.132 | 1.09 | 1.09 | 0.99 |
| GCC | 1.290 | 1.64 | 1.48 | 1.81 | |
| GCA | 1.125 | 0.85 | 0.75 | 0.59 | |
| GCG | 0.453 | 0.42 | 0.69 | 0.62 | |
| Tyr | UAU | 0.761 | 0.84 | 0.64 | 0.55 |
| UAC | 1.239 | 1.16 | 1.36 | 1.45 | |
| His | CAU | 0.863 | 0.81 | 0.84 | 0.75 |
| CAC | 1.137 | 1.19 | 1.16 | 1.25 | |
| Gln | CAA | 1.112 | 0.51 | 0.81 | 0.60 |
| CAG | 0.888 | 1.49 | 1.19 | 1.40 | |
| Asn | AAU | 0.692 | 0.89 | 0.79 | 0.64 |
| AAC | 1.308 | 1.11 | 1.21 | 1.36 | |
| Lys | AAA | 0.854 | 0.82 | 0.79 | 0.58 |
| AAG | 1.146 | 1.18 | 1.21 | 1.42 | |
| Asp | GAU | 0.903 | 0.89 | 1.12 | 0.96 |
| GAC | 1.097 | 1.11 | 0.88 | 1.04 | |
| Glu | GAA | 0.930 | 0.81 | 1.15 | 1.11 |
| GAG | 1.070 | 1.19 | 0.85 | 0.89 | |
| Cys | UGU | 0.941 | 0.86 | 0.83 | 0.69 |
| UGC | 1.060 | 1.14 | 1.17 | 1.31 | |
| Arg | CGU | 0.450 | 0.51 | 1.36 | 1.5 |
| CGC | 0.600 | 1.20 | 1.25 | 1.32 | |
| CGA | 0.253 | 0.63 | 1.17 | 0.97 | |
| CGG | 0.559 | 1.20 | 1.05 | 1.22 | |
| AGA | 2.443 | 1.20 | 0.64 | 0.58 | |
| AGG | 1.692 | 1.26 | 0.53 | 0.41 | |
| Gly | GGU | 0.529 | 0.64 | 1.10 | 1.24 |
| GGC | 0.672 | 1.40 | 1.04 | 1.07 | |
| GGA | 1.793 | 0.98 | 1.49 | 1.21 | |
| GGG | 1.006 | 0.98 | 0.37 | 0.47 | |
a AA represents amino acid.
b The “RSCU” value represents the pattern of relative synonymous codon usage.
Relative abundance of the 16 dinucleotides in polyprotein-coding region of 46 ZIKV strains.
| Dinucleotides | Range | Mean + SD |
|---|---|---|
| AA | 0.944–1.004 | 0.987 ± 0.020 |
| AU | 0.969–1.000 | 0.981 ± 0.010 |
| AC | 0.904–0.984 | 0.967 ± 0.012 |
| AG | 1.032–1.077 | 1.049 ± 0.013 |
| UA | 0.514–0.553 | 0.535 ± 0.009 |
| UU | 0.965–1.064 | 0.999 ± 0.016 |
| UC | 0.985–1.036 | 1.015 ± 0.012 |
| UG | 1.414–1.492 | 1.434 ± 0.023 |
| CA | 1.286–1.357 | 1.308 ± 0.019 |
| CU | 1.213–1.277 | 1.256 ± 0.014 |
| CC | 1.099–1.148 | 1.112 ± 0.010 |
| CG | 0.373–0.454 | 0.431 ± 0.025 |
| GA | 1.049–1.159 | 1.123 ± 0.018 |
| GU | 0.818–0.857 | 0.831 ± 0.008 |
| GC | 0.990–0.992 | 0.931 ± 0.013 |
| GG | 0.445–1.079 | 1.046 ± 0.091 |
The correlation analysis between the A, U, C, G contents and A3s, U3s, C3s, G3s contents in all selected ZIKV strains.
| A3s | U3s | C3s | G3s | GC3s | |
|---|---|---|---|---|---|
| -0.258 | |||||
a Value in this table is the r value of correlation analysis.
** represents P-value < 0.01.
* represents 0.01 < P-value < 0.05.
Fig 2A plot of values of the first axis (Axis 1) and the second axis (Axis 2) of each polyprotein-coding region of ZIKV in correspondence analysis.
The first axis accounts for 72.93% of total variation, and the second axis accounts for 8.99% of total variation.
Correlation analysis between the first two axes and different nucleotide-relevant indices.
| GC3s | GC | ENc | A | U | C | G | A3s | U3s | C3s | G3s | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| -0.106 | |||||||||||
| 0.483 | |||||||||||
| -0.072 | -0.027 | -0.105 | 0.123 | -0.043 | -0.053 | 0.145 | 0.017 | 0.047 | -0.197 | ||
| 0.633 | 0.859 | 0.487 | 0.417 | 0.777 | 0.726 | 0.337 | 0.913 | 0.756 | 0.191 |
** represents P-value < 0.01.
* represents 0.01 < P-value < 0.05.
Fig 3Graph showing the relationship between GC3s and effective number of codon (ENc) for the polyprotein-coding regions of ZIKV.
The solid curve shows the expected ENc value if the codon usage is only determined by the variation in the GC3s.
Fig 4Correlation between codon adaptation index (CAI) and effective number of codons (ENc).
The line represents the correlation curve produced by correlation analysis.
Fig 5Correlation between the GC content at the first and second codon positions (GC12) and that at the third codon position (GC3).
The line represents the correlation curve produced by correlation analysis.
Fig 6Parity rule 2 (PR2) plot [A3/(A3+U3) against G3/(G3+C3)].
PR2 bias plot was calculated for each polyprotein-coding region of ZIKV.
Correlation analysis among AROMO, GRAVY, the first two axes, GC3s, ENc and GC in the polyprotein-coding region of ZIKV isolates.
| Axis1 | Axis2 | GC3s | GC | ENc | |
|---|---|---|---|---|---|
| -0.038 | |||||
| 0.803 | |||||
| 0.101 | |||||
| 0.506 | |||||
** represents P-value < 0.01.
Fig 7A plot of the values of the first axis (Axis 1) and the second axis (Axis 2) of ZIKV strains isolated from different continents in correspondence analysis.
Fig 8A plot of the values of the first axis (Axis 1) and the second axis (Axis 2) of ZIKV strains isolated in different years in correspondence analysis.