| Literature DB >> 29299151 |
Ye Chen1, Quanming Xu1,2, Xiaomin Yuan1,2, Xinxin Li1, Ting Zhu1, Yanmei Ma1, Ji-Long Chen1,3.
Abstract
Middle East Respiratory Syndrome Coronavirus (MERS-CoV), which first broken out in Jeddah in 2012, causes a severe acute respiratory illness with a high mortality rate. To better understand the molecular characteristics of isolated MERS-CoV genomes, we first analysed the codon usage pattern of the zoonotic MERS-CoV strains comprehensively to gain an insight into the mechanism of cross-species transmission. We found that MERS human/camel isolates showed a low codon usage bias. Both mutation and nature selection pressure have contributed to this low codon usage bias, with the former being the main determining factor. We also observed that gene function, evolution time and the different host species of the virus all contributed to the bias of MERS-CoV, to some extent. Additionally, the codon usage pattern of MERS-CoV isolates is different from other related Nidovirales viruses isolated from bats and hedgehogs. In the future, more epidemiological surveys are required to examine the factors that resulted in the emergence and outbreak of this virus.Entities:
Keywords: MERS-CoV; codon usage pattern; mutation bias; natural selection
Year: 2017 PMID: 29299151 PMCID: PMC5746386 DOI: 10.18632/oncotarget.22738
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
The nucleotide contents of MERS and MERS related isolates
| Accession | Nucleotide content | Accession | Nucleotide content | ||||||
|---|---|---|---|---|---|---|---|---|---|
| A% | C% | G% | T% | A% | C% | G% | T% | ||
| 26.13 | 20.15 | 21.25 | 32.47 | 26.15 | 20.14 | 21.23 | 32.48 | ||
| 26.12 | 20.10 | 21.23 | 32.54 | 26.13 | 20.11 | 21.24 | 32.52 | ||
| 26.13 | 20.14 | 21.29 | 32.44 | 26.14 | 20.12 | 21.24 | 32.50 | ||
| 26.13 | 20.16 | 21.25 | 32.46 | 26.14 | 20.14 | 21.24 | 32.48 | ||
| 26.13 | 20.13 | 21.29 | 32.44 | 26.30 | 19.92 | 21.21 | 32.56 | ||
| 26.14 | 20.13 | 21.25 | 32.48 | 26.30 | 19.92 | 21.21 | 32.56 | ||
| 26.14 | 20.14 | 21.25 | 32.47 | 26.15 | 20.10 | 21.24 | 32.51 | ||
| 26.14 | 20.14 | 21.25 | 32.48 | 26.15 | 20.10 | 21.24 | 32.50 | ||
| 26.13 | 20.16 | 21.24 | 32.47 | 26.16 | 20.12 | 21.22 | 32.50 | ||
| 26.13 | 20.16 | 21.24 | 32.47 | 26.16 | 20.12 | 21.23 | 32.49 | ||
| 26.13 | 20.16 | 21.24 | 32.47 | 26.15 | 20.11 | 21.23 | 32.50 | ||
| 26.13 | 20.15 | 21.26 | 32.46 | 26.15 | 20.12 | 21.23 | 32.49 | ||
| 26.13 | 20.15 | 21.26 | 32.45 | 26.16 | 20.12 | 21.23 | 32.50 | ||
| 26.13 | 20.11 | 21.26 | 32.50 | 26.15 | 20.12 | 21.24 | 32.49 | ||
| 26.13 | 20.15 | 21.25 | 32.48 | 26.15 | 20.12 | 21.24 | 32.49 | ||
| 26.13 | 20.16 | 21.24 | 32.47 | 26.14 | 20.12 | 21.25 | 32.49 | ||
| 26.12 | 20.09 | 21.23 | 32.56 | 26.15 | 20.09 | 21.25 | 32.51 | ||
| 26.12 | 20.08 | 21.24 | 32.55 | 26.17 | 20.11 | 21.21 | 32.51 | ||
| 26.19 | 20.27 | 20.96 | 32.58 | 26.17 | 20.11 | 21.21 | 32.51 | ||
| 26.19 | 20.26 | 20.96 | 32.59 | 26.17 | 20.11 | 21.21 | 32.51 | ||
| 26.19 | 20.26 | 20.96 | 32.59 | 26.50 | 21.55 | 21.71 | 30.24 | ||
| 26.19 | 20.26 | 20.96 | 32.59 | 26.57 | 21.44 | 21.62 | 30.37 | ||
| 26.11 | 20.10 | 21.24 | 32.55 | 26.88 | 21.44 | 21.62 | 30.36 | ||
| 26.11 | 20.10 | 21.24 | 32.55 | 26.58 | 21.44 | 21.62 | 30.36 | ||
| 26.11 | 20.12 | 21.23 | 32.54 | 26.50 | 21.55 | 21.71 | 30.24 | ||
| 26.14 | 20.11 | 21.24 | 32.51 | 27.54 | 17.16 | 20.72 | 34.58 | ||
| 26.14 | 20.15 | 21.22 | 32.49 | 27.54 | 17.16 | 20.73 | 34.57 | ||
| 26.12 | 20.08 | 21.22 | 32.58 | 27.54 | 17.16 | 20.73 | 34.58 | ||
| 26.14 | 20.11 | 21.25 | 32.50 | 27.53 | 17.14 | 20.75 | 34.58 | ||
| 26.12 | 20.09 | 21.23 | 32.56 | 27.56 | 17.06 | 20.83 | 34.54 | ||
| 26.14 | 20.11 | 21.25 | 32.50 | 26.45 | 19.05 | 21.28 | 33.21 | ||
| 26.15 | 20.11 | 21.24 | 32.51 | 25.14 | 20.77 | 22.55 | 31.53 | ||
| 26.21 | 20.31 | 20.99 | 32.49 | NC-022643.1 | 29.17 | 16.16 | 21.43 | 33.24 | |
| 26.19 | 20.24 | 20.96 | 32.59 | KC545386.1 | 29.17 | 16.16 | 21.43 | 33.24 | |
| 26.09 | 20.11 | 21.23 | 32.58 | KC545383.1 | 29.16 | 16.13 | 21.43 | 33.28 | |
| 26.13 | 20.10 | 21.25 | 32.51 | ||||||
The accession numbers marked in red, green, blue and black represent the MERS-CoV human isolates, the MERS-CoV camel isolates, the MERS-CoV related CoV bat isolates and the MERS-CoV related CoV hedgehog isolates, respectively.
The synonymous codon usage pattern presented in the MERS strains
| AA | Codon | RSCU/number | AA | Codon | RSCU/number |
|---|---|---|---|---|---|
| A (Ala) | GCA | 0.988/7894 | P (Pro) | CCA | 1.216/5631 |
| GCC | 0.632/5061 | CCC | 0.656/3032 | ||
| GCG | 0.308/2476 | CCG | 0.188/868 | ||
| C (Cys) | UGC | 0.806/5335 | Q (Glu) | ||
| CAG | 0.86/6620 | ||||
| D (Asp) | GAC | 0.72/8395 | R (Arg) | ||
| AGG | 0.84/2242 | ||||
| E (Glu) | CGA | 0.456/1210 | |||
| GAG | 0.95/8073 | CGC | 1.104/2948 | ||
| F (Phe) | UUC | 0.718/8016 | CGG | 0.432/1160 | |
| G (Gly) | GGA | 0.644/4116 | S (Ser) | AGC | 0.438/2457 |
| GGC | 1.008/6467 | AGU | 1.332/7527 | ||
| GGG | 0.292/1883 | UCA | 1.212/6838 | ||
| UCC | 0.714/4035 | ||||
| H (His) | CAC | 0.682/3078 | UCG | 0.186/1065 | |
| I (Ile) | AUA | 0.714/5048 | T (Thr) | ACA | 1.180/8958 |
| AUC | 0.573/4060 | ACC | 0.688/5243 | ||
| ACG | 0.176/1336 | ||||
| K (Lys) | |||||
| AAG | 0.996/11513 | V (Val) | GUA | 0.724/7162 | |
| L (Leu) | CUA | 0.456/3141 | GUC | 0.764/7536 | |
| CUC | 0.708/4859 | GUG | 0.736/7262 | ||
| CUG | 0.486/3359 | ||||
| TAC | 0.732/7604 | ||||
| UUA | 1.218/8373 | ||||
| UUG | 1.434/9867 | ||||
| N (Asn) | AAC | 0.604/6647 | |||
The bold and italic text indicates the preferentially used codons and RSCU values for the MERS strains.
The preferentially used codons for each amino acid are displayed in bold and italics.
Figure 1The relative abundance values of the 16 dinucleotides
The different colours represent the different dinucleotides. The red dotted line indicates that the relative abundance value of a dinucleotide is 1.
Figure 2The plots of ENC values against GC3s values for MERS-CoV and MERS-CoV related strains
All the points corresponding to human, camel isolated MERS-CoV strains and bat and hedgehog(erinaceus) isolated CoV were labelled in circle, square, triangle, and rhombus, respectively.
The correlations between the codon compositions (A3s, U3s, G3s, C3s, and GC3s), the ENC values, nucleotide compositions (A%, U%, G%, C%, and GC%), the first axis values, the second axis values, the Gravy values, and the Aroma values of the MERS human isolates
| A% | C% | G% | U% | GC% | 1st axis | 2nd axis | Gravy | Aroma | |
|---|---|---|---|---|---|---|---|---|---|
| U3s | 0.687** | 0.665** | -0.913** | 0.904** | -0.831** | 0.924 ** | -0.224 | -0.882** | 0.983** |
| C3s | 0.012 | -0.177 | 0.172 | -0.185 | 0.240 | -0.209 | 0.091 | 0.218 | -0.251 |
| A3s | -0.276 | -0.406 | 0.634 ** | -0.803** | 0.775 ** | -0.670** | 0.436* | 0.636 ** | -0.790** |
| G3s | -0.772 ** | -0.784 ** | 0.959** | -0.819** | 0.718** | -0.958 ** | 0.050 | 0.933** | -0.966** |
| ENC | 0.002 | 0.067 | 0.337 | -0.895** | 0.736** | -0.359 | 0.812 ** | 0.270 | -0.576 ** |
| GC3s | -0.716 ** | -0.620 ** | 0.898** | -0.920** | 0.817 ** | -0.886** | 0.232 | 0.835** | -0.945 ** |
* Signifies 0.01
Figure 3(A) The neutral analysis of GC3s against GC12s. (B) The evolutionary analysis of the GC3s and GC12s values. The solid line represents the regression line.
Figure 4CA of MERS-CoV human/camel isolates
Each viral gene is displayed in a 3-dimensional representation. The X, Y and Z-axes have arbitrary scales generated by the CA and the weight of each codon in these axes varies in different segments. The codon usage trends with time of the viral isolates are indicated by different colours. The different hosts of the MRES-CoV isolates are indicated by different shapes. (A), (B), (C), (D) and (E) represents the 3D graph of the M, N, ORF1ab, S and the complete genome, respectively, using the CA data.