| Literature DB >> 35404024 |
Zoya Shafat1, Anwar Ahmed2, Mohammad K Parvez3, Shama Parveen4.
Abstract
BACKGROUND: Hepatitis E virus (HEV) is a positive-sense RNA virus belonging to the family Hepeviridae. The genome of HEV is organized into three open-reading frames (ORFs): ORF1, ORF2, and ORF3. The ORF1 non-structural Y-domain region (YDR) has been demonstrated to play an important role in the HEV pathogenesis. The nucleotide composition, synonymous codon usage bias in conjunction with other factors influencing the viral YDR genes of HEV have not been studied. Codon usage represents a significant mechanism in establishing the host-pathogen relationship. The present study for the first time elucidates the detailed codon usage patterns of YDR among HEV and HEV-hosts (Human, Rabbit, Mongoose, Pig, Wild boar, Camel, Monkey).Entities:
Keywords: Codon usage bias; Mutation pressure; Natural selection; Nucleotide composition; YDR
Year: 2022 PMID: 35404024 PMCID: PMC9001762 DOI: 10.1186/s43141-022-00319-2
Source DB: PubMed Journal: J Genet Eng Biotechnol ISSN: 1687-157X
Nucleotide composition analysis of YDR of hepatitis E viruses (%)
| Nucleotide | HEV | Human | Rabbit | Mongoose | Pig | Wild boar | Camel | Monkey |
|---|---|---|---|---|---|---|---|---|
| 18.841 | 19.319 | 18.127 | 18.705 | 19.532 | 19.108 | 18.910 | 18.960 | |
| 30.169 | 28.022 | 29.816 | 28.287 | 28.048 | 28.391 | 27.662 | 28.287 | |
| 26.631 | 27.654 | 27.777 | 27.777 | 27.485 | 27.014 | 28.671 | 29.510 | |
| 24.357 | 25.003 | 24.277 | 25.229 | 24.933 | 25.485 | 24.755 | 23.241 | |
| 22.698 | 22.057 | 21.844 | 21.253 | 21.997 | 21.990 | 22.660 | 22.477 | |
| 26.665 | 25.688 | 27.013 | 25.993 | 25.792 | 25.908 | 21.100 | 24.770 | |
| 21.505 | 21.403 | 20.387 | 21.559 | 21.246 | 21.037 | 21.100 | 21.100 | |
| 30.131 | 30.851 | 30.784 | 31.192 | 30.963 | 31.062 | 31.467 | 31.651 | |
| 24.349 | 24.077 | 23.037 | 23.853 | 24.045 | 23.888 | 23.713 | 23.853 | |
| 28.387 | 28.186 | 28.134 | 27.981 | 28.175 | 28.158 | 27.941 | 27.522 | |
| 27.408 | 28.147 | 29.001 | 28.440 | 28.216 | 28.228 | 28.033 | 27.981 | |
| 19.853 | 19.588 | 19.826 | 19.724 | 19.562 | 19.724 | 20.312 | 20.642 | |
| 9.480 | 11.821 | 9.531 | 11.009 | 12.554 | 11.440 | 10.366 | 10.550 | |
| 36.452 | 30.193 | 34.301 | 30.886 | 30.177 | 31.108 | 30.275 | 32.568 | |
| 30.978 | 33.414 | 33.944 | 33.333 | 32.992 | 31.779 | 36.880 | 39.449 | |
| 23.088 | 24.570 | 22.222 | 24.770 | 24.275 | 25.670 | 22.477 | 17.431 | |
| 45.472 | 46.973 | 45.904 | 46.482 | 47.617 | 46.122 | 47.581 | 48.47 | |
| 54.526 | 53.025 | 54.093 | 53.516 | 52.981 | 53.876 | 54.417 | 51.528 | |
| 59.54 | 54.763 | 56.523 | 55.656 | 54.452 | 56.778 | 52.752 | 49.999 |
The values are represented as percentage
Fig. 1Comparative analysis of nucleotide composition patterns between HEV and its hosts (human, rabbit, mongoose, pig, wild boar, camel, and monkey)
Average RSCU values of the codons of the HEV YDR and comparison with the RSCU values of its natural hosts
| AA | Codon | HEV | Human | Rabbit | Mongoose | Pig | Wild boar | Camel | Monkey |
|---|---|---|---|---|---|---|---|---|---|
| UUU | 1.33 | ||||||||
| UUC | 0.52 | 0.44 | 0.56 | 0.72 | 0.67 | ||||
| UUA | 0.39 | 0.28 | 0.17 | 0.42 | 0.28 | 0.07 | 0.57 | 0.33 | |
| UUG | 0.81 | 0.77 | 0.24 | 0.84 | 0.65 | 0.79 | 0.7 | 1 | |
| CUU | |||||||||
| CUC | |||||||||
| CUA | 0.37 | 0.29 | 0.07 | 0.11 | 0.42 | 0.34 | 0 | 0.33 | |
| CUG | 0.81 | 0.92 | 1.09 | 0.67 | |||||
| AUU | 0.62 | ||||||||
| AUC | |||||||||
| AUA | 0.57 | 0.79 | 0.5 | 0.81 | 0.48 | 0.23 | |||
| GUU | |||||||||
| GUC | |||||||||
| GUA | 0.1 | 0.25 | 0.22 | 0 | 0.22 | 0.13 | 0.05 | 0 | |
| GUG | 0.24 | 0 | |||||||
| UCU | 1.19 | ||||||||
| UCC | 0.97 | 1.13 | 1.3 | 0.96 | |||||
| UCA | 0.81 | 0.89 | 0.92 | 1 | 1.07 | 0.57 | 0.59 | 0.71 | |
| UCG | 0.74 | 0.79 | 1.26 | 0.63 | 0.85 | 1.14 | 0.96 | 0.35 | |
| AGU | 0.59 | 0.51 | 0.38 | 0.5 | 0.58 | 0.88 | 0.89 | ||
| AGC | 0.52 | 0.6 | 0.76 | 0.63 | 0.53 | 0.28 | 0.74 | 0.67 | |
| CCU | 0.92 | 1.21 | 0.67 | ||||||
| CCC | 0.8 | 0.78 | 0.67 | 0.65 | 0.77 | 1.02 | 0.33 | ||
| CCA | 0.86 | 0.88 | 0.55 | 0.44 | 0.97 | 0.98 | 0.68 | ||
| CCG | 0.69 | 0.9 | 0.99 | 1 | 0.94 | 1.03 | 0.81 | ||
| ACU | 0.75 | ||||||||
| ACC | 0 | ||||||||
| ACA | 0.54 | 0.85 | 0.77 | 0.42 | 0.84 | 0.88 | 0.76 | ||
| ACG | 0.47 | 0.38 | 0.42 | 0.58 | 0.42 | 0.34 | 0.38 | ||
| GCU | |||||||||
| GCC | 0.2 | ||||||||
| GCA | 0.43 | 0.68 | 0.32 | 0.6 | 0.71 | ||||
| GCG | 0.21 | 0.64 | 0.66 | 0.67 | |||||
| UAU | |||||||||
| UAC | 0.53 | ||||||||
| CAU | |||||||||
| CAC | 0.8 | 0.66 | 0.83 | 0.76 | 0.66 | 0.52 | 0.74 | 0.86 | |
| CAA | 0.36 | 0.27 | 0.07 | 0.19 | 0.31 | 0.43 | 0.27 | 0.33 | |
| CAG | 1.73 | ||||||||
| AAU | 0.87 | 0.99 | 0.96 | 0.89 | 0.89 | 1.1 | 0.55 | 0 | |
| AAC | 1.13 | 1.01 | 1.04 | 1.11 | 1.11 | 0.9 | 1.45 | 2 | |
| AAA | 0.23 | 0.8 | 0.78 | 0.72 | 0.63 | 0.48 | |||
| AAG | 0.7 | 0.86 | |||||||
| GAU | 0.89 | ||||||||
| GAC | 0.66 | 0.7 | 0.83 | 0.87 | |||||
| GAA | 0.37 | 0.38 | 0.67 | 0.36 | 0.36 | 0.38 | 0.52 | 0.4 | |
| GAG | |||||||||
| UGU | 0.29 | 0.67 | 0.43 | 0.53 | 0.59 | 0.4 | 0.64 | 1.2 | |
| UGC | 1.33 | 1.41 | 1.36 | 0.8 | |||||
| CGU | |||||||||
| CGC | 1.7 | 1.27 | |||||||
| CGA | 0.3 | 0.34 | 0.24 | 0.35 | 0.26 | 0.3 | 0.42 | 0.35 | |
| CGG | 1.22 | 0.67 | 1.1 | 1.18 | 1.13 | 0.71 | |||
| AGA | 0.34 | 0.3 | 0 | 0.35 | 0.31 | 0.3 | 0.21 | 0 | |
| AGG | 0.48 | 0.64 | 0.71 | 0.12 | 0.7 | 0.68 | 0.78 | 0.71 | |
| GGU | 0.92 | 0.86 | |||||||
| GGC | |||||||||
| GGA | 0.23 | 0.19 | 0.03 | 0.57 | 0.26 | 0.18 | 0.35 | 0.29 | |
| GGG | 0.74 | 0.62 | 0 | 0.1 | 0.7 | 0.79 | 0.41 | 0.57 |
The preferred codons are indicated in bold
Fig. 2Comparative analysis of relative synonymous codon usage (RSCU) patterns between HEV and its hosts (human, rabbit, mongoose, pig, wild boar, camel, and monkey)
Preferred codons for each amino acid in the YDR of HEV and its hosts
Comparison of codon usage frequency of preferred codons among HEV and its hosts. All the preferred codons are highlighted indicating the highest codon frequency. Thus, codon usage pattern of YDR was a mix of coincidence and antagonism with respect to its host
Fig. 3Heat map showing the relative synonymous codon usage (RSCU) values accompanying different hosts (H: human, R: rabbit, M: mongoose, P: pig, W: wild boar, C: camel, and M: monkey). The host species are mentioned on the horizontal axis and codons are represented on the vertical axis. Heatmap confirms the occurrence of resemblance as well as discrepancies in RSCU pattern among different hosts
Most frequent and least used codons among HEV and its natural hosts
| | ACC (8.8), GAG (7.9), UAC (6.5), GUC (6.4), GGC (6.3), |
| | GCC (7.9), UUU (6.6), GGC (6.3), ACU (6.2), CGU (6.0) |
| | GGC (8.1), CUU (8.0), CGC/UUU (7.0), GCU/GCC (6.8), GUC (6.7) |
| | GAG (7.7), ACU (7.3), UUU/GUC (7.0), GCC/UAC (6.7), GGC/CAG/CUC (6.3) |
| | GCC (8.1), GAG (7.5), CUU (6.9), GGC (6.7), UUU (6.5) |
| | GGC (7.4), GAG (7.3), GCU (6.7), GUC/GCC/UAU (6.4), UUU/ACU (5.8) |
| | UAU (8.8), GUC (7.8), GUU (7.6), GAG (7.4), GCC (7.0) |
| | GUU (11), AUU/ACU/GCU/GAG/CGC/GGC (8), CCU/GCC/UAU (7), UUU/CUU/GUC (6), CUC/UCC/ACC/UAC/CAG (5) |
| | GUA (0.4), UGU/AAA (0.7), GGA (0.8), CGA/GCG (0.9), AGA (1) |
| | GGA (0.7), AGA (0.8), CAA/UUA (0.9), CUA/GUA/CGA (1.0), AGU (1.4) |
| | AGA/GGG (0), GGA (0.1), CUA/CAA (0.2), UUA (0.6), CGA (0.7), UUG (0.8) |
| | GUA (0), CUA/GGG/AGG (0.3), CAA (0.7), AGA/CGA (1.0), UUA/CCA/AGU/UGU/AAU (1.3), UCG/ACA/AAC/GAA/AGC (1.7), |
| | CGA (0.7), UUA/GUA/AGA/GGA (0.9), CAA (1.1), AAU (1.3), CUA (1.4) |
| | UUA (0.2), GUA (0.5), GGA (0.6), CGA/AGC/AGA (0.8), UGU (1.0) |
| | CUA (0), GUA (0.2), AAU/AGA (0.6), CAA (0.8), GUG (1.0), CGA/GGA (1.2) |
| | GUA/GUG/ACG/AAU/AGA (0), UUA/CUA/AUAUCG/CCG/GCG/CAA/CGA/GGA (1), CUG/UCA/CCC/CCA/AAC/GAA/UGC/CGG/AGC/AGG/GGG (2), UUC/UUG/ACA/CAC/AAG/UGU/AGU/GGU (3), AUC/UCU/GCA/CAU/AAA/GAU/GAC/CGU (4), CUC/UCC/ACC/UAC/CAG (5) |
Codon frequency is given in parentheses following the relative synonymous codon usage