| Literature DB >> 35842571 |
Han Wu1, Bingzhe Li1, Ziping Miao2, Linjie Hu1, Lu Zhou1, Yihan Lu3.
Abstract
BACKGROUND: Group A rotavirus (RVA) is a common causative agent of acute gastroenteritis in infants and young children worldwide. RVA P genotypes, determined by VP4 sequences, have been confirmed to infect humans and animals. However, their codon usage patterns that are essential to obtain insights into the viral evolution, host adaptability, and genetic characterization remained unclear, especially across animal hosts.Entities:
Keywords: Animal host; Codon usage bias; Complete coding sequence; Evolution; Group A rotavirus; P genotype; VP4; Zoonosis
Mesh:
Substances:
Year: 2022 PMID: 35842571 PMCID: PMC9288207 DOI: 10.1186/s12864-022-08730-2
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 4.547
Nucleotide compositions and properties of VP4 coding sequences for RVA P genotypes
| Categories | Human genotypes | Porcine genotypes | Zoonotic genotypes | |||||
|---|---|---|---|---|---|---|---|---|
| P[4] | P[8] | P[13] | P[23] | P[1] | P[6] | P[7] | P[19] | |
| A | 0.374(0.002) | 0.377(0.005) | 0.361(0.001) | 0.359(0.003) | 0.366(0.006) | 0.354(0.003) | 0.360(0.004) | 0.357(0.002) |
| U | 0.314(0.001) | 0.309(0.002) | 0.279(0.002) | 0.284(0.001) | 0.267(0.001) | 0.299(0.003) | 0.278(0.005) | 0.305(0.002) |
| G | 0.164(0.001) | 0.160(0.004) | 0.182(0.001) | 0.179(0.002) | 0.190(0.000) | 0.178(0.003) | 0.175(0.002) | 0.172(0.003) |
| C | 0.148(0.001) | 0.154(0.002) | 0.178(0.001) | 0.179(0.001) | 0.177(0.005) | 0.170(0.002) | 0.187(0.003) | 0.167(0.002) |
| AU | 0.688(0.002) | 0.686(0.005) | 0.641(0.002) | 0.643(0.003) | 0.633(0.005) | 0.653(0.004) | 0.638(0.002) | 0.662(0.004) |
| GC | 0.312(0.002) | 0.314(0.005) | 0.359(0.002) | 0.357(0.003) | 0.367(0.005) | 0.348(0.004) | 0.362(0.002) | 0.338(0.004) |
| GC1 | 0.396(0.003) | 0.395(0.004) | 0.435(0.005) | 0.433(0.004) | 0.441(0.003) | 0.425(0.005) | 0.431(0.004) | 0.410(0.001) |
| GC2 | 0.384(0.002) | 0.385(0.003) | 0.414(0.002) | 0.419(0.002) | 0.410(0.003) | 0.390(0.006) | 0.424(0.002) | 0.394(0.003) |
| GC3 | 0.157(0.005) | 0.162(0.014) | 0.229(0.003) | 0.220(0.005) | 0.251(0.009) | 0.228(0.007) | 0.231(0.005) | 0.211(0.011) |
| A3 | 0.558(0.004) | 0.575(0.016) | 0.567(0.002) | 0.571(0.008) | 0.574(0.021) | 0.525(0.009) | 0.558(0.018) | 0.515(0.007) |
| U3 | 0.497(0.006) | 0.475(0.006) | 0.387(0.005) | 0.393(0.004) | 0.362(0.008) | 0.436(0.008) | 0.393(0.013) | 0.461(0.008) |
| G3 | 0.137(0.004) | 0.126(0.018) | 0.178(0.005) | 0.182(0.008) | 0.210(0.005) | 0.177(0.008) | 0.166(0.010) | 0.162(0.010) |
| C3 | 0.076(0.007) | 0.091(0.006) | 0.124(0.003) | 0.108(0.003) | 0.121(0.007) | 0.124(0.007) | 0.134(0.006) | 0.117(0.008) |
All values were displayed in mean (std)
Fig. 1Frequency of ending nucleotides across the 18 preferred synonymous codons for eight RVA P genotypes
Number of most commonly used codons with RSCU values > 1.6 for RVA P genotypes
| Group | Genotypes | RSCU value > 1.6 |
|---|---|---|
| Human | P[4] | 15 |
| P[8] | 16 | |
| Porcine | P[13] | 8 |
| P[23] | 9 | |
| Zoonotic | P[1] | 11 |
| P[6] | 11 | |
| P[7] | 8 | |
| P[19] | 12 |
RSCU relative synonymous codon usage
Fig. 2Principal component analysis (PCA) of VP4 coding sequences for eight RVA P genotypes. The eight P genotypes were classified into three groups by hosts: human, porcine and zoonotic groups. X and Y axis represented principal component 1 and principal component 2, respectively. The dots represented VP4 sequences. The ellipses in the figure predicted new observations with a probability of 0.95. New observations from the same group were expected to fall inside the ellipses
ENC values of VP4 complete coding sequences for RVA P genotypes
| Group | Genotypes | Mean ± std | Range |
|---|---|---|---|
| Human | P[4] | 38.18 ± 0.54 | 36.79-39.17 |
| P[8] | 37.86 ± 1.08 | 35.99-39.97 | |
| Porcine | P[13] | 42.93 ± 0.73 | 42.14-43.57 |
| P[23] | 42.23 ± 0.80 | 40.38-43.26 | |
| Zoonotic | P[1] | 41.26 ± 0.40 | 40.98-41.54 |
| P[6] | 42.24 ± 1.01 | 40.24-43.99 | |
| P[7] | 43.02 ± 0.37 | 42.40-43.48 | |
| P[19] | 41.75 ± 0.91 | 40.71-42.33 |
ENC effective number of codons
Fig. 3ENC-plot analysis of VP4 coding sequences for the eight RVA P genotypes. The ENC values were plotted against the GC contents at the third codon position (GC3). The expected curve represented the expected ENC values according to corresponding GC3 contents. The dots represented VP4 sequences
Fig. 4Parity rule 2 (PR2) analysis of VP4 coding sequences for eight RVA P genotypes. A Human genotypes. B Porcine genotypes. C Zoonotic genotypes. A3/(A3 + U3) and G3/(G3 + C3) of fourfold degenerate codon families represented the ordinate and abscissa, respectively. In the centre of the graph, both coordinates had a value of 0.5, i.e. A = U and G = C. The dots represented VP4 sequences
Fig. 5Neutrality plot analysis of VP4 coding sequences for eight RVA P genotypes. The GC12 values were plotted against the GC3 values. The dots represented VP4 sequences. * Represented correlation significant at P < 0.05
Basic information of VP4 coding sequences for RVA P genotypes
| Group | Genotypes | Host | Number of sequences |
|---|---|---|---|
| Human | P[4] | Human | 50 |
| P[8] | 122 | ||
| Porcine | P[13] | Swine | 3 |
| P[23] | 17 | ||
| Zoonotic | P[1] | Human, swine and other animals | 2 |
| P[6] | Human and swine | 30 | |
| P[7] | Human, swine and bovine | 6 | |
| P[19] | Human and swine | 3 |