| Literature DB >> 21255687 |
Mariela Martínez Gómez1, Luis Fernando Lopez Tort, Eduardo de Mello Volotao, Ricardo Recarey, Gonzalo Moratorio, Héctor Musto, José Paulo G Leite, Juan Cristina.
Abstract
The Rotavirus genus belongs to the family Reoviridae and its genome consist of 11 segments of double-stranded RNA. Group A rotaviruses (RV-A) are the main etiological agent of acute viral gastroenteritis in infants and young children worldwide. Understanding the extent and causes of biases in codon usage is essential to the understanding of viral evolution. However, the factors shaping synonymous codon usage bias and nucleotide composition in human RV-A are currently unknown. In order to gain insight into these matters, we analyzed the codon usage and base composition constraints on the two genes that codify the two outer capsid proteins (VP4 [VP8*] and VP7) of 58 P[4]G2 RV-A strains isolated in Brazil and investigated the possible key evolutionary determinants of codon usage bias. The results of these studies revealed that the frequencies of codon usage in both RV-A proteins studied are significantly different than the ones used by human cells. In order to observe if similar trends of codon usage are found when RV-A complete genomes are considered, we compare these results with results found using a dataset of 10 reference strains for whom the complete codes of the 11 segments are known. Similar results were obtained using capsid proteins or complete genomes. The general correlations found between the position of each sequence on the first axis generated by correspondence analysis and the relative dinucleotide abundances indicate that codon usage in RV-A can also be strongly influenced by underlying biases in dinucleotide frequencies. CpG and GpC containing codons are markedly suppressed. Thus, the results of this study suggest that RV-A genomic biases are the result of the evolution of genome composition in relation to host adaptation and the ability to escape antiviral cell responses.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21255687 PMCID: PMC7172681 DOI: 10.1016/j.meegid.2011.01.006
Source DB: PubMed Journal: Infect Genet Evol ISSN: 1567-1348 Impact factor: 3.342
Codon usage in P[4]G2 RV-A strains, displayed as RSCU values.
| AA | Cod | HC | VP4 | VP7 | AA | Cod | HC | VP4 | VP7 | AA | Cod | HC | VP4 | VP7 | AA | Cod | HC | VP4 | VP7 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Phe | Ser | UCU | 1.14 | 1.07 | 1.00 | Tyr | |||||||||||||
| UUC | 1.08 | 0.20 | 0.14 | UCC | 1.32 | 0.40 | 0.27 | UAC | 1.12 | 0.04 | 0.50 | UGC | 1.08 | 0.03 | 0.76 | ||||
| Leu | TER | UAA | ** | ** | ** | TER | UGA | ** | ** | ** | |||||||||
| UUG | 0.78 | 0.46 | 0.94 | UCG | 0.30 | 0.55 | 0.59 | UAG | ** | ** | ** | Trp | UGG | 1.00 | 1.00 | 1.00 | |||
| CUU | 0.78 | 1.14 | 0.39 | Pro | CCU | 1.16 | 0.31 | 0.36 | His | Arg | CGU | 0.48 | 0.04 | 0.64 | |||||
| CUC | 1.20 | 0.44 | 0.37 | CCC | 1.28 | 0.29 | 0.00 | CAC | 1.16 | 0.35 | 0.00 | ||||||||
| Gln | CGA | 0.66 | 0.02 | 1.78 | |||||||||||||||
| CUG | 2.40 | 0.35 | 0.70 | CCG | 0.44 | 0.01 | 0.72 | CAG | 1.46 | 0.58 | 0.19 | CGG | 1.20 | 0.41 | 0.64 | ||||
| Ile | AUU | 1.08 | 1.79 | 1.45 | Thr | Asn | Ser | AGU | 0.90 | 1.99 | 0.33 | ||||||||
| AUC | 1.41 | 0.31 | 0.25 | ACC | 1.44 | 0.18 | 0.25 | AAC | 1.06 | 0.15 | 0.23 | AGC | 1.44 | 0.03 | 0.26 | ||||
| AUA | 0.51 | 0.90 | 1.30 | ACA | 1.12 | 1.47 | 1.42 | Lys | Arg | ||||||||||
| Met | AUG | 1.00 | 1.00 | 1.00 | ACG | 0.44 | 0.37 | 0.82 | AAG | 1.14 | 0.35 | 0.10 | AGG | 1.26 | 0.84 | 0.35 | |||
| Val | Ala | Asp | GAU | 0.92 | 1.57 | 1.22 | Gly | GGU | 0.64 | 2.03 | 0.98 | ||||||||
| GUC | 0.96 | 0.75 | 0.01 | GCC | 1.60 | 0.02 | 0.13 | GAC | 1.08 | 0.43 | 0.78 | GGC | 1.36 | 0.26 | 0.24 | ||||
| GCA | 0.92 | 1.74 | 1.11 | Glu | |||||||||||||||
| GUG | 1.84 | 0.28 | 0.45 | GCG | 0.44 | 0.50 | 0.28 | GAG | 1.16 | 0.59 | 0.43 | GGG | 1.00 | 0.05 | 0.26 | ||||
RSCU, relative synonymous codon usage; AA, amino acid; Cod, codons; HC, human cells; TER, termination codon. More frequent codons in both VP4 [VP8*] and VP7 with respect to human cells are shown in bold. Codon CGC (Arg) not used in VP4 [VP8*] and VP7 P[4]G2 RV-A isolated in Brazil are shown in italics.
Fig. 1Effective number of codons used in each ORF plotted against the GC3S. The curve plots the relationship between GC3S and ENC in absence of selection. Black square dots show the results obtained for RV-A strains. All of them lie below the expected curve. The results found for VP4 and VP7 are shown in (A) and (B), respectively.
Codon usage in RV-A strains of different genotypes, expressed by RSCU values.
| AA | Cod | HC | OC | IM | IC | NSP | Full | AA | Cod | HC | OC | IM | IC | NSP | Full |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Phe | Ser | UCU | 1.14 | 1.30 | 1.02 | 1.13 | 1.66 | 1.30 | |||||||
| UUC | 1.08 | 0.47 | 0.42 | 0.48 | 0.48 | 0.49 | UCC | 1.32 | 0.26 | 0.43 | 0.29 | 0.28 | 0.28 | ||
| Leu | |||||||||||||||
| UUG | 0.78 | 0.99 | 1.25 | 1.14 | 1.32 | 1.17 | UCG | 0.30 | 0.54 | 0.55 | 0.62 | 0.54 | 0.57 | ||
| CUU | 0.78 | 0.47 | 1.37 | 0.60 | 0.92 | 0.70 | Pro | CCU | 1.16 | 0.61 | 0.26 | 0.43 | 0.90 | 0.54 | |
| CUC | 1.20 | 0.29 | 0.32 | 0.18 | 0.39 | 0.25 | CCC | 1.28 | 0.27 | 0.02 | 0.12 | 0.20 | 0.15 | ||
| CUG | 2.40 | 0.42 | 0.48 | 0.46 | 0.45 | 0.48 | CCG | 0.44 | 0.44 | 0.38 | 0.60 | 0.35 | 0.51 | ||
| Ile | AUU | 1.08 | 1.13 | 1.80 | 1.02 | 1.80 | 1.24 | Thr | |||||||
| AUC | 1.41 | 0.22 | 0.30 | 0.23 | 0.27 | 0.24 | ACC | 1.44 | 0.24 | 0.12 | 0.24 | 0.24 | 0.23 | ||
| AUA | 0.51 | 1.65 | 0.90 | 1.75 | 0.93 | 1.51 | ACA | 1.12 | 1.73 | 2.11 | 1.63 | 1.52 | 1.66 | ||
| Met | AUG | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | ACG | 0.44 | 0.89 | 0.43 | 0.78 | 0.61 | 0.76 | |
| Val | Ala | ||||||||||||||
| GUC | 0.96 | 0.33 | 0.71 | 0.38 | 0.32 | 0.37 | GCC | 1.60 | 0.26 | 0.32 | 0.33 | 0.17 | 0.28 | ||
| GCA | 0.92 | 1.76 | 1.36 | 1.62 | 2.03 | 1.69 | |||||||||
| GUG | 1.84 | 0.94 | 0.74 | 0.80 | 0.90 | 0.86 | GCG | 0.44 | 0.73 | 0.75 | 0.61 | 0.45 | 0.63 | ||
| Tyr | |||||||||||||||
| UAC | 1.12 | 0.54 | 0.87 | 0.53 | 0.67 | 0.57 | UGC | 1.08 | 0.65 | 0.20 | 0.61 | 0.83 | 0.69 | ||
| TER | UAA | ** | ** | ** | ** | ** | ** | TER | UGA | ** | ** | ** | ** | ** | ** |
| UAG | ** | ** | ** | ** | ** | ** | Trp | UGG | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | |
| His | Arg | CGU | 0.48 | 0.12 | 0.46 | 0.60 | 0.74 | 0.53 | |||||||
| CAC | 1.16 | 0.31 | 0.20 | 0.34 | 0.52 | 0.38 | |||||||||
| Gln | CGA | 0.66 | 0.87 | 0.31 | 0.50 | 0.59 | 0.56 | ||||||||
| CAG | 1.46 | 0.80 | 0.72 | 0.72 | 0.62 | 0.71 | |||||||||
| Asn | Ser | AGU | 0.90 | 0.85 | 0.55 | 0.58 | 0.92 | 0.71 | |||||||
| AAC | 1.06 | 0.47 | 0.62 | 0.48 | 0.44 | 0.49 | AGC | 1.44 | 0.22 | 0.39 | 0.11 | 0.31 | 0.21 | ||
| Lys | Arg | ||||||||||||||
| AAG | 1.14 | 0.48 | 0.34 | 0.51 | 0.52 | 0.57 | AGG | 1.26 | 0.75 | 0.12 | 0.67 | 0.92 | 0.70 | ||
| Asp | GAU | 0.92 | 1.38 | 1.54 | 1.43 | 1.63 | 1.47 | Gly | GGU | 0.64 | 1.44 | 1.03 | 1.31 | 1.45 | 1.34 |
| GAC | 1.08 | 0.62 | 0.46 | 0.57 | 0.37 | 0.53 | GGC | 1.36 | 0.34 | 0.34 | 0.26 | 0.20 | 0.28 | ||
| Glu | |||||||||||||||
| GAG | 1.16 | 0.70 | 0.54 | 0.54 | 0.53 | 0.57 | GGG | 1.00 | 0.32 | 0.23 | 0.41 | 0.31 | 0.37 | ||
RSCU, relative synonymous codon usage; AA, amino acid; Cod, codons; HC, human cells; OC, outer capsid shell proteins; IM, intermediate protein shell; IC, inner capsid shell proteins; NSP, non-structural proteins; Full, full genome; TER, termination codon. More frequent codons with respect to human cells found in all genome regions studied are shown in bold. Frequencies sharply reduced with respect to frequencies found in human cells are shown in italics.
Relative abundance of dinucleotides in VP4 [VP8*] and VP7 proteins from P[4]G2 RV-A Brazilian strains and summary of COA.
| VP4 [VP8] | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| UU | UC | UA | UG | CU | CC | CA | CG | ||
| Mean ± S.D. | 1.490 ± 0.035 | 0.823 ± 0.024 | 1.665 ± 0.021 | 0.846 ± 0.020 | 0.610 ± 0.022 | 0.381 ± 0.012 | 1.157 ± 0.023 | 0.230 ± 0.035 | |
| Axis 1 | −0.261 | 0.384 | −0.427 | 0.038 | −0.010 | 0.560 | 0.127 | 0.453 | |
| 0.040 | <0.01 | <0.001 | 0.764 | 0.928 | <0.0001 | 0.317 | <0.001 | ||
Mean values of 58 P[4]G2 RV-A strains’ relative dinucleotide ratios ± standard deviation.
Correlation analysis between the first axis in COA and the sixteen dinucleotides frequencies in VP4 [VP8*] and VP7 proteins is shown.
Position of codons in each of the four major axes of COA for RV-A VP4 [VP8*] and VP7 proteins.
| Axis 1 | Axis 2 | Axis 3 | Axis 4 | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Codon | Value | Aminoacid | Codon | Value | Aminoacid | Codon | Value | Aminoacid | Codon | Value | Aminoacid | |
| VP4 | GCC | −4.183 | Ala | UGC | −7.683 | Cys | CCG | −6.889 | Pro | GCC | −2.634 | Ala |
| UGC | 0.660 | Cys | CCG | 0.898 | Pro | GCC | 2.071 | Ala | GGG | 1.967 | Gly | |
| VP7 | GUC | 0.610 | Val | GGG | 0.212 | Gly | GUC | −0.960 | Val | GUC | −0.100 | Val |
| AGG | 1.488 | Arg | GUC | 4.447 | Val | UUC | 0.337 | Phe | UUC | 0.279 | Phe | |