| Literature DB >> 28830350 |
Bidhan Ch Bera1, Nitin Virmani2, Naveen Kumar3, Taruna Anand1, S Pavulraj1, Adam Rash4, Debra Elton4, Nicola Rash4, Sandeep Bhatia3, Richa Sood3, Raj Kumar Singh5, Bhupendra Nath Tripathi1.
Abstract
BACKGROUND: Equine influenza is a major health problem of equines worldwide. The polymerase genes of influenza virus have key roles in virus replication, transcription, transmission between hosts and pathogenesis. Hence, the comprehensive genetic and codon usage bias of polymerase genes of equine influenza virus (EIV) were analyzed to elucidate the genetic and evolutionary relationships in a novel perspective.Entities:
Keywords: Codon usage bias; Equine influenza virus; Evolution; H3N8; Polymerase genes
Mesh:
Substances:
Year: 2017 PMID: 28830350 PMCID: PMC5568313 DOI: 10.1186/s12864-017-4063-1
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Amino acid substitutions in the polymerase genes of EIV H3N8 among different lineages/clades
| Genes | Residue no. | Pre-divergent | Eurasian | Florida clade 1 | Florida clade 2 |
|---|---|---|---|---|---|
| PA | 62 | I | V | V | V |
| 64 | E | E | E | D | |
| 86 | M | M | M | I | |
| 158 | K | K | K | R/G/K | |
| 213 | R | R | K | K | |
| 231 | A | V | V | V | |
| 237 | E | E | E | K | |
| 269 | R | K | K | K | |
| 270 | I | I/M | M | M | |
| 321 | S/N | S | S | N | |
| 337 | A | A | T | T | |
| 343 | A | A | E | E | |
| 345 | L | L | I | I | |
| 353 | K | K | R | R | |
| 354 | T/I | T/D | I | I | |
| 388 | S | N | S | S | |
| 409 | S | S | N/S | S | |
| 432 | I | I | V | V | |
| 450 | V | V/F | I/V | I | |
| 476 | A | A | A | T | |
| 532 | L | F | F | F | |
| 626 | K | K | K | R | |
| PA-X | 213 | G | G | S | S |
| PB1 | 114 | V | V | I | I |
| 119 | V | V | V | M | |
| 154 | D | D | G | G | |
| 198 | K | R | R | R | |
| 221 | A | A | T/A | T | |
| 317 | M | M | I | I | |
| 329 | Q | Q | Q | R | |
| 377 | D | D | D | E | |
| 618 | E | E | E | D | |
| 738 | E | E/D | D | D | |
| PB1-F2 | 4 | G | G | G | E |
| 18 | T | T | I | I | |
| 21 | R | R | K | R | |
| 41 | R | R | H | H | |
| 79 | R | R | R | Q | |
| PB2 | 12 | S | S | L | L |
| 105 | T | T | T | A | |
| 251 | R | R/K | R/K | K | |
| 344 | V | M | M | M | |
| 511 | V | I | I | I | |
| 590 | G | S | S | S |
Fig. 1Phylogenetic analyses of polymerase genes of EIVs. The trees were generated by the maximum likelihood model implemented in the software MEGA 5.0. The reliability of the trees was assessed by bootstrap with 1000 replications. The bootstrap values higher than 0.7 are highlighted with solid black dots. The distinct Florida clade 1 and clade 2 sublineages of EIVs have been represented by colored boxes
The Relative synonymous codon usage (RSCU) patterns of polymerase genes of EIVs and their host species
| Amino acid | Codon | PB2 | PB1 | PA | Horse | Donkey | Dog | Amino acid | Codon | PB2 | PB1 | PA | Horse | Donkey | Dog |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Phe | UUU | 0.98 | 0.63 | 1.22 | 0.83 | 0.88 | 0.82 | Ser | UCA |
| 1.58 | 1.18 | 0.80 | 0.83 | 0.81 |
| UUC | 1.02 | 1.38 | 0.77 | 1.17 | 1.12 | 1.16 | UCG |
|
|
|
|
|
| ||
| Leu | UUA |
| 0.72 | 1.08 |
|
|
| AGU | 0.76 | 0.68 | 1.25 | 0.86 | 1.10 | 0.89 | |
| UUG | 1.45 | 1.29 | 0.92 | 0.72 | 0.82 | 0.68 | AGC | 1.24 | 1.32 | 0.74 | 1.48 | 1.27 | 1.56 | ||
| CUU | 1.16 | 0.65 | 1.35 | 0.73 | 0.80 | 0.67 | Arg | AGA | 1.4 | 1.57 | 1.13 | 1.30 | 1.49 | 1.20 | |
| CUC | 0.88 | 0.79 | 1.05 | 1.32 |
| 1.25 | AGG | 0.6 |
| 0.87 | 1.32 |
| 1.32 | ||
| CUA | 0.95 | 1.11 | 0.66 |
|
|
| CGU | 0.63 |
|
|
|
|
| ||
| CUG | 1 | 1.44 | 0.92 |
|
|
| CGC | 0.99 |
| 1.13 | 1.15 | 0.79 | 1.26 | ||
| Ile | AUU | 1.31 | 0.91 | 1.24 | 0.92 |
| 0.96 | CGA | 0.98 |
|
| 0.61 | 0.74 | 0.67 | |
| AUC |
| 0.85 | 0.83 |
|
|
| CGG | 1.4 |
|
| 1.08 | 0.62 | 1.31 | ||
| AUA | 1.18 | 1.24 | 0.93 |
|
|
| Cys | UGU |
| 1.19 |
| 0.89 | 0.62 | 0.85 | |
| Val | GUU | 0.76 | 0.91 | 0.96 | 0.60 | 0.64 |
| UGC |
| 0.81 | 1.4 | 1.11 | 1.38 | 1.10 | |
| GUC | 0.66 | 1.12 | 0.63 | 1.08 | 1.42 | 1.1 | His | CAU | 1.4 | 1.11 | 0.92 | 0.81 | 0.89 | 0.78 | |
| GUA | 1.18 | 1.08 | 1.04 |
|
|
| CAC | 0.6 | 0.89 | 1.07 | 1.19 | 1.11 | 1.22 | ||
| GUG | 1.4 | 0.89 | 1.34 |
|
|
| Gln | CAA | 1.25 | 1.24 | 1.25 |
| 0.84 |
| |
| Pro | CCU | 1.1 | 1.13 |
| 1.19 | 0.83 | 1.08 | CAG | 0.75 | 0.76 | 0.74 | 1.48 | 1.16 | 1.46 | |
| CCC | 0.96 | 0.75 | 0.85 | 1.38 |
| 1.47 | Asn | AAU | 1.2 | 1.15 | 0.97 | 0.84 | 0.66 | 0.87 | |
| CCA |
|
|
| 0.97 | 1.06 | 1.05 | AAC | 0.8 | 0.85 | 1.02 | 1.16 | 1.34 | 1.12 | ||
| CCG |
|
|
|
|
|
| Lys | AAA | 1.37 | 1.32 | 1.33 | 0.79 | 0.79 | 0.79 | |
| Thr | ACU | 0.83 | 1.07 | 0.77 | 0.94 | 0.82 | 0.89 | AAG | 0.63 | 0.89 | 0.67 | 1.21 | 1.21 | 1.13 | |
| ACC | 0.93 | 0.85 | 1.11 | 1.58 |
| 1.58 | Asp | GAU | 1.15 | 1.27 | 1.25 | 0.83 | 0.84 | 0.86 | |
| ACA |
|
|
| 0.96 | 0.79 | 1.05 | GAC | 0.85 | 0.73 | 0.74 | 1.17 | 1.16 | 1.09 | ||
| ACG |
|
|
|
| 0.61 |
| Glu | GAA | 1.36 | 1.34 | 1.16 | 0.76 | 0.84 | 0.79 | |
| Ala | GCU | 0.94 | 1.03 | 1.26 | 1.05 | 1.20 | 1 | GAG | 0.64 | 0.66 | 0.83 | 1.24 | 1.16 | 1.23 | |
| GCC | 0.93 | 0.94 | 0.81 |
|
|
| Gly | GGU | 0.7 | 0.74 | 0.84 | 0.65 | 0.87 | 0.65 | |
| GCA |
|
| 1.44 | 0.77 | 0.77 | 0.81 | GGC | 0.67 |
|
| 1.43 | 1.42 | 1.45 | ||
| GCG |
|
|
|
|
|
| GGA |
| 1.53 |
| 0.95 | 0.85 | 1.02 | ||
| Tyr | UAU | 0.76 | 0.96 | 1.23 | 0.75 | 0.63 | 0.79 | GGG | 0.64 | 1.23 | 0.96 | 0.97 | 0.86 | 1.05 | |
| UAC | 1.24 | 1.04 | 0.76 | 1.25 | 1.37 | 1.15 | |||||||||
| Ser | UCU |
| 0.68 | 1.38 | 1.09 | 1.10 | 1.09 | ||||||||
| UCC | 0.84 | 1.57 | 1.27 | 1.43 | 1.46 | 1.52 |
Over- (RSCU ≥1.6) and under-represented (RSCU ≤0.6) codons are displayed in bold and italics, respectively
Fig. 3Magnitude of significant determining factors of codon usage bias in polymerase genes of EIVs. ENc-plot: the red dotted line represents the expected curve of positions of strains when the codon usage was only determined by the GC3s composition. Neutrality plot: the red dotted line is the linear regression of GC12 against GC3. PR2 bias plot: AU-bias [A3/(A3 + U3)] at the third codon position of the four-codon amino acids of entire genes were plotted against the GC-bias [G3/(G3 + C3)] and the centre of the plot represents no bias between the influence of the mutation pressure and natural selection
Frequency of tRNA genes in equine cells for most preferentially used codons in polymerase genes of EIVs
| AA | Preferentially used codons | tRNA isotypes in equine cells | Total count | ||
|---|---|---|---|---|---|
| PB1 | PB2 | PA | |||
| Ala | GCA | GCA | GCA | AGC (27), GGC (0), CGC (8), | 45 |
| Gly | GGA | GGA | GGA | ACC (0), GCC (10), CCC (8), | 23 |
| Pro | CCA | CCA | CCA | AGG (10), GGG (0), CGG (3), | 20 |
| Thr | ACA | ACA | ACA | AGT (9), GGT (0), CGT (3), | 19 |
| Val | GTC | GTC | GTG | AAC (12), | 37 |
| Ser | TCA | TCA | TCT |
| 32 |
| Arg | AGA/CGG | CGA/CGG | CGA | ACG (10), GCG (1), | 33 |
| Leu | TTG | CTG | CTT |
| 26 |
| Phe | TTC | TTC | TTT |
| 13 |
| Asn | AAT | AAT | AAC |
| 22 |
| Lys | AAA | AAA | AAA | CTT (18), | 33 |
| Asp | GAT | GAT | GAT |
| 11 |
| Glu | GAA | GAA | GAA | CTC (50), | 61 |
| His | CAT | CAT | CAC |
| 13 |
| Gln | CAA | CAA | CAA | CTG (10), | 16 |
| Ile | ATT | ATA | ATT |
| 24 |
| Tyr | TAC | TAC | TAT |
| 15 |
| Cys | TGC | TGT | TGC |
| 26 |
Codons which are likely to be paired with respective anticodon are displayed in bold
Fig. 4Correspondence analysis (COA) of the synonymous codon usage in EIVs polymerase genes. a COA generated axes contributions. The relative and cumulative inertia of the first 20 factors from a COA of the synonymous codon usage frequencies, (b) COA of the synonymous codon usage towards codons. This analysis was built on the RSCU values of the 59 synonymous codons. The positions of each codon were plotted on the first two-main-dimensional coordinates. Different base ended codons were color labelled. c COA of the synonymous codon usage in coding sequences of polymerase genes of EIV isolates. The positions of each polymerase genes of EIV isolates were plotted on the first two-main-axes
Fig. 5The RCDI analysis of phylogenetic derived clusters of EIVs polymerase genes in relation to their host species, Equus caballus
Fig. 2Comparison of phylogenetic derived clusters of EIVs polymerase genes based on their effective number of codon (ENc) values