| Literature DB >> 16269197 |
E Mateu1, I Díaz, L Darwich, J Casal, M Martín, J Pujols.
Abstract
ORF5 sequences of porcine reproductive and respiratory syndrome virus (PRRSV) were analysed to determine genetic diversity, codon usage, positive and negative selection sites and potential changes in the predicted glycoprotein 5 (GP5). A hypothetical GP5 containing all selected sites was constructed to determine its characteristics. These sequences corresponded to isolates obtained 10 years apart (1991-1995, 18 strains) and a second set (n = 46) from 2000 to 2005. Similarity to Lelystad virus (LV) decreased from 95.5% in 1991-1995 to 89.5% in 2000-2005. Three highly variable regions were found in ORF5. Codon usage was different in both sets for leucine, glutamine, serine and proline. Thus, 2000-2005 sequences used codons more similar to those present in highly expressed pig genes compared to the 1991-1995 set. Twenty four sites of positive selection and 20 sites of negative selection were found in GP5, most of them in transmembrane regions. Additional glycosylation in N37 of GP5 was common in 2000-2005 but some sequences lack a glycosylation site in N46. The hypothetical GP5 was only 88.1% similar to LV and was less hydrophobic. Taking together these results suggest that PRRSV is still adapting to pig cells.Entities:
Mesh:
Substances:
Year: 2005 PMID: 16269197 PMCID: PMC7172362 DOI: 10.1016/j.virusres.2005.09.008
Source DB: PubMed Journal: Virus Res ISSN: 0168-1702 Impact factor: 3.303
Codon usage in the ORF5 gene of Spanish sequences of porcine reproductive and respiratory syndrome strains (1991–1995 and 2000–2005)
AA: aminoacid; N: number of effective codons; RSCU: synonymous codon usage measure. In bold is shown the most frequent codon for each aminoacid. Boxes show the main changes between 1991–1995 and 2000–2005 sequences.
Potential sites for positive selection of mutations in Spanish strains of porcine reproductive and respiratory syndrome virus GP5
| Position in GP5 | Lelystad virus | Spanish strains 2000–2005 ( | Spanish strains 1991–1995 ( | ||||
|---|---|---|---|---|---|---|---|
| Aminoacid | Aminoacid (strains) | NS/S | Probability | Aminoacid (strains) | NS/S | Probability | |
| 10 | Phenilalanine | Serine (14) | 18/3 | 1.3 × 10−20 | No change | 0/0 | N.A. |
| 12 | Threonine | Isoleucine (12) | 20/0 | 1.8 × 10−13 | No change | 0/0 | N.A. |
| 13 | Proline | Glutamine (12) | 21/18 | 1.3 × 10−18 | Leu (1) | 2/8 | 0.03 |
| Leucine (7) | 3.2 × 10−6 | ||||||
| 21 | Phenilalanine | Serine (5) | 6/0 | 7.5 × 10−9 | No change | 0/0 | N.A. |
| 32 | Alanine | Valine (15) | 19/0 | 9.6 × 10−17 | Valine (5) | 5/0 | 3.2 × 10−7 |
| 36 | Glycine | Aspartic acid (12) | 17/2 | 2.1 × 10−17 | Aspartic (2) | 3/0 | 8.0 × 10−3 |
| 37 | Aspartic acid | Asparagine (38) | 0/47 | 1.5 × 10−53 | Asparagine (5) | 9/0 | 1.5 × 10−7 |
| Serine (5) | 9.7 × 10−4 | Serine (4) | 8.9 × 10−6 | ||||
| 46 | Asparagine | Aspartic acid (12) | 15/14 | 1.5 × 10−12 | No change | 0/0 | N.A. |
| 56 | Aspartic acid | Glycine (16) | 42/0 | 6.7 × 10−11 | Glutamic acid (4) | 7/1 | 2.6 × 10−6 |
| Glutamic acid (14) | 4.2 × 10−15 | ||||||
| 60 | Serine | Asparagine (16) | 33/0 | 6.2 × 10−33 | Asparagine (1) | 2/0 | 0.03 |
| 63 | Glycine | Aspartic acid (18) | 45/0 | 1.1 × 10−20 | Aspartic acid (11) | 12/0 | 3.2 × 10−19 |
| Serine (12) | 5.7 × 10−12 | ||||||
| 96 | Glycine | Serine (13) | 15/10 | 7.8 × 10−19 | Serine (2) | 2/1 | 8.0 × 10−4 |
| 100 | Threonine | Isoleucine (11) | 24/0 | 9.1 × 10−11 | Isoleucine (3) | 5/0 | 3.4 × 10−4 |
| 101 | Alanine | Threonine (30) | 46/0 | 1.1 × 10−29 | Threonine (8) | 9/0 | 1.6 × 10−9 |
| 106 | Glycine | Lysine (16) | 26/3 | 4.8 × 10−15 | Lysine (4) | 3/1 | 3.8 × 10−7 |
| 111 | Cysteine | Serine (36) | 47/0 | 1.4 × 10−54 | Serine (15) | 15/0 | 2.1 × 10−27 |
| 116 | Alanine | Valine (17) | 26/1 | 1.5 × 10−18 | Phenilalanine (1) | 1/0 | 0.02 |
| 119 | Phenilalanine | Leucine (9) | 17/10 | 7.0 × 10−12 | Leucine (3) | 3/1 | 1.3 × 10−6 |
| 122 | Phenilalanine | Leucine (30) | 35/0 | 9.3 × 10−42 | Leucine (9) | 9/0 | 9.9 × 10−17 |
| 123 | Valine | Alanine (13) | 19/11 | 8.6 × 10−12 | No change | 0/0 | N.A. |
| 143 | Phenilalanine | Histidine (9) | 12/9 | 4.1 × 10−14 | No change | 0/0 | N.A. |
| 154 | Valine | Isoleucine (39) | 42/0 | 4.9 × 10−66 | Isoleucine (10) | 10/0 | 1.7 × 10−15 |
| 172 | Aspartic acid | Glycine (34) | 41/5 | 2.6 × 10−39 | Glycine (10) | 10/6 | 5.7 × 10−10 |
| 174 | Asparagine | Aspartic acid (35) | 43/0 | 1.7 × 10−52 | Aspartic acid (10) | 11/0 | 1.8 × 10−17 |
Probability: probability, according to a binomial distribution, that in a set of n independent sequences with a mutation, a given number of strains shared the same non-synonymous mutation. For strains detected in the period 2000–2005 only mutations with probabilities lower than 1.0 × 10−9 are shown.
NS/S: non synonymous/synonymous mutations.
Potential sites for negative selection of mutations in Spanish strains of porcine reproductive and respiratory syndrome virus GP5 obtained in 2000–2005
| Position in GP5 | Aminoacid | S/NS | Probability |
|---|---|---|---|
| 8 | Glycine | 20/2 | 2.0 × 10−24 |
| 73 | Proline | 17/1 | 1.3 × 10−21 |
| 79 | Leucine | 33/4 | 1.3 × 10−54 |
| 85 | Threonine | 15/4 | 9.6 × 10−17 |
| 89 | Phenilalanine | 16/3 | 3.3 × 10−26 |
| 91 | Aspartic acid | 31/4 | 3.69 × 10−51 |
| 108 | Tyrosine | 34/8 | 3.6 × 10−53 |
| 109 | Valine | 21/7 | 3.9 × 10−22 |
| 112 | Serine | 22/8 | 3.9 × 10−33 |
| 113 | Valine | 34/8 | 4.6 × 10−37 |
| 135 | Alanine | 36/5 | 8.4 × 10−42 |
| 137 | Arginine | 28/5 | 9.3 × 10−26 |
| 140 | Arginine | 25/7 | 1.9 × 10−21 |
| 153 | Arginine | 35/5 | 2.5 × 10−31 |
| 160 | Proline | 16/3 | 1.2 × 10−18 |
| 163 | Valine | 25/4 | 5.8 × 10−29 |
| 164 | Glutamic acid | 39/3 | 4.9 × 10−66 |
| 169 | Alanine | 21/7 | 3.9 × 10−22 |
| 194 | Threonine | 14/4 | 1.5 × 10−15 |
| 200 | Glutamic acid | 35/8 | 7.4 × 10−55 |
S/NS: synonymous/non-synonymous mutations.
Probability: probability, according to a binomial distribution, that in a set of n independent sequences with a mutation, a given number of strains shared a synonymous mutation.
Significant changes (p < 0.05) in codon usage for negatively selected sites of ORF5 of porcine reproductive and respiratory strains isolated in 1991–1995 or 2000–2005
| Position in GP5 | Aminoacids | Codons 1991–1995 | Codons 2000–2005 |
|---|---|---|---|
| 73 | Proline | CCG (18/18) | CCG (30/46) |
| CCA (15/46) | |||
| CCC (1/46) | |||
| 79 | Leucine | CTC (12/18) | CTC (11/44) |
| CTT (6/18) | CTT (33/44) | ||
| 85 | Threonine | ACA (18/18) | ACA (29/45) |
| ACG (13/45) | |||
| ACC (3/45) | |||
| 89 | Phenilalanine | TTT (18/18) | TTT (29/46) |
| TTC (17/46) | |||
| 91 | Aspartic acid | GAC (11/18) | GAC (13/45) |
| GAT (7/18) | GAT (32/45) | ||
| 109 | Valine | GTA (15/17) | GTA (20/41) |
| GTG (2/17) | GTG (21/41) | ||
| 112 | Serine | AGC (15/18) | AGC (18/40) |
| AGT (3/18) | AGT (22/40) | ||
| 135 | Alanine | GCC (7/17) | GCC (7/43) |
| GCT (10/17) | GCT (34/43) | ||
| GCG (2/43) | |||
| 140 | Arginine | CGT (13/17) | CGC (16/40) |
| CGC (4/17) | CGC (24/40) | ||
| 153 | Arginine | AGA (8/18) | AGA (8/43) |
| AGG (10/18) | AGG (35/43) | ||
| 160 | Proline | CCA (18/18) | CCA (29/45) |
| CCG (11/45) | |||
| CCC (5/45) | |||
| 163 | Valine | GTA (13/15) | GTA (18/43) |
| GTG (2/15) | GTG (16/43) | ||
| GTC (9/43) | |||
| 164 | Glutamic acid | GAA (16/18) | GAA (6/45) |
| GAG (2/18) | GAG (39/45) | ||
| 169 | Alanine | GCC (10/18) | GCC (20/41) |
| GCT (8/18) | GCT (21/41) | ||
Differences in the denominator to 18 (1991–1995 sequences) or to 46 (2000–2005 strains) reflect non-synonymous mutations.
Fig. 1Bootstrapped tree (neighbor-joining method) of predicted aminoacid sequences of GP5 of PRRS virus. In italics (light colour) sequences from 1991 to 1995 period; in bold type, sequences from 2000 to 2005. Spanish strains 1991–1995 = Suarez, CRESA-VP21 and Olot91. Spanish Strains 2000–2005 = AF49XXX and CReSA. All other strains correspond to European non-Spanish isolates from 1991 to 1995. Bootstrap values are shown in italics close to the nodes.
Fig. 2Comparison with other non-Spanish European strains of the hypothetical GP5 containing all predicted sites for positive and negative selection in Spanish PRRSV sequences. The hypothetical GP5 was BLASTed and the 10 closest non-Spanish matches were included in the alignment along with Lelystad virus GP5 (LV) and an European-type strain isolated in USA. Protein sequences are represented by its GenBank accession number followed by the country of origin (AT: Austria, DK: Denmark, IT: Italy, US: United States of America) and the year of isolation. Amino acids are presented with a one-letter code in the row corresponding to LV. (- - -) Discontinuous underlined: predicted signal peptide. (—) Continuous underlined: predicted N-glycosylation sites. () Boxed segments: predicted transmembrane regions in the hypothetical GP5. () Negative selection sites. Positive selection sites are marked with the one-letter aminoacid code in the hypothetical GP5. Selection site: the last row shows whether or not the predicted selection sites were found in other non-Spanish European sequences. Dots (·) indicate negative selection site found in 11/11 non-Spanish European sequences. Aminoacid symbol (X) indicates positive selection site found in at least one non-Spanish European sequence. Encircled aminoacid symbol (®) indicates positive selection site found in ≥8/11 non-Spanish European sequences. (*) Predicted positive or negative selection site not found in non-Spanish European sequences.