| Literature DB >> 21185049 |
John Aaskov1, Anita Jones, Wilson Choi, Kym Lowry, Emerald Stewart.
Abstract
A sequence of thirty-six nucleotides in the nsP3 gene of Ross River virus (RRV), coding for the amino acid sequence HADTVSLDSTVS, was duplicated some time between 1969 and 1979 coinciding with the appearance of a new lineage of this virus and with a major outbreak of Epidemic Polyarthritis among residents of the Pacific Islands. This lineage of RRV continues to circulate throughout Australia and both earlier lineages, which lacked the duplicated element, now are extinct. Multiple copies of several other elements also were observed in this region of the nsP3 gene in all lineages of RRV. Multiple copies of one of these, coding for the amino acid sequence P*P*PR, were detected in the C-terminal region of the nsP3 protein of all alphaviruses except those of African origin. The fixation of duplications and insertions in 3' region of nsP3 genes from all lineages of alphaviruses, suggests they provide some fitness advantage.Entities:
Mesh:
Substances:
Year: 2010 PMID: 21185049 PMCID: PMC7173084 DOI: 10.1016/j.virol.2010.11.025
Source DB: PubMed Journal: Virology ISSN: 0042-6822 Impact factor: 3.616
Amino acid repeat motifs in the nsP3 proteins of Ross River virus and their presence in the nsP3 proteins of other alphaviruses.
| Semliki Forest complex | WEE complex | |||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Motif | Lineage I | Lineage 2 | Lineage 3 | |||||||||||||||||
| RRV | RRV | RRV | RRV | RRV | RRV | RRV | RRV | GETV | SFV | MAYV | CHIKV | ONNV | BFV | SINV | AURV | WEEV | VEE | EEV | ||
| T48 | NB5092 | F9073 | MCLE | OREG | QML1 | SNP51 | PW14 | A7 | 06–021 | SC650 | BH2193 | SA.AR86 | 10315 | 71V-1658 | OAX131 | PE3.0803 | ||||
| 1959 | 1969 | 1979 | 1983 | 1989 | 2004 | 2009 | 2009 | |||||||||||||
| 332 | H | H | H | H | H | H | H | |||||||||||||
| A | A | A | A | A | A | A | ||||||||||||||
| D | D | D | D | D | D | D | ||||||||||||||
| T | T | T | T | T | T | T | ||||||||||||||
| V | V | V | V | V | V | V | ||||||||||||||
| S | S | S | S | S | S | S | ||||||||||||||
| L | L | L | L | L | L | L | ||||||||||||||
| D | D | D | D | D | D | D | ||||||||||||||
| S | S | S | S | S | S | S | ||||||||||||||
| T | T | T | T | T | T | T | ||||||||||||||
| V | V | V | V | V | V | V | ||||||||||||||
| S | L/S | L/S | L/S | L/S | L/S | L/S | ||||||||||||||
| 383 | P | P | P | P | P | P | P | |||||||||||||
| V | V/I/V/T | V/I/V/I | V | V/I/V | I/V | V/I/V | ||||||||||||||
| P | P | P | P | P | P | P | ||||||||||||||
| P | P/A/A/T | P/A/A/T | P | A/S/K | R/A/K | V/A/K | ||||||||||||||
| P | P | P | P/L | P | P | P | ||||||||||||||
| R | R | R/H/R/R | R | R | R | A/R | ||||||||||||||
| 487 | V | V | V | V | V | V | V | V | V | |||||||||||
| E | E | E | E | E | E | E | E | E | ||||||||||||
| F | F/L | F/L | F/L | F/L | F/L | F/L | F/L | F/L | ||||||||||||
| P | P | P | P | P | P | P | P | P | ||||||||||||
| W | W | W | W | W | W | W | W | W | ||||||||||||
| A | A/E | A/E | A/E | A/E | A/E | A/E | A/E | A/E | ||||||||||||
| P | P | P | P | P | P | P | P | P | ||||||||||||
| E | E | E | E | E | E | E | E | E | ||||||||||||
| D | D | D | D | D | D | D | D | D | ||||||||||||
| L | L/V | L/V | L/I | L/I | L/I | L/I | L/I | L/I | ||||||||||||
| 521 | D | D | D | D | D | D | D | D | D | D/G | ||||||||||
| –/K | –/K | –/K | –/K | –/K | –/K | –/K | –/K | |||||||||||||
| I | I | I | I | I | I | I | I | I | I | |||||||||||
| Q | Q | Q | Q | Q | Q | Q | Q | Q | Q | T | T | T | T | T | ||||||
| F | F | F | F | F | F | F | F | F | F | F | F | F | F | F | ||||||
| G | G | G | G | G | G | G | G | G | G | G | G | G | G | G | ||||||
| D | D | D | D | D | D | D | D | D | D | D | D | D | D | D | ||||||
Amino acid numbering from the N-terminal of RRV T48 nsP3.
Single copy of the motif in italics.
Multiple copies of motifs in bold type. Motif sequence from left to right from N-terminal to C-terminal of the nsP3 protein e.g HADTVSLDSTVL followed by HADTVSLDSTVS.
Spaces indicate the motif was not observed in that virus.
Fig. 1Duplicated elements in the nsP3 protein/gene of Ross River virus strain F9073. (A) Duplicated amino acid sequences are represented in the same colour. Underlined sequences appear to be repeats within a repeat and are found nowhere else in nsP3. (B) Possible sites at which a 36 nucleotide element of the RRV T48 genome could have been inserted in the parental genome to give rise to the duplicated amino acid sequence. Amino acids coded by nucleotides of interest are shown above and below the nucleotide sequences. Codons which differ between RRV T48 (no repeat) and F9073 (duplicated element) are shown in pink. Similar nucleotide sequences flanking a putative insertion site are highlighted. Nucleotide numbering is from the 5′ end of the nsP3 gene.
Fig. 2Predicted secondary structure of the region of the RRV nsP3 gene in which a 36 nucleotide element was duplicated. (A) RRV T48 (B) RRV F9073 with the element duplicated. Nucleotide numbering refers to the position in the nsP3 gene of the respective viruses.
Fig. 3Variation in the amino acid sequences of nsP3 proteins of different lineages within families of alphaviruses. Repeated elements are shown in bold type and what appear to be inserts of foreign sequence are shaded in grey.
Alphaviruses analysed in this study.
| Virus | Strain(lineage) | Year of isolation | Source | Location | Accession number | Amino acids in nsP3 |
|---|---|---|---|---|---|---|
| AURAV | BeAR 10315 | 1959 | Brazil | 544 | ||
| BFV | BH2193 | 1974 | Australia | 470 | ||
| CHIKV | 06–021 | 2006 | Human | Reunion | 530 | |
| ALSA-1 | 1986 | India | 495 | |||
| EEEV | NJ-60 (I) | 1959 | USA | 559 | ||
| PE24.0111 (II) | 2000 | Mosq. | Peru | 539 | ||
| PE17.0547 (III) | 1998 | Mosq. | Peru | 536 | ||
| PE3.0803 (IIIA) | 1996 | Mosq. | Peru | 535 | ||
| BeAR436087 (IV) | 1985 | Brazil | 545 | |||
| GETV | Porcine | Korea | 524 | |||
| GETV | Sagiyama M6/Mag32 | 1956 | Japan | 524 | ||
| MAYV | 492 | |||||
| ONNV | SG650 | 1996 | Human | Uganda | 569 | |
| RRV | T48 | 1959 | Australia | 538 | ||
| NB5092 | 1969 | Australia | 538 | |||
| F9073 | 1979 | Human | Fiji | 550 | ||
| MCLE | 1983 | Human | Australia | 550 | ||
| OREG | 1989 | Human | Australia | 550 | ||
| QML 1 | 2004 | Human | Australia | 550 | ||
| SNP 51 | 2009 | Human | Australia | 550 | ||
| PW 14 | 2009 | Human | Australia | 550 | ||
| SFV | A7 | 475 | ||||
| SK | 1970 | Finland | 482 | |||
| SINV | S.A.AR86 | South Africa | 543 | |||
| SW6562 | Mosq. | Australia | 523 | |||
| Ockelbo Edsbyn | 558 | |||||
| VEEV | 71–180 (1AB) | 1971 | Equine | USA | 557 | |
| PMCHo5 (1C) | 557 | |||||
| 8131 (1D) | Human | Peru | 557 | |||
| OAX131 (1E) | 562 | |||||
| WEEV | 71V-1658 | 1971 | USA | Equine | 532 | |
| AG80–646 | 1980 | Argentina | 529 |
Oligonucleotide primers used to amplify and sequence the nsP3 gene of Ross River virus.
| P3537 | CAGGGCGAGAGGGTAGAATGG | 3534–3554 |
| cP4200 | CATTTTCTCGCCACCGCTCTG | 4175–4195 |
| cP4486 | GCGTCCGTGGTGTCCATTGC | 4460–4479 |
| TCACTTGAGTCTGATTTGATACGGG | 4581–4605 | |
| P4774 | GCATTGGGTGAGAGTATGGACAG | 4773–4795 |
| ATTTGCTTCTGATACTGTCCATACTCTC | 4782–4809 | |
| P4854 | GTTCCGTGTCTGTGTAGGTATGC | 4851–4873 |
| P4932 | GTGTGCTCTTCATTCCCTTTACC | 4931–4953 |
| P5307 | GCTGTTGTAGCGGAGAGAGTGG | 5304–5325 |
| cP6097 | CCTCTGTCGGGTAATTGGCTTC | 6075–6096 |
P — sense primer.
cP — complimentary primer.
Numbering refers to that for nucleotides in RRV T48.