| Literature DB >> 10500282 |
J D Meissner1, C Y Huang, M Pfeffer, R M Kinney.
Abstract
The 5' nontranslated region (5'NTR) and nonstructural region nucleotide sequences of nine enzootic Venezuelan equine encephalitis (VEE) virus strains were determined, thus completing the genomic RNA sequences of all prototype strains. The full-length genomes, representing VEE virus antigenic subtypes I-VI, range in size from 11.3 to 11.5 kilobases, with 48-53% overall G+C contents. Size disparities result from subtype-related differences in the number and length of direct repeats in the C-terminal nonstructural protein 3 (nsP3) domain coding sequence and the 3'NTR, while G+C content disparities are attributable to strain-specific variations in base composition at the wobble position of the polyprotein codons. Highly-conserved protein components and one nonconserved protein domain constitute the VEE virus replicase polyproteins. Approximately 80% of deduced nsP1 and nsP4 amino acid residues are invariant, compared to less than 20% of C-terminal nsP3 domain residues. In two enzootic strains, C-terminal nsP3 domain sequences degenerate into little more than repetitive serine-rich blocks. Nonstructural region sequence information drawn from a cross-section of VEE virus subtypes clarifies features of alphavirus conserved sequence elements and proteinase recognition signals. As well, whole-genome comparative analysis supports the reclassification of VEE subtype-variety IF and subtype II viruses.Entities:
Mesh:
Substances:
Year: 1999 PMID: 10500282 PMCID: PMC7126981 DOI: 10.1016/s0168-1702(99)00078-7
Source DB: PubMed Journal: Virus Res ISSN: 0168-1702 Impact factor: 3.303
Reference information for Venezuelan equine encephalitis (VEE) virus strains
| Virus | Abbreviation | Subtype | Origin | GenBank Accession Number | |
|---|---|---|---|---|---|
| Mena II | MENA | IE | Panama, 1962 | AF075252 | |
| 78V-3531 | 78V-3531 | IF | Brazil, 1976 | AF075257 | |
| Everglades Fe3-7c | EVE | II | Florida, USA, 1963 | AF075251 | |
| Mucambo BeAn 8 | MUC | IIIA | Brazil, 1954 | AF075253 | |
| Tonate CaAn 410d | TON | IIIB | French Guiana, 1973 | AF075254 | |
| 71D-1252 | 71D-1252 | IIIC | Peru, 1971 | AF075255 | |
| PixunaBeAr35645 | PIX | IV | Brazil, 1961 | AF075256 | |
| Cabassou CaAr 508 | CAB | V | French Guiana, 1968 | AF075259 | |
| AG80-663 | AG80-663 | VI | Argentina, 1980 | AF075258 |
Strain name abbreviation (Karabatsos, 1985).
Subtype-variety classification scheme for the VEE antigenic complex (Young and Johnson, 1969, France et al., 1979, Calisher et al., 1982, Kinney et al., 1983, Calisher et al., 1985, Roehrig and Mathews, 1985, Roehrig et al., 1991).
Previously sequenced VEE viruses in bold, isolation references included in sequencing reports referenced.
Vaccine strain derived from TRD virus by passage in tissue culture (Berge et al., 1961).
Two subtype IE strains were isolated from the same patient in consecutive years, the latter in 1962 (M.A. Grayson, personal communication).
Fig. 1(A) Schematic VEE virus 5′NTR and nonstructural region subgenomic map (not drawn to scale), with ranges of 5′NTR and nsP coding sequence lengths above and deduced amino acid sequence lengths below individual nsP designations. The C-terminal nsP3 domain is shaded. Asterisks indicate stop codons. (B) Total genome lengths and nonstructural region landmarks of prototype VEE viruses and representative alphaviruses. Nucleotide positions for the ends of the nsP3 and nonstructural polyprotein coding sequences include respective stop codons. Polyprotein codon position and total genomic%G+C content are also listed (right). The C-terminal nsP3 domain coding sequence is excluded from nonstructural region%G+C content calculations. For strain abbreviations, see Table 1.
Fig. 2VEE virus conserved sequence elements. (A) 5′NTR sequence alignment, with both conserved nucleotides (bold) and consensus sequence (CONS) derived from nucleotides present in at least 5 of 6 VEE virus subtypes. A dash indicates a gap used to improve alignment. 5′NTR sequences of 71–180 and 3880 viruses are identical to TRD virus (Kinney et al., 1992a, Kinney et al., 1992b). Previously reported sequences are underlined. See Table 1 for strain abbreviations. (B) Proposed secondary structures for the VEE virus 51-nt CSE (left) and additional VEE virus nsP1 coding region CSEs (right). The 51-nt CSE nucleotide positions and putative stem sequence are based on TRD virus. Remaining non-stem 51-nt CSE sequence is a composite of all VEE virus strains (Y=A/U, V=A/G/C, H=A/C/U, N=A/C/G/U). The C-G pairs replaced by U-A pairs in certain strains are boxed. The shaded boxes indicate additional pairs which could hydrogen bond in subtype IE MENA virus. For the putative hairpin beginning at TRD nt-67, the sequence is that of 71D-1252 virus, although a similar structure with slightly lower stability can be modeled for other VEE virus strains. TRD virus sequence is used for the other two hairpins. To facilitate comparison with earlier studies (Ou et al., 1983, Niesters and Strauss, 1990a), an older method of calculating the free energy of duplex formation is used (Tinoco et al., 1973), relying on a strict interpretation of Table 1 in this reference without modifications proposed in the text. ΔG=free energy at 25°C.
Individual nonstructural protein (nsP) coding sequence identities, adding increasingly more divergent VEE virus (and alphavirus) strain sequences
| nsP1 | N-term. nsP2 | C-term. nsP2 | N-term. nsP3 | nsP4 | |
|---|---|---|---|---|---|
| %nt(%aa) | %nt(%aa) | %nt(%aa) | %nt(%aa) | %nt(%aa) | |
| TRD, TC-83, 71–180, P676, 3880, EVE | 89 (98) | 84 (96) | 84 (96) | 85 (98) | 86 (98) |
| +MENA, 68U201 | 77 (93) | 70 (93) | 68 (88) | 70 (92) | 70 (93) |
| +MUC, TON, 71D-1252 | 67 (87) | 61 (86) | 56 (78) | 59 (82) | 60 (87) |
| +CAB, PIX, 78V-3531, AG80-663 | 60 (79) | 55 (79) | 48 (66) | 55 (75) | 55 (81) |
| +EEE | 49 (60) | 52 (72) | 38 (48) | 44 (56) | 50 (72) |
| +SIN | 41 (48) | 44 (56) | 27 (30) | 35 (40) | 45 (63) |
NA EEE virus strain 82V-2136 sequence, SIN strain HRsp sequence.
Essential alphavirus nonstructural protein (or cognate ‘Sindbis-like supergroup’ protein) features — VEE virus nonstructural protein sequence correlates
| Nonstructural protein or protein domain | (putative) Function(s) | Conserved residues/motifs/domains | Comments |
|---|---|---|---|
| nsP1 | membrane-associated methytransferase, guanylytransferase | VEE virus C-terminal residues vary — C-terminal truncations well-tolerated in SIN and SF virus constructs | |
| N-terminal nsP2 domain | RNA helicase, ATPase, GTPase | 6–7 conserved segments, including | ‘Invariant’ nsP2 V-389 is isoleucine in strain 71D-1252 |
| C-terminal nsP2 domain | Papain-like proteinase | Catalytic dyad residues | Other plus strand RNA viral papain-like proteinases have flexible catalytic dyad spacing |
| N-terminal nsP3 domain | Phosphoprotein necessary for minus-strand, 26S mRNA synthesis | Only aa similarity is with rubella virus and coronavirus proteins of undetermined functions | |
| nsP4 | RNA-dependent RNA polymerase | N-terminal |
Invariant VEE virus residues in bold, numbering based on TRD virus nonstructural protein sequences.
References: nsP1 (Mi et al., 1989, Mi and Stollar, 1991, Rozanov et al., 1992, Laakkonen et al., 1994, Peranen et al., 1995, Wang et al., 1996, Ahola et al., 1997, Pfeffer et al., 1997); nsP2 (Gorbalenya et al., 1988, Gorbalenya et al., 1989, Gorbalenya et al., 1991, Hodgman, 1988, Ding and Schlesinger, 1989, Hardy and Strauss, 1989, Strauss et al., 1992, Rikkonen et al., 1994); nsP3 (Gorbalenya et al., 1991, LaStarza et al., 1994a, LaStarza et al., 1994b); nsP4 (Kamer and Argos, 1984, Koonin, 1991, Shirako and Strauss, 1998).
Fig. 3Alignment of VEE virus C-terminal nsP3 domain sequences (single letter code) from the residue corresponding to TRD virus nsP3 V-331 to the putative nsP3/nsP4 proteinase cleavage site. A dash indicates a gap introduced to improve alignment. Asterisk indicates nsP3 opal stop codon. Strains with >80% nsP3 amino acid sequence identity to strain(s) selected for alignment are omitted. Consensus sequence (CONS) includes residues conserved in at least 7 of the 9 strains selected for alignment, with boxed residues conserved by all sequenced VEE virus strains. Domains 1–4 are as designated by Oberste et al. (1996). The direct repeats present in the various strains are indicated on the right, with lines drawn to the downstream copy of the direct repeat in the alignment (underlined) the upstream copy of the direct repeat is omitted from the alignment. Degenerate, repetitive portions of 78V-3531 virus and AG80-663 virus C-terminal nsP3 domain sequences excluded from the alignment (flanked by tildes) are given on the lower right. For strain abbreviations, see Table 1. IABCDII=Present in all VEE virus subtype IAB, IC, ID, and II strains. IE=present in all subtype IE strains.
Fig. 4Maximum-parsimony phylogenetic tree for VEE virus prototype strains derived from branch-and-bound search in PAUP V3.1.1 (Swofford, 1993) using codon first and second position nucleotides from combined nonstructural (excluding the C-terminal nsP3 domain coding sequence) and structural regions, rooted with EEE virus strain 82V-2136 (Weaver et al., 1993) as the outgroup (7056 total characters, 1255 parsimony-informative characters). Percentages to the left of internal nodes indicate bootstrap support for 1000 pseudoreplicates (Felsenstein, 1985), using 10 random-addition heuristic searches per pseudoreplicate.
| VE-72 | 5′-ATGGAGAARGTTCACGTTGAYATCGAGG |
| cVE-2511 | 5′-TCRTGRTTRAARTGNACYTTNAGRCACATCAT |
| VE-2008 | 5′-CAYGGHGGRGCNYTGAAYACNGAYGARGARTACTA or |
| VE-2518 | 5′-GGDGAYCCKAARCARTGYGGYTYYTYYAAYATGATGTG |
| cVE-4506 | 5′-TCHTCVGAGATRCATATCTCYTCBRYNGCYTCYCT |
| VE-4477 | 5′-GCRGATGTRGCYATMTAYTGYMGRGAYAARAARTGGGA |
| cVE-7155 | 5′-ACYACBGCRTCDATRATYTTNACYTCCATRTTYARCCA |