| Literature DB >> 31142666 |
Jiaxin Ling1, Teemu Smura2, Jan O Lundström3,4, John H-O Pettersson3,5,6,7, Tarja Sironen2,8, Olli Vapalahti2,8, Åke Lundkvist3, Jenny C Hesson3.
Abstract
Bird-hosted viruses have the potential to be transported over large areas of the world and to be transmitted in distant geographical regions. Sindbis virus (SINV) is a mosquito-borne alphavirus that is locally amplified in a bird-mosquito enzootic cycle and distributed all over the Old World and Australia/Oceania. Sindbis virus genotype I (SINV-I) is the cause of disease outbreaks in humans in South Africa as well as in northern Europe. To trace the evolutionary history and potential strain-disease association of SINV-I, we sequenced 36 complete genomes isolated from field material in Europe, as well as in Africa and the Middle East, collected over 58 years. These were analyzed together with 30 additional published whole SINV-I genomes using Bayesian analysis. Our results suggested that SINV-I was introduced only once to northern Europe from central Africa, in the 1920s. After its first introduction to Sweden, it spread east and southward on two separate occasions in the 1960s and 1970s. Another introduction from central Africa to southern/central Europe seems to have occurred, and where these two introductions meet, one recombination event was detected in central Europe. In addition, another recombinant strain was found in central Africa, where the most divergent SINV-I strains also originated.IMPORTANCE This study shows that only a single introduction of SINV into a new geographical area is required for spread and establishment, provided that the requisite vector(s) and reservoir(s) of epizootological and epidemiological importance are present. Furthermore, we present the first report of recombination between two strains of SINV in nature. Our study increases the knowledge on new introductions and dispersal of arboviruses in general and of SINV in particular.Entities:
Keywords: Sindbis virus; evolution; phylogeny
Mesh:
Substances:
Year: 2019 PMID: 31142666 PMCID: PMC6675900 DOI: 10.1128/JVI.00620-19
Source DB: PubMed Journal: J Virol ISSN: 0022-538X Impact factor: 5.103
FIG 1Spread of SINV-I from Africa to Europe. Shown is a Bayesian phylogenetic tree of SINV-I based on concatenated nucleotide ORF sequences (MrBayes). The colors indicate Sweden (green); Finland (blue); Germany (red); Egypt, Israel, and Saudi Arabia (pink); Kenya and the Central African Republic (orange); Slovak Republic (light blue); Italy (gray); and South Africa (purple). The roots in orange indicate that SINV-I originated in central Africa. Main SINV-I dispersal routes from Africa to and within northern/central Europe are indicated with dashed lines. L, N, E, and S represent locations in Sweden (Lövånger, Nedre Dalälven, Edsbyn, and Sundsvall, respectively). Posterior probability values above 0.95 are shown for the branches. Dots indicate the sequences generated from this study.
FIG 2Bayesian phylogenetic tree of SINV-I based on the complete open reading frame (ORF) amino acid sequences resulting from MrBayes. Dots indicate the sequences generated from this study. Posterior probability values above 0.8 are shown for the branches.
Identities of nucleotide and amino acid sequences of SINV-I strains within different clades
| Identity category | % identity | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| CDS | nsP1 | nsP2 | nsP3 | nsP4 | CP | E3 | E2 | 6K | E1 | |
| Total (nt) | 87.9–100 | 91.8–100 | 85.8–100 | 84.4–100 | 88–100 | 88.7–100 | 86.6–100 | 87.6–99.9 | 88.1–100 | 89.1–100 |
| Total (aa) | 96.3–100 | 97.1–100 | 96.9–100 | 91.1–100 | 97.5–100 | 92.2–100 | 94.5–100 | 95.3–100 | 94.2–100 | 96.2–100 |
| Within clade A (nt) | 99.1–100 | 99.2–100 | 98.9–100 | 99–100 | 98.9–100 | 96.8–100 | 98.8–100 | 99–99.9 | 97.5–100 | 99–100 |
| Within clade A (aa) | 99.1–100 | 98.9–100 | 99.3–100 | 89.3–100 | 99.3–100 | 94.4–100 | 98.2–100 | 99–100 | 96.2–98.1 | 99.3–100 |
| Within clade B (nt) | 99.1–99.9 | 99.1–99.8 | 99.1–99.9 | 99.3–99.9 | 99.1–100 | 98.8–100 | 99.4–100 | 99–99.8 | 98.8–100 | 99.1–99.9 |
| Within clade B (aa) | 99.7–100 | 99.3–99.8 | 99.8–100 | 99.6–100 | 99.7–100 | 99.2–100 | 100 | 99.8–100 | 100 | 99.1–100 |
| Within clade C (nt) | 98.3–99.8 | 98.6–99.9 | 98.1–99.6 | 97.6–99.6 | 98.7–99.7 | 98.7–100 | 97.6–100 | 98.3–99.8 | 95.4–100 | 98.4–100 |
| Within clade C (aa) | 98.9–99.6 | 98.5–100 | 98.5–99.6 | 97.6–99.5 | 98.9–100 | 99.2–100 | 96.4–100 | 98.8–100 | 96.2–100 | 98.6–100 |
| Within clade D (nt) | 87.9–100 | 96.1–100 | 94.3–100 | 95.7–100 | 96.1–100 | 95.7–100 | 97–100 | 94.9–100 | 93.2–100 | 97–100 |
| Within clade D (aa) | 96.3–100 | 97.9–100 | 97.7–100 | 96.6–100 | 98–100 | 97.7–100 | 100 | 97.3–100 | 98.1–100 | 98.6–100 |
| Within clade E (nt) | 99.2–99.6 | 99.4–99.7 | 98.9–99.8 | 98.5–99.3 | 99.3–99.4 | 99.1–99.7 | 98.8–99.4 | 99.3–99.8 | 93.1–98.8 | 99.3–100 |
| Within clade E (aa) | 99.6–99.9 | 99.8–100 | 99.8–100 | 99–99.6 | 99.8–100 | 99.2–100 | 100 | 99.8–100 | 94.2–98.1 | 94.8–100 |
nt, nucleotides; aa, amino acids.
FIG 3Dated phylogenetic tree of clades A and B. Posterior probability values above 0.95 are shown in the nodes. The values on the branches indicate years before 2016 and 95% highest posterior density (HPD) values.
FIG 4Bayesian phylogenetic tree of SINV-I based on partial E2 sequences. Posterior probability values above 0.8 are shown for the branches.
FIG 5Comparison of amino acid variations across the SINV-I genome among the strains in clades A, B, and C. The colors of the strains indicate Sweden (green), Finland (blue), Germany (red), Kenya and the Central African Republic (orange), and South Africa (purple).
FIG 6Bayesian phylogenetic trees based on the complete ORFs and the separated genes (nsP1 to -4, C, E3, E2, 6K/TF, and E1) of fully sequenced SINV-I genomes. Sequences that have undergone recombination are highlighted in red. CDS, coding sequence.
Recombination analysis for strains H7_Culex_pipiens_Germany_2013 and NVS_305_Culex_vansomereni_Kenya_2007
| Recombination strain | Event | Minor parental sequence (breakpoint positions [bp]) | Major parental sequence | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| RDP | GENECONV | Bootscan | Maxchi | Chimera | SiSscan | Phylpro | LARD | 3Seq | ||||
| H7_Culex_pipiens_Germany_2013 | 1 | 15Z03121_Culex_torrentium_Germany_2015 (6897–10655) | GREIS_Hyalomma_marginatum_Italy_1975 | 1.07E−78 | 2.76E−70 | 2.46E−76 | 1.35E−33 | 8.76E−34 | 4.41E−39 | NS | NS | 3.43E−12 |
| E594_Aedes__rossicus_Sweden_2002 | SK2_SM7V1_Cricetus_cricetus_SlovakRepublic_1972 | |||||||||||
| 84M140_Culi_morsitans_Sweden_1984 | J02363_Egypt_1952 | |||||||||||
| 83M108_Culex_pipiens_torrentium_Sweden_1983 | AZ_16_Nycticorax_nycticorax_Russia_1977 | |||||||||||
| 2 | 15Z03121_Culex_torrentium_Germany_2015 (1512–1930) | AZ_16_Nycticorax_nycticorax_Russia_1977 | 1.35E−23 | 2.90E−22 | 6.01E−20 | 1.86E−05 | 1.83E−05 | 4.61E−05 | NS | NS | 3.43E−12 | |
| E594_Aedes_rossicus_Sweden_2002 | SK2_SM7V1_Cricetus_cricetus_SlovakRepublic_a_1972 | |||||||||||
| 84M140_Culi_morsitans_Sweden_1984 | GREIS_Hyalomma_marginatum_Italy_1975 | |||||||||||
| NVS_305_Culex_vansomereni_Kenya_2007 | 1 | Kiihtelysvaara2002_Human_Finland_2002 (5258–7890) | MP684_Mansonia_fuscopennata_Uganda_1960 | 7.77E−06 | 2.56E−06 | 0.001478066 | 1.63E−07 | 0.034855155 | 1.21E−10 | NS | NS | 8.71E−09 |
| E594_Aedes__rossicus_Sweden_2002 | SAAR_18141_Cule_univittatus_South_Afria_1976 | |||||||||||
| 84M140_Culi_morsitans_Sweden_1984 | Culex_univittatus_South_Africa_1974 | |||||||||||
| 2 | JQ771793_Ilomantsi2005M_mosquito_Finland_2005 (8274–596) | SAAR_18141_Cule_univittatus_South_Afria_1976 | 4.45E−06 | 5.18E−05 | 0.000194763 | 2.12E−06 | 2.27E−06 | 3.71E−06 | NS | NS | 0.010431756 | |
| E594_Aedes_rossicus_Sweden_2002 | Culex_univittatus_South_Africa_1974 | |||||||||||
| 84M140_Culi_morsitans_Sweden_1984 | MP684_Mansonia_fuscopennata_Uganda_1960 | |||||||||||
The minor parent is the parent contributing the smaller fraction of sequence. The major parent is the parent contributing the larger fraction of sequence. Breakpoint positions (in base pairs) are relative to E594_Aedes_rossicus_Sweden_2002. NS indicates that no significant P value was recorded for this recombination event using this method.
FIG 7Recombination scheme for strains H7_Culex_pipiens_Germany_2013 and NVS_305_Culex_vansomereni_Kenya_2007. The recombination events were checked by using RDP in RDP3, and the window size (the number of polymorphic sites per window) was set to 30.
Sindbis virus genotype I strains included in this study
| Isolate | Yr of isolation | Location(s) (region) | Isolation source | E2 GenBank accession no. | GenBank accession no. | Reference(s) |
|---|---|---|---|---|---|---|
| E597 | 2002 | Sweden (N) | This study | |||
| E594 | 2002 | Sweden (N) | This study | |||
| 84M140 | 1984 | Sweden (E) | This study | |||
| 83M108 | 1983 | Sweden (E) | This study | |||
| 83M107 | 1983 | Sweden (E) | This study | |||
| Vmork1/92 | 1983 | Norway | This study, | |||
| 86828 | 1985 | Sweden (E) | This study | |||
| 95M116 | 1995 | Sweden (N) | This study, | |||
| 86752 | 1985 | Sweden (S) | This study | |||
| 85M78 | 1985 | Sweden (E) | This study, | |||
| 85M94 | 1985 | Sweden (E) | This study, | |||
| E917 | 2002 | Sweden (N) | This study | |||
| 85M68 | 1985 | Sweden (E) | This study, | |||
| 86520P3 | 1985 | Sweden (S) | This study | |||
| E945 | 2002 | Sweden (N) | This study | |||
| 09_M_991 | 2009 | Sweden (N) | This study, | |||
| F231 | 2003 | Sweden (N) | This study | |||
| 09_M_519 | 2009 | Sweden (N) | This study, | |||
| 09_M_564 | 2009 | Sweden (N) | This study, | |||
| 09_M_526 | 2009 | Sweden (N) | This study, | |||
| 85M134 | 1985 | Sweden (E) | This study | |||
| 09_M_358 | 2009 | Sweden (N) | This study, | |||
| SAAR_18141 | 1976 | South Africa | This study, | |||
| AR18132 | 1974 | South Africa | This study, | |||
| MP684 | 1960 | Uganda | This study, | |||
| 4251SM6 | 1969 | Cameroon | Mosquito | This study; R. M. Kinney and M. Pfeffer, unpublished data | ||
| SAAR_6071 | 1964 | South Africa | This study, | |||
| SK2_SM7V1 | 1972 | Slovakia | This study, | |||
| AZ_16 | 1977 | Russia, Azerbaijan | This study, | |||
| GREIS | 1975 | Italy | This study, | |||
| M1855 | 1967 | Israel | This study, | |||
| R2_SM7V1 | 1975 | Slovakia | This study, | |||
| 1038 | 1964 | Israel | This study, | |||
| SA80_480 | 1980 | Saudi Arabia | This study, | |||
| SA80_394 | 1980 | Saudi Arabia | This study, | |||
| SA80_370 | 1980 | Saudi Arabia | This study, | |||
| BNI_10865 | 2016 | Germany | ||||
| 15Z03121 | 2015 | Germany | ||||
| LEIV_9298 | 1983 | Russia | Mosquito | |||
| Kiihtelysvaara2002 | 2002 | Finland | ||||
| Johannes2002 | 2002 | Finland | ||||
| Edsbyn | 1982 | Sweden | Mosquito | |||
| Ilomantsi2002B | 2002 | Finland | ||||
| Ilomantsi2002A | 2002 | Finland | ||||
| 28_9 | 2009 | Germany | ||||
| 5_3 | 2009 | Germany | ||||
| Ilomantsi2002C | 2002 | Finland | ||||
| Lovanger | 2013 | Sweden (L) | ||||
| Z158523 | 2013 | Germany | ||||
| Berlin2010A | 2010 | Germany | ||||
| Ilomantsi2005M | 2005 | Finland | Mosquito | |||
| NVS_305 | 2007 | Kenya | ||||
| ArB7761 | 1977 | Central African Republic | ||||
| HB741570 | 1975 | Central African Republic | V. Tricou, E. Nakoune, B. Selekon, M. Kazanji, and N. Berthet, unpublished data | |||
| Babanki | 1969 | Cameroon | Mosquito | |||
| KSM_1008 | 2007 | Kenya | ||||
| BUD_12001 | 2010 | Kenya | ||||
| Girdwood_2017 | 1962 | South Africa | ||||
| Girdwood_SA | 1962 | South Africa | ||||
| YN87448 | 1998 | China | Unknown | G. L. Zhou, G. D. Liang, L. Li, S. H. Fu, Q. Jin, H. L. Zhang, W. L. Huang, and Y. D. Hou, unpublished data | ||
| Prototype AR399 | 1952 | Egypt | ||||
| S_A_AR86 | 1954 | South Africa | ||||
| H7 | 2013 | Germany | ||||
| BONI_584 | 2013 | Kenya | ||||
| BONI_566 | 2013 | Kenya | ||||
| ArB489 | 1985 | Central African Republic | Arthropod |
Letters in parentheses indicate regions within Sweden shown in Fig. 1 (N, Nedre Dalälven; E, Edsbyn; S, Sundsvall; L, Lövånger).
Letters in parentheses indicate the isolation source M, mammal; B, bird; T, tick; A, amphibian.
This strain was isolated from a human in 1962 and sequenced twice.
Primers used in this study
| Oligonucleotide | Sequence | Positions | Length of the product (bp) |
|---|---|---|---|
| SINV F1 | GAATCRAACAGCCGACCAAT | 25–44 | 641 |
| SINV R1 | GTCGGCCCAGTTAGTGTTGT | 646–665 | |
| SINV F2 | ACTGGATTGGCTTTGACACC | 583–602 | 485 |
| SINV R2 | CGGGATATACGTGCACACAG | 1048–1067 | |
| SINV F3 | CAAACAATAGCGAGGGCTTC | 979–998 | 793 |
| SINV R3 | GAGGTTGGTGAAACGACGAT | 1752–1771 | |
| SINV F4 | TGGCAGACAAAGACATCGAG | 1606–1625 | 681 |
| SINV R4 | GTGCCGTGACAGTTGACTTG | 2267–2286 | |
| SINV F5 | GCGATGCGTTAAGAAGGAAG | 2096–2115 | 1,088 |
| SINV R5 | ACTGGCAACCGGTAAGTACG | 3164–3183 | |
| SINV F6 | GAGGACTGGGAAGCTGAACA | 3030–3049 | 1,062 |
| SINV R6 | TCCRTCTCTTGTACCCTCRT | 4072–4091 | |
| SINV F7 | GRCCAGAKTGYGTCTCAAGCA | 3959–3978 | 600 |
| SINV R7 | TCGATTCGTTCCTTCCACTT | 4539–4558 | |
| SINV F8 | ATCAAGTCTGTCGCCATTCC | 4401–4420 | 1,086 |
| SINV R8 | CATGGATACCCCACCAAAAG | 5467–5486 | |
| SINV F9 | TTTAGCGGATCGGACAATTC | 5238–5257 | 1,298 |
| SINV R9 | GCGGTGACGAACTCAGTAG | 6517–6535 | |
| SINV F10 | CTGGAYTCAGCGACATTCAA | 6423–6442 | 1,127 |
| SINV R10 | TTGCTCTGGGCAAAAGTTCT | 7530–7549 | |
| SINV F11 | GCCCTGCTAGATGAAACGAA | 7416–7435 | 1,171 |
| SINV R11 | TATCGTAGGCCTCGTGGTTC | 8567–8586 | |
| SINV F12 | GGATAACTCAGGTCGGGTTG | 8312–8331 | 768 |
| SINV R12 | TTACCGTGAACGGGAGGTAG | 9060–9079 | |
| SINV F13 | AGCGTGACGGTTAGCATAGC | 8964–8983 | 909 |
| SINV R13 | CAGCGAAGTTGGAATTACGG | 9853–9872 | |
| SINV F14 | TACCATCGCCATCCTGTGTA | 9714–9733 | 949 |
| SINV R14 | CTGGTTTCATCGCTCCGTAT | 10 643–10 662 | |
| SINV F15 | ACGGAGTTACACCAGGAACG | 10 516–10 535 | 1,121 |
| SINV R15 | TATGCACCAYGCTTCCTCAG | 11 617–11 636 |
The reference genome was from reference 17.
Model comparison by BEAST analysis
| Molecular clock model | Demographic model | Mutation rate | 95% HPD | Log marginal likelihood (SS) | Log marginal likelihood (PS) |
|---|---|---|---|---|---|
| Relaxed clock, exponential | Coalescent exponential population | 1.045 × 10−4 | 5.6508 × 10−5, 1.6026 × 10−4 | −23,632.61 | −23,632.33 |
| Relaxed clock, log normal | Coalescent exponential population | 1.5723 × 10−4 | 6.6297 × 10−5, 2.6341 × 10−4 | −23,632.24 | −23,631.94 |
| Relaxed clock, exponential | Coalescent Bayesian skyline | 1.225 × 10−4 | 6.8056 × 10−5, 1.8561 × 10−4 | −23,622.27 | −23,621.7 |
| Relaxed clock, normal | Coalescent Bayesian skyline | 1.257 × 10−4 | 7.4641 × 10−5, 1.8208 × 10−4 | −23,623.27 | −23,623.38 |
| Strict | Coalescent exponential population | 5.4535 × 10−5 | 4.3983 × 10−5, 6.5605 × 10−5 | −23,757.07 | −23,756.55 |
| Strict | Coalescent Bayesian skyline | 5.476 × 10−5 | 4.2365 × 10−5, 6.7008 × 10−5 | −23,746.58 | −23,746.95 |
95% HPD, 95% highest posterior density; SS, stepping-stone sampling; PS, path sampling.