| Literature DB >> 27760182 |
Yann J K Bertrand1, Magnus Johansson2,3,4, Peter Norberg5.
Abstract
The hypothesis of wide spread reticulate evolution in Tick-Borne Encephalitis virus (TBEV) has recently gained momentum with several publications describing past recombination events involving various TBEV clades. Despite a large body of work, no consensus has yet emerged on TBEV evolutionary dynamics. Understanding the occurrence and frequency of recombination in TBEV bears significant impact on epidemiology, evolution, and vaccination with live vaccines. In this study, we investigated the possibility of detecting recombination events in TBEV by simulating recombinations at several locations on the virus' phylogenetic tree and for different lengths of recombining fragments. We derived estimations of rates of true and false positive for the detection of past recombination events for seven recombination detection algorithms. Our analytical framework can be applied to any investigation dealing with the difficult task of distinguishing genuine recombination signal from background noise. Our results suggest that the problem of false positives associated with low detection P-values in TBEV, is more insidious than generally acknowledged. We reappraised the recombination signals present in the empirical data, and showed that reliable signals could only be obtained in a few cases when highly genetically divergent strains were involved, whereas false positives were common among genetically similar strains. We thus conclude that recombination among wild-type TBEV strains may occur, which has potential implications for vaccination with live vaccines, but that these events are surprisingly rare.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27760182 PMCID: PMC5070875 DOI: 10.1371/journal.pone.0164435
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Synoptic diagram presenting the methods and analytical framework deployed during in the simulation.
The analytical protocol designed to estimate the rates of true and false positive for the detection of recombination in simulated data consists in several steps: STEPS 1–3 derive the parameters for the simulation from empirical sequence alignments. STEPS 4–8 simulate alignments that are similar to the empirical data in term of tree topology, divergence dates, number of strains, number of sites and rate heterogeneity among sites. Several stochastic processes are added in order to model lineage specific substitution rate variation (STEP 4) and model the effect of purifying selection (STEP 6). The simulated datasets with and without recombinations are finally analyzed with RDP4 (STEP 9).
Fig 2Selected lineages for simulating recombination events in the three TBEV subtypes: FE- (a), S-(b) and W-(c).
For each strain the corresponding GenBank accession number appears in parentheses. Trees have been derived from BEAST analyses using the time stamped ALN1-FE, ALN1-S and ALN1-W alignments. For some selected nodes, the 95% HPD for their divergence times is shown between square brackets. All dates correspond to years before present. Posterior probabilities of branches are indicated by coloring the nodes in different shades of grey. Uncolored nodes bear branch support values below 0.75.
Results of the recombination analyses using RDP4 applied to empirical data (W-, S- and FE-ALN2 alignments) at detection P-value of 0.05.
| Detection methods | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Subtype | Event | Start | End | Recombinant(s) | Minor parent(s) | Major parent(s) | 3Seq | Bootscan | Chimaera | GenConv | Maxchi | RDP | SiSscan |
| 1 | 2902 | 6848 | Shkotovo-94 | Primo212, Primo183, Primo18, Primo202, Primo208, Primo253, Primo270, Primo274, Primo320, Primo332, Primo345, Primo437, Primo69, Primo7503, Primo75, Primo823, Primo828, Primo82, Primo86, Primo895, Primo909, Kiparis-94, | Primo94, 1230, Dalnegorsk, Irkutsk-1861, Primi89, Primo1153, Primo2239, Primo501, Primo633, Primo7396, Primo87, Primo91, SofjinKSY, Sofjin-Chumakov, Sofjin-HO, Spassk-72 | 1.5E-29 | 7.9E-45 | 3.5E-19 | 1.3E-37 | 1.0E-21 | 8.9E-39 | 5.8E-30 | |
| 2 | 3796 | 4839 | Primo196 | Primo1153, 1230, Dalnegorsk, Irkutsk-1861, Primi89, Primo2239, Primo501, Primo633, Primo739, Primo87, Primo91, Primo94, SofjinKSY, Sofjin-Chumakov, Sofjin-HO, Spassk-72 | Primo52, Glubinnoe-2004 | 1.4E-16 | 2.1E-27 | 3.1E-11 | 5.7E-25 | 6.4E-11 | 2.7E-27 | 1.3E-12 | |
| 3 | 6867 | 7806 | Primo196 | Primo1153, 1230, Dalnegorsk, Irkutsk-1861, Primi89, Primo2239, Primo501, Primo633, Primo739, Primo87, Primo91, Primo94, SofjinKSY, Sofjin-Chumakov, Sofjin-HO, Spassk-72 | Primo52, Glubinnoe-2004 | 1.2E-18 | 5.5E-31 | 1.4E-09 | 2.0E-28 | 1.0E-09 | 6.5E-31 | 5.0E-11 | |
| 4 | 5541 | 5946 | Primo52 | Sofjin-HO, 1230, Primi89 | Primo196, Glubinnoe-2004 | 7.3E-08 | 2.7E-06 | ||||||
| 5 | 435 | 990 | Glubinnoe-2004 | 205 | Primo196, Primo52 | 1.4E-02 | 2.4E-06 | 1.8E-04 | 6.7E-05 | 3.9E-03 | 2.8E-06 | 1.4E-04 | |
| 6 | 6488 | 6671 | 205, 9024 | Primo69, Kiparis-94, Primo183, Primo18, Primo202, Primo208, Primo212, Primo253, Primo270, Primo274, Primo320, Primo332, Primo345, Primo437, Primo750, Primo75 Primo823, Primo828, Primo82, Primo86, Primo895, Primo909, Shkotovo-94 | Primo2239 | 1.16E-03 | 3.3E-05 | ||||||
| 7 | 8343 | 9123 | MDJ-03, MDJ-02 | Xinjiang-01 | Senzhang | 1.4E-04 | 9.1E-03 | 1.92E-03 | 1.5E-04 | ||||
| 8 | 7952 | 8342 | MDJ-01, Senzhang | MDJ-02, MDJ-03 | Xinjiang-01 | 1.5E-03 | 3.6E-08 | 3.6E-06 | |||||
| 9 | 8343 | 9294 | MDJ-01 | Senzhang, WH2012 | Xinjiang-01, MDJ-03 | 2.3E-05 | |||||||
| 10 | 6690 | 9219 | Primo501, Primo1153 | Primo2239 | Primo87 | 1.1E-03 | 1.1E-04 | 4.6E-02 | 6.3E-04 | ||||
| 11 | 2190 | 2613 | Glubinnoe-2004 | Sofjin-HO, Dalnegorsk, Primo196, Primo1153, Primo2239, Primo501, Primo633, Primo739, Primo87, Primo91, Primo94, Sofjin-Chumakov | Primo52 | 1.9E-03 | 1.5E-03 | 2.7E-02 | 3.3E-03 | ||||
| 12 | 6462 | 6550 | Primo633 | Zabaikalye | Sofjin-Chumakov, Irkutsk-1861, Primi89, Primo91, SofjinKSY | 3.0E-03 | 3.4E-03 | ||||||
| 13 | 3381 | 3413 | MDJ-02 | Primo52 | MDJ-03 | 8.7E-03 | |||||||
| 14 | 9155 | 9541 | Glubinnoe-2004 | Primo2239 | MDJ-02, MDJ-03 | 3.2E-02 | |||||||
| 15 | 4536 | 4913 | Primo253 | Svetlogorie | Primo94 | 3.2E-02 | |||||||
| 16 | 9338 | 10105 | Xinjiang-01 | Svetlogorie | WH2012 | 3.4E-02 | 1.8E-03 | 2.4E-05 | |||||
| 1 | 5200 | 5244 | 2689 | Est54, Latvia-1-96 | MucAr-M14-10, Aina, Cht-653, Kolarovo-2008, Vasilchenko, Lesopark, IR99-22f7, Irkutsk-BR-683-11 | 2.8E-05 | 4.7E-06 | ||||||
| 2 | 6375 | 6459 | Zabai | Vasilchenko, Aina | Irkutsk-12 | 2.0E-05 | 1.0E-04 | ||||||
| 3 | 3784 | 4075 | Kolarovo-2008 | MucAr-M14-10, 2689, Zausaev | Cht-22, Cht-653, Irkutsk-12 | 2.2E-02 | 7.5E-03 | 1.6E-02 | 1.2E-04 | ||||
| 4 | 8566 | 8718 | Kolarovo-2008 | Zausaev | Cht-653 | 8.6E-03 | 1.9E-04 | ||||||
| 5 | 108 | 1284 | Kolarovo-2008 | Zausaev5 | Cht-22, Vasilchenko | 1.0E-02 | 3.0E-03 | ||||||
| 6 | 7645 | 7867 | Kolarovo-2008 | 2689 | Cht-22 | 3.4E-02 | |||||||
| 7 | 7280 | 8215 | Zausaev | 2689 | MucAr-M14—10 | 2.6E-09 | 1.6E-05 | 4.1E-07 | 9.0E-05 | 8.5–04 | |||
| 8 | 1998 | 2301 | Zausaev, MucAr-M14-10 | Latvia-1-96, | Irkutsk-12, Aina, Cht-653, Sakhalin-6-11, Vasilchenko | 1.3E-02 | |||||||
| 9 | 8562 | 9550 | Kolarovo-2008 | Irkutsk-12, Cht-22, Sakhalin-6-11, Zabaikalye-1-09 | Vasilchenko, Aina | 5.7E-04 | 4.4E-08 | ||||||
| 10 | 7187 | 8199 | Kolarovo-2008 | Cht-653, Aina, Irkutsk-BR-683-11 | Tomsk-PT122 | 2.0E-05 | 2.1E-04 | ||||||
| 11 | 587 | 1174 | Kolarovo-2008 | IR99-22f7, Lesopark 11, Zausaev | Tomsk-PT122 | 4.7E-02 | 2.5E-05 | 2.4E-03 | |||||
| 12 | 4518 | 5226 | Aina, Cht-653, Vasilchenko | Buzuuchuk | Tomsk-PT122, Kolarovo-2008, MGL-Selenge-13-12 | 6.3E-07 | 2.9E-05 | 3.4E-06 | 7.2E-06 | 2.4E-07 | 1.9E-08 | ||
| 13 | 3860 | 4258 | Vasilchenko, Aina, 653, Tomsk-PT122 | Buzuuchuk | Irkutsk-12, Sakhalin-6-11 | 5.4E-05 | 1.7E-03 | 1.0E-02 | 1.8E-03 | 1.2E-05 | |||
| 14 | 5805 | 6240 | Vasilchenko, Aina, Cht-653 | Buzuuchuk | Tomsk, Kolarovo-2008 | 9.4E-06 | 2.4E-03 | 3.1E-04 | 2.9E-03 | 1.2E-05 | 4.1E-05 | ||
| 15 | 3525 | 3859 | Irkutsk-BR-683-11, Cht-22, Irkutsk-12, Sakhalin-6-11, Zabaikalye-1-09 | 2689 | Aina, Cht-653, Vasilchenko | 6.8E-04 | 7.92E-04 | ||||||
| 16 | 1701 | 2250, | Buzuuchuk | Aina, Cht-22, Cht-653, Irkutsk-12, MGL-Selenge-13-14, Sakhalin-6-11, Tomsk-PT122, Vasilchenko, Zabaikalye-1-09 | IR99-22f7, 2689, Lesopark, Zausaev | 4.7E-02 | 5.7E-07 | ||||||
| 17 | 9454 | 9894 | Buzuuchuk, Tomsk-PT122, Vasilchenko | Aina | IR99-22f7, MucAr-M14-10, Zausaev | 1.5E-04 | |||||||
| 18 | 7050 | 7635 | Buzuuchuk | Aina | MucAr-M14-10, Lesopark, Zausaev | 7.0E-03 | 4.5E-02 | 4.9E-04 | |||||
| 19 | 8030 | 8624 | Buzuuchuk | Aina, Cht-653, Vasilchenko | 2689, Lesopark, MucAr-M14-10 | 6.3E-03 | 3.0E-03 | 3.1E-06 | |||||
| 20 | 3926 | 4155 | Latvia-1-96 | Buzuuchuk | Lesopark, IR99-22f7, MucAr-M14-10 | 8.7E-03 | 1.0E-04 | ||||||
| 21 | 5091 | 5564 | Kolarovo-2008 | MGL-Selenge-13-12, Cht-22, Irkutsk-12, Irkutsk-BR-683-11, MGL-Selenge-13-14, Sakhalin-6-11, Zabaikalye-1-09 | Tomsk-PT122 | 5.0E-04 | 2.8E-02 | 8.9E-03 | 1.4E-03 | ||||
| 1 | 946 | 1702 | Joutseno | Est3476 | AS33 | 1.3E-05 | 3.9E-02 | 6.0E-04 | 6.7E-04 | 4.1E-03 | 3.5E-07 | ||
| 2 | 6997 | 8593 | Joutseno, AS33 | Est3476 | Stockholm, Norwegian, Toro-2003 | 1.3E-07 | 4.57E-02 | 1.7E-07 | 1.5E-06 | 3.2E-02 | 2.0E-11 | ||
| 3 | 4719 | 5836 | Stockholm, Norwegian, Toro-2003 | Kumlinge-A52 | Ljubljana-I | 8.1E-03 | |||||||
*The prefixes “Primo” and “Primi” refer to the strains called “Primorye” and “Primirye” respectively.
The start of ORFs is used as the origin for reporting the location of recombination breakpoints along the alignment.
Range of durations between the recombination tMRCA of the donor and receiver clades for the simulated events.
| Subtype | FE | S | W | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| event | 1 | 2 | 3 | 4 | 1 | 2 | 3 | 1 | 2 | 3 |
| Sampled durations | [435,1077] | [578,894] | [0,74] | [1187,1524] | [304,421] | [211,302] | [115,207] | [20,26] | [0,17] | [98,151] |
Table displaying for each simulated recombination event, the range of sampled durations in years between the time of the recombination event and the tMRCA of the donor and receiver clades. tMCRA is set to the median age of the split between the two clades based on posterior density regions at 95% for the node ages in the empirical BEAST trees from subtype alignments (ALN1-FE, ALN1-S and ALN1-W).
Fig 3Collection localities for the E-sequences used to build the tree in Fig I in S1 file.
Strains origins for three clades of interest (W1, FE4 and FE5) are indicated with arrows. Because strain origins are reported with various levels of precision (from local to national level), this map should only be used as an indication of the patchy record of TBEV genetic diversity. This map also shows that the territory comprised between the Irkutsk and Zabaikalsky regions represent a hot spot of genetic diversity with the co-circulation of X1-, X2-, FE- and S-strains. Some well known foci such as the presence of the S-TBEV in Finland and the isolation of all three sub-types in Estonia and Latvia are not included on the map as the associated sequences are too short to yield reliable phylogenetic signal. Sampling intensity is given in number of sequences collected in the same locality. The map was generated using an equidistant cylindrical projection with the Basemap toolkit available from the python package Matplotlib (http://matplotlib.org/basemap/users/cyl.html?highlight=cylindrical).
Reappraisal of the RDP4 identified putative recombination events.
| Event | Length (bp) | Region | No recombination mean node age (95% HPD) | Recombinationmean node age (95% HPD) | tMRCA mean node age (95% HPD) | Duration: tMRCA to recombination event | Simulation category | |
|---|---|---|---|---|---|---|---|---|
| Subtype FE | 1 | 3946 | NS1-NS4B | 646 [450,973] | 41 [25,65] | 1071 [749,1579] | 1030 | deep |
| 2 | 1043 | NS2A-NS3 | 920 [620,1542] | 100 [39,147] | 1294 [818,1922] | 1194 | deep | |
| 3 | 939 | NS4B-NS5 | - | - | - | - | - | |
| 4 | 405 | NS3 | 1292 [721,1696] | 106 [75,271] | 1574 [1016,2254] | 1468 | deep | |
| 5 | 555 | PrM-E | 1358 [793,2060] | 274 [63,464] | 1597 [1063,2495] | 1323 | deep | |
| 8 | 390 | NS5 | MDJ-01: 58 [18,88], Senzhang: 190 [79,182] | MDJ-01: 6 [2,49], Senzhang: 70 [59,235] | 488 | MDJ-01: 482, Senzhang: 412 | intermediate-deep |
The events in subtype FE correspond to those presented in Table 1. Event 2 and 3 are considered to be derived from the same event. The length of the recombining fragment and the genome region spanned by the fragment are indicated. The “no recombination mean node age” refers to the divergence date for the recombining strain in the recombination free alignment, whereas the “recombination mean node age” reports the divergence time of the strain in the alignment region covering the recombination. The time of the most common recent ancestor of the parental strains that produced the recombinant (highlighted by a red circle in (Fig J in S1 file) is given by the tMCRA. The tMCRA was calculated on the largest alignment, that is, on the recombination free sequences. The duration was calculated as the time from the tMRCA to the time of the recombination event. All dates and duration are reported in years. Events are assigned to simulation categories based on their duration values.