| Literature DB >> 31392031 |
Maude Jacquot1, Pavuluri P Rao2, Sarita Yadav3, Kyriaki Nomikou4, Sushila Maan5, Y Krishna Jyothi6, Narasimha Reddy7, Kalyani Putty7, Divakar Hemadri8, Karam P Singh9, Narender Singh Maan5, Nagendra R Hegde2, Peter Mertens3,10, Roman Biek1.
Abstract
For segmented viruses, rapid genomic and phenotypic changes can occur through the process of reassortment, whereby co-infecting strains exchange entire segments creating novel progeny virus genotypes. However, for many viruses with segmented genomes, this process and its effect on transmission dynamics remain poorly understood. Here, we assessed the consequences of reassortment for selection on viral diversity through time using bluetongue virus (BTV), a segmented arbovirus that is the causative agent of a major disease of ruminants. We analysed ninety-two BTV genomes isolated across four decades from India, where BTV diversity, and thus opportunities for reassortment, are among the highest in the world. Our results point to frequent reassortment and segment turnover, some of which appear to be driven by selective sweeps and serial hitchhiking. Particularly, we found evidence for a recent selective sweep affecting segment 5 and its encoded NS1 protein that has allowed a single variant to essentially invade the full range of BTV genomic backgrounds and serotypes currently circulating in India. In contrast, diversifying selection was found to play an important role in maintaining genetic diversity in genes encoding outer surface proteins involved in virus interactions (VP2 and VP5, encoded by segments 2 and 6, respectively). Our results support the role of reassortment in driving rapid phenotypic change in segmented viruses and generate testable hypotheses for in vitro experiments aiming at understanding the specific mechanisms underlying differences in fitness and selection across viral genomes.Entities:
Keywords: India; bluetongue virus; evolution; reassortment; selection
Year: 2019 PMID: 31392031 PMCID: PMC6680063 DOI: 10.1093/ve/vez027
Source DB: PubMed Journal: Virus Evol ISSN: 2057-1577
Rates of nucleotide substitution and root height of each segment of BTV.
| CoV | Substitution rate (subs/site/year) | Estimated date of MRCA | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Mean | Median | 95% HPD | 95% HPD | Mean | Median | 95% HPD | 95% HPD | Mean | Median | 95% HPD | 95% HPD | |
| Seg-1 | 3.17 | 3.13 | 2.37 | 4.12 | 1.14E-03 | 1.13E-03 | 7.99E-04 | 1.47E-03 | 1953 | 1954 | 1941 | 1961 |
| Seg-2 | 4.60 | 4.59 | 3.98 | 5.26 | 1.46E-02 | 1.45E-02 | 1.06E-02 | 1.86E-02 | 1937 | 1938 | 1918 | 1954 |
| Seg-3 | 4.43 | 4.31 | 2.64 | 6.26 | 1.34E-03 | 1.33E-03 | 9.55E-04 | 1.76E-03 | 1956 | 1957 | 1949 | 1961 |
| Seg-4 | 3.04 | 2.96 | 2.04 | 4.21 | 1.28E-03 | 1.27E-03 | 9.09E-04 | 1.63E-03 | 1956 | 1957 | 1947 | 1961 |
| Seg-5 | 3.88 | 3.82 | 2.64 | 5.05 | 1.13E-03 | 1.12E-03 | 8.50E-04 | 1.41E-03 | 1955 | 1955 | 1947 | 1961 |
| Seg6 | 4.95 | 5.03 | 3.57 | 5.79 | 9.40E-03 | 9.27E-03 | 5.96E-03 | 1.31E-02 | 1946 | 1946 | 1927 | 1961 |
| Seg-7 | 5.36 | 5.30 | 3.97 | 6.79 | 2.28E-03 | 2.25E-03 | 1.54E-03 | 3.11E-03 | 1954 | 1956 | 1944 | 1961 |
| Seg-8 | 3.05 | 2.97 | 1.98 | 4.21 | 1.22E-03 | 1.21E-03 | 8.54E-04 | 1.63E-03 | 1956 | 1957 | 1949 | 1961 |
| Seg-9 | 3.45 | 3.39 | 2.44 | 4.68 | 7.81E-04 | 7.77E-04 | 5.29E-04 | 1.05E-03 | 1950 | 1951 | 1934 | 1961 |
| Seg-10 | 2.28 | 2.21 | 1.45 | 3.17 | 1.06E-03 | 1.04E-03 | 7.31E-04 | 1.41E-03 | 1957 | 1957 | 1950 | 1961 |
Parameters were estimated based on ninety-two time-stamped Indian isolates in program BEAST v1.8.4 (Drummond et al. 2012) under a relaxed molecular clock and a flexible skygrid tree prior. Date of MRCA was deduced from the estimated mean root height.
Coefficient of variation.
Most recent common ancestor.
Highest posterior density.
Figure 1.Phylogeny of Indian BTV Seg-2 and amino-acid based clusters of all genomic segments. Shown is a time-scaled maximum clade credibility (MCC) tree based on Seg-2 for ninety-two Indian BTV isolates, obtained using Bayesian phylogenetic inference in program BEAST (Drummond et al. 2012). Seg-2 sequences were assigned to amino-acid based clusters (Pessia et al. 2015), as highlighted on the tree, which corresponded almost perfectly to serotype (shown on the very right). The same clustering approach was used for all other segments in the genome, for which cluster memberships is shown as vignettes on the right side of the tree. Colours chosen to represent clusters are used consistently across all figures of the manuscript.
Figure 2.Seg-2 diversity in Indian BTV samples collected over four decades. The proportion of sequences falling into each of ten phylogenetic clusters for Seg-2 (largely equivalent to serotype, see Fig. 1) is shown for each decade: 1985–1994 (n = 8), 1995–2004 (n = 15), 2005–2014 (n = 23), 2015 and after (n = 44). Multiple Seg-2 types co-circulate within the same decade and dominant types vary between decades.
Figure 3.Changes in the phylogenetic diversity of Indian BTV Seg-5 sequences across time and space indicates that a single variant has risen to dominance in recent decades. (A) Seg-5 time-scaled MCC tree of the ninety-two Indian BTV isolates, which was obtained using Bayesian phylogenetic inference in program BEAST (Drummond et al. 2012). Sequences could be divided into four amino-acid based clusters (Pessia et al. 2015) represented by different colours on the tree. According to ancestral state reconstruction of Seg-5 amino acid sequences, the branch defining the largest cluster (shown in red) only includes a single unique non-synonymous change in codon 322 (Asp -> Glu). The inferred amino acid states in this position across other branches of the phylogeny are also shown. (B) Changes in the proportions of samples belonging to each of the four Seg-5 clusters over the past four decades, documenting the continuous rise in frequency of the red cluster, which has become the only variant found among samples collected since 2014. (C) The spatial distribution of Seg-5 clusters before and after 2005, confirming that the recent dominance of the red cluster applies across the Indian sub-continent.
Figure 4.Multi-dimensional scaling (MDS) plot reflecting correlations in time to most recent common ancestor (TMRCA) between pairs of Indian BTV virus segments. In the absence of reassortment, segments are expected to exhibit highly correlated TMRCAs due to their evolutionary histories being tightly linked. MDS allows to depict the overall level of cross-correlation between all segments in two-dimensional space (Rambaut et al. 2008), where overlap between observations is indicative of shared evolutionary history (i.e. linkage) between segments. In contrast, segments broken up by reassortment are expected to occupy different regions in the plot. Temporal estimation was done separately for each of the ten BTV segments. Clouds of points reflect phylogenetic uncertainty based on 500 trees sampled in program BEAST (Drummond et al. 2012) for each segment, with pairwise comparisons to other segments being limited to viruses sampled in the same year. Only the first two dimensions of the scaling are shown.
Figure 5.Tree topology inconsistencies between pairs of Indian BTV segments. Inconsistencies in tree topologies, indicative of genome reassortment, were identified using the program GiRaF (Nagarajan and Kingsford 2011). The number of independent runs (out of 50) with evidence of inconsistency is shown between each possible pair of segments (columns) for each BTV sample (rows). Samples are ordered according to the Seg-2 MCC tree (i.e. by serotype, Fig. 1). Segment pairs were ordered by reassortment pattern similarity using hclust, the Ward minimum variance clustering method, and Ward’s clustering criterion (Murtagh and Legendre 2014).
Population genetics analysis of the BTV isolates studied.
|
|
|
|
| PAML M3 parameter estimates | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| nuc | aa | nuc | aa | nuc | aa | nuc | aa |
|
|
|
|
|
|
| |
| Seg-1 | 92 | 1125 | 126 | 0.06 | 0.02 | 0.05 | 0.01 | −0.32 | −1.37 | 0.00 | 0.06 | 0.55 | 0.77 | 0.21 | 0.02 | 0.04 |
| Seg-2 | 92 | 2496 | 854 | 0.17 | 0.17 | 0.42 | 0.50 | 5.29 | 6.64 | 0.01 | 0.04 | 0.15 | 0.25 | 0.51 | 0.24 | 0.13 |
| Seg-3 | 92 | 749 | 40 | 0.05 | 0.01 | 0.04 | 0.00 | −1.09 | −1.96 | 0.00 | 0.03 | NA | 0.79 | 0.21 | 0.00 | 0.01 |
| Seg-4 | 92 | 578 | 103 | 0.06 | 0.03 | 0.04 | 0.02 | −1.06 | −1.19 | 0.02 | 0.02 | 0.40 | 0.65 | 0.26 | 0.09 | 0.07 |
| Seg-5 | 92 | 443 | 68 | 0.05 | 0.02 | 0.05 | 0.02 | −0.47 | −0.98 | NA | 0.02 | 0.33 | 0.00 | 0.91 | 0.09 | 0.06 |
| Seg-6 | 92 | 928 | 269 | 0.11 | 0.10 | 0.28 | 0.22 | 4.82 | 4.04 | 0.00 | 0.01 | 0.07 | 0.35 | 0.49 | 0.16 | 0.05 |
| Seg-7 | 92 | 337 | 20 | 0.06 | 0.01 | 0.14 | 0.02 | 4.01 | 2.18 | 0.00 | 0.00 | 0.03 | 0.27 | 0.58 | 0.15 | 0.01 |
| Seg-8 | 92 | 319 | 73 | 0.06 | 0.04 | 0.03 | 0.02 | −1.42 | −1.40 | 0.01 | 0.39 | 7.59 | 0.83 | 0.17 | 0.00 | 0.11 |
| Seg-9 | 93 | 370 | 135 | 0.07 | 0.08 | 0.08 | 0.08 | 0.25 | 0.01 | 0.08 | 0.08 | 0.81 | 0.40 | 0.29 | 0.31 | 0.31 |
| Seg-9 NS4 | 93 | 57 | 11 | 0.05 | 0.03 | 0.04 | 0.01 | −0.26 | −1.58 | 0.03 | NA | NA | 1.00 | 0.00 | 0.00 | 0.08 |
| Seg-10 | 92 | 172 | 25 | 0.05 | 0.02 | 0.03 | 0.01 | −1.20 | −2.02 | 0.00 | 0.25 | 0.25 | 0.88 | 0.11 | 0.01 | 0.05 |
| Seg-10 NS5 | 92 | 29 | 24 | 0.03 | 0.08 | 0.02 | 0.05 | −1.30 | −1.02 | 0.00 | 9.98 | 73.49 | 0.27 | 0.70 | 0.03 | 6.56 |
S, θ, θ, and D statistics were computed in MEGA v7.0 (Kumar, Stecher, and Tamura 2016) at the nucleotide (nuc) and amino-acid (aa) levels. ω0, ω1, ω2 and p, p and p are parameters estimated for the M3 model (discrete) which best fits data in PAML (Yang 1997). Mean d/d ratios were estimated using the single likelihood ancestor counting (SLAC) method (Kosakovsky Pond, Frost, and Muse 2005).
N, number of sequences; S, number of segregating sites; D, Tajima's D; ω, d/d; p, proportion of sites; NA, not applicable because p = 0.
Detection of positively and negatively selected codons.
| FUBAR | FEL | MEME | ||||||
|---|---|---|---|---|---|---|---|---|
| Total number of codons in the alignments | Number of sites under positive/ diversifying selection | Number and percentage of sites under negative/ purifying selection | Number of sites under positive/ diversifying selection | Number and percentage of sites under negative/ purifying selection | Number of sites under episodic positive/diversifying selection | |||
| Seg-1 | 1,302 | 0 | 936 | 71.89% | 0 | 489 | 37.56% | 6 |
| Seg-2 | 978 | 0 | 816 | 83.44% | 3 | 689 | 70.45% | 4 |
| Seg-3 | 901 | 0 | 689 | 76.47% | 0 | 389 | 43.17% | 2 |
| Seg-4 | 644 | 0 | 390 | 60.56% | 0 | 175 | 27.17% | 5 |
| Seg-5 | 552 | 0 | 307 | 55.62% | 0 | 134 | 24.28% | 1 |
| Seg-6 | 528 | 0 | 487 | 92.23% | 0 | 448 | 84.85% | 1 |
| Seg-7 | 349 | 0 | 304 | 87.11% | 0 | 219 | 62.75% | 1 |
| Seg-8 | 354 | 0 | 155 | 43.79% | 0 | 76 | 21.47% | 2 |
| Seg-9 | 330 | 1 | 76 | 23.03% | 1 | 70 | 21.21% | 6 |
| Seg-9 NS4 | 77 | 0 | 13 | 16.88% | 0 | 1 | 1.30% | 0 |
| Seg-10 | 229 | 0 | 83 | 36.24% | 0 | 12 | 5.24% | 0 |
| Seg-10 NS5 | 59 | 8 | 0 | 0.00% | 7 | 0 | 0.00% | 1 |
Positively and negatively selected codons were estimated by the FUBAR, FEL, and MEME methods of the HyPhy suit (Kosakovsky Pond, Frost, and Muse 2005).
Codons with posterior probabilities >0.9 are reported.
Codons with P-values of <0.05 are reported.