Literature DB >> 25815768

Time-calibrated phylogenomics of the classical swine fever viruses: genome-wide bayesian coalescent approach.

Taehyung Kwon1, Sook Hee Yoon1, Kyu-Won Kim2, Kelsey Caetano-Anolles3, Seoae Cho4, Heebal Kim1.   

Abstract

The phylogeny of classical swine fever virus (CSFV), the causative agent of classical swine fever (CSF), has been investigated extensively. However, no evolutionary research has been performed using the whole CSFV genome. In this study, we used 37 published genome sequences to investigate the time-calibrated phylogenomics of CSFV. In phylogenomic trees based on Bayesian inference (BI) and Maximum likelihood (ML), the 37 isolates were categorized into five genetic types (1.1, 1.2, 2.1, 2.3, and 3.4). Subgenotype 1.1 is divided into 3 groups and 1 unclassified isolate, 2.1 into 4 groups, 2.3 into 2 groups and 1 unclassified isolate, and subgenotype 1.2 and 3.4 consisted of one isolate each. We did not observe an apparent temporal or geographical relationship between isolates. Of the 14 genomic regions, NS4B showed the most powerful phylogenetic signal. Results of this evolutionary study using Bayesian coalescent approach indicate that CSFV has evolved at a rate of 13×.010-4 substitutions per site per year. The most recent common ancestor of CSFV appeared 2770.2 years ago, which was about 8000 years after pig domestication. The effective population size of CSFV underwent a slow increase until the 1950s, after which it has remained constant.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 25815768      PMCID: PMC4376735          DOI: 10.1371/journal.pone.0121578

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Classical swine fever (CSF), also known as hog cholera is a highly contagious viral disease of domestic pig and wild boar that causes watery diarrhea and weakness. The high mortality rate of CSF leads to significant economic losses in the global swine industry [1]. From the 1990s to the early 2000s, sporadic outbreaks of CSF in European swine industries were reported [2-4]; for instance, 117, 48, and 429 farms were contaminated in Germany, Belgium, and the Netherlands respectively. In the case of the Netherlands, economic losses were calculated to be almost 2.3 billion US dollars [5, 6]. Many countries are currently considered to be CSF free or low risk by the United States Department of Agriculture (USDA), including Australia, Canada, and several European countries. Given the potential for devastating economic impact of a CSF outbreak on the swine industry, circumspect governmental regulations are in place to prevent CSF exposure in large, dense populations of pigs. Nevertheless, CSF outbreaks in small populations of wild boar are still observed in several regions like India [7]. As populations in the swine industry become denser and larger, more government attention to CSF is needed. Additionally, countries that are not CSF free, such as India or China, account for a large portion of the global swine industry [8], which means that global attention to the CSF is necessary as well. Classical swine fever virus (CSFV), the causative agent of CSF, is a member of the genus Pestivirus within the family Flaviviridae [1, 9]. In viral epidemiology, mortality rate is defined as the number of deaths over the number of individuals infected during a specific period. [10]. While the mortality rate of CSF is high, the severity differs by case, especially depending on the age of the animal and virulence of the virus. While symptoms in older animal tend to be milder, the mortality rate of young pigs is almost 90 percent [1, 9]. CSF can often be confused with American swine fever (ASF), which produces clinical symptoms similar in pigs to that of CSF, including haemorrhagic fever. However, despite these similarities, CSF virus is a small single-stranded RNA virus, while ASF virus is a large double-stranded DNA virus of genus Asfivirus in the family Asfarviridae [11]. While it is difficult to diagnose these viruses from clinical signs and lesions alone, confirmatory experiments like RT-PCR is necessary [12]. CSFV is also structurally close to the bovine viral diarrhea (BVD) virus and the border disease (BD) virus, which also belong to the genus Pestivirus [9]. Although these viruses are capable of infecting pigs, they do not have the ability to spread out without their original hosts and can easily be genetically differentiated from CSFV [13]. CSFV has a single positive-stranded RNA genome of approximately 12.3 kb including 11.7 kb open reading frame (ORF), which is flanked by the 5’ and 3’untranslated regions (UTRs). The ORF encodes 12 polypeptides, which contain four structural proteins (C, Erns, E1, E2) and seven non-structural proteins (p7, NS2, NS3, NS4A, NS4B, NS5A, NS5B) with autoprotease (Npro) at N-terminus [14, 15]. The UTRs contain signals for viral replication, transcription, and translation [15]. Ji et al. [16] found that virulence of CSFV is associated with mutations in seven protein genes (Npro, C, Erns, E1, E2, p7, NS4B) related to viral entry, release, replication and host cell interaction [17]. CSFVs have been classified under three major genotypes (1, 2, and 3), each comprised of three to four subgenotypes (1.1–1.4; 2.1–2.3; 3.1–3.4) [18, 19]. Since the first report of CSFV in the United States at 1833 [15] and in South Korea at 1908 [20], many global phylogenetic studies have been published based on the genome [21-23] and particular genes such as 5’ UTR, E2, and NS5B (e.g., [24-31]). However, as described in Floegel et al. and Ji et al. [32, 33], accumulation of mutations in viruses have caused variability in virulence between genotypes. The temporal dynamics of global CSFVs also remain unclear. The only previous study analyzed a small number genes (5’UTR, Npro, and E2) [34], and found that the most recent common ancestor of the virus existed in the year 1892. To fully comprehend the phylogeny and evolutionary dynamics of CSFV, as mentioned in previous phylogenomic studies [35], it is necessary to conduct a genomic scale analysis. Here, we performed genome wide study on the time-calibrated phylogenomics of the virus. We analyzed 37 available public genome sequences that sampled during a time period of 68 years (1945–2012). The objectives of this study are (1) to analyze the characteristics of CSFV genome sequences; (2) to reconstruct the genome wide phylogeny of the global CSFVs using two different analyses methods, Bayesian inference (BI) and maximum likelihood (ML); and (3) to elucidate the evolutionary mechanisms such as selection pressure, substitution rate, divergence times, and effective population size changes using Bayesian coalescent approach.

Materials and Methods

Sequence and selection pressure analyses

We used the published nucleotide and amino acid sequences of 37 CSFV isolates from NCBI, discovered around the world from 1945 to 2012. These isolates consisted of 20 Asian isolates (China, 14; South Korea, 2; Taiwan, 3; Japan, 1), 16 European isolates (Germany, 7; France, 3; Italy, 1; Lithuania, 1; Spain, 1; Denmark, 1; Bulgaria, 1; Croatia, 1), and 1 American isolate (USA, 1). Each isolate belongs to one of the following subgenotypes: 1.1, 1.2, 2.1, 2.3, and 3.4. The information of 37 samples is shown in Table 1.
Table 1

Information of the isolates used in this study.

IsolateSubgenotypeCountryYearAccession no.
CSFV-GZ-20091.1China2009HQ380231
CF1141.1China2001AF333000
Shimen1.1China1945AF092448
SWH1.1China2004DQ127910
C-ZJ-20081.1China2008HM175885
JL1(06)1.1China2006EU497410
LOM1.1Japan1980EU789580
Eystrup1.1Germany1964NC_002657
Alfort/1871.1France1968X87939
CAP1.1France1978X96550
Brescia1.1Italy1945AF091661
Glentorf1.1Denmark1968U45478
BRESCIAX1.2USA2001AY578687
YC11WB2.1South Korea2011KC149990
PC11WB2.1South Korea2011KC149991
HNLY-20112.1China2011JX262391
Heb520102.1China2010JQ268754
HNSD-20122.1China2012JX218094
SXCDK2.1China2009GQ923951
SXYL20062.1China2006GQ122383
GXWZ022.1China2002AY367767
HEBZ2.1China2009GU592790
Zj08012.1China2008FJ529205
96TD2.1Taiwan1996AY554397
0406/CH/01/TWN2.1Taiwan2001AY568569
Paderborn2.1Germany1997GQ902941
CSFV/2.1/dp/CSF1048/2009/LT/Penevezys2.1Lithuania2009HQ148063
CSFV/2.3/wb/XXX0609/2004/Uelzen2.3Germany2004GU324242
CSFV/2.3/wb/CSF1045/2009/Roesrath2.3Germany2009GU233734
CSFV/2.3/wb/XXX0608/2005/Euskirchen2.3Germany2005GU233732
CSFV/2.3/wb/CSF1046/2009/Hennef2.3Germany2009GU233733
CSFV/2.3/dp/CSF857/2006/Borken2.3Germany2006GU233731
Alfort/T2.3France1980J04358
Sp012.3Spain2001FJ265020
CSFV/2.3/dp/CSF864/2007/BG/Jambul2.3Bulgaria2007HQ148062
CSFV/2.3/dp/CSF0821/2002/HR/Novska2.3Croatia2002HQ148061
94.4/IL/94/TWN3.4Taiwan1994AY646427
Both nucleotide and amino acid sequences of the CSFV whole genome and 14 regions (5’UTR-Npro-C-Erns-E1-E2-p7-NS2-NS3-NS4A-NS4B-NS5A-NS5B-3’UTR) were aligned with MAFFT 7 [36]. The following values were calculated from aligned sequences using Tree puzzle 5.2 [37] and BIOEDIT 7.2.3.0 [37, 38]: total sites (including gaps), conserved sites, and average identities. During the process, we depicted variable site plots for nucleotide and amino acid sequence. Results of model tests using Modeltest 3.7 [39] for the genome and genomic regions are summarized in Table 2.
Table 2

The best fit evolutionary models estimated for CSFV genomic regions with Modeltest 3.7.

Genomic regionModel-lnLBase frequencies (A,C,G)Substitution matrix (A-C, A-G, A-T, C-G, C-T)PinvarNst
Whole genomeGTR+I+G66705.170.31, 0.22, 0.260.99, 11.00, 1.90, 0.38, 25.040.496
5'UTRSYM+I+G1451.45Equal0.96, 3.90, 1.20, 0.00, 9.470.536
Npro GTR+G2755.610.30, 0.23, 0.273.67, 26.01, 3.40, 0.72, 62.760.006
CGTR+I1595.190.38, 0.18, 0.271.42, 9.28, 1.63, 0.16, 26.640.556
Erns GTR+I+G3562.380.32, 0.22, 0.271.90, 13.50, 2.24, 0.46, 38.940.456
E1GTR+I3163.490.29, 0.23, 0.251.12, 17.99, 2.95, 0.56, 27.720.566
E2GTR+I+G6623.000.28, 0.22, 0.271.01, 9.54, 1.59, 0.33, 19.160.506
p7TrN+G1072.380.30, 0.19, 0.231.00, 12.65, 1.00, 1.00, 22.660.006
NS2GTR+I+G7931.160.30, 0.21, 0.250.85, 9.80, 1.42, 0.29, 18.590.426
NS3GTR+I+G9710.910.31, 0.22, 0.270.91, 12.27, 2.36, 0.34, 29.500.596
NS4ATIM+I947.920.34, 0.21, 0.231.00, 57.99, 5.13, 5.13, 111.190.556
NS4BGTR+I+G5770.450.30, 0.22, 0.271.23, 13.30, 2.18, 0.29, 28.530.546
NS5AGTR+I+G8511.480.32, 0.22, 0.260.90, 9.71, 1.71, 0.41, 21.260.426
NS5BGTR+I+G11343.280.33, 0.21, 0.260.96, 11.95, 2.35, 0.57, 35.590.496
3'UTRTVM+G1190.080.31, 0.21, 0.160.24, 5.22, 1.04, 0.60, 5.220.006
To evaluate selective pressure acting on CSFVs, the relative rates of nonsynonymous and synonymous substitution (ω = dN/dS) across coding region of the viral genome were also estimated using ClustalX 1.81 [40] PAL2NAL [41], and CodeML of PAML 4.7 package [42]. A dN/dS ratio of < 1 indicated purifying selection, dN/dS = 1 suggested an absence of selection (i.e., neutral evolution), and dN/dS > 1 indicated positive selection.

Phylogenomic tree reconstruction

Phylogenomic trees were reconstructed using two different analytical methods—BI and ML. The best-fit model for the CSFV whole genome was determined using Akaike’s information criterion (AIC) within Modeltest 3.7 [39]. BI analysis using MrBayes 3.1.2 [43] was performed with following options: nst = 6, rates = invgamma, number of generation = 20,000,000, and burn-in = 20,000. Bayesian posterior probability (BPP) values were shown on the BI tree [44]. ML analysis using Phyml 3.0 [45] was also conducted with the following parameters: model of nucleotide substitution = GTR, replicates = 500, pinvar = estimated, and number of substitution rate categories = 6. All trees were visualized in Figtree 1.4 [46]. In order to screen for congruent tree topologies with genome tree topologies, all 14 genomic regions were analyzed in the same methods as genome data. The best-fit models for each region used in analysis are summarized in Table 2.

Estimation of substitution rate, divergence times, and population size changes

Using BEAST 1.7.4 [47], the mean rate of nucleotide substitution, time of the most recent common ancestor (tMRCA), and change in effective population size of the CSFV were estimated with the result of the model test. We combined three molecular clock models (strict, relaxed uncorrelated exponential, and relaxed uncorrelated log-normal) with five demographic models (constant size, exponential growth, expansion growth, logistic growth, and Bayesian skyline) to make 15 datasets for simulation. Next, each dataset was simulated with the following options: generation = 400,000,000, burn-in of 10%, and ESSs > 100. By comparing the highest Bayesian factors (log10 Bayesian factor > 2) which were based on the relative marginal likelihood of 15 models, the relaxed uncorrelated exponential clock and expansion growth population model was selected as the best-fit evolutionary model. The resulting convergence was analyzed using Tracer 1.5 [48]. Trees were summarized as maximum clade credibility (MCC) tree using the TreeAnnotator 1.7.4 [49] and visualized using Figtree 1.4 [46]. For each tree node, estimated divergence times and 95% highest posterior density (HPD) intervals, which summarize the statistical uncertainties, were indicated. The change of effective population size was plotted using Bayesian skyline plot (BSP) analyses [50].

Results

Features of the entire genome and 14 individual regions of the 37 CSFVs are summarized in Table 3. The whole genome alignment (including insertions) was 12,301 bps in length, and revealed relatively low similarities; 7,645 (62.1%) of the nucleotide sites were conserved. The differences in nucleotide and amino acid sequence at each site of the CSFV genome alignment are shown in Fig. 1. Both nucleotide and amino acid variations were evenly distributed throughout the genomes, though higher amino acid similarities were observed in three regions (NS3, NS4A, and NS4B). Pairwise comparisons also revealed that the average identities among the complete genome sequences were 89.3% for nucleotide sequences. Of the 14 individual regions, E2 was the most variable (average sequence identities of 87.8 and 92.3% for nucleotides and amino acids, respectively), while 5’ UTR was the most conserved (average sequence identities of 94.2% for nucleotides).
Table 3

Summary of alignment of CSFV genomic regions.

Genomic regionTotal sites including gaps, nt/aaConserved sites(%), nt/aaAverage identities(%), nt/aaω value (dN/dS)
Whole genome12301/NA7645(62.1%)/NA89.3/NANA
ORF11694/38987224(61.8%)/2949(75.7%)89.1/94.70.067
5'UTR374/NA287(76.7%)/NA94.2/NANA
Npro 504/168301(59.7%)/117(69.6%)89.6/93.20.137
C297/99183(61.6%)/69(69.7%)88.7/93.70.093
Erns 681/227423(62.1%)/167(73.6%)89.4/93.70.081
E1585/195360(61.5%)/153(78.5%)88.7/95.30.069
E21119/373654(58.4%)/261(70.0%)87.8/92.30.104
p7210/70126(60.0%)/51(72.9%)88.1/94.20.071
NS21371/457803(58.6%)/318(69.6%)88.0/92.70.089
NS32049/6831400(68.3%)/609(89.2%)90.7/98.60.020
NS4A192/64120(62.5%)/55(85.9%)89.6/98.30.032
NS4B1041/347653(62.7%)/280(80.7%)89.3/96.60.049
NS5A1491/497858(57.5%)/332(66.8%)88.6/92.10.094
NS5B2154/7181343(62.3%)/537(74.8%)89.4/95.00.065
3'UTR230/NA133(57.8%)/NA89.2/NANA
Fig 1

Variable site plots for nucleotide (a) and amino acid (b) sequence of 37 CSFVs.

The number of differences at each site represents the number of variable isolates estimated with multiple sequence alignment. Each color indicates a different genomic region.

Variable site plots for nucleotide (a) and amino acid (b) sequence of 37 CSFVs.

The number of differences at each site represents the number of variable isolates estimated with multiple sequence alignment. Each color indicates a different genomic region. The mean ratio of nonsynonymous/synonymous substitution (dN/dS) derived from entire data sets was calculated and indicated that purifying selection acted on the CSFV genome sequences (Table 3, Fig. 2). The dN/dS value of entire coding sequences was 0.067 and all values for each component gene were lower than 1. Particularly, the highest dN/dS ratio was observed in Npro (0.137), while the lowest one was shown in NS3 (0.020).
Fig 2

Pairwise dN/dS (ω) values of the protein coding gene sequences of 37 CSFVs.

37 CSFV isolates were classified into one of five subgenotypes (1.1, 1.2, 2.1, 2.3, and 3.4). BI and ML methods presented similar configurations regarding the phylogenomics of CSFV, and also supported the topology of the maximum clade credibility (MCC) tree (Fig. 3). The phylogenomic tree revealed two sister-group relationships, subgenotype 1.1 + 1.2 + 3.4 and subgenotype 2.1 + 2.3, with a branching pattern of [{3.4, (1.1, 1.2)}, (2.1, 2.3)]. Subgenotype 1.1 consisted of 11 CSFVs collected from Asia (China and Japan) and Europe (Denmark, France, Germany, and Italy) during 1945–2009; those 11 viruses were divided into 3 groups and 1 unclassified isolate. Subgenotype 2.1 was grouped with 9 isolates from Asia (South Korea, China, and Taiwan) and Europe (Germany and Lithuania) during 1996–2012; individuals were clustered to 4 groups. On the other hand, all viruses in subgenotype 2.3 originated from European countries (France, Germany, Spain, Croatia, and Bulgaria) between 1980 and 2009; individuals were categorized to 2 groups and 1 unclassified isolate. Subgenotype 1.2 and 3.4 had only one member derived from USA in 2001 and Taiwan in 1994.
Fig 3

Bayesian maximum clade credibility phylogenetic tree based on the whole genome sequences of 37 CSFVs.

With the BI and ML methods, identical topology was produced. Divergence times (in years) are positioned below the nodes; the 95% HPD intervals are indicated in brackets. The confidence of the phylogenetic analysis is presented above the nodes: left numbers represent Bayesian posterior probabilities (≥ 0.80) and right ones represent ML bootstrap values (≥ 60%). Subgenotypes and groups are indicated above the corresponding nodes using squares and circles.

Bayesian maximum clade credibility phylogenetic tree based on the whole genome sequences of 37 CSFVs.

With the BI and ML methods, identical topology was produced. Divergence times (in years) are positioned below the nodes; the 95% HPD intervals are indicated in brackets. The confidence of the phylogenetic analysis is presented above the nodes: left numbers represent Bayesian posterior probabilities (≥ 0.80) and right ones represent ML bootstrap values (≥ 60%). Subgenotypes and groups are indicated above the corresponding nodes using squares and circles. To identify the most significant phylogenetic marker of CSFV among the 14 regions, phylogenetic trees for the 14 regions were reconstructed based on both BI and ML methods, and these individual trees were compared to the whole genome trees. NS4B gene trees were the most similar to the genomic tree, thus, NS4B gene was chosen as the most significant phylogenetic marker for CSFV (Fig. 4). The overall tree topologies of E2 gene, which was used as the marker of CSFV phylogeny, were very different from those of genome tree (S1 and S2 Figs.); most groupings were disrupted in the E2 trees.
Fig 4

The BI tree based on NS4B sequences of 37 CSFVs.

The robustness of the phylogenetic analysis is showed above the nodes: left numbers are Bayesian posterior probabilities (≥0.80) and right ones are ML bootstrap values (≥60%). Subgenotypes and groups are marked above the corresponding nodes using squares and circles.

The BI tree based on NS4B sequences of 37 CSFVs.

The robustness of the phylogenetic analysis is showed above the nodes: left numbers are Bayesian posterior probabilities (≥0.80) and right ones are ML bootstrap values (≥60%). Subgenotypes and groups are marked above the corresponding nodes using squares and circles.

Substitution rate, divergence times, and population size changes

Results of analyses performed by BEAST 1.7.4 revealed that the relaxed uncorrelated exponential clock and expansion growth population model was the best-fit for simulating evolution of CSFV genome. Under this model, evolutionary rate was estimated as 1.03×10-4 substitution/site/year, and the tMRCA of 37 CSFVs was estimated as 2770.2 years ago (95% HPD 223.5–8611.6). According to the MCC tree, subgenotype 3.4 appeared first 2203.0 (95% HPD = 612.8–4342.1) years ago, followed by sequential divergence of subgenotype 1.2 (1255.1 years ago; 95% HPD = 106.6–2157.1), 2.3 (663.6 years ago; 95% HPD = 37.1–1456.1), 2.1 (617.8 years ago; 95% HPD = 63.9–2428.4), and 1.1 (545.8 years ago; 95% HPD = 92.1–1269.7) (Fig. 3). In BSP, the effective population size of CSFV slowly increased until the 1950s, after which it plateaued (Fig. 5).
Fig 5

Bayesian skyline plot based on the entire genome sequences of 37 CSFV isolate.

The plot depicts changes in the effective population size (N ). The dark line shows the effective population size estimated through time. The upper and lower lines indicate the 95% HPD range of BSP.

Bayesian skyline plot based on the entire genome sequences of 37 CSFV isolate.

The plot depicts changes in the effective population size (N ). The dark line shows the effective population size estimated through time. The upper and lower lines indicate the 95% HPD range of BSP.

Discussion

Our research aimed to characterize the CSFV genome and elucidate its evolutionary features using genomic data. Our results revealed a high degree of genetic diversity between the 37 CSFV genome sequences. This feature confirmed the views of Domingo et al. [51] who proposed that RNA viruses have the high level of mutation rates of 10-3 to 10-5 nucleotide substitutions per site per replication cycle due to their inaccurate RNA replication. Of the 14 genomic regions, our analyses showed that E2 was significantly (p-value < 2.2 × 10-16) variable, while 5’UTR was the most conserved. The E2 protein, a glycoprotein formed on the virus membrane, has been suggested as a virulence determinant [52, 53]. In CSFV, it acts as an immunogenic protein which induces the neutralizing antibodies and protection reactions of the host [54]. 5’UTR of CSFV contains an internal ribosome entry site (IRES) where translation initiation occurs, which is indispensable for the virus [55, 56]. Domains in IRES tend to be highly conserved between several viruses in genus Pestivirus, including CSFV [55, 57]. These two regions (E2 and 5’UTR) have been considered as the most important targets for CSFV studies such as PCR detection and immunology as well as evolutionary phylogenetics. Because the sequence dissimilarity may affect PCR diagnosis and vaccination efficiency as well as investigations on immunology, epidemiology, and phylogenetic evolution, it is necessary to continuously monitor the genome sequence variability. In agreement with the interpretations of Ji et al. [33], our results on the basis of complete coding genome sequences were also indicative of purifying selection acting on the CSF viruses; the mean ratio of nonsynonymous/synonymous substitution (ω = dN/dS) values for each genomic region were low in all cases (all, ω < 1). Of those, the Npro presented the highest dN/dS values, while the NS3 indicated the lowest ones. Next, the phylogeny of CSFV was studied to provide further information regarding epidemiology and evolution. Despite of the importance of CSFV to the pig industry and recent rapid increase of CSFV data, most phylogenetic studies describing the virus have focused only on limited genome region sequences such as E2 [29, 30, 58, 59], 5’UTR [25, 26, 29, 30], and NS5B [31, 60]. However, expanding information of genome-wide phylogeny has potential to improve our knowledge on its emergence and transmission. Our phylogenomic analysis of 37 CSFV genome sequences revealed that no apparent correlation between time and/or country and the evolution of the virus. Within the subgenotype 1.1, the isolates appeared in samples taken in four European countries (Denmark, France, Germany, and Italy) and two Asian countries (China and Japan) during 1945–2009. Subgenotype 2.1 viruses originated from six countries (South Korea, China, Taiwan, Germany, and Lithuania) during 1996–2012. Moreover, isolates of subgenotype 2.3 were found in five countries (France, Germany, Spain, Croatia, and Bulgaria) during 1980–2009. These configurations were also in concordance with the views of other authors [23, 35] and may be largely due to their rapid spread via the frequent international trade in livestock. The mixed population structure can make vaccine development and local regulation more difficult. Thus, it is essential to continuously monitor the structural changes of the mixed population. In order to trace the most appropriate phylogenetic marker, we compared 14 individual gene trees with complete genome trees. Although E2, NS5B, and 5’UTR are generally considered suitable for elucidating the CSFV phylogeny, our findings postulated that topologies of NS4B trees were the most similar to those of the complete genome trees, rather than E2, NS5B, and 5’UTR. Thus, we suggest that NS4B has the strongest signal to infer the genetic relationships of these viruses. Among three frequently used markers, topologies of E2 tree was more apparent than those of the other two regions. Given that research has shown that the complete E2 sequence is suitable for phylogenetic analysis of CSFV [61], comparison of topologies between NS4B and E2 full-length sequences can support the discriminatory power of NS4B as a phylogenetic marker (S1 and S2 Figs.). Both of those regions are regarded as significant determinant in viral activity. That is, E2 is related to viral entry to the target cell [62], E2 has been known as a virulence determinant of CSFV, and as the most immunogenic factor among the viral protein [63, 64]. The nonstructural protein translated from NS4B was reported to contain Toll/Interleukin-1 receptor (TIR) domain in Brescia strain which is considerably virulent, and the viral replication of Brescia strain was shown to be decreased by mutation in that domain [65]. Additionally, after artificial re-injection of the GPE- vaccine, amino acid sequence changes in NS4B contributed to the recovery of virulence of the virus [66]. Thus, NS4B was considered to be essential in viral replication and to have interaction with immune system, and mutations in NS4B significantly affected the virulence of CSFV [65]. Finally, we attempted to elucidate the evolutionary mechanisms and features of CSFV including estimation of evolutionary rate, divergence times, and population size changes. Regarding the evolutionary dynamics of CSFV, there was only one previous study [34]. They utilized only three genes (5’UTR, Npro, and E2) sequences of 35 pestiviruses including six CSFVs, and reported that the most recent common ancestor of CSFV existed 1825 years ago; no analysis of substitution rate and population size changes was performed. In contrast to this small number of genes analyzed from a limited selection of representative viruses, we analyzed the complete genome sequences with an available year of isolation in order to co-estimate an overall substitution rate, population size changes as well as divergence times for CSFVs. The mean evolutionary rate estimated in the present study was 1.03×10-4 (95% HPD 2.03×–2.61×10-4) substitutions/site/year; that calculation was within the range of 10-2 to 10-5 nucleotide substitutions/site/year for nearly all RNA viruses [67]. However, this value was lower than the results from previous studies based on E2 sequences; the mean evolutionary rates were calculated as 3.3×10-3 and 2.41×10-3 substitutions/site/year in two previous studies [18, 62]. Because E2 was relatively variable and less conserved compared to the whole genome, the mean rate of substitution of the whole genome could be lower than that of E2. The fast evolutionary rates of RNA viruses including CSFV are affected by a combination of forces such as lack of proof-reading, small genome size, short generation times, and the extent of natural selection [68]. As a result, it is possible that RNA virus raises viral population adaptation, survival, and fitness, allowing them to rapidly spread to new hosts and novel environments [69]. The tMRCA of CSFV was 2770.2 (95% HPD 223.5–8611.6) years ago which was about 8000 years later than the domestication of wild boar [70, 71]. In terms of the effective population size changes of CSFVs, our bayesian skyline plot (BSP) analysis depicted that a population increase until the middle of 1990s, after which population size appears to have evolved at a near constant rate population size until the present. The trend of effective population size can be attributed to the introduction of vaccine to prevent further global CSF outbreaks [72]. This plateau after 1950s also can be explained by huge slaughter of pigs during global outbreak of porcine reproductive and respiratory syndrome (PRRS) and swine influenza (SI) [73-75]. The present study is the first of its kind to use the complete CSFV genome to investigate the temporal dynamics of the virus. CSFV is still one of the most acute pathogens in the global swine industry. Accordingly, global strategies are essential for prevention and control of this virus. Results of the present study expand the limited information available on CSFV evolutionary dynamics, which may be crucial for the control of this virus, as well as improve our knowledge of its epidemiology and evolution.

The BI tree on the basis of E2 sequences of 37 CSFVs.

Bayesian posterior probabilities above 0.80 are shown on the nodes. (TIFF) Click here for additional data file.

The ML tree derived from E2 sequences of 37 CSFVs.

ML bootstrap values above 60% are presented on the nodes. (TIFF) Click here for additional data file.
  66 in total

1.  The classical swine fever epidemic 1997-1998 in The Netherlands: descriptive epidemiology.

Authors:  A R Elber; A Stegeman; H Moser; H M Ekker; J A Smak; F H Pluimers
Journal:  Prev Vet Med       Date:  1999-12-01       Impact factor: 2.670

2.  Localization of pestiviral envelope proteins E(rns) and E2 at the cell surface and on isolated particles.

Authors:  F Weiland; E Weiland; G Unger; A Saalm ºller; H J Thiel
Journal:  J Gen Virol       Date:  1999-05       Impact factor: 3.891

3.  Classical swine fever (CSF): a historical review of research and vaccine production on the Isle of Riems.

Authors:  V Kaden; B Riebe
Journal:  Berl Munch Tierarztl Wochenschr       Date:  2001 Jul-Aug       Impact factor: 0.328

4.  Experimental non-transmissible marker vaccines for classical swine fever (CSF) by trans-complementation of E(rns) or E2 of CSFV.

Authors:  H G P van Gennip; A Bouma; P A van Rijn; M N Widjojoatmodjo; R J M Moormann
Journal:  Vaccine       Date:  2002-02-22       Impact factor: 3.641

5.  Selection of classical swine fever virus with enhanced pathogenicity reveals synergistic virulence determinants in E2 and NS4B.

Authors:  Tomokazu Tamura; Yoshihiro Sakoda; Fumi Yoshino; Takushi Nomura; Naoki Yamamoto; Yuka Sato; Masatoshi Okamatsu; Nicolas Ruggli; Hiroshi Kida
Journal:  J Virol       Date:  2012-06-06       Impact factor: 5.103

6.  Origins and evolutionary genomics of the 2009 swine-origin H1N1 influenza A epidemic.

Authors:  Gavin J D Smith; Dhanasekaran Vijaykrishna; Justin Bahl; Samantha J Lycett; Michael Worobey; Oliver G Pybus; Siu Kit Ma; Chung Lam Cheung; Jayna Raghwani; Samir Bhatt; J S Malik Peiris; Yi Guan; Andrew Rambaut
Journal:  Nature       Date:  2009-06-25       Impact factor: 49.962

7.  Classical swine fever virus isolates from Cuba form a new subgenotype 1.4.

Authors:  Alexander Postel; Stefanie Schmeiser; Carmen Laura Perera; Lester Josué Pérez Rodríguez; Maria Teresa Frias-Lepoureau; Paul Becher
Journal:  Vet Microbiol       Date:  2012-08-03       Impact factor: 3.293

8.  Complete sequence of a subgroup 3.4 strain of classical swine fever virus from Taiwan.

Authors:  Yu-Ju Lin; Maw-Sheng Chien; Ming-Chung Deng; Chin-Cheng Huang
Journal:  Virus Genes       Date:  2007-08-25       Impact factor: 2.332

9.  Genetic diversity of the envelope glycoprotein E2 of classical swine fever virus: recent isolates branched away from historical and vaccine strains.

Authors:  Ning Chen; Hongxia Hu; Zhanfeng Zhang; Jiangbing Shuai; Lingli Jiang; Weihuan Fang
Journal:  Vet Microbiol       Date:  2007-09-22       Impact factor: 3.293

10.  PAL2NAL: robust conversion of protein sequence alignments into the corresponding codon alignments.

Authors:  Mikita Suyama; David Torrents; Peer Bork
Journal:  Nucleic Acids Res       Date:  2006-07-01       Impact factor: 16.971

View more
  5 in total

1.  Phylodynamics of classical swine fever virus in Brazil.

Authors:  Antônio Augusto Fonseca Júnior; Mateus Laguardia-Nascimento; Aline Aparecida Silva Barbosa; Valdenia Lopes da Silva Gonçalves; Tânia Rosária Pereira Freitas; Anselmo Vasconcelos Rivetti Júnior; Marcelo Fernandes Camargos
Journal:  Braz J Microbiol       Date:  2022-04-08       Impact factor: 2.214

2.  Deciphering the emergence, genetic diversity and evolution of classical swine fever virus.

Authors:  Liliam Rios; Liani Coronado; Dany Naranjo-Feliciano; Orlando Martínez-Pérez; Carmen L Perera; Lilian Hernandez-Alvarez; Heidy Díaz de Arce; José I Núñez; Llilianne Ganges; Lester J Pérez
Journal:  Sci Rep       Date:  2017-12-20       Impact factor: 4.379

3.  Evaluation of classical swine fever E2 (CSF-E2) subunit vaccine efficacy in the prevention of virus transmission and impact of maternal derived antibody interference in field farm applications.

Authors:  Jing-Yuan Chen; Chi-Ming Wu; Zeng-Weng Chen; Chih-Ming Liao; Ming-Chung Deng; Min-Yuan Chia; Chienjin Huang; Maw-Sheng Chien
Journal:  Porcine Health Manag       Date:  2021-01-11

4.  Time-calibrated phylogenomics of the porcine epidemic diarrhea virus: genome-wide insights into the spatio-temporal dynamics.

Authors:  Jisung Jang; Sook Hee Yoon; Wonseok Lee; Jihyun Yu; Joon Yoon; Seunghyun Shim; Heebal Kim
Journal:  Genes Genomics       Date:  2018-04-21       Impact factor: 1.839

5.  Atypical Porcine Pestivirus Circulation and Molecular Evolution within an Affected Swine Herd.

Authors:  Alba Folgueiras-González; Robin van den Braak; Bartjan Simmelink; Martin Deijs; Lia van der Hoek; Ad de Groof
Journal:  Viruses       Date:  2020-09-25       Impact factor: 5.048

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.