| Literature DB >> 25815768 |
Taehyung Kwon1, Sook Hee Yoon1, Kyu-Won Kim2, Kelsey Caetano-Anolles3, Seoae Cho4, Heebal Kim1.
Abstract
The phylogeny of classical swine fever virus (CSFV), the causative agent of classical swine fever (CSF), has been investigated extensively. However, no evolutionary research has been performed using the whole CSFV genome. In this study, we used 37 published genome sequences to investigate the time-calibrated phylogenomics of CSFV. In phylogenomic trees based on Bayesian inference (BI) and Maximum likelihood (ML), the 37 isolates were categorized into five genetic types (1.1, 1.2, 2.1, 2.3, and 3.4). Subgenotype 1.1 is divided into 3 groups and 1 unclassified isolate, 2.1 into 4 groups, 2.3 into 2 groups and 1 unclassified isolate, and subgenotype 1.2 and 3.4 consisted of one isolate each. We did not observe an apparent temporal or geographical relationship between isolates. Of the 14 genomic regions, NS4B showed the most powerful phylogenetic signal. Results of this evolutionary study using Bayesian coalescent approach indicate that CSFV has evolved at a rate of 13×.010-4 substitutions per site per year. The most recent common ancestor of CSFV appeared 2770.2 years ago, which was about 8000 years after pig domestication. The effective population size of CSFV underwent a slow increase until the 1950s, after which it has remained constant.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25815768 PMCID: PMC4376735 DOI: 10.1371/journal.pone.0121578
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Information of the isolates used in this study.
| Isolate | Subgenotype | Country | Year | Accession no. |
|---|---|---|---|---|
| CSFV-GZ-2009 | 1.1 | China | 2009 | HQ380231 |
| CF114 | 1.1 | China | 2001 | AF333000 |
| Shimen | 1.1 | China | 1945 | AF092448 |
| SWH | 1.1 | China | 2004 | DQ127910 |
| C-ZJ-2008 | 1.1 | China | 2008 | HM175885 |
| JL1(06) | 1.1 | China | 2006 | EU497410 |
| LOM | 1.1 | Japan | 1980 | EU789580 |
| Eystrup | 1.1 | Germany | 1964 | NC_002657 |
| Alfort/187 | 1.1 | France | 1968 | X87939 |
| CAP | 1.1 | France | 1978 | X96550 |
| Brescia | 1.1 | Italy | 1945 | AF091661 |
| Glentorf | 1.1 | Denmark | 1968 | U45478 |
| BRESCIAX | 1.2 | USA | 2001 | AY578687 |
| YC11WB | 2.1 | South Korea | 2011 | KC149990 |
| PC11WB | 2.1 | South Korea | 2011 | KC149991 |
| HNLY-2011 | 2.1 | China | 2011 | JX262391 |
| Heb52010 | 2.1 | China | 2010 | JQ268754 |
| HNSD-2012 | 2.1 | China | 2012 | JX218094 |
| SXCDK | 2.1 | China | 2009 | GQ923951 |
| SXYL2006 | 2.1 | China | 2006 | GQ122383 |
| GXWZ02 | 2.1 | China | 2002 | AY367767 |
| HEBZ | 2.1 | China | 2009 | GU592790 |
| Zj0801 | 2.1 | China | 2008 | FJ529205 |
| 96TD | 2.1 | Taiwan | 1996 | AY554397 |
| 0406/CH/01/TWN | 2.1 | Taiwan | 2001 | AY568569 |
| Paderborn | 2.1 | Germany | 1997 | GQ902941 |
| CSFV/2.1/dp/CSF1048/2009/LT/Penevezys | 2.1 | Lithuania | 2009 | HQ148063 |
| CSFV/2.3/wb/XXX0609/2004/Uelzen | 2.3 | Germany | 2004 | GU324242 |
| CSFV/2.3/wb/CSF1045/2009/Roesrath | 2.3 | Germany | 2009 | GU233734 |
| CSFV/2.3/wb/XXX0608/2005/Euskirchen | 2.3 | Germany | 2005 | GU233732 |
| CSFV/2.3/wb/CSF1046/2009/Hennef | 2.3 | Germany | 2009 | GU233733 |
| CSFV/2.3/dp/CSF857/2006/Borken | 2.3 | Germany | 2006 | GU233731 |
| Alfort/T | 2.3 | France | 1980 | J04358 |
| Sp01 | 2.3 | Spain | 2001 | FJ265020 |
| CSFV/2.3/dp/CSF864/2007/BG/Jambul | 2.3 | Bulgaria | 2007 | HQ148062 |
| CSFV/2.3/dp/CSF0821/2002/HR/Novska | 2.3 | Croatia | 2002 | HQ148061 |
| 94.4/IL/94/TWN | 3.4 | Taiwan | 1994 | AY646427 |
The best fit evolutionary models estimated for CSFV genomic regions with Modeltest 3.7.
| Genomic region | Model | -lnL | Base frequencies (A,C,G) | Substitution matrix (A-C, A-G, A-T, C-G, C-T) | Pinvar | Nst |
|---|---|---|---|---|---|---|
| Whole genome | GTR+I+G | 66705.17 | 0.31, 0.22, 0.26 | 0.99, 11.00, 1.90, 0.38, 25.04 | 0.49 | 6 |
| 5'UTR | SYM+I+G | 1451.45 | Equal | 0.96, 3.90, 1.20, 0.00, 9.47 | 0.53 | 6 |
| Npro | GTR+G | 2755.61 | 0.30, 0.23, 0.27 | 3.67, 26.01, 3.40, 0.72, 62.76 | 0.00 | 6 |
| C | GTR+I | 1595.19 | 0.38, 0.18, 0.27 | 1.42, 9.28, 1.63, 0.16, 26.64 | 0.55 | 6 |
| Erns | GTR+I+G | 3562.38 | 0.32, 0.22, 0.27 | 1.90, 13.50, 2.24, 0.46, 38.94 | 0.45 | 6 |
| E1 | GTR+I | 3163.49 | 0.29, 0.23, 0.25 | 1.12, 17.99, 2.95, 0.56, 27.72 | 0.56 | 6 |
| E2 | GTR+I+G | 6623.00 | 0.28, 0.22, 0.27 | 1.01, 9.54, 1.59, 0.33, 19.16 | 0.50 | 6 |
| p7 | TrN+G | 1072.38 | 0.30, 0.19, 0.23 | 1.00, 12.65, 1.00, 1.00, 22.66 | 0.00 | 6 |
| NS2 | GTR+I+G | 7931.16 | 0.30, 0.21, 0.25 | 0.85, 9.80, 1.42, 0.29, 18.59 | 0.42 | 6 |
| NS3 | GTR+I+G | 9710.91 | 0.31, 0.22, 0.27 | 0.91, 12.27, 2.36, 0.34, 29.50 | 0.59 | 6 |
| NS4A | TIM+I | 947.92 | 0.34, 0.21, 0.23 | 1.00, 57.99, 5.13, 5.13, 111.19 | 0.55 | 6 |
| NS4B | GTR+I+G | 5770.45 | 0.30, 0.22, 0.27 | 1.23, 13.30, 2.18, 0.29, 28.53 | 0.54 | 6 |
| NS5A | GTR+I+G | 8511.48 | 0.32, 0.22, 0.26 | 0.90, 9.71, 1.71, 0.41, 21.26 | 0.42 | 6 |
| NS5B | GTR+I+G | 11343.28 | 0.33, 0.21, 0.26 | 0.96, 11.95, 2.35, 0.57, 35.59 | 0.49 | 6 |
| 3'UTR | TVM+G | 1190.08 | 0.31, 0.21, 0.16 | 0.24, 5.22, 1.04, 0.60, 5.22 | 0.00 | 6 |
Summary of alignment of CSFV genomic regions.
| Genomic region | Total sites including gaps, nt/aa | Conserved sites(%), nt/aa | Average identities(%), nt/aa | ω value (dN/dS) |
|---|---|---|---|---|
| Whole genome | 12301/NA | 7645(62.1%)/NA | 89.3/NA | NA |
| ORF | 11694/3898 | 7224(61.8%)/2949(75.7%) | 89.1/94.7 | 0.067 |
| 5'UTR | 374/NA | 287(76.7%)/NA | 94.2/NA | NA |
| Npro | 504/168 | 301(59.7%)/117(69.6%) | 89.6/93.2 | 0.137 |
| C | 297/99 | 183(61.6%)/69(69.7%) | 88.7/93.7 | 0.093 |
| Erns | 681/227 | 423(62.1%)/167(73.6%) | 89.4/93.7 | 0.081 |
| E1 | 585/195 | 360(61.5%)/153(78.5%) | 88.7/95.3 | 0.069 |
| E2 | 1119/373 | 654(58.4%)/261(70.0%) | 87.8/92.3 | 0.104 |
| p7 | 210/70 | 126(60.0%)/51(72.9%) | 88.1/94.2 | 0.071 |
| NS2 | 1371/457 | 803(58.6%)/318(69.6%) | 88.0/92.7 | 0.089 |
| NS3 | 2049/683 | 1400(68.3%)/609(89.2%) | 90.7/98.6 | 0.020 |
| NS4A | 192/64 | 120(62.5%)/55(85.9%) | 89.6/98.3 | 0.032 |
| NS4B | 1041/347 | 653(62.7%)/280(80.7%) | 89.3/96.6 | 0.049 |
| NS5A | 1491/497 | 858(57.5%)/332(66.8%) | 88.6/92.1 | 0.094 |
| NS5B | 2154/718 | 1343(62.3%)/537(74.8%) | 89.4/95.0 | 0.065 |
| 3'UTR | 230/NA | 133(57.8%)/NA | 89.2/NA | NA |
Fig 1Variable site plots for nucleotide (a) and amino acid (b) sequence of 37 CSFVs.
The number of differences at each site represents the number of variable isolates estimated with multiple sequence alignment. Each color indicates a different genomic region.
Fig 2Pairwise dN/dS (ω) values of the protein coding gene sequences of 37 CSFVs.
Fig 3Bayesian maximum clade credibility phylogenetic tree based on the whole genome sequences of 37 CSFVs.
With the BI and ML methods, identical topology was produced. Divergence times (in years) are positioned below the nodes; the 95% HPD intervals are indicated in brackets. The confidence of the phylogenetic analysis is presented above the nodes: left numbers represent Bayesian posterior probabilities (≥ 0.80) and right ones represent ML bootstrap values (≥ 60%). Subgenotypes and groups are indicated above the corresponding nodes using squares and circles.
Fig 4The BI tree based on NS4B sequences of 37 CSFVs.
The robustness of the phylogenetic analysis is showed above the nodes: left numbers are Bayesian posterior probabilities (≥0.80) and right ones are ML bootstrap values (≥60%). Subgenotypes and groups are marked above the corresponding nodes using squares and circles.
Fig 5Bayesian skyline plot based on the entire genome sequences of 37 CSFV isolate.
The plot depicts changes in the effective population size (N ). The dark line shows the effective population size estimated through time. The upper and lower lines indicate the 95% HPD range of BSP.