| Literature DB >> 26375730 |
Ming-Hua Sung1, Ming-Chung Deng2, Yi-Hsuan Chung3, Yu-Liang Huang2, Chia-Yi Chang2, Yu-Ching Lan4, Hsin-Lin Chou4, Day-Yu Chao5.
Abstract
Since 2010, a new variant of PEDV belonging to Genogroup 2 has been transmitting in China and further spreading to the Unites States and other Asian countries including Taiwan. In order to characterize in detail the temporal and geographic relationships among PEDV strains, the present study systematically evaluated the evolutionary patterns and phylogenetic resolution in each gene of the whole PEDV genome in order to determine which regions provided the maximal interpretative power. The result was further applied to identify the origin of PEDV that caused the 2014 epidemic in Taiwan. Thirty-four full genome sequences were downloaded from GenBank and divided into three non-mutually exclusive groups, namely, worldwide, Genogroup 2 and China, to cover different ranges of secular and spatial trends. Each dataset was then divided into different alignments by different genes for likelihood mapping and phylogenetic analysis. Our study suggested that both nsp3 and S genes contained the highest phylogenetic signal with substitution rate and phylogenetic topology similar to those obtained from the complete genome. Furthermore, the proportion of nodes with high posterior support (posterior probability >0.8) was similar between nsp3 and S genes. The nsp3 gene sequences from three clinical samples of swine with PEDV infections were aligned with other strains available from GenBank and the results suggested that the virus responsible for the 2014 PEDV outbreak in Taiwan clustered together with Clade I from the US within Genogroup 2. In conclusion, the current study identified the nsp3 gene as an alternative marker for a rapid and unequivocal classification of the circulating PEDV strains which provides complementary information to the S gene in identifying the emergence of epidemic strain resulting from recombination.Entities:
Keywords: Evolutionary rate; Likelihood mapping; Nsp3; Phylogenetic signal; Porcine epidemic diarrhea virus; Spike; Taiwan
Mesh:
Substances:
Year: 2015 PMID: 26375730 PMCID: PMC7106162 DOI: 10.1016/j.meegid.2015.09.011
Source DB: PubMed Journal: Infect Genet Evol ISSN: 1567-1348 Impact factor: 3.342
Oligonucleotide primers used for amplifications of the PEDV nsp3 gene by PCR and sequencing.
| Primer ID | Sequence (5′ to 3′) | Position | Used to amplify fragment |
|---|---|---|---|
| Nsp3-F2 | TCCCACCGATGGTAATAGTG | 2646–2665 | PCR forward primer |
| Nsp3-R2 | TGAACAGACACAAAAACCAGAAG | 8088–8110 | PCR reverse primer |
| Nsp3-F2-1 | TTGGGTGATGTGTCGGCTTG | 3706–3725 | Sequencing primer |
| Nsp3-R2-1 | GCTTCTTACAGAACTTAGAACC | 6960–6981 | Sequencing primer |
| Nsp3-F2-2 | AGGAAGATGTTCAACAAGTTTC | 4700–4721 | Sequencing primer |
| Nsp3-R2-2 | ACACTGTAATTAAATTACGTGAC | 6176–6197 | Sequencing primer |
Positions correspond to the PEDV CV777 strain (GenBank accession no. NC_003436).
Determination of phylogenetic signal/noise by likelihood mapping analysis.
| Dataset | Gene | Length | % Noise | Constant sites (%) | Alpha |
|---|---|---|---|---|---|
| Worldwide | Nsp1 | 330 | 32.4 | 90 | 0.18 |
| Worldwide | Nsp2 | 2355 | 9.8 | 89.1 | 0.13 |
| Worldwide | Nsp3 | 4863 | 3.2 | 91.9 | 0.03 |
| Worldwide | Nsp4 | 1443 | 13.5 | 92.5 | 0.34 |
| Worldwide | 3C-like protease (nsp5) | 906 | 24.2 | 94 | 0.02 |
| Worldwide | Nsp6 | 840 | 17.4 | 91.9 | 0.38 |
| Worldwide | Nsp7 | 249 | 55.9 | 94 | 0.52 |
| Worldwide | Nsp8 | 585 | 15.1 | 93.2 | 0.4 |
| Worldwide | Nsp9 | 324 | 30.1 | 94.1 | 0.03 |
| Worldwide | Nsp10 | 405 | 47.4 | 96.8 | 0.02 |
| Worldwide | RNA-dependent RNA polymerase (nsp12) | 2781 | 10 | 93.5 | 0.02 |
| Worldwide | Helicase (nsp13) | 1557 | 12.9 | 93.8 | 0.02 |
| Worldwide | Exoribonuclease (nsp14) | 1785 | 6.1 | 93.1 | 0.02 |
| Worldwide | Uridylate-specific endoribonuclease (nsp15) | 1017 | 16.9 | 93.6 | 0.03 |
| Worldwide | Putative 2′-O-methyl transferase (nsp16) | 903 | 20.8 | 94.6 | 0.03 |
| Worldwide | S | 4180 | 4.1 | 85.7 | 0.1 |
| Worldwide | ORF3 | 675 | 16.7 | 77.9 | 0.09 |
| Worldwide | E | 231 | 26.6 | 80.5 | 3.13 |
| Worldwide | M | 681 | 27.9 | 91.1 | 0.02 |
| Worldwide | N | 1326 | 16.6 | 87 | 0.16 |
| Worldwide | 5′(nsp1–2) | 2685 | 7 | 89.2 | 0.1 |
| Worldwide | 3′OEMN | 2911 | 8.7 | 85.4 | 0.02 |
Strong phylogenetic signal (< 30% noise).
Average of constant sites among all genes is 90.6%, which is statistically significantly lower than that from Genogroup 2 dataset.
Fig. 1Correlation between nucleotide length and phylogenetic noise. Length of each gene tested for phylogenetic noise is plotted on the x-axis, and the phylogenetic noise as measured using TREE-PUZZLE program is plotted on the y-axis. Open circles indicate measurements from the worldwide dataset, filled circles indicate measurements from the Genogroup 2 dataset, and filled triangles indicate measurements from the China dataset. A linear regression line was plotted against each dataset (dashed = worldwide, solid = Genogroup 2, intermittent = China) with statistical significance (worldwide: R2 = 0.82, p < 0.001; Genogroup 2: R2 = 0.68, p < 0.001; China: R2 = 0.82, p < 0.001) by Pearson correlation test. The gray filled region at the bottom of the graph denoted phylogenetic noise < 30%.
Bayesian estimation of the molecular clock, evolutionary rate, and time to most recent common ancestor (TMRCA).
| Dataset | Gene | Clock model | Bayes factor | Evolutionary rate (substitutions/site/year) | Lower HPD | Upper HPD | TMRCA (year) | Lower HPD | Upper HPD |
|---|---|---|---|---|---|---|---|---|---|
| Worldwide (N = 34) | Complete | Strict | 1.31E-02 | 1.14E-02 | 1.48E-02 | 119.8 | 102.2 | 134.5 | |
| Relaxed | 494.9 | 1.37E-02 | 9.77E-03 | 1.80E-02 | 38.4 | 35 | 45.8 | ||
| NSP3 | Strict | 2.16E-02 | 1.81E-02 | 2.50E-02 | 60.4 | 53.5 | 67.6 | ||
| Relaxed | 146.2 | 1.63E-02 | 1.09E-02 | 2.19E-02 | 37 | 35 | 41.2 | ||
| S | Strict | 2.24E-02 | 1.89E-02 | 2.63E-02 | 91.5 | 77.7 | 106.9 | ||
| Relaxed | 288.1 | 2.28E-02 | 1.50E-02 | 3.10E-02 | 36.1 | 35 | 38.7 | ||
| E | Strict | 5.81E-02 | 4.30E-02 | 7.42E-02 | 35.7 | 35 | 37 | ||
| Relaxed | 39.9 | 2.90E-02 | 1.77E-02 | 4.14E-02 | 36 | 35 | 38.3 | ||
| Genogroup2 (N = 24) | Complete | Strict | 3.93E-02 | 3.14E-02 | 4.78E-02 | 8.8 | 7.5 | 10.1 | |
| Relaxed | 357 | 4.59E-02 | 2.66E-02 | 6.71E-02 | 3.2 | 2.3 | 4.5 | ||
| NSP3 | Strict | 5.81E-02 | 4.42E-02 | 7.38E-02 | 4.6 | 3.8 | 5.3 | ||
| Relaxed | 108.9 | 4.50E-02 | 2.56E-02 | 6.51E-02 | 2.9 | 2.2 | 3.8 | ||
| S | Strict | 6.07E-02 | 4.63E-02 | 7.78E-02 | 6.3 | 5.2 | 7.5 | ||
| Relaxed | 103.8 | 5.67E-02 | 3.24E-02 | 8.46E-02 | 3.2 | 2.3 | 4.4 | ||
| E | Strict | 1.29E-01 | 7.89E-02 | 1.85E-01 | 2.8 | 2.2 | 3.4 | ||
| Relaxed | 15.3 | 8.27E-02 | 4.01E-02 | 1.32E-01 | 2.6 | 2.1 | 3.2 | ||
| China (N = 20) | Complete | Strict | 3.35E-02 | 2.95E-02 | 3.78E-02 | 29.6 | 27.8 | 31.3 | |
| Relaxed | 464 | 1.37E-02 | 9.83E-03 | 1.80E-02 | 26.9 | 26 | 29 | ||
| NSP3 | Strict | 3.50E-02 | 2.82E-02 | 4.22E-02 | 26.6 | 26 | 27.8 | ||
| Relaxed | 97.3 | 2.31E-03 | 1.95E-03 | 2.65E-03 | 26.2 | 26 | 26.9 | ||
| S | Strict | 5.15E-02 | 4.01E-02 | 6.35E-02 | 26.5 | 26 | 27.5 | ||
| Relaxed | 159.3 | 3.05E-02 | 1.79E-02 | 4.43E-02 | 27.2 | 26 | 29.8 | ||
| E | Strict | 6.90E-02 | 4.97E-02 | 9.03E-02 | 26.5 | 26 | 27.5 | ||
| Relaxed | 40.1 | 2.63E-02 | 1.40E-02 | 4.00E-02 | 26.8 | 26 | 28.9 |
Fig. 2Bayesian maximum clade credibility phylogenetic trees based on the spike (S, left), nsp3 (middle), and E (right) genes for the worldwide dataset according to the indicated substitution models and molecular clock suggested by the ModelTest and Bayes factors. Branch lengths are scaled in years. Branch with posterior support > 0.8 are labeled inside the node. Sequences colored in red clustered within Genogroup 1 and black within Genogroup 2 based on the full-genome phylogenetic analysis (Huang et al., 2013). Each strain is labeled based on the accession number on GeneBank followed by the host, country and year, which could refer to supplementary Table S1 for details.
Sum of the posterior probabilities of all internal nodes for all three different genes from the worldwide, Genogroup 2 and China databases.
| Dataset | Gene | Sum of posterior probabilities | Proportion of nodes with posterior probability > 0.8 |
|---|---|---|---|
| Worldwide (N = 34) | NSP3 | 27.48 | 0.76(25/33) |
| S | 26.79 | 0.73(24/33) | |
| E | 17.85 | 0.45(15/33) | |
| Genogroup2 (N = 24) | NSP3 | 17.5 | 0.52(12/23) |
| S | 17.32 | 0.61(14/23) | |
| E | 6.75 | 0.13(3/23) | |
| China (N = 20) | NSP3 | 17.82 | 0.84(16/19) |
| S | 18.55 | 0.84(16/19) | |
| E | 13.48 | 0.63(12/19) |
Fig. 3Bayesian maximum clade credibility phylogenetic trees with three PEDV strains collected during the 2014 Taiwan epidemic based on the nsp3 for the worldwide dataset (A) and North America dataset (B) according to the substitution model of GTR + I + G and GTR + I suggested by jModeltest, respectively. Branch with posterior support > 0.8 are labeled inside the node. Sequences within each clade are colored consistently across genes. Three local Taiwan PEDV strains are marked with asterisk. North American PEDV strains are labeled based on the accession number on GeneBank (KJ645635–708) followed by the host, country and year.