| Literature DB >> 32407507 |
Wan-Ting He1, Xiang Ji2,3,4, Wei He1, Simon Dellicour5,6, Shilei Wang1, Gairu Li1, Letian Zhang1, Marius Gilbert6, Henan Zhu2,3, Gang Xing7, Michael Veit8, Zhen Huang9, Guan-Zhu Han10, Yaowei Huang7, Marc A Suchard2,3, Guy Baele5, Philippe Lemey5, Shuo Su1.
Abstract
The emergence of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has shown once again that coronavirus (CoV) in animals are potential sources for epidemics in humans. Porcine deltacoronavirus (PDCoV) is an emerging enteropathogen of swine with a worldwide distribution. Here, we implemented and described an approach to analyze the epidemiology of PDCoV following its emergence in the pig population. We performed an integrated analysis of full genome sequence data from 21 newly sequenced viruses, along with comprehensive epidemiological surveillance data collected globally over the last 15 years. We found four distinct phylogenetic lineages of PDCoV, which differ in their geographic circulation patterns. Interestingly, we identified more frequent intra- and interlineage recombination and higher virus genetic diversity in the Chinese lineages compared with the USA lineage where pigs are raised in different farming systems and ecological environments. Most recombination breakpoints are located in the ORF1ab gene rather than in genes encoding structural proteins. We also identified five amino acids under positive selection in the spike protein suggesting a role for adaptive evolution. According to structural mapping, three positively selected sites are located in the N-terminal domain of the S1 subunit, which is the most likely involved in binding to a carbohydrate receptor, whereas the other two are located in or near the fusion peptide of the S2 subunit and thus might affect membrane fusion. Finally, our phylogeographic investigations highlighted notable South-North transmission as well as frequent long-distance dispersal events in China that could implicate human-mediated transmission. Our findings provide new insights into the evolution and dispersal of PDCoV that contribute to our understanding of the critical factors involved in CoVs emergence.Entities:
Keywords: BEAST; Bayesian inference; evolution; phylogeographic; porcine deltacoronavirus; recombination
Mesh:
Substances:
Year: 2020 PMID: 32407507 PMCID: PMC7454817 DOI: 10.1093/molbev/msaa117
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
Fig. 1.Characteristics of isolated PDCoVs and phylogenetic analysis of full genomes. (A) Cytopathic effects and multistep replication curves of PDCoVs in LLC-PK1 cells at 6, 12, 24, and 36 hpi. Newly isolated viruses SD2018/10 and AH2019/H were used to infect LLC-PK1 cells at a MOI of 0.1. Magnification: 200×. (B) The viral titers (TCID50/ml) were determined at 6, 12, 24, and 36 hpi. The graph shows the mean of three different experiments. (C) Phylogeny reconstructed using 119 PDCoV genomes using the SplitsTree5 software with the Kimura 2-parameter model. Strains in blue regions correspond to the Thailand lineage, in red correspond to the Early China lineage, in orange to the USA lineage, and in green to regions of the China lineage.
Fig. 2.Recombinant features of PDCoV phylogenies. (A–C) A rescaled structure of the PDCoV genome and bootscanning recombination analysis based on the variable genomic sites. The dashed line indicates 70% bootstrap support. The likely recombination region is shaded in a different color and colored broken lines represent different lineages: blue indicates the Thailand lineage, red the Early China lineage, orange the USA lineage, and green the China lineage. (A) Breakpoint at nucleotide 4989. (B) A breakpoint at nucleotide 13681. (C) A breakpoint at nucleotide 22826. (D–G) ML phylogenetic trees inferred for the different recombinant regions: nucleotides 1–4989, 4990–13681, 13682–22825, and 22826–26451. The ML tree was reconstructed using RAxML (v8.4.10) using a general time-reversible model accommodating among-site rate heterogeneity. In total, 1,000 bootstraps were evaluated to assess support values. The Thailand lineage is indicated in blue; the Early China lineage in red; the USA lineage in orange; and the China lineage in green. The red dots indicate strains that were sequenced as part of this study.
Fig. 3.Frequent intralineage recombination events in China and Early China lineages. (A) Summary of GARD results. Colored boxes indicate fragments resulting from GARD-inferred breakpoints with corrected ΔAIC values shown on the right. (B) ML phylogenetic trees inferred for China and Early China lineage PDCoV ORF1ab using RAxML under the general time-reversible substitution model with gamma-distributed rates across sites. In total, 1,000 bootstraps were evaluated to assess support values. Trees were reconstructed for both regions separated by the breakpoint at position 11570.
Fig. 4.Demographic history of PDCoV in pigs. (A) Demographic history inferred via a skygrid coalescent tree prior. The intervals represent 95% HPD of the product of generation time and effective population size Ne(t). The middle line tracks the inferred median of Ne(t). (B) MCC tree of the S gene constructed using BEAST (version 1.10.5) under the skygrid nonparametric coalescent model. The red dots indicate strains were sequenced in this study.
Selection Analysis of the PDCoV S Protein.
| Site | FUBAR (Post.Pro) | MEME ( | FEL ( | SLAC ( |
|---|---|---|---|---|
| 107 | 0.979 | 0.02 | 0.015 | 0.026 |
| 149 | 0.998 | 0.01 | 0.004 | 0.027 |
| 183 | 0.968 | 0.04 | 0.027 | 0.089 |
| 630 | 0.98 | 0.00 | 0.012 | 0.007 |
| 698 | 0.99 | 0.02 | 0.012 | 0.087 |
Fig. 5.Location of selected amino acids in the structure of S protein. (A) Cartoon representation of an S monomer. The S1 subunit is represented in blue and the S2 subunit in green. Selected amino acids are shown as red spheres. Leu 107 and His 149 are located in the N-terminal domain of S1 (S1-NTD) that binds to unidentified sugars, but close to the C-terminal domain of S1 (S1-CTD) that contains the binding site for the protein receptor. (B) Hydrogen bonds formed between Thr 183 and Thr 186 that might stabilize a loop. (C) Part of the structure of S2 with individual elements drawn in different colors. Central helix N (CH-N) is represented in blue, the central helix C (CH-C) in magenta, the fusion peptide (FP) in cyan, and the heptad repeat N in orange. Arg 669 and Arg 673, shown as red sticks, are presumed proteolytic cleavage sites. The selected amino acids Ala 630 and Ser 698 are shown as red spheres. (D) Detail of S2 showing the selected amino acid Ala 630 in proximity (6.3 Å) to Leu 720 in the fusion peptide.
Fig. 6.Spatiotemporal diffusion of China PDCoV lineage within China as estimated from discrete (A) and continuous (B) phylogeographic reconstructions. (A) The discrete phylogeographic analysis was performed with the Bayesian stochastic search variable selection (BSSVS) approach, for which we displayed the intensity of the transition rates associated with a BF support higher than 3. (B) Continuous phylogeographic analyses were performed with the DRW diffusion model (only including a latitudinal drift; see the text for further details). For the continuous reconstruction, we mapped the MCC tree and 95% HPD regions based on trees subsampled from the post burn-in posterior distribution of trees. Nodes of the tree are colored according to a color scale ranging from brown (tMRCA) to green (most recent sampling time). Ninety-five percent highest posterior density (HPD) regions were computed for successive time layers, superimposed using the same color scale reflecting time cropped using Chinese international borders (see supplementary fig. S6, Supplementary Material online, for noncropped 95% HPD polygons as well as a comparison with the continuous phylogeographic reconstruction obtained under the relaxed random walk model). On both maps, subnational Chinese province borders are represented by white lines.