Literature DB >> 29704627

Newly emerged porcine enteric alphacoronavirus in southern China: Identification, origin and evolutionary history analysis.

Xinliang Fu1, Bo Fang1, Yixing Liu1, Mengkai Cai1, Junming Jun1, Jun Ma2, Dexin Bu1, Lifang Wang2, Pei Zhou2, Heng Wang3, Guihong Zhang4.   

Abstract

Coronaviruses have a wide host range and can cause a variety of diseases with varying severity in different animals. Several enteric coronaviruses have been identified that are associated with diarrhea in swine and that have caused substantial economic losses. In this study, a newly emerged porcine enteric alphacoronavirus (PEAV), PEAV-GD-CH/2017, was identified from suckling piglets with diarrhea in southern China, and a full-length genome sequence of PEAV was obtained for systematic analysis. The novel PEAV sequence was most identical to that of bat-HKU2, and the differences between them were comprehensively compared, especially the uniform features of the S protein, which was shown to have a close relationship with betacoronaviruses and to perhaps represent unrecognized betacoronaviruses. In addition, Bayesian analysis was conducted to address the origin of PEAV, and the divergence time between PEAV and bat-HKU2 was estimated at 1926, which indicates that PEAV is not newly emerged and may have circulated in swine herds for several decades since the interspecies transmission of this coronavirus from bat to swine. The evolutionary rate of coronaviruses was estimated to be 1.93 × 10-4 substitutions per site per year for the RdRp gene in our analysis. For the origin of PEAV, we suspect that it is the result of the interspecies transmission of bat-HKU2 from bat to swine. Our results provide valuable information about the uniform features, origin and evolution of the novel PEAV, which will facilitate further investigations of this newly emerged pathogen.
Copyright © 2018 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Coronavirus; Diarrhea; Evolutionary analysis; Origin; PEAV

Mesh:

Year:  2018        PMID: 29704627      PMCID: PMC7106130          DOI: 10.1016/j.meegid.2018.04.031

Source DB:  PubMed          Journal:  Infect Genet Evol        ISSN: 1567-1348            Impact factor:   3.342


Introduction

Coronaviruses (CoVs) are enveloped viruses with a single-stranded, positive-sense RNA genome, they belong to the family Coronaviridae, and they are found in a wide variety of animals in which they can cause respiratory, hepatic, enteric and neurological diseases of varying severity (Weiss and Navas-Martin, 2005; Woo et al., 2006). CoVs are separated into four distinct genera based on genotypic and serological characterization: alpha-CoV, beta-CoV, gamma-CoV and delta-CoV (Su et al., 2016). To date, several enteric CoVs that are attributed to diarrhea in swine have been identified and have caused substantial economic losses. Transmissible gastroenteritis virus (TGEV) and porcine epidemic diarrhea virus (PEDV) belong to alpha-CoV, and both of them cause life-threatening acute enteric disease in suckling piglets (Pensaert and de Bouck, 1978; Zhang et al., 2017). Porcine hemagglutinating encephalomyelitis virus (PHEV) is a beta-CoV that primarily affects pigs under 3 weeks of age (Pensaert and Callebaut, 1974; Rho et al., 2011). Porcine deltacoronavirus (PDCoV) is a newly identified enteric coronavirus in swine and belongs to delta-CoV (Wang et al., 2014a). The outbreak of severe acute respiratory syndrome (SARS) and the identification of SARS-CoV-like viruses from wild animals in China have boosted interest in the discovery of novel CoVs in both humans and animals. For example, human coronaviruses NL63 and HKU1 were discovered in 2004 and 2005, respectively, and MERS-CoV emerged in 2012 (Fouchier et al., 2004; Woo et al., 2005; Zaki et al., 2012). For animal CoVs, SARS-CoV-like viruses and bat-CoV-HKU2 were discovered in horseshoe bats; novel delta-CoVs, in birds and swine; and additional novel CoVs, in bats and other animals (Chu et al., 2008; Dong et al., 2007; Lau et al., 2005, 2007; Wang et al., 2014b; Woo et al., 2012). Recently, a novel bat-HKU2-like coronavirus that can cause diarrhea in suckling piglets was discovered in swine by two research groups in China (Gong et al., 2017; Pan et al., 2017). This novel enteric coronavirus shares high nucleotide identities (approximately 95%) with the reported bat-HKU2 strains at the full genome level and is tentatively named porcine enteric alphacoronavirus (PEAV) (Gong et al., 2017). In this retrospective study, we report the identification of this newly emerged PEAV from a pig farm in Guangdong Province, China, which outbreaks of severe diarrhea in suckling piglets in March 2017. We analyzed and described the genome characteristic of this novel PEAV systematically and the phylogenetic relationship of this virus with other groups of CoVs. Bayesian analysis was also conducted to address the origin and evolutionary history of PEAV, and our results indicate that PEAV emerged approximately 91 years ago and may have circulated in swine herds for several decades.

Materials and methods

Sample collection and disease diagnosis

In March 2017, an acute diarrheal outbreak of newborn-piglet diarrhea occurred in a commercial pig farm in Guangdong Province, China. The clinical manifestations included vomiting, acute watery diarrhea and dehydration in ill suckling piglets. Small intestinal and fecal samples were collected from ill pigs and submitted to the Animal Disease Detection Diagnosis Center of Southern China Agricultural University for pathogen detection. The small intestinal samples were homogenized with phosphate-buffered saline (PBS; 0.1 M, pH 7.4) and subsequently centrifuged at 10,000 ×g for 10 min at 4 °C. The fecal samples were resuspended with PBS and centrifuged as described above. Both supernatants were collected for RNA extraction using a TaKaRa MiniBEST Universal RNA Extraction Kit (TaKaRa, Dalian, China), and first-strand cDNA was synthesized using a PrimeScript™ 1st Strand cDNA Synthesis Kit (TaKaRa, Dalian, China) following the manufacturer's instructions. PCR was used for the detection of common enteric viral pathogens as previously described, including PEDV, TGEV, PDCoV and porcine group A rotaviruses (RVAs) (Amimo et al., 2013; Kim et al., 2000; Liu and Wang, 2016; Song et al., 2015). However, all samples were negative for PEDV, TGEV, PDCoV and RVAs. Subsequently, we suspected PEAV infection and conducted a retrospective study of these samples after the report of PEAV in Guangdong (Gong et al., 2017).

PEAV detection and complete genome sequencing

A pair of primers (forward: 5′-TTTTGGTTCTTACGGGCTGTT-3′; reverse: 5′-CAAACTGTACGCTGGTCAACT-3′) based on RNA-dependent RNA polymerase (RdRp) gene of a known bat-HKU2 strain (EF203065) was designed for PEAV detection. After PEAV was detected, 18 pairs of primers were designed based on the bat-HKU2 genome to amplify the full genome (these primer sequences are available on request), and the PCR-amplified products were analyzed by electrophoresis on 1.5% agarose gels and purified using a MiniBEST DNA Extraction Kit (TaKaRa, Dalian, China). The purified PCR product was cloned into the pMD18-T (TaKaRa, Dalian, China) vector for sequencing. Sequences of fragments were assembled using the DNAStar program to produce the final viral genome sequence and used for further analysis.

Genome analysis and phylogenetic analysis

The complete genome sequence of PEAV and the deduced amino acid sequences of the open reading frames (ORFs) were compared to those of other known CoVs as previously reported (Woo et al., 2012). Multiple sequence alignments were performed by MAFFT, and a phylogenetic tree based on the full-length genome nucleotide sequences of PEAV and of other representative CoVs was constructed using the neighbor-joining method with 1000 bootstrap replicates in MEGA 5.0 (Tamura et al., 2011). Consideration the extensive divergence between the nucleotide sequences of different coronavirus genera, phylogenetic trees for the ORF1ab, RdRp, S, M, and N proteins were also constructed based on the corresponding amino acid sequences. Bootscan analysis was also performed to detect if a potential recombination event occurred for PEAV using Simplot 3.5.1 with the genome sequence of PEAV as the query. Prediction of transmembrane domains was performed using TMHMM (http://www.cbs.dtu.dk/services/TMHMM/).

Evolutionary dynamics and estimation of the divergence time of PEAV

The Bayesian Markov chain Monte Carlo (MCMC) method was used to infer the divergence time of PEAV with other members of CoVs in BEAST 1.8.3 as described previously (Drummond and Rambaut, 2007; Fu et al., 2018; Woo et al., 2012). Specifically, analyses were performed under the GTR + I + Г nucleotide substitution model for the RdRp gene (2781 bp) and using an unrelaxed lognormal distribution molecular clock with a constant size model. The MCMC algorithm was run for a 100 million step chain and sampled every 10,000 states, and 10% of the chain was removed as burn-in. The maximum clade credibility (MCC) tree was inferred by the Tree Annotator program included in the BEAST package. The mean time of the most recent common ancestor (TMRCA) and the highest posterior density (HPD) regions at 95% were calculated in Tracer 1.6, and posterior probability values provided an assessment of the degree of support for the key node of the tree. The nucleotide substitution rate (per site per year) for coronaviruses was also estimated in this analysis.

Results

Diagnosis and detection of PEAV

All samples were negative for RT-PCR detection of common enteric viruses, including PEDV, TGEV, PDCoV and RVAs. Subsequently, a newly emerged PEAV that can cause diarrhea in suckling piglets was reported in Guangdong, China (Gong et al., 2017); we suspected PEAV infection and conducted a retrospective study of these samples. Considering the high nucleotide identities (approximately 95%) of PEAV with reported bat-HKU2 strains (Gong et al., 2017), we designed a pair of primers based on RNA-dependent RNA polymerase (RdRp) gene of a known bat-HKU2 strain for PEAV detection. To our surprise, an expected 750 bp fragment was amplified from all samples, and the PCR products were further sequenced. The sequences of the PCR products were subjected to BLAST searches in the GenBank database, showed the highest identity to bat-HKU2 strains (approximately 97%), and corresponded to nucleotide positions 12,837–13,570 in the bat-HKU2 genome. The full-length genome of PEAV was finally obtained by segment amplification and named PEAV-GD-CH/2017 (MG742313).

Genome and S protein feature analysis

The genomic structure of PEAV is organized with the same gene order as that of bat-HKU2, namely, 5′-ORF1a/1b (ORF1ab)-S-ORF3-E-M-N-NS7a-3′ (Fig. 1 ), and the genome sequence length of PEAV-GD-CH/2017 is 27,155 nt, excluding the poly (A) tail, which is similar to previous reports (Gong et al., 2017; Pan et al., 2017). The G + C content of PEAV ranges from 39.34% to 39.41% (Table 1 ), and the genome nucleotide identities of PEAV-GD-CH/2017 with PEAV-GDS04 (MF167434) and PEAV-GD-01(MF370205) are 99.7% and 99.8%, respectively. All three known PEAV strains are most identical to bat-HKU2 and BtRF-AlphaCoV/YN2012, with approximately 95.0% and 87.5% nucleotide identities, respectively. In addition, comparison of the genomic features of PEAV and of other coronaviruses and the amino acid identities between the predicted ORF1ab, RdRp, S, E, M and N proteins of PEAV and the corresponding proteins of other coronaviruses are summarized in Table 1. Notably, most of these PEAV proteins share higher identities to alpha-CoVs (group B) than the other three groups of coronaviruses, except the S protein, which shares only approximately 25% amino acid identity to that of alpha-CoVs (Table 1). The putative transcription regulatory sequence (TRS) motif, 5’-AACUAAA-3′, precedes each ORF of PEAV (Table 2 ) and has the same TRS sequence as bat-HKU2 and HCoV-NL63 (Lau et al., 2007; Pyrc et al., 2004). The coding potential and putative TRS sequence for each ORF of PEAV are summarized in Table 2. Similar to bat-HKU2, one ORF was observed between the S and E genes, which encodes a putative 229-amino acid nonstructural protein, NS3 (Lau et al., 2007). The NS3 protein of PEAV shares 94% amino acid identity to that of bat-HKU2 but only 42% and 35% identities to those of HCoV-NL63 and PEDV, respectively.
Fig. 1

Diagram of the structural organization of the PEAV genome. The putative cleavage sites S1/S2 and S2′ in the S protein are shown by arrows, and the numbers indicate the amino acid positions in the S protein of PEAV. The S protein is composed of two subunits: the S1 receptor-binding subunit, and the S2 fusion subunit. NTD: N-terminal domain of S1; C-domain: C-terminal domain of S1; FP: putative fusion peptide; TM: transmembrane domain; E: endodomain. Not drawn to scale.

Table 1

Comparison of the genomic features of PEAV and other coronaviruses and amino acid identities between the predicted ORF1ab, RdRp, S, E, M and N proteins of PEAV and the corresponding proteins of other coronaviruses.

CoronavirusesaGenome
Features
Pairwise amino acid identity (%)
Size (bases)G + C content (%)ORF1abRdRpSEMN
Alpha-CoV group A
 TGEV28,61437.5855.775.625.227.652.441.7
 FIPV29,35538.1455.575.525.527.652.442.7
 PRCV27,55037.4655.775.524.027.654.641.5
Alpha-CoV group B
 HCoV-229E27,31738.2660.980.825.151.356.946.7
 HCoV-NL6327,55334.4660.078.925.549.358.449.7
 PEDV28,03342.0260.178.025.247.364.647.1
 Bat-CoV HKU227,16539.2898.399.185.297.396.193.9
 BtRF-CoV YN201226,97537.8094.598.978.696.096.988.0
 PEAV-GD-0127,15539.4399.799.998.498.798.799.7
 PEAV-GDS0427,15439.3499.599.498.197.398.299.5
 PEAV-GD-CH27,15539.41NAbNANANANANA
Beta-CoV group A
 HCoV-HKU129,92632.0636.156.626.925.035.126.8
 HCoV-OC4330,74636.6535.957.627.724.035.628.6
 MHV3,161641.7836.556.426.625.037.429.9
 PHEV30,48037.2535.657.427.225.337.327.3
Beta-CoV group B
 SARS-CoV29,75140.7637.859.725.825.332.125.2
Beta-CoV group C
 Bat-CoV HKU530,48243.1938.259.026.422.733.229.5
Beta-CoV group D
 Bat-CoV HKU929,11441.0536.558.026.018.434.423.1
Gamma-CoV
 IBV27,67937.9336.359.321.117.121.525.6
Delta-CoV
 PDCoV25,40443.2832.350.223.218.322.019.6

TGEV, porcine transmissible gastroenteritis virus; FIPV, feline infectious peritonitis virus; PRCV, porcine respiratory coronavirus; HCoV-229E, human coronavirus 229E; HCoV-NL63, human coronavirus NL63; PEDV, porcine epidemic diarrhea virus; PEAV, porcine enteric alphacoronavirus; HCoV-HKU1, human coronavirus HKU1; HCoV-OC43, human coronavirus OC43; MHV, murine hepatitis virus; PHEV, porcine hemagglutinating encephalomyelitis virus; SARS-CoV, severe acute respiratory syndrome coronavirus; IBV, infectious bronchitis virus; PDCoV, porcine deltacoronavirus.

NA, data not available for analysis.

Table 2

Coding potential and putative transcription regulatory sequences (TRSs) of PEAV.

CoronavirusesORFsStart-end (nucleotide position)No. of nucleotidesNo. of amino acidsPutative TRS
Nucleotide position in the genomeTRS
PEAV1ab297–20,482 (shift at 12,434)20,186672869AACUAAAC(220a)AUG
S20,479–23,8713393113020,473AACUAAAUG
NS323,871–24,56069022923,826AACUAAAC(37)AUG
E24,541–24,7682287524,532AACUAAAC(1)AUG
M24,777–25,46368722824,768AACUAAAC(1)AUG
N25,475–26,602112837525,463AACUAAAC(4)AUG
NS7a26,614–26,9133009926,606AACUAAACAUG

Number means the number of nucleotides from the TRS to AUG.

Diagram of the structural organization of the PEAV genome. The putative cleavage sites S1/S2 and S2′ in the S protein are shown by arrows, and the numbers indicate the amino acid positions in the S protein of PEAV. The S protein is composed of two subunits: the S1 receptor-binding subunit, and the S2 fusion subunit. NTD: N-terminal domain of S1; C-domain: C-terminal domain of S1; FP: putative fusion peptide; TM: transmembrane domain; E: endodomain. Not drawn to scale. Comparison of the genomic features of PEAV and other coronaviruses and amino acid identities between the predicted ORF1ab, RdRp, S, E, M and N proteins of PEAV and the corresponding proteins of other coronaviruses. TGEV, porcine transmissible gastroenteritis virus; FIPV, feline infectious peritonitis virus; PRCV, porcine respiratory coronavirus; HCoV-229E, human coronavirus 229E; HCoV-NL63, human coronavirus NL63; PEDV, porcine epidemic diarrhea virus; PEAV, porcine enteric alphacoronavirus; HCoV-HKU1, human coronavirus HKU1; HCoV-OC43, human coronavirus OC43; MHV, murine hepatitis virus; PHEV, porcine hemagglutinating encephalomyelitis virus; SARS-CoV, severe acute respiratory syndrome coronavirus; IBV, infectious bronchitis virus; PDCoV, porcine deltacoronavirus. NA, data not available for analysis. Coding potential and putative transcription regulatory sequences (TRSs) of PEAV. Number means the number of nucleotides from the TRS to AUG. The S protein is the main determinant during coronavirus infection, as it possesses both receptor-binding and fusion functions; it is also the crucial determinant of tissue tropism and host range (Millet and Whittaker, 2015). However, the S protein of PEAV is very unique, similar to that of bat-HKU2; because the amino acid identities to the S proteins of all known coronaviruses are lower than 28%, we systematically analyzed the S protein of PEAV and compared it with those of other coronaviruses. The S protein of PEAV contains 1130 amino acid residues, and the insertion of two amino acid residues (serine and isoleucine) at positions 12 and 13 was observed compared to that of bat-HKU2. Two putative cleavage sites, S1/S2 (VRR↓MTFE) and S2′ (ESR↓SAIEDLLF), were found at positions 546 and 673 in the S protein of PEAV, respectively (Fig. 1). Interestingly, the arginine at cleavage site S2′ is conserved in the S proteins of almost all four genera of coronaviruses, and this cleavage site have a remarkably conserved motif, E-D-L-L-F; in contrast, the arginine (position 545) at cleavage sites S1/S2 is conserved in S proteins from several beta-CoVs (Table S1). The PEAV S protein is predicted to have a transmembrane domain from positions 1069 to 1091, followed by a short cytoplasmic tail (endodomain), which contains conserved cysteine residues (Fig. 1). Pairwise comparison of the amino acid sequences of S proteins of PEAV and bat-HKU2 revealed more mutations at the S1 subunit (122 mutations) than the S2 subunit (26 mutations), particularly in the NTD (amino-terminal domain), which may be related to tissue tropism and host range changes and may result in interspecies transmission from bat to swine.

Phylogenetic analysis and recombination analysis

Phylogenetic analysis was conducted to address the evolutionary relationship and the potential recombination of PEAV with other coronaviruses based on the nucleotide sequences of the whole genome and the amino acid sequences of ORF1ab, RdRp, S, M and N proteins, respectively (Fig. 2, Fig. 3). Obviously, all PEAV strains cluster with bat-HKU2 and BtRF-AlphaCoV/YN2012 and form a distinct lineage (defined as HKU2-like, not shown in the tree) closely related to other alpha-CoVs that belong to group 1b based on the whole genome level (Fig. 2). The same result can also be observed from the phylogenetic tree that was constructed based on the amino acid sequences of ORF1ab, RdRp, M and N proteins (Fig. 3). However, the evolutionary relationship of PEAV exhibited a uniform feature when phylogenetic analysis was conducted based on the S protein. All PEAV strains cluster with bat-HKU2 and BtRF-AlphaCoV/YN2012 along with a newly identified rat-CoV, LRNV. These strains form a distinct lineage and cluster with beta-CoV but are separate from all four known subgroups of beta-CoVs; we defined this distinct lineage as the beta-like group (Fig. 3). These results are consistent with identical amino acid analysis and with those of a previous report (Pan et al., 2017). We also conducted recombination analysis to evaluate if recombination has occurred in the PEAV genome, especially in the S gene, but no significant single recombination event was observed when the genome sequence of PEAV was used as the query (Fig. S1). Additionally, recombination was not observed in bat-HKU2 and LRNV genomes in previous studies (Lau et al., 2007; Wang et al., 2015). Noteworthy, another large difference between PEAV and bat-HKU2 is in N protein, it shows distant phylogenetic relationship comparing with analysis of ORF1ab, RdRp, E and M protein (Fig. 3), which is consistent with analysis of amino acid identity (Table 1). 22 amino acid mutations were found during pairwise comparison of the amino acid sequences of N proteins of PEAV and bat-HKU2, and most mutations located in carboxyl terminal, however, N protein is highly conserved among different PEAV strains.
Fig. 2

Phylogenetic analysis of PEAV with other four genera of coronaviruses based on full-length genome sequences. The tree was constructed by the neighbor-joining method with 1000 bootstrap replicates in MEGA 5.0 after multiple sequence alignments by MAFFT. Alpha-CoV and beta-CoV subgroups are shown in the tree, and the PEAV strain (PEAV-GD-CH/2017) identified in this study is indicated with a solid black circle.

Fig. 3

Phylogenetic analysis of the ORF1ab, RdRp, M, N and S proteins of PEAV based on the amino acid sequences of these proteins. These trees were constructed using the neighbor-joining method with 1000 bootstrap replicates in MEGA 5.0. The amino acid lengths of the ORF1ab, RdRp, M, N and S proteins used in this analysis are 6262 aa, 927 aa, 229 aa, 342 aa and 1130 aa, respectively. The PEAV strains are shown in bold in these trees.

Origin and the divergence time of PEAV

Because the RdRp gene is the most conserved gene between all coronaviruses, the RdRp gene was used for Bayesian analysis to address the divergence time and evolutionary history of PEAV in this study. The MCC tree constructed based on the RdRp gene has a topology similar to that of the phylogenetic tree that was constructed based on the whole genome and the RdRp protein, with high posterior probability values supporting each key node, and the mean TMRCA was estimated with 95% HPD values (Fig. 4 ). Based on our analysis, the mean TMRCA of bovine-CoV and HCoV-OC43 was estimated at 1914 (95% HPD, 1841 to 1981), and the mean TMRCA of human and civet SARSr-CoV was estimated at 2001 (95% HPD, 1998 to 2003). In addition, the divergence time between HKU15 and PDCoV was estimated at 1986 (95% HPD, 1970 to 1994). All of these results are highly consistent with those of previous studies (Lau et al., 2010; Vijgen et al., 2005; Woo et al., 2017) and indicate that our Bayesian analysis is unbiased. The mean TMRCA of PEAV and bat-HKU2 was estimated at 1926 (95% HPD, 1864 to 1984), approximately 91 years ago, which indicates that PEAV is not newly emerged and may have circulated in swine herds for several decades since its interspecies transmission from bat to swine. PEAV clusters with bat-HKU2; these coronaviruses have a common ancestor with another bat-CoV, BtRf-AlphaCoV/YN2012, and the divergence time between them was estimated at 1783 (95% HPD, 1620 to 1943). All of these bat-HKU2-like coronaviruses are closely related to HCoV-229E and HCoV-NL63 and emerged at approximately 277 (95% HPD, 931 BC to 1434). In addition, the TMRCA for alpha-CoV, beta-CoV and gamma-CoV were also estimated in our analysis at approximately 827 BC (95% HPD, 2626 BC to 1042), 1419 BC (95% HPD, 3561 BC to 867) and 977 BC (95% HPD, 3313 BC to 1090), respectively. In addition, the TMRCA for all coronaviruses was estimated at 3914 BC (95% HPD, 8637 BC to 45 BC), approximately 6000 years ago, which indicates that coronaviruses have had a very long evolutionary history since their emergence. The mean evolutionary rate of CoVs was estimated to be 1.93 × 10−4 (95% HPD, 1.27 × 10−4 to 3.57 × 10−4) nucleotide substitutions per site per year for the RdRp gene based on Bayesian analysis, which is consistent with the results of a previous report (Woo et al., 2012).
Fig. 4

Bayesian maximum clade credibility (MCC) phylogenetic tree was constructed in BEAST 1.8.3 using the Markov chain Monte Carlo (MCMC) method based on the RdRp gene (2781 bp). The mean TMRCA (time of the most recent common ancestor) was estimated for each key node with 95% HPD (highest posterior density) and is shown in brackets. High posterior probability values are shown for each key node and provide an assessment of the degree of support for the node on the tree. BC dates are identified with a suffix, while AD dates are not.

Phylogenetic analysis of PEAV with other four genera of coronaviruses based on full-length genome sequences. The tree was constructed by the neighbor-joining method with 1000 bootstrap replicates in MEGA 5.0 after multiple sequence alignments by MAFFT. Alpha-CoV and beta-CoV subgroups are shown in the tree, and the PEAV strain (PEAV-GD-CH/2017) identified in this study is indicated with a solid black circle. Phylogenetic analysis of the ORF1ab, RdRp, M, N and S proteins of PEAV based on the amino acid sequences of these proteins. These trees were constructed using the neighbor-joining method with 1000 bootstrap replicates in MEGA 5.0. The amino acid lengths of the ORF1ab, RdRp, M, N and S proteins used in this analysis are 6262 aa, 927 aa, 229 aa, 342 aa and 1130 aa, respectively. The PEAV strains are shown in bold in these trees. Bayesian maximum clade credibility (MCC) phylogenetic tree was constructed in BEAST 1.8.3 using the Markov chain Monte Carlo (MCMC) method based on the RdRp gene (2781 bp). The mean TMRCA (time of the most recent common ancestor) was estimated for each key node with 95% HPD (highest posterior density) and is shown in brackets. High posterior probability values are shown for each key node and provide an assessment of the degree of support for the node on the tree. BC dates are identified with a suffix, while AD dates are not. For the origin of PEAV, we conjecture that the interspecies transmission of bat-HKU2 from bat to swine occurred approximately 90 years ago. As wild boars have been reported as reservoirs for various pathogens, and can transmit these pathogens into domestic swine, such as porcine circovirus type 2 (PCV2), classical swine fever virus (CSFV) and Hepatitis E virus (HEV) (Adlhoch et al., 2009; Firth et al., 2009; Goller et al., 2016), but whether wild boars plays an important role during the interspecies transmission of bat-HKU2 needs to further investigate.

Discussion

Coronaviruses are important pathogens that have a wide host range and cause different kinds of diseases in a variety of animals; many novel coronaviruses have been identified in both humans and animals since the outbreak of SARS in 2003 (Fouchier et al., 2004; Lau et al., 2005, 2007; Wang et al., 2014b; Woo et al., 2005, 2012; Zaki et al., 2012). Several enteric coronaviruses that can cause diarrhea in swine have been identified and have circulated in swine herds for a long time; PEDV, TGEV and PHEV are examples of these viruses (Pensaert and de Bouck, 1978; Rho et al., 2011; Zhang et al., 2017). In particular, large-scale outbreaks of PEDV in China and the USA, with high rates of illness and death in suckling piglets, caused substantial economic losses in late 2010 and 2013, respectively (Huang et al., 2013; Wang et al., 2013). A newly enteric coronavirus, PDCoV, was identified in the USA in 2014, and this coronavirus caused clinical signs in swine similar to those of PEDV (Wang et al., 2014a). In this study, a novel PEAV (PEAV-GD-CH/2017) strain was identified from suckling piglets with diarrhea, and this strain shares high identities with the other two PEAV strains that were previously reported (Gong et al., 2017; Pan et al., 2017). These novel PEAVs are most identical to bat-HKU2, with 95% nucleotide identity, and have the same genome organization and TRS motif for each ORF. The greatest difference between PEAV and bat-HKU2 is their S proteins, which share 85% amino acid identity each other, a value much lower compared with those of other proteins (Table 1). This difference is caused by amino acid mutations in the S protein, particularly in the NTD in the S1 subunit, which has been proven to be the key factor determining issue tropism and the host range of coronaviruses (Lu et al., 2015). In addition to its low amino acid identity with the S protein of HKU2-like coronavirus, it shares low amino acid identity (lower than 28%) with S proteins of all known coronaviruses. Thus, clarifying the origin of the S proteins of PEAV and HKU2-like coronavirus is important for determining the origin and evolutionary history of these coronaviruses. A previous study showed that the extreme NTD in the S1 subunit of PEAV is structurally similar to that of NL63, while the rest of the S1 subunit is structurally similar to that of MHV (Pan et al., 2017). In addition, a short peptide in the S protein of bat-HKU2 was found to be homologous to a corresponding peptide within the receptor-binding motif (RBM) in the S1 subunit of SARS-CoV (Lau et al., 2007). We also analyzed the arginine (position 545) at cleavage sites S1/S2 of PEAV and found that it is conserved in several beta-CoVs in this study. Moreover, the phylogenetic tree based on the S protein presents a uniform evolutionary relationship; these bat-HKU2-like coronaviruses cluster with a newly identified rat-CoV, LRNV, which represents a novel species of coronaviruses. All of these strains form a distinct lineage and cluster with beta-CoVs but are separate from all four known subgroups of beta-CoVs (Fig. 3), which may indicate that these strains are part of a novel subgroup of beta-CoVs. These results suggest that PEAV and HKU2-like coronaviruses may have some relations with beta-CoVs and most likely resulted from recombination with the backbone of alpha-CoV and the S gene from an unrecognized beta-CoV. Another large difference between PEAV and bat-HKU2 is N protein, which share about 93.9% amino acid identity, as N protein is a multifunctional protein for coronaviruses, which is involving in virus replication, budding and pathogenesis et al. (McBride et al., 2014). While the role of these mutations in N protein between PEAV and bat-HKU2 should further investigate. The origin and emergence time of a newly emerged pathogen are important issues to answer to determine the evolutionary history and plan methods of prevention for these new pathogens. For example, previous SARS research reported that the interspecies transfer of SARS-like coronaviruses from bats to the amplifying host (e.g., civet) occurred in 1998 and that interspecies transfer from civet to humans occurred in 2002 (Chinese, 2004; Hon et al., 2008; Lau et al., 2010; Song et al., 2005). These results provide insight into the origin and evolutionary history of SARS coronavirus. The origin and divergence time of other coronaviruses have also been estimated previously; the divergence time of bovine-CoV and HCoV-OC43 could be dated back to the end of the 19th century to the beginning of the 20th century and was estimated at 1910 (Vijgen et al., 2006). The TMRCA for all PDCoV strains was reported at 1991, approximately 24 years before PDCoV was identified (Woo et al., 2017). In this study, we also addressed the emergence time and evolutionary history of PEAV and of the other coronaviruses based on the RdRp gene by Bayesian analysis. In anticipation, the mean divergence time of bovine-CoV and HCoV-OC43 was estimated at 1914, and the mean TMRCA of human and civet SARSr-CoV was estimated at 2001 in our analysis (Fig. 4). These results are highly consistent with those of a previous report discussed above and further indicate that our analysis is unbiased. The emergence time of PEAV was estimated at 1926 (95% HPD, 1864 to 1984) based on our analysis, which indicates that PEAV is not newly emerged and may have circulated in swine herds for several decades since interspecies transmission from bat to swine occurred. In addition, these HKU2-like coronaviruses have a common ancestor with HCoV-NL63 and HCoV-229E, and the divergence time was estimated at 277, which indicates that these HKU2-like coronaviruses have a long evolutionary history. The mean TMRCA for alpha-CoV, beta-CoV, and gamma-CoV, as well as those of all coronaviruses estimated in this study, were later compared with those of a previous report (Woo et al., 2012). Nevertheless, the mean TMRCA coincides with the regions with 95% HPD to each other. The evolutionary rate of different coronaviruses was estimated previously; 4.3 × 10−4 substitutions per site per year was estimated for HCoV-OC43 (Vijgen et al., 2005), and the mean evolutionary rate for group 1b coronaviruses was estimated to be 3 × 10−4 substitutions per site per year (Pyrc et al., 2006). In addition, the evolutionary rate for all coronaviruses was estimated to be 1.3 × 10−4 substitutions per site per year (Woo et al., 2012), which is estimated to be 1.93 × 10−4 (95% HPD, 1.27 × 10−4 to 3.57 × 10−4) nucleotide substitutions per site per year for the RdRp gene in this study, and all of these results are comparable to each other. Bats and birds are supposed to be the reservoir hosts for coronaviruses; in particular, bats are the reservoir hosts and gene pools of alpha-CoVs and beta-CoVs, while birds are the reservoir hosts and gene pools of gamma-CoVs and delta-CoVs (Woo et al., 2012). However, whether the first coronaviruses occurred in bats or birds is still unknown. To date, the generally acknowledged evolutionary model for coronaviruses is as follows: the ancestor of bat-CoV was transmitted to another species of bat and generated alpha-CoV and beta-CoV. Interspecies transmission of these bat-CoVs to other bat species and other mammals then occurred, and these coronaviruses are circulating in these hosts. Similarly, the ancestor of bird-CoV was transmitted to another species of birds and generated gamma-CoV and delta-CoV. Interspecies transmission of these bird-CoVs to other bird species and accidentally to some mammalian species (e.g., pig and whale) then occurred (Woo et al., 2012). Bat is also supposed to be the origin of other swine pathogens, such as porcine circovirus type 3 (PCV3), which was suspected to be generated from the interspecies transmission of bat-associated circovirus from bat to swine (Fu et al., 2018). Based on the evolutionary relationship and molecular features of PEAV and bat-HKU2-CoV, as well as the important role of bat in the ecology of coronaviruses, we conjecture that the origin of PEAV is the result of the interspecies transmission of bat-HKU2-CoV from bat to swine approximately 90 years ago. In summary, the novel PEAV was identified from suckling piglets with diarrhea in southern China, and the full-length genome of PEAV-GD-CH/2017 was obtained in this study. The genome and S protein features of PEAV was systematic analyzed, as well as the evolutionary relationship of PEAV with other coronaviruses, which indicated PEAV may recombination with unrecognized beta-CoV. PEAV emerged approximately 90 years ago and origin from the interspecies transmission of bat-HKU2 from bat to swine, and wild boars may plays an important role in this process. Thus, epidemiological investigations of PEAV should be further conducted in both swine and wild boars to better understand and clarify the origin and evolutionary history of PEAV. Importantly, considering this infectious coronavirus and its serious clinical implications for suckling piglets (Pan et al., 2017), the development of an effective vaccine for PEAV is urgently needed for the prevention of this disease. The following are the supplementary data related to this article.

Table S1

Comparison of S protein cleavage sites (S1/S2 and S2′) of coronaviruses. Arginine shown in blue with the blue arrow indicatesthe cleavage sitesof S1/S2, while arginine shown in red indicatesthe arginine in HKU2-like CoVs that is conserved in several beta-CoVs. Arginine shown in pink with the pink arrow indicates the cleavage site of S2′ that is conserved in almost all four genera of coronavirusesand is followedbya remarkably conserved motif, E-D-L-L-F.

Fig. S1

Bootscan analysis ofthe full-length genome of PEAV. Bootscanning was conducted with Simplot 3.5.1, and the genome sequence of PEAV was used as the query sequence. The red line denotes alpha-CoV HKU2 (EF203064), the green line denotes beta-CoV SARS (NC004718), the yellow line denotes gamma-CoV IBV (AY646283),and the blue line denotes delta-CoV HKU15 (NC016990). No significant single recombination eventwas observed in this analysis.
  44 in total

1.  Discovery of seven novel Mammalian and avian coronaviruses in the genus deltacoronavirus supports bat coronaviruses as the gene source of alphacoronavirus and betacoronavirus and avian coronaviruses as the gene source of gammacoronavirus and deltacoronavirus.

Authors:  Patrick C Y Woo; Susanna K P Lau; Carol S F Lam; Candy C Y Lau; Alan K L Tsang; John H N Lau; Ru Bai; Jade L L Teng; Chris C C Tsang; Ming Wang; Bo-Jian Zheng; Kwok-Hung Chan; Kwok-Yung Yuen
Journal:  J Virol       Date:  2012-01-25       Impact factor: 5.103

2.  Characterization and complete genome sequence of a novel coronavirus, coronavirus HKU1, from patients with pneumonia.

Authors:  Patrick C Y Woo; Susanna K P Lau; Chung-ming Chu; Kwok-hung Chan; Hoi-wah Tsoi; Yi Huang; Beatrice H L Wong; Rosana W S Poon; James J Cai; Wei-kwang Luk; Leo L M Poon; Samson S Y Wong; Yi Guan; J S Malik Peiris; Kwok-yung Yuen
Journal:  J Virol       Date:  2005-01       Impact factor: 5.103

3.  Insights into the epidemic characteristics and evolutionary history of the novel porcine circovirus type 3 in southern China.

Authors:  X Fu; B Fang; J Ma; Y Liu; D Bu; P Zhou; H Wang; K Jia; G Zhang
Journal:  Transbound Emerg Dis       Date:  2017-11-26       Impact factor: 5.005

4.  Complete genomic sequence of human coronavirus OC43: molecular clock analysis suggests a relatively recent zoonotic coronavirus transmission event.

Authors:  Leen Vijgen; Els Keyaerts; Elien Moës; Inge Thoelen; Elke Wollants; Philippe Lemey; Anne-Mieke Vandamme; Marc Van Ranst
Journal:  J Virol       Date:  2005-02       Impact factor: 5.103

Review 5.  Coronavirus pathogenesis and the emerging pathogen severe acute respiratory syndrome coronavirus.

Authors:  Susan R Weiss; Sonia Navas-Martin
Journal:  Microbiol Mol Biol Rev       Date:  2005-12       Impact factor: 11.056

6.  Severe acute respiratory syndrome coronavirus-like virus in Chinese horseshoe bats.

Authors:  Susanna K P Lau; Patrick C Y Woo; Kenneth S M Li; Yi Huang; Hoi-Wah Tsoi; Beatrice H L Wong; Samson S Y Wong; Suet-Yi Leung; Kwok-Hung Chan; Kwok-Yung Yuen
Journal:  Proc Natl Acad Sci U S A       Date:  2005-09-16       Impact factor: 11.205

7.  BEAST: Bayesian evolutionary analysis by sampling trees.

Authors:  Alexei J Drummond; Andrew Rambaut
Journal:  BMC Evol Biol       Date:  2007-11-08       Impact factor: 3.260

Review 8.  Host cell proteases: Critical determinants of coronavirus tropism and pathogenesis.

Authors:  Jean Kaoru Millet; Gary R Whittaker
Journal:  Virus Res       Date:  2014-11-22       Impact factor: 3.303

Review 9.  Epidemiology, Genetic Recombination, and Pathogenesis of Coronaviruses.

Authors:  Shuo Su; Gary Wong; Weifeng Shi; Jun Liu; Alexander C K Lai; Jiyong Zhou; Wenjun Liu; Yuhai Bi; George F Gao
Journal:  Trends Microbiol       Date:  2016-03-21       Impact factor: 17.079

10.  Discovery of a novel swine enteric alphacoronavirus (SeACoV) in southern China.

Authors:  Yongfei Pan; Xiaoyan Tian; Pan Qin; Bin Wang; Pengwei Zhao; Yong-Le Yang; Lianxiang Wang; Dongdong Wang; Yanhua Song; Xiangbin Zhang; Yao-Wei Huang
Journal:  Vet Microbiol       Date:  2017-09-28       Impact factor: 3.293

View more
  24 in total

1.  Insight into the Epidemiology and Evolutionary History of Novel Goose Astrovirus-Associated Gout in Goslings in Southern China.

Authors:  Xinliang Fu; Zhanpeng Hou; Wenjun Liu; Nan Cao; Yu Liang; Bingxin Li; Danli Jiang; Wanyan Li; Danning Xu; Yunbo Tian; Yunmao Huang
Journal:  Viruses       Date:  2022-06-15       Impact factor: 5.818

2.  Cryo-electron Microscopy Structure of the Swine Acute Diarrhea Syndrome Coronavirus Spike Glycoprotein Provides Insights into Evolution of Unique Coronavirus Spike Proteins.

Authors:  Hongxin Guan; Youwang Wang; Vanja Perčulija; Abdullah F U H Saeed; Yichang Liu; Jinyu Li; Syed Sajid Jan; Yu Li; Ping Zhu; Songying Ouyang
Journal:  J Virol       Date:  2020-10-27       Impact factor: 5.103

3.  Rampant C→U Hypermutation in the Genomes of SARS-CoV-2 and Other Coronaviruses: Causes and Consequences for Their Short- and Long-Term Evolutionary Trajectories.

Authors:  P Simmonds
Journal:  mSphere       Date:  2020-06-24       Impact factor: 4.389

4.  Cryo-EM structures of HKU2 and SADS-CoV spike glycoproteins provide insights into coronavirus evolution.

Authors:  Jinfang Yu; Shuyuan Qiao; Runyu Guo; Xinquan Wang
Journal:  Nat Commun       Date:  2020-06-17       Impact factor: 14.919

5.  Development of a Conventional RT-PCR Assay for Rapid Detection of Porcine Deltacoronavirus with the Same Detection Limit as a SYBR Green-Based Real-Time RT-PCR Assay.

Authors:  Lei Ma; Fanwen Zeng; Bihong Huang; Feng Cong; Ren Huang; Jingyun Ma; Pengju Guo
Journal:  Biomed Res Int       Date:  2018-11-06       Impact factor: 3.411

6.  Complete Genome Sequence of a Novel Swine Acute Diarrhea Syndrome Coronavirus, CH/FJWT/2018, Isolated in Fujian, China, in 2018.

Authors:  Kai Li; Hao Li; Zhen Bi; Jun Gu; Wang Gong; Suxian Luo; Fanfan Zhang; Deping Song; Yu Ye; Yuxin Tang
Journal:  Microbiol Resour Announc       Date:  2018-12-06

7.  Point-of-care diagnostic assay for rapid detection of porcine deltacoronavirus using the recombinase polymerase amplification method.

Authors:  Lei Ma; Fanwen Zeng; Bihong Huang; Yujun Zhu; Miaoli Wu; Fengjiao Xu; Li Xiao; Ren Huang; Jingyun Ma; Feng Cong; Pengju Guo
Journal:  Transbound Emerg Dis       Date:  2019-04-01       Impact factor: 5.005

8.  Conserved Genomic Terminals of SARS-CoV-2 as Co-evolving Functional Elements and Potential Therapeutic Targets.

Authors:  Agnes P Chan; Yongwook Choi; Nicholas J Schork
Journal:  bioRxiv       Date:  2020-07-06

9.  Genetic characterization and phylogenetic analysis of porcine deltacoronavirus (PDCoV) in Shandong Province, China.

Authors:  Wenchao Sun; Li Wang; Haixin Huang; Wei Wang; Liang Cao; Jinyong Zhang; Min Zheng; Huijun Lu
Journal:  Virus Res       Date:  2020-01-18       Impact factor: 3.303

10.  Swine acute diarrhea syndrome coronavirus (SADS-CoV) antagonizes interferon-β production via blocking IPS-1 and RIG-I.

Authors:  Zhihai Zhou; Yuan Sun; Xiaoling Yan; Xiaoyu Tang; Qianniu Li; Yaorong Tan; Tian Lan; Jingyun Ma
Journal:  Virus Res       Date:  2019-12-26       Impact factor: 3.303

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.