| Literature DB >> 32121102 |
Giovanni Franzo1, Eric Delwart2,3, Robert Fux4, Ben Hause5, Shuo Su6, JiYong Zhou7, Joaquim Segalés8,9,10.
Abstract
The discovery of a globally distributed porcine circovirus (Porcine circovirus 3; PCV-3) has led to intense research activity and the production of a large amount of molecular data. Different research groups have proposed several, not always concordant, genotypes for this virus. While such categories could aid an easier interpretation of PCV-3 molecular epidemiology, any classification, to be useful in practical settings, must be univocal and of help in the understanding of underlying biological features and epidemiology. Based on these premises, the possibility of defining PCV-3 genotypes was evaluated on the broadest available dataset of PCV-3 complete genome (n = 357) and open reading frame 2 (ORF2, n = 653) sequences. Genetic distance and phylogenetic clustering were selected as the main objective criteria. Additional factors, including the number of within-cluster sequences, host and geographic clustering, concordance between different genomic regions, and analysis method were also taken in account to generate a classification that could be effectively applied in research and diagnostic settings. A maximum within-genotype genetic distance of 3% at the complete genome and 6% at the ORF2 levels, bootstrap support higher than 90%, and concordance between analysis methods allowed us to clearly define two clades which could be potentially defined as genotypes. Further subdivision was not suggested due to the absence of a meaningful association between PCV-3 and its biological/epidemiological features. Nevertheless, since one of the clades included two strains only, thus far we formally propose the definition of only one PCV-3 genotype (PCV-3a). The established criteria will allow us to automatically recognize other genotypes when more strain sequences are characterized.Entities:
Keywords: ORF2; PCV-3; classification; genome; genotypes
Mesh:
Year: 2020 PMID: 32121102 PMCID: PMC7150946 DOI: 10.3390/v12030265
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Count of sequences included in the present study classified based on the collection continent/region, country, and host. ORF2: open reading frame 2.
| Continent/Region | Country | Host | Count | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Bovine | Dog | Mouse | Unknown | Swine | Wild Boar | ||||
| Complete genome | Asia | China | 4 | 5 | 4 | 2 | 243 |
| |
| Japan | 3 |
| |||||||
| South Korea | 21 |
| |||||||
| Taiwan | 2 |
| |||||||
| Thailand | 2 |
| |||||||
| Central America | Mexico | 1 |
| ||||||
| Europe | Denmark | 2 |
| ||||||
| Germany | 14 |
| |||||||
| Hungary | 1 |
| |||||||
| Italy | 5 |
| |||||||
| Spain | 13 |
| |||||||
| Sweden | 1 |
| |||||||
| Unknown | Unknown | 11 | 1 |
| |||||
| North America | USA | 1 | 9 |
| |||||
| Russia | Russia | 2 |
| ||||||
| South America | Brazil | 7 | 1 |
| |||||
| Colombia | 2 |
| |||||||
|
|
|
|
|
|
|
|
| ||
| ORF2 | Asia | China | 16 | 18 | 4 | 6 | 482 |
| |
| Japan | 3 |
| |||||||
| South Korea | 26 |
| |||||||
| Taiwan | 2 |
| |||||||
| Thailand | 3 |
| |||||||
| Central America | Mexico | 2 |
| ||||||
| Europe | Denmark | 2 |
| ||||||
| France | 2 |
| |||||||
| Germany | 24 |
| |||||||
| Hungary | 1 |
| |||||||
| Italy | 5 |
| |||||||
| Spain | 14 |
| |||||||
| Sweden | 1 |
| |||||||
| United Kingdom | 2 |
| |||||||
| Unknown | Unknown | 12 | 6 |
| |||||
| North America | USA | 1 | 9 |
| |||||
| Russia | Russia | 2 |
| ||||||
| South America | Brazil | 7 | 1 |
| |||||
| Colombia | 2 |
| |||||||
|
|
|
|
|
|
|
|
| ||
Figure 1Phylogenetic trees obtained on complete genome (upper figure) and ORF 2 (lower figure) using the maximum likelihood (ML), neighbour joining (NJ), and Bayesian inference (BI) approach. Different clades have been colour coded.
Results of the phylogeny–trait (i.e., region and host) association performed on both original and balanced datasets. The global statistics (AI and PS) and the feature-specific ones (MC) are reported. AI: association index; PS: parsimony score; MC: monophyletic clade size.
| \ | Statistic | Feature | Observed Mean | Lower Observed 95% CI | Upper Observed 95% CI | Null Mean | Lower Null 95% CI | Upper Null 95% CI | ||
|---|---|---|---|---|---|---|---|---|---|---|
| Geographic location | All sequences | AI | Global | 10.624 | 9.048 | 12.338 | 20.408 | 19.203 | 21.617 | 0.000 |
| PS | 90.378 | 86.000 | 94.000 | 116.707 | 115.063 | 118.105 | 0.000 | |||
| MC | Asia | 3.001 | 3.000 | 3.000 | 1.375 | 1.090 | 2.002 | 0.001 | ||
| Central America | 2.000 | 2.000 | 2.000 | 1.025 | 1.000 | 1.102 | 0.001 | |||
| Europe | 34.865 | 25.000 | 50.000 | 14.750 | 11.833 | 20.940 | 0.001 | |||
| North America | 3.631 | 3.000 | 5.000 | 1.660 | 1.249 | 2.111 | 0.001 | |||
| Russia | 2.769 | 2.000 | 3.000 | 1.150 | 1.003 | 1.958 | 0.001 | |||
| South America | 1.996 | 2.000 | 2.000 | 1.000 | 1.000 | 1.000 | 0.001 | |||
| Unknown | 1.973 | 2.000 | 2.000 | 1.000 | 1.000 | 1.000 | 0.001 | |||
| Balanced dataset | AI | Global | 7.251 | 5.826 | 8.717 | 13.574 | 12.407 | 14.658 | 0.000 | |
| PS | 62.552 | 59.000 | 66.000 | 89.229 | 85.724 | 92.304 | 0.000 | |||
| MC | Asia | 1.348 | 1.000 | 2.000 | 1.313 | 1.017 | 2.006 | 1.000 | ||
| Central America | 2.000 | 2.000 | 2.000 | 1.118 | 1.000 | 1.812 | 0.026 | |||
| Europe | 6.630 | 5.000 | 11.000 | 4.367 | 3.541 | 5.536 | 0.018 | |||
| North America | 3.602 | 3.000 | 5.000 | 2.424 | 2.011 | 3.149 | 0.107 | |||
| Russia | 3.022 | 2.000 | 4.000 | 1.388 | 1.039 | 2.017 | 0.001 | |||
| South America | 2.000 | 2.000 | 2.000 | 1.004 | 1.000 | 1.000 | 0.002 | |||
| Unknown | 1.873 | 1.000 | 2.000 | 1.003 | 1.000 | 1.000 | 0.001 | |||
| Host | All sequences | AI | Global | 5.290 | 4.037 | 6.617 | 9.511 | 8.757 | 10.269 | 0.000 |
| PS | 41.388 | 39.000 | 44.000 | 53.611 | 52.749 | 53.914 | 0.000 | |||
| MC | Bovine | 108.773 | 98.000 | 146.000 | 30.972 | 25.001 | 40.500 | 0.002 | ||
| Canine | 3.612 | 2.000 | 4.000 | 1.107 | 1.013 | 1.339 | 0.001 | |||
| Mouse | 6.969 | 6.000 | 7.000 | 1.084 | 1.007 | 1.271 | 0.001 | |||
| Domestic pig | 1.827 | 1.000 | 2.000 | 1.080 | 1.005 | 1.284 | 0.006 | |||
| Wild boar | 2.000 | 2.000 | 2.000 | 1.004 | 1.000 | 1.014 | 0.001 | |||
| Balanced dataset | AI | Global | 3.082 | 2.345 | 3.900 | 7.543 | 6.736 | 8.316 | 0.000 | |
| PS | 31.276 | 28.000 | 34.000 | 50.186 | 47.890 | 52.021 | 0.000 | |||
| MC | Bovine | 4.007 | 3.000 | 5.000 | 1.589 | 1.166 | 2.145 | 0.002 | ||
| Canine | 17.428 | 17.000 | 19.000 | 4.366 | 3.386 | 6.276 | 0.001 | |||
| Mouse | 6.662 | 5.000 | 9.000 | 1.472 | 1.110 | 2.062 | 0.001 | |||
| Domestic pig | 2.310 | 2.000 | 4.000 | 1.481 | 1.110 | 2.083 | 0.160 | |||
| Wild boar | 2.000 | 2.000 | 2.000 | 1.029 | 1.000 | 1.110 | 0.007 |
Figure 2Phylogenetic tree reconstructed using the BI on the complete (left figure) and partial (right figure) ORF2 dataset. The collection continent and host have been colour-coded.
Figure 3Plot reporting the alignment of the amino acids of the cap gene with respect to their position in the phylogenetic tree. Different amino acids have been colour-coded. For ease of interpretation, only the results of ML-based analysis are reported.