| Literature DB >> 30976080 |
Amber Papineau1,2, Yohannes Berhane1, Todd N Wylie3,4, Kristine M Wylie3,4, Samuel Sharpe5, Oliver Lung6,7.
Abstract
The complete genome of a novel coronavirus was sequenced directly from the cloacal swab of a Canada goose that perished in a die-off of Canada and Snow geese in Cambridge Bay, Nunavut, Canada. Comparative genomics and phylogenetic analysis indicate it is a new species of Gammacoronavirus, as it falls below the threshold of 90% amino acid similarity in the protein domains used to demarcate Coronaviridae. Additional features that distinguish the genome of Canada goose coronavirus include 6 novel ORFs, a partial duplication of the 4 gene and a presumptive change in the proteolytic processing of polyproteins 1a and 1ab.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30976080 PMCID: PMC6459860 DOI: 10.1038/s41598-019-42355-y
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Genome organization of Canada goose coronavirus. Purple indicates untranslated regions, blue indicates putative proteins, green indicates coding region of mature non-structural proteins (NSP) and red indicates transcription regulatory sequences (TRS). The stem loop-like motif and octamer motif are contained within the 3′ UTR. Genome organization figure was constructed using GeneiousTM (Biomatters, v 9.1.8). *Indicate ACoV 4b homologues. Proteins are named numerically from the 5′ end of the genome, with the exception of the structural genes, which are denoted by their common names.
Putative viral proteins of Canada goose coronavirus.
| Protein | Top Match in NCBI | Top match - aa % identity* | Size (aa) | Distance between TRS and start codon (nt) |
|---|---|---|---|---|
| 1a | 1a-Infectious bronchitis virus strain B1648 | 43 | 3825 | 480 |
| 1ab | 1ab-Infectious bronchitis virus strain ck/CH/LJL/05I | 57 | 6510 | 480 |
| Spike | Spike-Infectious bronchitis virus strain N2-75 | 53 | 1184 | 82 |
| 3 | n/a | n/a | 53 | 0 |
| 4a | n/a | n/a | 55 | 3 |
| Envelope | Envelope-Infectious bronchitis virus strain IS-1494 | 69 | 100 | n/a |
| Membrane | Membrane-Duck Coronavirus isolate DK/GD/2014 | 72 | 235 | 74 |
| 5b | 4b-Infectious bronchitis virus strain Georgia 1998 Vaccine | 41 | 88 | n/a |
| 6 | n/a | n/a | 63 | 5 |
| 7a | 4b-Duck Coronavirus isolate DK/GD/2014 | 23 | 92 | 3 |
| 7b | n/a | n/a | 69 | n/a |
| 8a | 5a-Duck Coronavirus isolate DK/GD/2014 | 37 | 65 | 4 |
| 8b | 5b-Duck Coronavirus isolate DK/GD/2014 | 46 | 85 | n/a |
| Nucleocapsid | Nucleocapsid-Goose Coronavirus | 94 | 414 | 94 |
| 10 | ORFxg-Goose Coronavirus | 92 | 97 | 0 |
| 11 | ORFyg-Goose Coronavirus | 81 | 180 | 91 |
*Matches below 20% coverage not shown.
Non-structural proteins size and cleavage site of gammacoronaviruses.
| Protein | CGCoV | TCoV | IBV | SW1 | ||||
|---|---|---|---|---|---|---|---|---|
| Cleavage site | Size aa | Cleavage site | Size aa | Cleavage site | Size aa | Cleavage site | Size aa | |
| NSP1/2 | AG^GH | 609 | AG^GK | 673 | AG^GK | 673 | VD^GD | 636 |
| NSP3 | AG^GV | 1532 | AG^GV | 1594 | AG^GI | 1592 | LG^GV | 1586 |
| NSP4 | LQ^AG | 503 | LQ^AG | 514 | LQ^SG | 514 | LQ^AG | 537 |
| NSP5 | LQ^SN | 307 | LQ^SS | 307 | LQ^SS | 307 | LQ^SN | 303 |
| NSP6 | VQ^SK | 295 | VQ^SK | 297 | VQ^AK | 293 | VQ^SK | 303 |
| NSP7 | LQ^AV | 83 | LQ^SV | 83 | LQ^SV | 83 | LQ^AV | 83 |
| NSP8 | LQ^NN | 212 | LQ^NN | 210 | LQ^NN | 210 | LQ^NN | 198 |
| NSP9 | LQ^GK | 111 | LQ^SK | 111 | LQ^SK | 111 | LQ^HG | 112 |
| NSP10 |
|
|
|
|
|
|
|
|
| NSP11 |
|
|
|
|
|
|
|
|
| NSP12 |
|
|
|
|
|
|
|
|
| NSP13 | LQ^SC | 599 | LQ^SC | 601 | LQ^SC | 600 | LQ^AS | 601 |
| NSP14 | LQ^SN | 522 | LQ^GT | 521 | LQ^GT | 514 | LQ^SQ | 528 |
| NSP15 | LQ^SI | 338 | LQ^SI | 338 | LQ^SI | 338 | LQ^SL | 349 |
| NSP16 | LQ^SG | 298 | LQ^SA | 302 | LQ^SA | 302 | LQ^SD | 312 |
*Amino acids present in CGCoV where putative protease cleavage sites were observed in TCoV, IBV and SW1.
Figure 2The phylogeny of gammacoronavirus spike and nucleocapsid proteins. A maximum likelihood tree built, using the amino acid sequences of the spike protein (a) and nucleocapsid protein (b) domains aligned with ClustalW[31], in MEGA X using the Jones-Taylor-Thornton (JTT) substitution model and 1000 bootstraps[32]. IBV Infectious Bronchitis virus, TCoV Turkey Coronavavirus, PCoV Pigeon Coronavirus, DCoV Duck Coronavirus.
Figure 3The phylogeny of Canada goose coronavirus. A maximum likelihood tree built, using the concatenated amino acid sequences of the replicase and helicase protein domains aligned with ClustalW[31], in MEGA X using the Jones-Taylor-Thornton (JTT) substitution model and 1000 bootstraps[32]. Numbers at nodes indicate the bootstrap value.
Comparison of the amino acid pairwise identity of 7 conserved coronavirus domains in the poly1ab protein of Canada goose coronavirus to other gammacoronaviruses.
| Domain | aa % identity to IBV | aa % identity to TCoV | aa % identity to DCoV | aa % Identity to SW1 |
|---|---|---|---|---|
| ADP-ribose-1″-phosphatase | 42 | 43 | 38 | 23 |
| 3C-like Protease | 56 | 58 | 57 | 49 |
| RdRp | 80 | 80 | 83 | 69 |
| Helicase 1 | 89 | 90 | 92 | 78 |
| Exonuclease | 78 | 72 | 77 | 56 |
| Endoribonuclease | 53 | 53 | 54 | 41 |
| Ribose-2′-O methyltransferase | 74 | 77 | 76 | 65 |
| Average | 67 | 68 | 68 | 54 |