| Literature DB >> 35769892 |
Wen Wang1, Jun-Hua Tian2, Xiao Chen3, Rui-Xue Hu1, Xian-Dan Lin4, Yuan-Yuan Pei1, Jia-Xin Lv1, Jiao-Jiao Zheng1, Fa-Hui Dai1, Zhi-Gang Song1, Yan-Mei Chen1, Yong-Zhen Zhang1.
Abstract
Over the last several decades, no emerging virus has had a profound impact on the world as the SARS-CoV-2 that emerged at the end of 2019 has done. To know where severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) originated from and how it jumped into human population, we immediately started a surveillance investigation in wild mammals in and around Wuhan when we determined the agent. Herein, coronaviruses were screened in the lung, liver, and intestinal tissue samples from fifteen raccoon dogs, seven Siberian weasels, three hog badgers, and three Reeves's muntjacs collected in Wuhan and 334 bats collected around Wuhan. Consequently, eight alphacoronaviruses were identified in raccoon dogs, while nine betacoronaviruses were found in bats. Notably, the newly discovered alphacoronaviruses shared a high whole-genome sequence similarity (97.9 per cent) with the canine coronavirus (CCoV) strain 2020/7 sampled from domestic dog in the UK. Some betacoronaviruses identified here were closely related to previously known bat SARS-CoV-related viruses sampled from Hubei province and its neighbors, while the remaining betacoronaviruses exhibited a close evolutionary relationship with SARS-CoV-related bat viruses in the RdRp gene tree and clustered together with SARS-CoV-2-related bat coronaviruses in the M, N and S gene trees, but with relatively low similarity. Additionally, these newly discovered betacoronaviruses seem unlikely to bind angiotensin-converting enzyme 2 because of the deletions in the two key regions of their receptor-binding motifs. Finally, we did not find SARS-CoV-2 or its progenitor virus in these animal samples. Due to the high circulation of CCoVs in raccoon dogs in Wuhan, more scientific efforts are warranted to better understand their diversity and evolution in China and the possibility of a potential human agent.Entities:
Keywords: CCoV; SARS-related coronavirus; Wuhan; bats; raccoon dog
Year: 2022 PMID: 35769892 PMCID: PMC9214087 DOI: 10.1093/ve/veac046
Source DB: PubMed Journal: Virus Evol ISSN: 2057-1577
Figure 1.Sampling locations in Wuhan and its surrounding regions in Hubei province, China. SARS-CoV-related and SARS-CoV-2-related viruses identified in China are marked, corresponding to the three lineages in Supplementary Fig. S3. The scattered dots represent Chinese islands. Location abbreviations are as follows: AH, Anhui; FJ, Fujian; GD, Guangdong; GX, Guangxi; GZ, Guizhou; HuB, Hubei; HeB, Hebei; HeN, Henan; HK, Hong Kong; HN, Hunan; JL, Jilin; JX, Jiangxi; LN, Liaoning; SAX, Shaanxi; SC, Sichuan; SX, Shanxi; YN, Yunnan; ZJ, Zhejiang.
Prevalence of coronaviruses in wild animals by species and location in China.
| Family | Species | Wuhan | Xiaogan | Jingmen | Total |
|---|---|---|---|---|---|
| Hipposideridae |
| – | – | 0/107 | 0/107 |
|
| – | 0/3 | – | 0/3 | |
| Rhinolophidae |
| – | 7/86 | – | 7/86 |
|
| – | 0/4 | – | 0/4 | |
|
| – | 0/2 | – | 0/2 | |
|
| – | 2/8 | – | 2/8 | |
|
| – | 0/51 | – | 0/51 | |
| Vespertilionidae |
| – | 0/2 | – | 0/2 |
|
| – | 0/4 | – | 0/4 | |
|
| – | 0/67 | – | 0/67 | |
| Canidae |
| 8/15 | – | – | 8/15 |
| Cervidae |
| 0/3 | – | – | 0/3 |
| Mustelidae |
| 0/3 | – | – | 0/3 |
|
| 0/7 | – | – | 0/7 | |
| Total | 14 | 8/28 | 9/227 | 0/107 | 17/362 |
Note: ‘-’ means that no animals were captured.
Figure 2.ML phylogenetic trees were constructed based on the nt sequences of the whole genome and RdRp, S, M, and N genes of raccoon dog CoVs and other CoVs. Numbers (>70) above or below branches indicate percentage bootstrap values. The trees were mid-point rooted for clarity only. The scale bar represents the number of substitutions per site.
Percent identities of GH8-2 to Alphacoronavirus-1 reference strains.
| Identity (%) to GH8-2 | |||||||
|---|---|---|---|---|---|---|---|
| AphaCoV-1 | Strain | Accession number | Whole genome (nt) | RdRp (nt/aa) | S | M (nt/aa) | N (nt/aa) |
| CCoV-I | 23/03 | KP849472 | 87.3 | 94.6/98.1 | 56.5/47.9 | 87.8/89.0 | 88.1/89.5 |
| CCoV-II | CCoV-HuPn-2018 | MW591993 | 93.4 | 93.8/98.4 | 90.2/93.9 | 92.5/93.9 | 96.0/96.5 |
| CCoV-II | HLJ-071 | KY063616 | 92.1 | 94.1/98.9 | 82.2/83.2 | 92.5/95.4 | 95.6/96.6 |
| CCoV-II | Z19 | MZ420153 | 93.6 | 93.7/98.5 | 90.2/93.7 | 96.7/97.7 | 96.9/99.0 |
| CCoV-II | 1-71 | JQ404409 | 92.8 | 97.1/99.4 | 82.9/83.3 | 94.2/95.8 | 96.8/97.9 |
| CCoV-II | 2020/7 | MT906865 | 97.9 | 97.5/99.4 | 98.3/98.1 | 96.1/96.9 | 99.0/99.0 |
| CCoV-II | 2020/15 | MT906864 | 90.7 | 96.1/99.4 | 70.8/68.7 | 93.0/95.8 | 96.9/97.9 |
| CCoV-II | A76 | JN856008 | 91.0 | 94.3/99.0 | 71.3/68.5 | 92.6/94.3 | 97.3/97.4 |
| CCoV-II | CB/05 | KP981644 | 93.5 | 95.8/98.9 | 83.3/84.0 | 93.7/97.0 | 97.0/97.9 |
| CCoV-II | CCoV/NTU336/F/2008 | GQ477367 | 94.5 | 95.2/99.1 | 93.6/95.9 | 92.6/93.9 | 97.5/99.0 |
| CCoV-II | K378 | KC175340 | 92.6 | 96.9/99.2 | 82.8/82.7 | 94.2/95.8 | 96.7/97.7 |
| CCoV-II | S378 | KC175341 | 92.6 | 96.9/99.2 | 82.8/82.8 | 94.2/95.8 | 96.7/97.7 |
| CCoV-II | SD-A1 | OM451122 | 91.9 | 97.4/99.0 | 79.4/77.6 | 93.5/94.7 | 89.4/89.8 |
| CCoV-II | HeB-G1 | OM451123 | 92.7 | 98.1/99.6 | 80.7/79.9 | 92.6/94.3 | 90.3/91.6 |
| CCoV-II | TN-449 | JQ404410 | 92.1 | 94.2/98.9 | 82.1/83.1 | 92.5/95.4 | 95.6/96.6 |
| CCoV-II | 171 | KC175339 | 84.9 | 92.1/97.4 | 82.4/82.9 | 82.1/85.8 | 79.2/78.6 |
| FCoV-I | C1Je | DQ848678 | 79.2 | 85.9/95.3 | 54.6/46.7 | 81.6/84.0 | 78.9/78.5 |
| FCoV-I | Black | EU186072 | 79.4 | 87.1/95.9 | 54.8/46.5 | 82.3/85.5 | 79.1/78.6 |
| FCoV-II | 79-1146 | NC_002306 | 84.9 | 92.1/97.3 | 82.4/82.9 | 82.1/85.8 | 79.2/78.6 |
| PRCV | ISU-1 | DQ811787 | 91.2 | 93.8/98.3 | 88.8/92.6 | 91.3/95.0 | 92.3/93.2 |
| TGEV | Purdue | AJ271965 | 92.0 | 93.9/98.7 | 89.1/91.5 | 92.1/95.1 | 94.3/94.5 |
Figure 3.Schematic of the annotated GH8-2 genome in comparison to those of representative alphacoronavirus-1.
Coding of potential and putative transcription regulatory sequences of the GH8-2 genome.
| ORF | Location (nt) | Length(nt) | Length (aa) | TRS location | TRS sequence |
|---|---|---|---|---|---|
| 1ab | 312-20,359 | 20,049 | 6,682 | 94 | TCGAA |
| S | 20,356-24,729 | 4,374 | 1,457 | 20,324 | AGTTA |
| 3a | 24,793-25,029 | 237 | 78 | 24,785 | AAGAA |
| 3c | 25,175-25,909 | 735 | 244 | 24,977 | ATATG |
| E | 25,896-26,144 | 249 | 82 | 25,853 | CGGTT |
| M | 26,155-26,943 | 789 | 262 | 26,146 | TTGAA |
| N | 26,956-28,104 | 1,149 | 382 | 26,944 | TATAA |
| 7a | 28,109-28,414 | 306 | 101 | 28,101 | ACGAA |
| 7b | 28,419-29,060 | 642 | 213 |
Underlined and bold type indicates the conserved nt in the TRS core sequence.
Prediction of the putative polyprotein pp1ab cleavage sites in GH8-2.
| Cleavage | First–last amino acid residues | Protein size (aa) | Putative functional domain(s) |
|---|---|---|---|
| nsp1 | 1Met–Gly110 | 110 | |
| nsp2 | 111Ala–Gly879 | 769 | |
| nsp3 | 880Gly–Gly2386 | 1507 | ADRP, PL1pro, PL2pro |
| nsp4 | 2387Ser–Gln2876 | 490 | |
| nsp5 | 2877Ser–Gln3178 | 302 | 3CLpro |
| nsp6 | 3179Ala–Gln3472 | 294 | |
| nsp7 | 3473Ser–Gln3555 | 83 | |
| nsp8 | 3556Ser–Gln3750 | 195 | |
| nsp9 | 3751Asn–Gln3861 | 111 | |
| nsp10 | 3862Ala–Gln3996 | 135 | |
| nsp11 | 3997Ser–Asp4015 | 19 | Short peptide at the end of ORF1a |
| nsp12 | 3997Ser–Gln4925 | 929 | RdRp |
| nsp13 | 4926Ala–Gln5524 | 599 | Hel |
| nsp14 | 5525Ala–Gln6043 | 519 | ExoN |
| nsp15 | 6044Ser–Gln6382 | 339 | NendoU |
| nsp16 | 6383Ser–Pro6682 | 300 | O-MT |
Identities of nt and aa sequences of new bat coronaviruses to representative sarbecoviruses.
| RdRp (nt/aa) | S (nt/aa) | M (nt/aa) | N (nt/aa) | |||||
|---|---|---|---|---|---|---|---|---|
| Strain | XG-104 | XG-145 | XG-104 | XG-145 | XG-104 | XG-145 | XG-104 | XG-145 |
| HN2021A | 98.3/99.1 | 92.4/99.2 | 95.6/98.8 | 75.8/82.2 | 98.4/100 | 82.7/89.6 | 95.6/97.9 | 91.4/94.0 |
| HN2021G | 97.6/99.4 | 92.8/99.5 | 96.4/98.7 | 75.4/82.2 | 99.4/99.6 | 83.3/89.6 | 97.5/98.3 | 93.0/94.3 |
| RsSHC014 | 97.7/99.3 | 93.0/99.7 | 73.7/77.9 | 78.4/81.2 | 84.7 /89.6 | 93.2 /96.4 | 91.9/93.6 | 97.2/98.3 |
| WIV1 | 97.6/99.1 | 92.9/99.6 | 74.1/78.0 | 78.5/81.1 | 84.7/89.6 | 93.2 /96.4 | 91.8/93.6 | 97.1/98.3 |
| Rs672/2006 | 97.2/99.4 | 93.1/99.7 | 75.8/81.4 | 87.8/95.2 | 84.5/89.2 | 93.1/96.8 | 91.9/93.6 | 97.7/99.3 |
| SARS-CoV Tor2 | 97.2/99.1 | 93.2/99.5 | 73.5/77.1 | 78.0/80.5 | 84.5/89.6 | 93.4/97.7 | 91.7/93.8 | 96.8/98.6 |
| SZ3 | 97.2/99.0 | 93.2/99.4 | 73.4/76.8 | 77.9/80.2 | 84.5/89.2 | 93.7/97.7 | 91.7/93.8 | 96.8/98.6 |
| Cp/Yunnan2011 | 94.0/98.7 | 92.8/99.2 | 75.4/81.8 | 85.0/92.9 | 86.2/90.5 | 92.3/97.7 | 91.6/93.8 | 97.1/99.3 |
| bat-SL-CoVZC45 | 92.7/98.6 | 92.6/99.0 | 95.3/99.0 | 75.5/82.5 | 97.6/100 | 83.0/89.6 | 95.1/96.7 | 90.7/92.8 |
| JSB_Rsin | 92.4/98.7 | 99.4/99.7 | 76.2/82.6 | 94.6/95.7 | 85.0/90.1 | 95.3/99.1 | 92.5/93.6 | 99.3/99.5 |
| Rf1 | 92.3/98.3 | 92.2/98.6 | 75.5/81.6 | 82.0/88.6 | 83.8/89.2 | 91.9/96.8 | 89.6/91.7 | 94.1/96.9 |
| BtRs-BetaCoV/HuB2013 | 92.2/98.9 | 97.5/99.7 | 76.3/82.2 | 87.7/91.5 | 85.0/90.1 | 95.3/99.1 | 92.0/93.8 | 98.3/99.8 |
| HKU3-1 | 91.7/98.3 | 92.4/98.8 | 76.2/83.1 | 85.4/93.9 | 84.7/90.1 | 95.0/99.1 | 91.7/92.6 | 96.2/97.9 |
| SARS-CoV-2 Wuhan-Hu-1 | 88.3/96.2 | 87.3/96.2 | 77.2/82.1 | 72.7/76.6 | 94.5/98.7 | 83.8/90.5 | 89.4/92.9 | 88.6/91.4 |
| RaTG13 | 88.0/96.2 | 86.9/96.2 | 77.5/81.8 | 73.0/76.5 | 94.3 /99.1 | 83.2 /89.6 | 89.4/93.6 | 88.2/91.6 |
| bat/Yunnan/RpYN06/2020 | 88.2/96.2 | 87.0/96.2 | 92.4/98.0 | 75.3/82.1 | 93.2/99.1 | 83.0/89.6 | 90.0/93.3 | 88.7/91.9 |
| RacCS203 | 87.8/96.1 | 86.9/96.1 | 74.9/78.4 | 73.2/78.9 | 94.3/98.6 | 83.5/89.6 | 91.1/95.2 | 88.9/93.1 |
| BM48-31 | 87.8/97.6 | 87.4/98.2 | 68.6/71.9 | 71.0/75.9 | 76.8/87.4 | 79.9/91.4 | 77.5/88.5 | 78.7/88.7 |
| Rc-o319 | 86.9/96.6 | 86.4/96.7 | 72.9/77.6 | 71.0/75.9 | 85.9/91.5 | 85.6/93.7 | 86.7/89.7 | 87.1/90.6 |
| PCoV_GX-P2V | 86.4/96.0 | 86.4/96.0 | 76.8/81.5 | 73.4/76.4 | 91.5 /98.2 | 82.1/89.6 | 89.5/92.3 | 88.1/93.0 |
| MP789 | 85.1/96.1 | 84.6/96.1 | 80.4/86.8 | 72.1/76.6 | 94.2/99.6 | 82.9/89.2 | 89.7/93.1 | 88.2/91.9 |
Figure 4.ML phylogenetic trees were based on the nt sequences of the RdRp, S, M, and N genes of sarbecoviruses. Numbers (>70) above or below branches indicate percentage bootstrap values. The trees were mid-point rooted for clarity only. The scale bar represents the number of substitutions per site. All viruses found in this study are labeled in red. SARS-CoV and SARS-CoV-related virus groups are labeled in red, and SARS-CoV-2 and SARS-CoV-2 related virus group are labeled in green. The GenBank and GISAID accession numbers are available in Supplementary Table S3.