Literature DB >> 32315281

Possible Bat Origin of Severe Acute Respiratory Syndrome Coronavirus 2.

Susanna K P Lau, Hayes K H Luk, Antonio C P Wong, Kenneth S M Li, Longchao Zhu, Zirong He, Joshua Fung, Tony T Y Chan, Kitty S C Fung, Patrick C Y Woo.   

Abstract

We showed that severe acute respiratory syndrome coronavirus 2 is probably a novel recombinant virus. Its genome is closest to that of severe acute respiratory syndrome-related coronaviruses from horseshoe bats, and its receptor-binding domain is closest to that of pangolin viruses. Its origin and direct ancestral viruses have not been identified.

Entities:  

Keywords:  COVID-19; SARS-CoV-2; bats; coronavirus disease; genomic analysis; recombinant virus; recombination; respiratory infections; severe acute respiratory syndrome coronavirus 2; viruses; zoonoses

Mesh:

Year:  2020        PMID: 32315281      PMCID: PMC7323513          DOI: 10.3201/eid2607.200092

Source DB:  PubMed          Journal:  Emerg Infect Dis        ISSN: 1080-6040            Impact factor:   6.883


Seventeen years after the severe acute respiratory syndrome (SARS) epidemic, an outbreak of pneumonia, now called coronavirus disease (COVID-19), was reported in Wuhan, China. Some of the early case-patients had a history of visiting the Huanan Seafood Wholesale Market, where wildlife mammals are sold, suggesting a zoonotic origin. The causative agent was rapidly isolated from patients and identified to be a coronavirus, now designated as severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) by the International Committee on Taxonomy of Viruses (). SARS-CoV-2 has spread rapidly to other places; 113,702 cases and 4,012 deaths had been reported in 110 countries/areas as of March 10, 2020 (). In Hong Kong, 130 cases and 3 deaths had been reported. SARS-CoV-2 is a member of subgenus Sarbecovirus (previously lineage b) in the family Coronaviridae, genus Betacoronavirus, and is closely related to SARS-CoV, which caused the SARS epidemic during 2003, and to SARS-related-CoVs (SARSr-CoVs) in horseshoe bats discovered in Hong Kong and mainland China (–). Whereas SARS-CoV and Middle East respiratory syndrome coronavirus were rapidly traced to their immediate animal sources (civet and dromedaries, respectively), the origin of SARS-CoV remains obscure. SARS-CoV-2 showed high genome sequence identities (87.6%–87.8%) to SARSr-Rp-BatCoV-ZXC21/ZC45, detected in Rhinolophus pusillus bats from Zhoushan, China, during 2015 (). A closer-related strain, SARSr-Ra-BatCoV-RaTG13 (96.1% genome identity with SARS-CoV-2), was recently reported in Rhinolophus affinis bats captured in Pu’er, China, during 2013 (). Subsequently, Pangolin-SARSr-CoV/P4L/Guangxi/2017 (85.3% genome identities to SARS-CoV-2) and related viruses were also detected in smuggled pangolins captured in Nanning, China, during 2017 () and Guangzhou, China, during 2019 (). To elucidate the evolutionary origin and pathway of SARS-CoV-2, we performed an in-depth genomic, phylogenetic, and recombination analysis in relation to SARSr-CoVs from humans, civets, bats, and pangolins ().

The Study

We downloaded 4 SARS-CoV-2, 16 human/civet-SARSr-CoV, 63 bat-SARSr-CoV and 2 pangolin-SARSr-CoV genomes from GenBank and GISAID (https://www.gisaid.org). We also sequenced the complete genome of SARS-CoV-2 strain HK20 (GenBank accession no. MT186683) from a patient with COVID-19 in Hong Kong. We performed genome, phylogenetic, and recombination analysis as described (). The 5 SARS-CoV-2 genomes had overall 99.8%–100% nt identities with each other. These genomes showed 96.1% genome identities with SARSr-Ra-BatCoV-RaTG13, 87.8% with SARSr-Rp-BatCoV-ZC45, 87.6% with SARSr-Rp-BatCoV-ZXC21, 85.3% with pangolin-SARSr-CoV/P4L/Guangxi/2017, and 73.8%–78.6% with other SARSr-CoVs, including human/civet-SARSr-CoVs (Table 1).
Table 1

Comparison of pairwise percentage amino acid identities between SARS-CoV-2 HK20, 4 other strains of SARS-CoV-2, and other SARSr-CoVs with complete genomes available*

Virusnsp1nsp2nsp3nsp4nsp5nsp6nsp7nsp8nsp9nsp10nsp12nsp13nsp14nsp15nsp16SpikeNTDCTDS2ORF3ORF8EMNGenome
SARS-CoV-2 HKU-SZ-005b_2020/Guangdong/202010010010010010010010010010010010010010010010099.310010010010099.2100100100100
SARS-CoV-2 Wuhan-Hu-1/Hubei/201910010010010010010010010010010010010010010010010010010010010010010010010099.9
SARS-CoV-2 WIV02/Hubei/201910010010010010010010010010010010010010010099.799.310010010010010010010010099.8
SARS-CoV-2 HKU-SZ-002a_2020/Guangdong/202010010010010010010010010010010010010010010010099.310010010010099.210010010099.8
SARSr-Ra-BatCoV RaTG13/R.affinis/Yunnan/201396.798.396.799.699.399.710010010010099.699.799.297.710096.798.689.399.197.895.010099.599.096.1
SARSr-Rp-BatCoV bat-SL-CoVZC45/R.pusillus/Zhejiang/201595.695.393.996.899.097.910097.597.397.195.999.294.589.098.080.266.365.590.890.994.210098.694.387.8
SARSr-Rp-BatCoV bat-SL-CoVZXC21/R.pusillus/Zhejiang/201595.696.493.096.299.097.698.896.096.597.895.698.794.788.298.079.665.264.590.592.094.210098.694.387.6
Pangolin-SARSr-CoV Guangxi/P4L/201790.688.485.590.297.196.210097.597.397.197.998.297.094.597.791.788.086.895.689.587.610098.293.885.3
SARSr-Rm-BatCoV Longquan-140/R.monoceros/Zhejiang/201283.982.875.981.295.886.610097.597.394.395.799.094.989.094.375.849.366.090.561.593.490.589.580.1
SARSr-Rs-BatCoV Rs4231/R.sinicus/Yunnan/201385.067.976.381.296.187.210097.097.397.896.499.395.689.093.376.352.277.287.573.857.993.490.190.879.5
SARSr-Rs-BatCoV Rs3367/R.sinicus/Yunnan/201285.067.476.180.496.186.910097.597.397.896.499.395.488.793.076.853.776.687.874.257.994.790.190.579.5
SARSr-Rs-BatCoV Rs9401/R.sinicus/Yunnan/201584.468.376.380.896.187.210097.597.397.896.199.395.689.093.076.653.776.187.774.257.993.489.690.579.5
SARSr-Rs-BatCoV RsSHC014/R.sinicus/Yunnan/201185.067.975.981.095.887.210097.597.397.896.499.395.189.092.677.053.777.787.974.258.794.790.190.379.4
SARSr-Ra-BatCoV LYRa11/R.affinis/Yunnan/201182.268.076.180.895.886.610097.596.596.496.198.895.189.093.076.049.375.188.376.057.993.489.691.079.4
SARSr-Rs-BatCoV Rs4237/R.sinicus/Yunnan/201383.967.776.181.096.187.698.897.597.397.896.799.595.489.094.675.050.065.089.172.057.994.790.590.879.4
SARSr-Rs-BatCoV Rs4247/R.sinicus/Yunnan/201383.967.776.381.096.187.698.897.597.397.896.699.395.689.094.675.249.765.589.372.457.093.490.190.579.4
Human SARS-CoV TOR2/Toronto/Mar200384.468.376.080.096.187.298.897.597.397.196.499.795.188.793.375.451.974.187.172.494.790.590.579.4
Human SARS-CoV GZ02/Guangdong/Feb200383.968.376.179.896.187.298.897.597.397.196.499.395.188.793.675.752.674.687.373.129.794.790.590.579.4
SARSr-Rs-BatCoV Rs4081/R.sinicus/Yunnan/201285.067.976.381.096.187.610097.597.397.896.499.295.889.094.374.448.164.089.072.058.794.790.190.379.3
SARS-As-BatCoV As6526/Aselliscus stoliczkanus/Yunnan/201484.467.976.381.095.887.210097.597.397.896.199.395.689.094.675.149.766.088.971.658.794.791.090.079.3
Human SARS-CoV BJ01/Beijing84.468.376.080.496.187.298.897.597.397.196.499.595.188.793.375.652.274.187.372.494.790.590.579.3
Civet SARS-CoV SZ3/Shenzhen/201384.468.576.180.696.187.298.897.597.397.196.499.595.188.793.675.751.975.187.372.429.794.790.190.579.3
Civet SARS-CoV SZ16/Shenzhen/201384.468.576.080.696.187.298.897.597.397.196.499.595.188.793.675.751.975.187.372.429.794.790.190.579.3
SARSr-Ra-BatCoV YN2018A/R.affinis/Yunnan?/201684.467.976.380.896.186.910097.597.397.896.599.595.489.064.175.249.765.589.171.357.994.789.690.379.2
SARSr-Rs-BatCoV Rs4255/R.sinicus/Yunnan/201385.067.976.180.896.187.210097.597.397.896.199.395.688.794.374.548.165.089.072.058.794.789.290.379.2
SARSr-Rs-BatCoV Rp3/R.sinicus/Guangxi/200482.867.775.480.695.887.210096.096.597.896.299.295.488.493.375.049.365.589.374.556.294.790.190.379.2
SARSr-Rs-BatCoV Rs4084/R.sinicus/Yunnan/201284.468.376.380.495.887.210097.597.397.896.298.595.188.792.676.953.777.287.974.294.790.190.079.2
Civet SARS-CoV PC4–227/Guangdong/200484.468.276.080.696.186.998.897.096.597.196.299.394.988.793.075.452.275.186.772.494.790.190.079.2
SARSr-Rs-BatCoV HuB2013/R.sinicus/Hubei/201383.969.376.081.096.186.610097.597.397.896.199.095.688.294.074.947.866.089.475.349.693.491.091.079.1
SARSr-Rm-BatCoV Rm1/R.macrotis/Hubei/200480.069.174.880.695.886.610097.096.597.195.998.894.987.093.074.447.665.089.074.557.093.491.090.079.1
SARSr-Rs-BatCoV Anlong-103/Guizhou/R.sinicus/201384.468.275.880.896.186.998.897.597.397.196.599.395.689.093.675.150.264.589.172.029.794.789.689.679.1
Civet SARS-CoV civet020/Guangdong/200484.468.075.980.696.187.297.697.097.397.196.499.394.788.793.375.452.275.186.772.429.794.790.190.079.1
SARSr-Rs-BatCoV HKU3-12/R.sinicus/Hong Kong/200783.968.875.381.895.886.610097.596.597.895.998.794.788.293.375.849.066.590.573.557.094.791.089.179.0
SARSr-Rs-BatCoV HKU3-7/R.sinicus/Guangdong/200683.968.575.581.495.887.210096.597.397.895.899.294.587.994.075.749.066.590.373.557.093.490.190.379.0
SARSr-Rs-BatCoV HKU3–8/R.sinicus/Guangdong/200683.968.575.681.495.487.210096.597.397.895.899.294.587.994.075.949.366.590.573.592.189.690.379.0
SARSr-Rs-BatCoV HKU3–1/R.sinicus/Hong Kong/200583.969.075.281.495.487.210096.597.397.895.698.794.788.293.375.749.066.590.373.157.094.791.089.579.0
SARSr-Rs-BatCoV HKU3–3/R.sinicus/Hong Kong/200583.969.075.281.495.487.210096.597.397.895.698.794.788.293.375.749.066.590.373.157.094.791.089.579.0
SARSr-Rs-BatCoV HKU3–9/R.sinicus/Hong Kong/200683.969.075.281.495.187.210096.597.397.895.698.894.787.693.375.749.066.590.272.757.094.791.089.379.0
SARSr-Rf-BatCoV Rf4092/R.ferrumequinum/Yunnan/201284.467.776.181.096.187.298.896.597.397.896.599.595.189.093.072.639.966.088.671.328.994.789.690.579.0
SARSr-Rs-BatCoV Anlong-112/Guizhou/R.sinicus/201383.968.276.080.896.187.298.897.597.397.896.599.395.689.093.675.150.264.589.172.029.794.789.689.679.0
SARSr-Rs-BatCoV HKU3–2/R.sinicus/Hong Kong/200583.969.075.381.495.487.210096.597.397.895.698.894.787.693.375.749.066.590.373.157.094.791.089.878.9
SARSr-Rs-BatCoV HKU3–13/R.sinicus/Hong Kong/200783.969.075.181.495.487.210096.597.397.895.698.894.587.693.375.749.066.590.272.757.094.791.089.378.9
UNVERIFIED: SARSr-Rp-BatCoV F46/R.pusillus/Yunnan/201281.768.375.880.896.186.298.897.596.597.196.499.395.689.693.672.639.966.088.970.926.694.789.690.578.9
Human SARS-CoV GZ0401/Guangdong/Dec200384.468.245.980.696.187.297.697.097.397.196.299.394.988.793.375.351.975.186.764.429.794.784.790.078.9
SARSr-Rl-BatCoV SC2018/Rhinolophus sp./201683.369.374.981.295.886.910097.597.397.896.198.895.486.792.374.447.466.088.970.955.494.790.190.778.8
SARSr-Rf-BatCoV YNLF_31C/R.ferrumequinum/Yunnan/201378.969.075.880.895.886.697.697.097.397.196.299.295.688.793.073.449.365.586.270.929.794.789.690.078.7
Civet SARS-CoV B039/Guangdong/200484.468.275.980.696.187.298.897.097.397.196.499.594.788.793.375.452.275.186.772.429.794.790.190.078.6
Civet SARS-CoV civet007/Guangdong/200484.468.075.980.696.187.298.897.097.397.196.499.594.988.793.075.351.975.186.672.029.794.790.190.078.6
Civet SARS-CoV A022/Guangdong/200484.468.276.080.696.187.298.897.097.397.196.499.594.988.792.675.351.975.186.672.029.794.790.189.878.5
SARSr-Rf-BatCoV Rf1/R.ferrumequinum/Hubei/200478.968.875.381.696.186.610097.097.397.196.299.094.988.291.973.650.066.086.268.731.393.489.689.178.4
SARSr-Rp-BatCoV Rp/Shaanxi2011/R.pusillus/Shaanxi/201184.469.375.881.296.186.610097.597.397.896.199.296.088.493.674.748.066.089.172.458.793.490.591.078.4
SARSr-Rf-BatCoV Jiyuan-84/R.ferrumequinum/ Henan/201277.868.875.381.495.486.910097.097.397.196.299.095.889.091.973.349.766.085.870.228.993.489.689.878.3
SARSr-Cp-BatCoV Cp/Yunnan2011/C. plicata/Yunnan/201181.768.076.581.096.186.610097.096.597.896.098.094.789.393.074.549.065.588.772.057.994.791.490.378.2
SARSr-Ra-BatCoV YN2018B/R.affinis/Yunnan?/201685.068.276.380.696.187.210097.597.397.896.499.395.689.093.076.653.476.187.873.857.994.790.190.878.1
SARSr-Rs-BatCoV Rs7327/R.sinicus/Yunnan/201484.467.976.280.896.187.210097.597.397.896.299.295.689.093.076.753.776.187.873.857.094.790.190.578.1
SARSr-Rs-BatCoV Rs4874/R.sinicus/Yunnan/201385.067.976.381.096.187.610097.597.397.896.298.795.388.793.376.552.276.687.974.257.994.789.690.378.1
SARSr-Rs-BatCoV WIV16/R.sinicus/Yunnan/201385.067.976.281.096.187.610097.597.397.896.199.394.788.793.376.552.276.687.974.257.994.790.190.378.1
SARSr-Ra-BatCoV YN2018D/R.affinis/Yunnan?/201685.068.276.381.096.187.210097.597.397.896.599.395.489.094.074.448.164.089.172.757.994.776.190.577.9
SARSr-Rf-BatCoV SX2013/R.ferrumequinum/Shanxi/201378.369.175.381.295.886.610096.597.397.195.999.095.889.091.973.349.766.085.870.930.592.189.689.877.9
SARSr-Rf-BatCoV HeB2013/R.ferrumequinum/Hebei/201378.969.175.181.695.886.610097.097.397.196.199.095.888.791.973.449.766.085.970.530.593.489.689.877.7
SARSr-Rs-BatCoV YN2013/R.sinicus/Yunnan/201383.967.976.180.696.187.210097.597.397.896.499.395.888.493.375.150.964.089.071.329.794.791.090.877.5
SARSr- Rs-BatCoV Rs672/2006/R.sinicus/Guizhou/200685.067.968.380.495.487.210097.597.397.896.598.895.688.294.374.247.864.089.071.658.794.789.690.577.5
SARSr-Rs-BatCoV GX2013/R.sinicus/Guangxi/201383.368.375.980.295.887.210097.596.597.196.599.395.489.094.075.149.766.589.072.729.794.791.489.677.3
SARSr-Rf-BatCoV 16BO133/R.ferrumequinum/Korea/201678.969.167.681.295.185.910096.597.397.196.198.594.988.292.671.239.065.086.668.093.489.688.876.9
SARSr-Rf-BatCoV JL2012/R.ferrumequinum/Jilin/201278.969.074.881.294.885.998.896.597.397.196.298.395.187.992.671.239.565.086.668.092.189.689.376.9
SARSr-Rf-BatCoV JTMC15/R.ferrumequinum/Jilin/201378.969.067.281.294.885.998.896.597.397.196.298.394.988.292.671.239.365.086.668.090.890.189.376.4
SARSr-Rs-BatCoV BtKY72/Rhinolophus sp./Kenya/200782.263.271.682.295.482.194.095.597.395.095.096.792.887.390.671.646.672.782.563.619.194.786.986.275.5
SARSr-Rb-BatCoV BM48–31/BGR/2008/R.blasii/Bulgaria/200881.762.471.381.094.183.895.296.598.295.095.297.793.589.988.671.344.169.783.462.592.185.987.675.2

*SARS-CoV-2 strains are highlighted in tan, and SARSr-CoV strains that had the highest end identity for each gene with SARS-CoV-2 are highlighted in red. SARSr-CoV strains were listed in descending order according to complete genome identities to SARS-CoV-2 HK20/Hong Kong/2020.. Pairwise amino acid identities could not be calculated for some strains because of completely or partially truncated ORF8. CTD, C-terminal domain; E, envelope; M, matrix; N, nucleocapsid; nsp, nonstructural protein; NTD, N-terminal domain; ORF, open reading frame; S, spike; SARS-CoV- 2, severe acute respiratory syndrome coronavirus 2; SARSr-CoV, severe acute respiratory syndrome–related coronavirus; –, not applicable..

*SARS-CoV-2 strains are highlighted in tan, and SARSr-CoV strains that had the highest end identity for each gene with SARS-CoV-2 are highlighted in red. SARSr-CoV strains were listed in descending order according to complete genome identities to SARS-CoV-2 HK20/Hong Kong/2020.. Pairwise amino acid identities could not be calculated for some strains because of completely or partially truncated ORF8. CTD, C-terminal domain; E, envelope; M, matrix; N, nucleocapsid; nsp, nonstructural protein; NTD, N-terminal domain; ORF, open reading frame; S, spike; SARS-CoV- 2, severe acute respiratory syndrome coronavirus 2; SARSr-CoV, severe acute respiratory syndrome–related coronavirus; –, not applicable.. Most predicted proteins of SARS-CoV-2 showed high amino acid sequence identities with that of SARSr-Ra-BatCoV RaTG13, except the receptor-binding domain (RBD) region. SARS-CoV-2 possessed an intact open reading frame 8 without the 29-nt deletion found in most human SARS-CoVs. The concatenated conserved replicase domains for coronavirus species demarcation by the International Committee on Taxonomy of Viruses showed >92.9% aa identities (threshold >90% for same species) between SARS-CoV-2 and other SARSr-CoVs, supporting their classification under the same coronavirus species (Table 2) ().
Table 2

Percentage amino acid identity between 7 conserved domains of the replicase polyprotein for species demarcation in SARS-CoV-2 and selected members of the subgenus Sarbecovirus*

Virus% Amino acid identity compared with that for SARS-CoV-2
ADRPnsp5nsp12nsp13nsp14nsp15nsp16Seven concatenated 
domains
Human SARS-CoV TOR2/Toronto/Mar200379.396.196.499.895.188.793.395.0
Civet SARS-CoV SZ3/Shenzhen/201379.396.196.499.795.188.793.695.0
Civet SARS-CoV PC4–136/Guangdong/200478.496.196.499.394.788.793.394.8
Human SARS-CoV GZ040278.495.896.299.594.951.793.394.8
SARSr-Rs-BatCoV HKU3–1/R.sinicus/Hong Kong/200579.395.495.698.894.788.293.394.5
SARSr-Rp-BatCoV Rp/Shaanxi2011/R.pusillus/Shaanxi/201178.496.196.199.396.088.493.695.0
SARSr-Rs-BatCoV Rs672/2006/R.sinicus/Guizhou/200680.295.496.599.095.688.294.395.0
SARSr-Rs-BatCoV WIV180.295.896.299.595.489.093.095.0
SARSr-Rf-BatCoV YNLF_31C/R.ferrumequinum/Yunnan/201380.295.896.299.395.688.793.095.0
SARSr-Rf-BatCoV 16BO133/R.ferrumequinum/Korea/201679.395.196.198.794.988.292.694.6
SARSr-Rf-BatCoV JTMC15/R.ferrumequinum/Jilin/201379.394.896.298.594.988.292.694.5
SARSr-Rs-BatCoV BtKY72/Rhinolophus sp./Kenya/200774.895.495.096.892.887.390.692.9
SARSr-Rm-BatCoV Longquan-140/R.monoceros/Zhejiang/201279.395.895.799.294.989.094.394.8
SARSr-Rb-BatCoV BM48–31/BGR/2008/R.blasii/Bulgaria/200878.494.195.297.893.589.988.693.5
SARSr-Rp-BatCoV ZXC21/R.pusillus/Zhejiang/201588.399.095.698.894.788.298.095.4
SARSr-Rp-BatCoV ZC45/R.pusillus/Zhejiang/201792.899.095.999.394.589.098.095.8
Pangolin-SARSr-CoV Guangxi/P4L/201785.597.197.998.297.094.597.796.9
SARSr-Ra-BatCoV RaTG13/R.affinis/Yunnan/201396.799.399.699.799.297.710099.2

*ADRP, ADP-ribose-1′′ phosphatase; nsp, nonstuctural protein; SARS-CoV-2, severe acute respiratory syndrome coronavirus 2; SARSr-CoV, severe acute respiratory syndrome–related coronavirus.

*ADRP, ADP-ribose-1′′ phosphatase; nsp, nonstuctural protein; SARS-CoV-2, severe acute respiratory syndrome coronavirus 2; SARSr-CoV, severe acute respiratory syndrome–related coronavirus. Unlike other members of the subgenus Sarbecovirus, SARS-CoV-2 has a spike protein that contains a unique insertion that results in a potential cleavage site at the S1/S2 junction, which might enable proteolytic processing that enhances cell–cell fusion. SARS-CoV-2 was demonstrated to use the same receptor, human angiotensin-converting enzyme 2 (hACE2), as does SARS-CoV (). The predicted RBD region of SARS-CoV-2 spike protein, corresponding to aa residues 318–513 of SARS-CoV (), showed the highest (97% aa) identities with pangolin-SARSr-CoV/MP789/Guangdong and 74.1%–77.7% identities with human/civet/bat-SARSr-CoVs known to use hACE2 (Table 1). Moreover, similar to the human/civet/bat-SARSr-CoV hACE2-using viruses, the 2 deletions (5 aa and 12 aa) found in all other SARSr-BatCoVs () were absent in SARS-CoV-2 RBD (Appendix Figure 1). Of the 5 critical residues needed for RBD-hACE2 interaction in SARSr-CoVs (), 3 (F472, N487, and Y491) were present in SARS-CoV-2 RBD and pangolin SARSr-CoV/MP789/2019-RBD. Phylogenetic analysis showed that the RNA-dependent RNA polymerase gene of SARS-CoV-2 is most closely related to that of SARSr-Ra-BatCoV RaTG13, whereas its predicted RBD is closest to that of pangolin-SARSr-CoVs (Figure 1). This finding suggests a distinct evolutionary origin for SARS-CoV-2 RBD, possibly as a result of recombination. Moreover, the SARS-CoV-2 RBD was also closely related to SARSr-Ra-BatCoV RaTG13 and the hACE2-using cluster containing human/civet-SARSr-CoVs and Yunnan SARSr-BatCoVs previously successfully cultured in VeroE6 cells (,).
Figure 1

Geographic and phylogenetic comparisons of SARS-CoV-2 isolates with closely related viruses. A) Locations in China where SARS-CoV-2 first emerged (Wuhan), and where closely related viruses were found, including SARSr-Ra-BatCoV RaTG13 (Pu’er), Pangolin-SARSr-CoVs (Guangzhou and Nanning), and SARSr-Rp-BatCoV ZC45 (Zhoushan). Time of sampling and percentage genome identities to SARS-CoV-2 are shown. *Guangzhou and Nanning. The geographic origin of smuggled pangolins remains unknown. B, C) Phylogenetic analyses of RdRp (B) and RBD (C) domains of SARSr-CoVs. Trees were constructed by using maximum-likelihood methods with Jones-Taylor-Thornton plus gamma plus invariant sites (RdRp) and Whelan and Goldman plus gamma (RBD) substitution models. A total of 745 aa residues for RdRp and 177 aa residues for RBD were included in the analyses. Numbers at nodes represent bootstrap values, which were calculated from 1,000 trees. Only bootstrap values >70% are shown. Purple indicates SARS-CoV-2 (strain HK20 in bold); teal indicates SARSr-Ra-BatCoV RaTG13; pink indicates pangolin SARSr-CoVs; green indicates SARSr-Rp-BatCoVs ZXC21 and ZC45; red indicates SARSr-Rs-BatCoV Rs3367; black indicates SARSr-Rs-BatCoV Longquan-140; gray indicates remaining SARSr-BatCoVs. Dots indicate SARSr-BatCoVs reported to use angiotensin-converting enzyme 2 as receptor. Scale bars indicate estimated number of amino acid substitutions per 200 aa residues for RdRp and per 20 aa residues for RBD. SARS-CoV-2, severe acute respiratory syndrome coronavirus 2; SARSr-CoV, severe acute respiratory syndrome–related coronavirus; NA, not available; RBD, receptor-binding domain; RdRp, RNA-dependent RNA polymerase.

Geographic and phylogenetic comparisons of SARS-CoV-2 isolates with closely related viruses. A) Locations in China where SARS-CoV-2 first emerged (Wuhan), and where closely related viruses were found, including SARSr-Ra-BatCoV RaTG13 (Pu’er), Pangolin-SARSr-CoVs (Guangzhou and Nanning), and SARSr-Rp-BatCoV ZC45 (Zhoushan). Time of sampling and percentage genome identities to SARS-CoV-2 are shown. *Guangzhou and Nanning. The geographic origin of smuggled pangolins remains unknown. B, C) Phylogenetic analyses of RdRp (B) and RBD (C) domains of SARSr-CoVs. Trees were constructed by using maximum-likelihood methods with Jones-Taylor-Thornton plus gamma plus invariant sites (RdRp) and Whelan and Goldman plus gamma (RBD) substitution models. A total of 745 aa residues for RdRp and 177 aa residues for RBD were included in the analyses. Numbers at nodes represent bootstrap values, which were calculated from 1,000 trees. Only bootstrap values >70% are shown. Purple indicates SARS-CoV-2 (strain HK20 in bold); teal indicates SARSr-Ra-BatCoV RaTG13; pink indicates pangolin SARSr-CoVs; green indicates SARSr-Rp-BatCoVs ZXC21 and ZC45; red indicates SARSr-Rs-BatCoV Rs3367; black indicates SARSr-Rs-BatCoV Longquan-140; gray indicates remaining SARSr-BatCoVs. Dots indicate SARSr-BatCoVs reported to use angiotensin-converting enzyme 2 as receptor. Scale bars indicate estimated number of amino acid substitutions per 200 aa residues for RdRp and per 20 aa residues for RBD. SARS-CoV-2, severe acute respiratory syndrome coronavirus 2; SARSr-CoV, severe acute respiratory syndrome–related coronavirus; NA, not available; RBD, receptor-binding domain; RdRp, RNA-dependent RNA polymerase. To identify putative recombination events, we performed sliding window analysis using SARS-CoV-2-HK20 as query and SARSr-Ra-BatCoV RaTG13, pangolin-SARSr-CoV/P4L/Guangxi/2017, SARSr-Rp-BatCoV ZC45, SARSr-Rs-BatCoV Rs3367, and SARSr-Rs-BatCoV Longquan-140 as potential parents (Figure 2; Appendix Figures 2). A similarity plot showed that SARS-CoV-2 is most closely related to SARSr-Ra-BatCoV RaTG13 in the entire genome, except for its RBD, which is closest to pangolin-SARSr-CoV/MP789/Guangdong, and shows potential recombination breakpoints. Moreover, different regions of SARS-CoV-2 genome showed different similarities to pangolin-SARSr-CoV/P4L/Guangxi/2017, SARSr-Rp-BatCoV ZC45, SARSr-Rs-BatCoV Rs3367, and SARSr-Rs-BatCoV Longquan-140, as supported by phylogenetic analysis (Appendix Figures 2, 3).
Figure 2

Bootscan analysis and nucleotide sequence alignment for SARS-CoV-2 isolates and closely related viruses. A) Boot scan analysis using the partial spike gene (positions 22397–23167) of SARS-CoV-2 strain HK20 as query sequence. Bootscanning was conducted with Simplot version 3.5.1 (https://sray.med.som) (F84 model; window size, 100 bp; step, 10 bp) on nucleotide alignment, generated with ClustalX (http://www.clustal.org). B) Multiple alignment of nucleotide sequences from genome positions 22300 to 23700. Yellow indicates receptor binding domain; orange indicates receptor binding motif; pink indicates bases conserved between SARS-CoV-2 HK20 and Pangolin-SARSr-CoV/MP789/Guangdong/2019; and blue indicates bases conserved between SARS-CoV-2 HK20 and SARSr-Ra-BatCoVs RaTG13.

Bootscan analysis and nucleotide sequence alignment for SARS-CoV-2 isolates and closely related viruses. A) Boot scan analysis using the partial spike gene (positions 22397–23167) of SARS-CoV-2 strain HK20 as query sequence. Bootscanning was conducted with Simplot version 3.5.1 (https://sray.med.som) (F84 model; window size, 100 bp; step, 10 bp) on nucleotide alignment, generated with ClustalX (http://www.clustal.org). B) Multiple alignment of nucleotide sequences from genome positions 22300 to 23700. Yellow indicates receptor binding domain; orange indicates receptor binding motif; pink indicates bases conserved between SARS-CoV-2 HK20 and Pangolin-SARSr-CoV/MP789/Guangdong/2019; and blue indicates bases conserved between SARS-CoV-2 HK20 and SARSr-Ra-BatCoVs RaTG13. Sequence alignment around the RBD supported potential recombination between SARSr-Ra-BatCoV RaTG13 and pangolin-SARSr-CoV/MP789/Guangdong/2019 and the receptor-binding motif region showing exceptionally high sequence similarity to that of pangolin-SARSr-CoV/MP789/Guangdong/2019. This finding suggested that SARS-CoV-2 might be a recombinant virus between viruses closely related to SARSr-Ra-BatCoV RaTG13 and pangolin-SARSr-CoV/MP789/Guangdong/2019.

Conclusions

Despite the close relatedness of SARS-CoV-2 to bat and pangolin viruses, none of the existing SARSr-CoVs represents its immediate ancestor. Most of the genome region of SARS-CoV-2 is closest to SARSr-Ra-BatCoV-RaTG13 from an intermediate horseshoe bat in Yunnan, whereas its RBD is closest to that of pangolin-SARSr-CoV/MP789/Guangdong/2019 from smuggled pangolins in Guangzhou. Potential recombination sites were identified around the RBD region, suggesting that SARS-CoV-2 might be a recombinant virus, with its genome backbone evolved from Yunnan bat virus–like SARSr-CoVs and its RBD region acquired from pangolin virus–like SARSr-CoVs. Because bats are the major reservoir of SARSr-CoVs and the pangolins harboring SARSr-CoVs were captured from the smuggling center, it is possible that pangolin SARSr-CoVs originated from bat viruses as a result of animal mixing, and there might be an unidentified bat virus containing an RBD nearly identical to that of SARS-CoV-2 and pangolin SARSr-CoV. Similar to SARS-CoV, SARS-CoV-2 is most likely a recombinant virus originated from bats. The ability of SARS-CoV-2 to emerge and infect humans is likely explained by its hACE2-using RBD region, which is genetically similar to that of culturable Yunnan SARSr-BatCoVs and human/civet-SARSr-CoVs. Most SARSr-BatCoVs have not been successfully cultured in vitro, except for some Yunnan strains that had human/civet SARS-like RBDs and were shown to use hACE2 (,). For example, SARSr-Rp-BatCoV ZC45, which has an RBD that is more divergent from that of human/civet-SARSr-CoVs, did not propagate in VeroE6 cells (). Factors that determine hACE2 use among SARSr-CoVs remain to be elucidated. Although the Wuhan market was initially suspected to be the epicenter of the epidemic, the immediate source remains elusive. The close relatedness among SARS-CoV-2 strains suggested that the Wuhan outbreak probably originated from a point source with subsequent human-to-human transmission, in contrast to the polyphyletic origin of Middle East respiratory syndrome coronavirus (). If the Wuhan market was the source, a possibility is that bats carrying the parental SARSr-BatCoVs were mixed in the market, enabling virus recombination. However, no animal samples from the market were reported to be positive. Moreover, the first identified case-patient and other early case-patients had not visited the market (), suggesting the possibility of an alternative source. Because the RBD is considered a hot spot for construction of recombinant CoVs for receptor and viral replication studies, the evolutionarily distinct SARS-CoV-2 RBD and the unique insertion of S1/S2 cleavage site among Sarbecovirus species have raised the suspicion of an artificial recombinant virus. However, there is currently no evidence showing that SARS-CoV-2 is an artificial recombinant, which theoretically might not carry signature sequences. Further surveillance studies in bats are needed to identify the possible source and evolutionary path of SARS-CoV-2.

Appendix

Additional information on possible bat origin of severe acute respiratory syndrome coronavirus virus 2.
  14 in total

1.  Ecoepidemiology and complete genome comparison of different strains of severe acute respiratory syndrome-related Rhinolophus bat coronavirus in China reveal bats as a reservoir for acute, self-limiting infection that allows recombination events.

Authors:  Susanna K P Lau; Kenneth S M Li; Yi Huang; Chung-Tong Shek; Herman Tse; Ming Wang; Garnet K Y Choi; Huifang Xu; Carol S F Lam; Rongtong Guo; Kwok-Hung Chan; Bo-Jian Zheng; Patrick C Y Woo; Kwok-Yung Yuen
Journal:  J Virol       Date:  2010-01-13       Impact factor: 5.103

2.  Receptor and viral determinants of SARS-coronavirus adaptation to human ACE2.

Authors:  Wenhui Li; Chengsheng Zhang; Jianhua Sui; Jens H Kuhn; Michael J Moore; Shiwen Luo; Swee-Kee Wong; I-Chueh Huang; Keming Xu; Natalya Vasilieva; Akikazu Murakami; Yaqing He; Wayne A Marasco; Yi Guan; Hyeryun Choe; Michael Farzan
Journal:  EMBO J       Date:  2005-03-24       Impact factor: 11.598

3.  Polyphyletic origin of MERS coronaviruses and isolation of a novel clade A strain from dromedary camels in the United Arab Emirates.

Authors:  Susanna K P Lau; Renate Wernery; Emily Y M Wong; Sunitha Joseph; Alan K L Tsang; Nissy Annie Georgy Patteril; Shyna K Elizabeth; Kwok-Hung Chan; Rubeena Muhammed; Jöerg Kinne; Kwok-Yung Yuen; Ulrich Wernery; Patrick C Y Woo
Journal:  Emerg Microbes Infect       Date:  2016-12-21       Impact factor: 7.163

4.  Isolation and characterization of a bat SARS-like coronavirus that uses the ACE2 receptor.

Authors:  Xing-Yi Ge; Jia-Lu Li; Xing-Lou Yang; Aleksei A Chmura; Guangjian Zhu; Jonathan H Epstein; Jonna K Mazet; Ben Hu; Wei Zhang; Cheng Peng; Yu-Ji Zhang; Chu-Ming Luo; Bing Tan; Ning Wang; Yan Zhu; Gary Crameri; Shu-Yi Zhang; Lin-Fa Wang; Peter Daszak; Zheng-Li Shi
Journal:  Nature       Date:  2013-10-30       Impact factor: 49.962

5.  A pneumonia outbreak associated with a new coronavirus of probable bat origin.

Authors:  Peng Zhou; Xing-Lou Yang; Xian-Guang Wang; Ben Hu; Lei Zhang; Wei Zhang; Hao-Rui Si; Yan Zhu; Bei Li; Chao-Lin Huang; Hui-Dong Chen; Jing Chen; Yun Luo; Hua Guo; Ren-Di Jiang; Mei-Qin Liu; Ying Chen; Xu-Rui Shen; Xi Wang; Xiao-Shuang Zheng; Kai Zhao; Quan-Jiao Chen; Fei Deng; Lin-Lin Liu; Bing Yan; Fa-Xian Zhan; Yan-Yi Wang; Geng-Fu Xiao; Zheng-Li Shi
Journal:  Nature       Date:  2020-02-03       Impact factor: 69.504

Review 6.  Molecular epidemiology, evolution and phylogeny of SARS coronavirus.

Authors:  Hayes K H Luk; Xin Li; Joshua Fung; Susanna K P Lau; Patrick C Y Woo
Journal:  Infect Genet Evol       Date:  2019-03-04       Impact factor: 3.342

7.  The species Severe acute respiratory syndrome-related coronavirus: classifying 2019-nCoV and naming it SARS-CoV-2.

Authors: 
Journal:  Nat Microbiol       Date:  2020-03-02       Impact factor: 17.745

8.  Discovery of a rich gene pool of bat SARS-related coronaviruses provides new insights into the origin of SARS coronavirus.

Authors:  Ben Hu; Lei-Ping Zeng; Xing-Lou Yang; Xing-Yi Ge; Wei Zhang; Bei Li; Jia-Zheng Xie; Xu-Rui Shen; Yun-Zhi Zhang; Ning Wang; Dong-Sheng Luo; Xiao-Shuang Zheng; Mei-Niang Wang; Peter Daszak; Lin-Fa Wang; Jie Cui; Zheng-Li Shi
Journal:  PLoS Pathog       Date:  2017-11-30       Impact factor: 6.823

9.  Genomic characterization and infectivity of a novel SARS-like coronavirus in Chinese bats.

Authors:  Dan Hu; Changqiang Zhu; Lele Ai; Ting He; Yi Wang; Fuqiang Ye; Lu Yang; Chenxi Ding; Xuhui Zhu; Ruicheng Lv; Jin Zhu; Bachar Hassan; Youjun Feng; Weilong Tan; Changjun Wang
Journal:  Emerg Microbes Infect       Date:  2018-09-12       Impact factor: 7.163

10.  A 193-amino acid fragment of the SARS coronavirus S protein efficiently binds angiotensin-converting enzyme 2.

Authors:  Swee Kee Wong; Wenhui Li; Michael J Moore; Hyeryun Choe; Michael Farzan
Journal:  J Biol Chem       Date:  2003-12-11       Impact factor: 5.157

View more
  90 in total

1.  Modelling scenarios of the epidemic of COVID-19 in Canada.

Authors:  Nick H Ogden; Aamir Fazil; Julien Arino; Philippe Berthiaume; David N Fisman; Amy L Greer; Antoinette Ludwig; Victoria Ng; Ashleigh R Tuite; Patricia Turgeon; Lisa A Waddell; Jianhong Wu
Journal:  Can Commun Dis Rep       Date:  2020-06-04

Review 2.  Challenges in Veterinary Vaccine Development.

Authors:  Sunil Thomas; Ann Abraham; Alina Rodríguez-Mallon; Sasimanas Unajak; John P Bannantine
Journal:  Methods Mol Biol       Date:  2022

Review 3.  Overview of Coronaviruses in Veterinary Medicine.

Authors:  Susan R Compton
Journal:  Comp Med       Date:  2021-08-19       Impact factor: 0.982

4.  Random-Forest-Bagging Broad Learning System With Applications for COVID-19 Pandemic.

Authors:  Choujun Zhan; Yufan Zheng; Haijun Zhang; Quansi Wen
Journal:  IEEE Internet Things J       Date:  2021-03-17       Impact factor: 10.238

5.  What We Need to Consider During and After the SARS-CoV-2 Pandemic.

Authors:  Willy A Valdivia-Granda; Jürgen A Richt
Journal:  Vector Borne Zoonotic Dis       Date:  2020-05-29       Impact factor: 2.133

6.  Virome of Bat Guano from Nine Northern California Roosts.

Authors:  Yanpeng Li; Eda Altan; Gabriel Reyes; Brian Halstead; Xutao Deng; Eric Delwart
Journal:  J Virol       Date:  2021-01-13       Impact factor: 5.103

7.  The PRRA insert at the S1/S2 site modulates cellular tropism of SARS-CoV-2 and ACE2 usage by the closely related Bat RaTG13.

Authors:  Shufeng Liu; Prabhuanand Selvaraj; Christopher Z Lien; Ivette A Nunez; Wells W Wu; Chao-Kai Chou; Tony T Wang
Journal:  J Virol       Date:  2021-03-08       Impact factor: 6.549

8.  Susceptibility of white-tailed deer (Odocoileus virginianus) to SARS-CoV-2.

Authors:  Mitchell V Palmer; Mathias Martins; Shollie Falkenberg; Alexandra Buckley; Leonardo C Caserta; Patrick K Mitchell; Eric D Cassmann; Alicia Rollins; Nancy C Zylich; Randall W Renshaw; Cassandra Guarino; Bettina Wagner; Kelly Lager; Diego G Diel
Journal:  J Virol       Date:  2021-03-10       Impact factor: 6.549

Review 9.  Original Hosts, Clinical Features, Transmission Routes, and Vaccine Development for Coronavirus Disease (COVID-19).

Authors:  Ting Wu; Shuntong Kang; Wenyao Peng; Chenzhe Zuo; Yuhao Zhu; Liangyu Pan; Keyun Fu; Yaxian You; Xinyuan Yang; Xuan Luo; Liping Jiang; Meichun Deng
Journal:  Front Med (Lausanne)       Date:  2021-07-06

10.  Investigating the Presence of SARS CoV-2 in Free-Living and Captive Animals.

Authors:  Lorena Jemeršić; Ivana Lojkić; Nina Krešić; Tomislav Keros; Tajana Amšel Zelenika; Luka Jurinović; Damir Skok; Ingeborg Bata; Jadranko Boras; Boris Habrun; Dragan Brnić
Journal:  Pathogens       Date:  2021-05-21
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.