Literature DB >> 32284615

The proximal origin of SARS-CoV-2.

Kristian G Andersen1,2, Andrew Rambaut3, W Ian Lipkin4, Edward C Holmes5, Robert F Garry6,7.   

Abstract

Entities:  

Mesh:

Substances:

Year:  2020        PMID: 32284615      PMCID: PMC7095063          DOI: 10.1038/s41591-020-0820-9

Source DB:  PubMed          Journal:  Nat Med        ISSN: 1078-8956            Impact factor:   87.241


× No keyword cloud information.
To the Editor — Since the first reports of novel pneumonia (COVID-19) in Wuhan, Hubei province, China[1,2], there has been considerable discussion on the origin of the causative virus, SARS-CoV-2[3] (also referred to as HCoV-19)[4]. Infections with SARS-CoV-2 are now widespread, and as of 11 March 2020, 121,564 cases have been confirmed in more than 110 countries, with 4,373 deaths[5]. SARS-CoV-2 is the seventh coronavirus known to infect humans; SARS-CoV, MERS-CoV and SARS-CoV-2 can cause severe disease, whereas HKU1, NL63, OC43 and 229E are associated with mild symptoms[6]. Here we review what can be deduced about the origin of SARS-CoV-2 from comparative analysis of genomic data. We offer a perspective on the notable features of the SARS-CoV-2 genome and discuss scenarios by which they could have arisen. Our analyses clearly show that SARS-CoV-2 is not a laboratory construct or a purposefully manipulated virus.

Notable features of the SARS-CoV-2 genome

Our comparison of alpha- and betacoronaviruses identifies two notable genomic features of SARS-CoV-2: (i) on the basis of structural studies[7-9] and biochemical experiments[1,9,10], SARS-CoV-2 appears to be optimized for binding to the human receptor ACE2; and (ii) the spike protein of SARS-CoV-2 has a functional polybasic (furin) cleavage site at the S1–S2 boundary through the insertion of 12 nucleotides[8], which additionally led to the predicted acquisition of three O-linked glycans around the site.

1. Mutations in the receptor-binding domain of SARS-CoV-2

The receptor-binding domain (RBD) in the spike protein is the most variable part of the coronavirus genome[1,2]. Six RBD amino acids have been shown to be critical for binding to ACE2 receptors and for determining the host range of SARS-CoV-like viruses[7]. With coordinates based on SARS-CoV, they are Y442, L472, N479, D480, T487 and Y4911, which correspond to L455, F486, Q493, S494, N501 and Y505 in SARS-CoV-2[7]. Five of these six residues differ between SARS-CoV-2 and SARS-CoV (Fig. 1a). On the basis of structural studies[7-9] and biochemical experiments[1,9,10], SARS-CoV-2 seems to have an RBD that binds with high affinity to ACE2 from humans, ferrets, cats and other species with high receptor homology[7].
Fig. 1

Features of the spike protein in human SARS-CoV-2 and related coronaviruses.

a, Mutations in contact residues of the SARS-CoV-2 spike protein. The spike protein of SARS-CoV-2 (red bar at top) was aligned against the most closely related SARS-CoV-like coronaviruses and SARS-CoV itself. Key residues in the spike protein that make contact to the ACE2 receptor are marked with blue boxes in both SARS-CoV-2 and related viruses, including SARS-CoV (Urbani strain). b, Acquisition of polybasic cleavage site and O-linked glycans. Both the polybasic cleavage site and the three adjacent predicted O-linked glycans are unique to SARS-CoV-2 and were not previously seen in lineage B betacoronaviruses. Sequences shown are from NCBI GenBank, accession codes MN908947, MN996532, AY278741, KY417146 and MK211376. The pangolin coronavirus sequences are a consensus generated from SRR10168377 and SRR10168378 (NCBI BioProject PRJNA573298)[29,30].

Features of the spike protein in human SARS-CoV-2 and related coronaviruses.

a, Mutations in contact residues of the SARS-CoV-2 spike protein. The spike protein of SARS-CoV-2 (red bar at top) was aligned against the most closely related SARS-CoV-like coronaviruses and SARS-CoV itself. Key residues in the spike protein that make contact to the ACE2 receptor are marked with blue boxes in both SARS-CoV-2 and related viruses, including SARS-CoV (Urbani strain). b, Acquisition of polybasic cleavage site and O-linked glycans. Both the polybasic cleavage site and the three adjacent predicted O-linked glycans are unique to SARS-CoV-2 and were not previously seen in lineage B betacoronaviruses. Sequences shown are from NCBI GenBank, accession codes MN908947, MN996532, AY278741, KY417146 and MK211376. The pangolin coronavirus sequences are a consensus generated from SRR10168377 and SRR10168378 (NCBI BioProject PRJNA573298)[29,30]. While the analyses above suggest that SARS-CoV-2 may bind human ACE2 with high affinity, computational analyses predict that the interaction is not ideal[7] and that the RBD sequence is different from those shown in SARS-CoV to be optimal for receptor binding[7,11]. Thus, the high-affinity binding of the SARS-CoV-2 spike protein to human ACE2 is most likely the result of natural selection on a human or human-like ACE2 that permits another optimal binding solution to arise. This is strong evidence that SARS-CoV-2 is not the product of purposeful manipulation.

2. Polybasic furin cleavage site and O-linked glycans

The second notable feature of SARS-CoV-2 is a polybasic cleavage site (RRAR) at the junction of S1 and S2, the two subunits of the spike[8] (Fig. 1b). This allows effective cleavage by furin and other proteases and has a role in determining viral infectivity and host range[12]. In addition, a leading proline is also inserted at this site in SARS-CoV-2; thus, the inserted sequence is PRRA (Fig. 1b). The turn created by the proline is predicted to result in the addition of O-linked glycans to S673, T678 and S686, which flank the cleavage site and are unique to SARS-CoV-2 (Fig. 1b). Polybasic cleavage sites have not been observed in related ‘lineage B’ betacoronaviruses, although other human betacoronaviruses, including HKU1 (lineage A), have those sites and predicted O-linked glycans[13]. Given the level of genetic variation in the spike, it is likely that SARS-CoV-2-like viruses with partial or full polybasic cleavage sites will be discovered in other species. The functional consequence of the polybasic cleavage site in SARS-CoV-2 is unknown, and it will be important to determine its impact on transmissibility and pathogenesis in animal models. Experiments with SARS-CoV have shown that insertion of a furin cleavage site at the S1–S2 junction enhances cell–cell fusion without affecting viral entry[14]. In addition, efficient cleavage of the MERS-CoV spike enables MERS-like coronaviruses from bats to infect human cells[15]. In avian influenza viruses, rapid replication and transmission in highly dense chicken populations selects for the acquisition of polybasic cleavage sites in the hemagglutinin (HA) protein[16], which serves a function similar to that of the coronavirus spike protein. Acquisition of polybasic cleavage sites in HA, by insertion or recombination, converts low-pathogenicity avian influenza viruses into highly pathogenic forms[16]. The acquisition of polybasic cleavage sites by HA has also been observed after repeated passage in cell culture or through animals[17]. The function of the predicted O-linked glycans is unclear, but they could create a ‘mucin-like domain’ that shields epitopes or key residues on the SARS-CoV-2 spike protein[18]. Several viruses utilize mucin-like domains as glycan shields involved immunoevasion[18]. Although prediction of O-linked glycosylation is robust, experimental studies are needed to determine if these sites are used in SARS-CoV-2.

Theories of SARS-CoV-2 origins

It is improbable that SARS-CoV-2 emerged through laboratory manipulation of a related SARS-CoV-like coronavirus. As noted above, the RBD of SARS-CoV-2 is optimized for binding to human ACE2 with an efficient solution different from those previously predicted[7,11]. Furthermore, if genetic manipulation had been performed, one of the several reverse-genetic systems available for betacoronaviruses would probably have been used[19]. However, the genetic data irrefutably show that SARS-CoV-2 is not derived from any previously used virus backbone[20]. Instead, we propose two scenarios that can plausibly explain the origin of SARS-CoV-2: (i) natural selection in an animal host before zoonotic transfer; and (ii) natural selection in humans following zoonotic transfer. We also discuss whether selection during passage could have given rise to SARS-CoV-2.

1. Natural selection in an animal host before zoonotic transfer

As many early cases of COVID-19 were linked to the Huanan market in Wuhan[1,2], it is possible that an animal source was present at this location. Given the similarity of SARS-CoV-2 to bat SARS-CoV-like coronaviruses[2], it is likely that bats serve as reservoir hosts for its progenitor. Although RaTG13, sampled from a Rhinolophus affinis bat[1], is ~96% identical overall to SARS-CoV-2, its spike diverges in the RBD, which suggests that it may not bind efficiently to human ACE2[7] (Fig. 1a). Malayan pangolins (Manis javanica) illegally imported into Guangdong province contain coronaviruses similar to SARS-CoV-2[21]. Although the RaTG13 bat virus remains the closest to SARS-CoV-2 across the genome[1], some pangolin coronaviruses exhibit strong similarity to SARS-CoV-2 in the RBD, including all six key RBD residues[21] (Fig. 1). This clearly shows that the SARS-CoV-2 spike protein optimized for binding to human-like ACE2 is the result of natural selection. Neither the bat betacoronaviruses nor the pangolin betacoronaviruses sampled thus far have polybasic cleavage sites. Although no animal coronavirus has been identified that is sufficiently similar to have served as the direct progenitor of SARS-CoV-2, the diversity of coronaviruses in bats and other species is massively undersampled. Mutations, insertions and deletions can occur near the S1–S2 junction of coronaviruses[22], which shows that the polybasic cleavage site can arise by a natural evolutionary process. For a precursor virus to acquire both the polybasic cleavage site and mutations in the spike protein suitable for binding to human ACE2, an animal host would probably have to have a high population density (to allow natural selection to proceed efficiently) and an ACE2-encoding gene that is similar to the human ortholog.

2. Natural selection in humans following zoonotic transfer

It is possible that a progenitor of SARS-CoV-2 jumped into humans, acquiring the genomic features described above through adaptation during undetected human-to-human transmission. Once acquired, these adaptations would enable the pandemic to take off and produce a sufficiently large cluster of cases to trigger the surveillance system that detected it[1,2]. All SARS-CoV-2 genomes sequenced so far have the genomic features described above and are thus derived from a common ancestor that had them too. The presence in pangolins of an RBD very similar to that of SARS-CoV-2 means that we can infer this was also probably in the virus that jumped to humans. This leaves the insertion of polybasic cleavage site to occur during human-to-human transmission. Estimates of the timing of the most recent common ancestor of SARS-CoV-2 made with current sequence data point to emergence of the virus in late November 2019 to early December 2019[23], compatible with the earliest retrospectively confirmed cases[24]. Hence, this scenario presumes a period of unrecognized transmission in humans between the initial zoonotic event and the acquisition of the polybasic cleavage site. Sufficient opportunity could have arisen if there had been many prior zoonotic events that produced short chains of human-to-human transmission over an extended period. This is essentially the situation for MERS-CoV, for which all human cases are the result of repeated jumps of the virus from dromedary camels, producing single infections or short transmission chains that eventually resolve, with no adaptation to sustained transmission[25]. Studies of banked human samples could provide information on whether such cryptic spread has occurred. Retrospective serological studies could also be informative, and a few such studies have been conducted showing low-level exposures to SARS-CoV-like coronaviruses in certain areas of China[26]. Critically, however, these studies could not have distinguished whether exposures were due to prior infections with SARS-CoV, SARS-CoV-2 or other SARS-CoV-like coronaviruses. Further serological studies should be conducted to determine the extent of prior human exposure to SARS-CoV-2.

3. Selection during passage

Basic research involving passage of bat SARS-CoV-like coronaviruses in cell culture and/or animal models has been ongoing for many years in biosafety level 2 laboratories across the world[27], and there are documented instances of laboratory escapes of SARS-CoV[28]. We must therefore examine the possibility of an inadvertent laboratory release of SARS-CoV-2. In theory, it is possible that SARS-CoV-2 acquired RBD mutations (Fig. 1a) during adaptation to passage in cell culture, as has been observed in studies of SARS-CoV[11]. The finding of SARS-CoV-like coronaviruses from pangolins with nearly identical RBDs, however, provides a much stronger and more parsimonious explanation of how SARS-CoV-2 acquired these via recombination or mutation[19]. The acquisition of both the polybasic cleavage site and predicted O-linked glycans also argues against culture-based scenarios. New polybasic cleavage sites have been observed only after prolonged passage of low-pathogenicity avian influenza virus in vitro or in vivo[17]. Furthermore, a hypothetical generation of SARS-CoV-2 by cell culture or animal passage would have required prior isolation of a progenitor virus with very high genetic similarity, which has not been described. Subsequent generation of a polybasic cleavage site would have then required repeated passage in cell culture or animals with ACE2 receptors similar to those of humans, but such work has also not previously been described. Finally, the generation of the predicted O-linked glycans is also unlikely to have occurred due to cell-culture passage, as such features suggest the involvement of an immune system[18].

Conclusions

In the midst of the global COVID-19 public-health emergency, it is reasonable to wonder why the origins of the pandemic matter. Detailed understanding of how an animal virus jumped species boundaries to infect humans so productively will help in the prevention of future zoonotic events. For example, if SARS-CoV-2 pre-adapted in another animal species, then there is the risk of future re-emergence events. In contrast, if the adaptive process occurred in humans, then even if repeated zoonotic transfers occur, they are unlikely to take off without the same series of mutations. In addition, identifying the closest viral relatives of SARS-CoV-2 circulating in animals will greatly assist studies of viral function. Indeed, the availability of the RaTG13 bat sequence helped reveal key RBD mutations and the polybasic cleavage site. The genomic features described here may explain in part the infectiousness and transmissibility of SARS-CoV-2 in humans. Although the evidence shows that SARS-CoV-2 is not a purposefully manipulated virus, it is currently impossible to prove or disprove the other theories of its origin described here. However, since we observed all notable SARS-CoV-2 features, including the optimized RBD and polybasic cleavage site, in related coronaviruses in nature, we do not believe that any type of laboratory-based scenario is plausible. More scientific data could swing the balance of evidence to favor one hypothesis over another. Obtaining related viral sequences from animal sources would be the most definitive way of revealing viral origins. For example, a future observation of an intermediate or fully formed polybasic cleavage site in a SARS-CoV-2-like virus from animals would lend even further support to the natural-selection hypotheses. It would also be helpful to obtain more genetic and functional data about SARS-CoV-2, including animal studies. The identification of a potential intermediate host of SARS-CoV-2, as well as sequencing of the virus from very early cases, would similarly be highly informative. Irrespective of the exact mechanisms by which SARS-CoV-2 originated via natural selection, the ongoing surveillance of pneumonia in humans and other animals is clearly of utmost importance.
  18 in total

1.  Laboratory-acquired severe acute respiratory syndrome.

Authors:  Poh Lian Lim; Asok Kurup; Gowri Gopalakrishna; Kwai Peng Chan; Christopher W Wong; Lee Ching Ng; Su Yun Se-Thoe; Lynette Oon; Xinlai Bai; Lawrence W Stanton; Yijun Ruan; Lance D Miller; Vinsensius B Vega; Lyn James; Peng Lim Ooi; Chew Suok Kai; Sonja J Olsen; Brenda Ang; Yee-Sin Leo
Journal:  N Engl J Med       Date:  2004-04-22       Impact factor: 91.245

2.  Spike protein, S, of human coronavirus HKU1: role in viral life cycle and application in antibody detection.

Authors:  Che-Man Chan; Patrick C Y Woo; Susanna K P Lau; Herman Tse; Hong-Lin Chen; Feng Li; Bo-Jian Zheng; Ling Chen; Jian-Dong Huang; Kwok-Yung Yuen
Journal:  Exp Biol Med (Maywood)       Date:  2008-10-10

3.  Proteolytic activation of the spike protein at a novel RRRR/S motif is implicated in furin-dependent entry, syncytium formation, and infectivity of coronavirus infectious bronchitis virus in cultured cells.

Authors:  Yoshiyuki Yamada; Ding Xiang Liu
Journal:  J Virol       Date:  2009-06-24       Impact factor: 5.103

4.  Isolation and characterization of a bat SARS-like coronavirus that uses the ACE2 receptor.

Authors:  Xing-Yi Ge; Jia-Lu Li; Xing-Lou Yang; Aleksei A Chmura; Guangjian Zhu; Jonathan H Epstein; Jonna K Mazet; Ben Hu; Wei Zhang; Cheng Peng; Yu-Ji Zhang; Chu-Ming Luo; Bing Tan; Ning Wang; Yan Zhu; Gary Crameri; Shu-Yi Zhang; Lin-Fa Wang; Peter Daszak; Zheng-Li Shi
Journal:  Nature       Date:  2013-10-30       Impact factor: 49.962

5.  Receptor Recognition by the Novel Coronavirus from Wuhan: an Analysis Based on Decade-Long Structural Studies of SARS Coronavirus.

Authors:  Yushun Wan; Jian Shang; Rachel Graham; Ralph S Baric; Fang Li
Journal:  J Virol       Date:  2020-03-17       Impact factor: 5.103

Review 6.  Coronavirus reverse genetic systems: infectious clones and replicons.

Authors:  Fernando Almazán; Isabel Sola; Sonia Zuñiga; Silvia Marquez-Jurado; Lucia Morales; Martina Becares; Luis Enjuanes
Journal:  Virus Res       Date:  2014-06-12       Impact factor: 3.303

7.  Furin cleavage of the SARS coronavirus spike glycoprotein enhances cell-cell fusion but does not affect virion entry.

Authors:  Kathryn E Follis; Joanne York; Jack H Nunberg
Journal:  Virology       Date:  2006-03-07       Impact factor: 3.616

8.  Viral Metagenomics Revealed Sendai Virus and Coronavirus Infection of Malayan Pangolins (Manis javanica).

Authors:  Ping Liu; Wu Chen; Jin-Ping Chen
Journal:  Viruses       Date:  2019-10-24       Impact factor: 5.048

9.  An interactive web-based dashboard to track COVID-19 in real time.

Authors:  Ensheng Dong; Hongru Du; Lauren Gardner
Journal:  Lancet Infect Dis       Date:  2020-02-19       Impact factor: 25.071

10.  Functional assessment of cell entry and receptor usage for SARS-CoV-2 and other lineage B betacoronaviruses.

Authors:  Michael Letko; Andrea Marzi; Vincent Munster
Journal:  Nat Microbiol       Date:  2020-02-24       Impact factor: 17.745

View more
  1413 in total

1.  COVID-19 infection alters kynurenine and fatty acid metabolism, correlating with IL-6 levels and renal status.

Authors:  Tiffany Thomas; Davide Stefanoni; Julie A Reisz; Travis Nemkov; Lorenzo Bertolone; Richard O Francis; Krystalyn E Hudson; James C Zimring; Kirk C Hansen; Eldad A Hod; Steven L Spitalnik; Angelo D'Alessandro
Journal:  JCI Insight       Date:  2020-07-23

2.  The Heterogeneous Landscape and Early Evolution of Pathogen-Associated CpG Dinucleotides in SARS-CoV-2.

Authors:  Andrea Di Gioacchino; Petr Šulc; Anastassia V Komarova; Benjamin D Greenbaum; Rémi Monasson; Simona Cocco
Journal:  SSRN       Date:  2020-05-27

Review 3.  Medication use during COVID-19: Review of recent evidence.

Authors:  T S Brandon Ng; Kori Leblanc; Darwin F Yeung; Teresa S M Tsang
Journal:  Can Fam Physician       Date:  2021-03       Impact factor: 3.275

4.  Systematic Review of Important Viral Diseases in Africa in Light of the 'One Health' Concept.

Authors:  Ravendra P Chauhan; Zelalem G Dessie; Ayman Noreddin; Mohamed E El Zowalaty
Journal:  Pathogens       Date:  2020-04-20

Review 5.  Emergency surgery during the COVID-19 pandemic: what you need to know for practice.

Authors:  B De Simone; E Chouillard; S Di Saverio; L Pagani; M Sartelli; W L Biffl; F Coccolini; A Pieri; M Khan; G Borzellino; F C Campanile; L Ansaloni; F Catena
Journal:  Ann R Coll Surg Engl       Date:  2020-04-30       Impact factor: 1.891

6.  WHO report into COVID pandemic origins zeroes in on animal markets, not labs.

Authors:  Amy Maxmen
Journal:  Nature       Date:  2021-04       Impact factor: 49.962

7.  The COVID lab-leak hypothesis: what scientists do and don't know.

Authors:  Amy Maxmen; Smriti Mallapaty
Journal:  Nature       Date:  2021-06-08       Impact factor: 49.962

8.  How a torrent of COVID science changed research publishing - in seven charts.

Authors:  Holly Else
Journal:  Nature       Date:  2020-12       Impact factor: 49.962

9.  Using circular economy principles to recycle materials in guiding the design of a wet scrubber-reactor for indoor air disinfection from coronavirus and other pathogens.

Authors:  Andrei Shishkin; Gaurav Goel; Janis Baronins; Jurijs Ozolins; Clare Hoskins; Saurav Goel
Journal:  Environ Technol Innov       Date:  2021-02-12

10.  What We Need to Consider During and After the SARS-CoV-2 Pandemic.

Authors:  Willy A Valdivia-Granda; Jürgen A Richt
Journal:  Vector Borne Zoonotic Dis       Date:  2020-05-29       Impact factor: 2.133

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.