| Literature DB >> 35419073 |
Abstract
Several investigations suggested origins of SARS-CoV-2 from the recombination of coronaviruses of various animals, including the bat Rhinolophus affinis and the pangolin Manis javanica, despite the processes describing the adaptation from a reservoir of animals to human are still debated. In this perspective, I will remark two main inconsistencies on the origins of SARS-CoV-2: polypeptide sequence alignment of the S-proteins does not return the expected identity of the receptor-binding motif among most of pangolin-CoVs and SARS-CoV-2; accurate referencing for samplings and sequencing deposition of the ancestral bat coronavirus named RaTG13 was missing since the first reports on the SARS-CoV-2 coronavirus. This contribution aims to stimulate discussion about the origins of SARS-CoV-2 and considers other intermediate hosts as a reservoir for coronavirus.Entities:
Keywords: RaTG13; S-protein; SARS-CoV-2; coronavirus; polypeptide sequence alignment; receptor-binding motif
Year: 2022 PMID: 35419073 PMCID: PMC8982992 DOI: 10.2217/fvl-2021-0233
Source DB: PubMed Journal: Future Virol ISSN: 1746-0794 Impact factor: 1.831
Figure 1.Polypeptide sequence alignment.
(A) Alignment of the whole dataset of the S-proteins of SARS-CoV viruses. Respective names, accession numbers and additional information are reported in Table 1. Black rectangles: insertions -1 to -4, as reported by Zhou et al. [1] and Xiao et al. [5]. Red rectangle: S1/S2 cleavage site. Sequences were aligned using Muscle software [6]. Alignment accuracy was checked manually base by base using BioEdit v7.2.5 [7]. Part of the alignment not shown in this figure is indicated with: […]. (B) Alignment of the receptor-binding motif of the S-proteins of SARS-CoV-2, RaTG13 and of pangolin-CoVs. Blue rectangle: receptor-binding motif; black rectangles: positions of key amino acid residues involved in the interaction with human ACE2; red asterisk: additional key amino acid residue reported in Andersen et al. [8]; green circles: deposited sequences of pangolin-CoVs presenting a full asset of conserved key amino acid residues within the receptor-binding motif of SARS-CoV-2.
If derived from the GISAID database (Table 1), S-protein sequences represented in this figure were predicted with orffinder (https://www.ncbi.nlm.nih.gov/orffinder/) using the deposited whole-genomes of their respective viruses as queries.
Information related with the polypeptide sequences of the S-proteins analyzed in this research.
| Viral strain name | Source | Accession | Submission date | DOI |
|---|---|---|---|---|
| MERS-related coronavirus | GenBank | ANF29184.1 | 30-04-2017 | 10.1093/infdis/jiw236 |
| Sars-CoV-2_WIV04 | GISAID | EPI_ISL_402124 | 11-01-2020 | 10.1038/s41586-020-2012-7 |
| Sars-CoV-2 isolate Wuhan-Hu-1 | GenBank | QHD43416.1 | 18-03-2020 | 10.1038/s41586-020-2012-7 |
| Pangolin-CoV | ViPR/GenBank | QIA48614.1 | 23-04-2020 | 10.1016/j.cub.2020.03.022 |
| Pangolin-CoV | ViPR/GenBank | QIQ54048.1 | 23-04-2020 | 10.1016/j.cub.2020.03.022 |
| Pangolin-CoV | ViPR/GenBank | QIA48632.1 | 23-04-2020 | 10.1016/j.cub.2020.03.022 |
| Pangolin-CoV | ViPR/GenBank | QIA48623.1 | 23-04-2020 | 10.1016/j.cub.2020.03.022 |
| Pangolin-CoV | ViPR/GenBank | QIA48641.1 | 23-04-2020 | 10.1016/j.cub.2020.03.022 |
| Pangolin-CoV | GenBank | MT121216 | 18-05-2020 | 10.1371/journal.ppat.1008421 |
| Pangolin-CoV | GenBank | MT084071.1 | 23-04-2020 | 10.1002/bies.202000240 |
| Pangolin-CoV | GISAID | EPI_ISL_410538 | 17-02-2020 | 10.1038/s41586-020-2169-0 |
| Pangolin-CoV | GISAID | EPI_ISL_410539 | 17-02-2020 | 10.1038/s41586-020-2169-0 |
| Pangolin-CoV | GISAID | EPI_ISL_410540 | 17-02-2020 | 10.1038/s41586-020-2169-0 |
| Pangolin-CoV | GISAID | EPI_ISL_410541 | 17-02-2020 | 10.1038/s41586-020-2169-0 |
| Pangolin-CoV | GISAID | EPI_ISL_410542 | 17-02-2020 | 10.1038/s41586-020-2169-0 |
| Pangolin-CoV | GISAID | EPI_ISL_410543 | 17-02-2020 | 10.1038/s41586-020-2169-0 |
| Bat_RatG13 | GISAID | EPI_ISL_402131 | 24-03-2020 | 10.1038/s41586-020-2012-7 |
| Bat coronavirus-former RaTG13 | GenBank | QHR63300.1 | 24-01-2020 | 10.1038/s41586-020-2012-7 |
| SARS coronavirus BJ01 | GenBank | AAP30030.1 | 01-09-2009 | 10.1016/s1672-0229(03)01017-9 |
| Bat SARS-like RsSHC014 | GenBank | AGZ48806.1 | 22-11-2013 | 10.1038/nature12711 |
| Bat SARS-like WIV1 | GenBank | AGZ48828.1 | 22-11-2013 | 10.1038/nature12711 |
| Bat Sars_SL-CoV Rs3367 | GenBank | AGZ48818.1 | 22-11-2013 | 10.1038/nature12711 |
| Recombinant FJ211860.1 | GenBank | ACJ60703.1 | 31-12-2008 | 10.1073/pnas.0808116105 |
| Former Bat-CoV_ZC45_AVP78031.1 | GenBank | AVP78031.1 | 28-03-2018 | 10.1038/s41426-018-0155-5 |
| Bat-SL-CoVZC45 | GenBank | AVP78031.1 | 20-05-2020 | 10.1038/s41426-018-0155-5 |
| Former Bat-Cov_ZXC21_AVP78042.1 | GenBank | AVP78042.1 | 28-03-2018 | 10.1038/s41426-018-0155-5 |
| Bat SL-CoVZXC21 | GenBank | AVP78042.1 | 20-05-2020 | 10.1038/s41426-018-0155-5 |
| Recombinant FJ211859.1 | GenBank | ACJ60694.1 | 26-07-2016 | 10.1073/pnas.0808116105 |
| BtRs-BetaCoV/YN2018A | GenBank | QDF43820.1 | 30-06-2019 | 10.3389/fmicb.2019.01900 |
| Bat SARS-like coronavirus | GenBank | ATO98181.1 | 18-12-2017 | 10.1371/journal.ppat.1006698 |
| Bat SL-coronavirus Rs3262-2 | GenBank | AGZ48783.1 | 22-11-2013 | 10.1038/nature12711 |
| Bat SARS-like Rs4087-2 | GenBank | AGZ48802.1 | 22-11-2013 | 10.1038/nature12711 |
Deposited sequences of pangolin-CoVs reporting a full asset of conserved key amino acid residues within the receptor-binding motif of the S-protein of SARS-CoV-2.
Supplementary data: Partial alignment among genomes of the coronaviruses RaTG13 and SARS-CoV-2 (accession numbers are available in Table 1) and the nucleotide sequence of BtCoV/4991 (Genbank KP876546.1) [9]. The polypeptide sequence is reported under the nucleotide sequences. Alignment of the nucleotide sequences is shown until the end of motif C [10] (red). Cyan, polymorphisms between BtCoV/4991 and SARS-CoV-2; bold: codon responsible of the amino acid substitution H-to-Y occurring between RaTG13 and SARS-CoV-2 (in cyan); green: interacting residues of the palm domain according with Gao et al. [10].