| Literature DB >> 34046923 |
Yuri Deigin1, Rossana Segreto2.
Abstract
RaTG13, MP789, and RmYN02 are the strains closest to SARS-CoV-2, and their existence came to light only after the start of the pandemic. Their genomes have been used to support a natural origin of SARS-CoV-2 but after a close examination all of them exhibit several issues. We specifically address the presence in RmYN02 and closely related RacCSxxx strains of a claimed natural PAA/PVA amino acid insertion at the S1/S2 junction of their spike protein at the same position where the PRRA insertion in SARS-CoV-2 has created a polybasic furin cleavage site. We show that RmYN02/RacCSxxx instead of the claimed insertion carry a 6-nucleotide deletion in the region and that the 12-nucleotide insertion in SARS-CoV-2 remains unique among Sarbecoviruses. Also, our analysis of RaTG13 and RmYN02's metagenomic datasets found unexpected reads which could indicate possible contamination. Because of their importance to inferring SARS-CoV-2's origin, we call for a careful reevaluation of RaTG13, MP789 and RmYN02 sequencing records and assembly methods.Entities:
Keywords: Pangolin CoV MP789; RaTG13; RmYN02; SARS-CoV-2; furin cleavage site; origin; peer review
Mesh:
Substances:
Year: 2021 PMID: 34046923 PMCID: PMC8209872 DOI: 10.1002/bies.202100015
Source DB: PubMed Journal: Bioessays ISSN: 0265-9247 Impact factor: 4.653
FIGURE 1(A) Clustal W multiple sequence alignment of RmYN02 with the strains used in Zhou et al. for comparison. RmYN02's nucleotides coding the PAA amino acids (CCT GCA GCG) are surrounded by a red box. No insertion in RmYN02 is visible; on the contrary, a deletion splitting the nucleotides coding for PAA is observed. (B) Clustal W multiple sequence alignment of RmYN02 with the strains used in Zhou et al. for comparison, with the exception of SARS‐CoV‐2. The deletion characterizing RmYN02 at the S1/S2 junction appears to cause a split of the first nucleotide from the rest of the sequence coding for the PAA amino acids (CCT GCA GCG, surrounded by a red box). (C) Pairwise comparisons of RmYN02 (anchor) to RaTG13, ZC45, and ZXC21. No PAA insertion is observed in RmYN02 in these comparisons. (D) Pairwise comparisons of ZC45 (anchor) to ZXC21, RmYN02, RaTG13, and Pangolin/GD/2019. RmYN02's nucleotides coding the PAA amino acids (surrounded by a red box) are aligned as mutations relative to ZC45 rather than insertions. (E) Phylogenetic tree of SARS‐GZ02, Pangolin/GX/2017, ZC45, ZXC21, RmYN02, RaTG13, and Pangolin/GD/2019 produced by CLUSTAL W based on the alignment of their genomes as in (B)
FIGURE 2Nucleotide and amino acid alignments of RmYN02 with SARS‐CoV‐2, RaTG13, RShSTT182/200 (Cambodia), RacCS203/264/271 (Thailand), Pangolin/GD/2019, RmYN01, RP3, Rf4092, LYRa11, Rs3367, RsSHC014, ZC45, and ZXC21 at the S1/S2 junction of the spike protein. For RmYN02, three alternative versions are provided, besides the ones proposed by Clustal W and Zhou et al