| Literature DB >> 32571797 |
Zhe Liu1,2, Huanying Zheng2, Huifang Lin1,2, Mingyue Li3, Runyu Yuan1,2, Jinju Peng1,2, Qianling Xiong1,2, Jiufeng Sun1,2, Baisheng Li2, Jie Wu2, Lina Yi1,2, Xiaofang Peng1,2, Huan Zhang1,2, Wei Zhang1,2, Ruben J G Hulswit4, Nick Loman5, Andrew Rambaut6, Changwen Ke2, Thomas A Bowden4, Oliver G Pybus7,8, Jing Lu9,2.
Abstract
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is a novel coronavirus first identified in December 2019. Notable features that make SARS-CoV-2 distinct from most other previously identified betacoronaviruses include a receptor binding domain and a unique insertion of 12 nucleotides or 4 amino acids (PRRA) at the S1/S2 boundary. In this study, we identified two deletion variants of SARS-CoV-2 that either directly affect the polybasic cleavage site itself (NSPRRAR) or a flanking sequence (QTQTN). These deletions were verified by multiple sequencing methods. In vitro results showed that the deletion of NSPRRAR likely does not affect virus replication in Vero and Vero-E6 cells; however, the deletion of QTQTN may restrict late-phase viral replication. The deletion of QTQTN was detected in 3 of 68 clinical samples and 12 of 24 in vitro-isolated viruses, while the deletion of NSPRRAR was identified in 3 in vitro-isolated viruses. Our data indicate that (i) there may be distinct selection pressures on SARS-CoV-2 replication or infection in vitro and in vivo; (ii) an efficient mechanism for deleting this region from the viral genome may exist, given that the deletion variant is commonly detected after two rounds of cell passage; and (iii) the PRRA insertion, which is unique to SARS-CoV-2, is not fixed during virus replication in vitro These findings provide information to aid further investigation of SARS-CoV-2 infection mechanisms and a better understanding of the NSPRRAR deletion variant observed here.IMPORTANCE The spike protein determines the infectivity and host range of coronaviruses. Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has two unique features in its spike protein, the receptor binding domain and an insertion of 12 nucleotides at the S1/S2 boundary resulting in a furin-like cleavage site. Here, we identified two deletion variants of SARS-CoV-2 that either directly affect the furin-like cleavage site itself (NSPRRAR) or a flanking sequence (QTQTN), and we investigated these deletions in cell isolates and clinical samples. The absence of the polybasic cleavage site in SARS-CoV-2 did not affect virus replication in Vero or Vero-E6 cells. Our data indicate the PRRAR sequence and the flanking QTQTN sequence are not fixed in vitro; thus, there appears to be distinct selection pressures on SARS-CoV-2 sequences in vitro and in vivo Further investigation of the mechanism of generating these deletion variants and their infectivity in different animal models would improve our understanding of the origin and evolution of this virus.Entities:
Keywords: COVID-19; SARS-CoV-2; deletion mutation; replication kinetics; spike protein
Mesh:
Substances:
Year: 2020 PMID: 32571797 PMCID: PMC7431800 DOI: 10.1128/JVI.00790-20
Source DB: PubMed Journal: J Virol ISSN: 0022-538X Impact factor: 5.103
Sample information and accession numbers for all sequences
| Patient identifier | Sample isolated from: | Passage | Sample name | Sequencing method | Genome Sequence Archive accession no. |
|---|---|---|---|---|---|
| Case1 | BALF | Original | 014 | Metagenomic | SAMC151281 |
| Vero-E6 | 3 | 014/MiSeq | PCR+MiSeq | SAMC150996 | |
| Vero-E6 | 3 | 014/cDNA | Nanopore direct cDNA | SAMC150997 | |
| Vero-E6 | Plaque | 014_Var1 | PCR+Nanopore | SAMC192628 | |
| Vero-E6 | Plaque | 014_Var2 | PCR+Nanopore | SAMC192629 | |
| Case2 | Vero-E6 | 2 | 025/E6 | PCR+Nanopore | SAMC150991 |
| Case3 | Vero | 2 | 028/Vero | PCR+Nanopore | SAMC150988 |
| Vero-E6 | 2 | 028/E6 | PCR+Nanopore | SAMC150992 | |
| Case4 | Vero-E6 | 2 | 029/E6 | PCR+Nanopore | SAMC150975 |
| Case5 | Vero-E6 | 2 | 107/E6 | PCR+Nanopore | SAMC150977 |
| Vero | 2 | 107/Vero | PCR+Nanopore | SAMC150989 | |
| Case6 | Vero-E6 | 2 | 108/E6 | PCR+Nanopore | SAMC150993 |
| Vero | 2 | 108/Vero | PCR+Nanopore | SAMC150995 | |
| Case7 | Vero-E6 | 2 | 112/E6 | PCR+Nanopore | SAMC150976 |
| Vero | 2 | 112/Vero | PCR+Nanopore | SAMC150994 | |
| Case8 | Vero-E6 | 2 | 115/E6 | PCR+Nanopore | SAMC150978 |
| Vero | 2 | 115/Vero | PCR+Nanopore | SAMC150990 | |
| Case9 | Vero-E6 | 2 | 252/E6 | PCR+Nanopore | SAMC150980 |
| Case10 | Vero-E6 | 2 | 262/E6 | PCR+Nanopore | SAMC150981 |
| Case11 | Vero-E6 | 2 | 263/E6 | PCR+Nanopore | SAMC150983 |
| Case12 | Vero-E6 | 2 | 265/E6 | PCR+Nanopore | SAMC150982 |
| Case13 | Vero-E6 | 2 | 272/E6 | PCR+Nanopore | SAMC150984 |
| Case14 | Vero-E6 | 3 | 619/E6 | PCR+Nanopore | SAMC153235 |
| Case15 | Vero-E6 | 2 | 1676/E6 | PCR+Nanopore | SAMC150979 |
| Case16 | Vero-E6 | 3 | 4276/E6 | PCR+Nanopore | SAMC153234 |
| Case17 | Vero-E6 | 2 | F2/E6 | PCR+Nanopore | SAMC150985 |
| Case18 | Vero-E6 | 2 | F4/E6 | PCR+Nanopore | SAMC150986 |
| Case19 | Vero-E6 | 2 | F5/E6 | PCR+Nanopore | SAMC150987 |
| Case20 | Nasopharyngeal swab | Original | 20SF5645 | PCR+Nanopore | SAMC150972 |
| Case21 | Nasopharyngeal swab | Original | ST-N3-D | PCR+Nanopore | SAMC150973 |
| Case22 | Nasopharyngeal swab | Original | SZ-N16-D | PCR+Nanopore | SAMC150974 |
FIG 1Deletion variants identified in SARS-CoV-2 cell strains. (A) High-throughput sequencing of the cell-isolated strain (014) from the first SARS-CoV-2 patient (EPI 403934) in Guangdong, China. Representative reads mapping to the SARS-CoV-2 genome (GenBank accession number MN908947.3 used as reference genome) showed two deletion variants. Redundant proteolytic cleavage sites, including furin cleavage site (PRRARS|V) and cathepsin L site (QSIIAY|T) are marked with red arrows. (B) Sanger sequencing of the 014 cell strains. Heterozygous peaks are highlighted with a red box, and sites with distinct three peaks are marked with * (C) Results of high-throughput sequencing, showing the ratio of deletion variants in original clinical sample SF014 (P0) and in cell strains, after 7 rounds of cell passage (P1 to P7). The size of each square was proportional to the number of reads having these deletions. (D) Phylogenetic tree of genome sequences of all 24 SARS-CoV-2 cell strains (see Table 1). The size of the circles is proportional to the percentage of Var1 (QTQTN deletion at positions 23583 to 23597) in total reads, except for strains 619, 4279, and 014, in which Var2 deletions were detected. The maximum likelihood tree was rooted with the reference genome sequence under GenBank accession number MN908947.3.
FIG 2The replication kinetics of the deletion variants in Vero-E6 and Vero cells. Vero-E6 and Vero cells were infected with the isolated strains 014_Var1, 014_Var2, and 029/E6 (Table 1) at a multiplicity of infection (MOI) of 0.5. Viral RNA was quantified by real-time PCR using GAPDH as an endogenous control. At each time point, the relative fold change in total intracellular viral RNA was measured by comparison with the viral RNA level at 1 hour postinoculation. Data are the mean ± SD of three independent experiments. Asterisks indicate a significant difference (P < 0.05).
QTQTN deletion variant identified in clinical samples
| Sample | Days post-illness onset | REF_depth (×) | ALT_depth (×) | Del variant ratio (%) |
|---|---|---|---|---|
| 20SF5645 | Asymptomatic | 104 | 25 | 19.4 |
| ST-N3-D | 16 | 82 | 8 | 8.8 |
| SZ-N16-D | 30 | 256 | 125 | 32.8 |
At position 23583 to 23597 (Var1).
Cases detected with the recurrence of SARS-CoV-2 after discharge.
REF_depth, sequencing depth for the reference bases; ALT_depth, sequencing depth for the altered bases.
FIG 3Observed deletions near the S1/S2 boundary map to an unresolved region in the cryo-EM structure of SARS-CoV-2 S. Cartoon representation of the SARS-CoV-2 S protein ectodomain, as resolved by Walls and colleagues (4) (PDB ID 6VXX). The S1 and S2 subunits of the different protomers are indicated (white and gray, respectively). The unresolved loop that contains part of deletion Var1 (675QTQTN679) and all of deletion Var2 (679NSPRRAR685) is indicated within each protomer of the trimeric assembly through signposting flanking residues T676 and S689 as spheres in deep teal. Similarly, the first residue of Var1 (Q675), which is resolved in the structure, is colored orange within each of the S protomers. N-linked glycans are shown as blue spheres and the Asn side chains to which the glycans that are linked are presented as sticks. Inset: A zoomed-in side view representation of this local arrangement is shown. T676 and S689, which flank the unresolved loop, and Var1 residue Q675 are numbered and indicated under transparent spheres as deep teal and orange sticks, respectively. A dashed line indicating the approximate position of the connecting unresolved loop is shown. N-linked glycans are presented as in the original image with their residue numbers marked.