| Literature DB >> 32752979 |
Cristina Andrés1, Damir Garcia-Cehic2,3, Josep Gregori3,4, Maria Piñana1, Francisco Rodriguez-Frias3,5,6, Mercedes Guerrero-Murillo2, Juliana Esperalba1, Ariadna Rando7, Lidia Goterris7, Maria Gema Codina7, Susanna Quer2, Maria Carmen Martín1, Magda Campins8, Ricard Ferrer9, Benito Almirante10, Juan Ignacio Esteban2,3,6, Tomás Pumarola6,7, Andrés Antón1,6, Josep Quer2,3.
Abstract
The SARS-CoV-2 spike (S) protein, the viral mediator for binding and entry into the host cell, has sparked great interest as a target for vaccine development and treatments with neutralizing antibodies. Initial data suggest that the virus has low mutation rates, but its large genome could facilitate recombination, insertions, and deletions, as has been described in other coronaviruses. Here, we deep-sequenced the complete SARS-CoV-2 S gene from 18 patients (10 with mild and 8 with severe COVID-19), and found that the virus accumulates deletions upstream and very close to the S1/S2 cleavage site (PRRAR/S), generating a frameshift with appearance of a stop codon. These deletions were found in a small percentage of the viral quasispecies (2.2%) in samples from all the mild and only half the severe COVID-19 patients. Our results suggest that the virus may generate free S1 protein released to the circulation. We suggest that natural selection has favoured a "Don't burn down the house" strategy, in which free S1 protein may compete with viral particles for the ACE2 receptor, thus reducing the severity of the infection and tissue damage without losing transmission capability.Entities:
Keywords: NGS; SARS-CoV-2; deletions; diversity; quasispecies; respiratory virus
Mesh:
Substances:
Year: 2020 PMID: 32752979 PMCID: PMC8284971 DOI: 10.1080/22221751.2020.1806735
Source DB: PubMed Journal: Emerg Microbes Infect ISSN: 2222-1751 Impact factor: 7.163
Characteristics of patients with mild and severe COVID-19. #P16 had clinical symptoms consistent with severe disease, but he was not hospitalized in the ICU.
| Sample Id ( | Real-time PCR cycle threshold (Ct) value | COVID-19 classification | Sample Type | Sex (F = female; M = male) | Age (years) | Days at Intensive Care Unit (ICU) |
|---|---|---|---|---|---|---|
| P01 | 19.00 | Mild | nasopharyngeal aspirate | F | 34 | no admission |
| P02 | 25.00 | mild | nasopharyngeal aspirate | F | 54 | no admission |
| P03 | 16.40 | mild | nasopharyngeal/oropharyngeal swab | F | 42 | no admission |
| P04 | 23.10 | mild | nasopharyngeal/oropharyngeal swab | F | 25 | no admission |
| P05 | 25.98 | mild | nasopharyngeal/oropharyngeal swab | M | 52 | no admission |
| P06 | 21.45 | mild | nasopharyngeal/oropharyngeal swab | F | 42 | no admission |
| P07 | 25.94 | mild | nasopharyngeal/oropharyngeal swab | F | 25 | no admission |
| P14 | 23.71 | mild | nasopharyngeal aspirate | F | 26 | no admission |
| P15 | 27.32 | mild | nasopharyngeal/oropharyngeal swab | M | 41 | no admission |
| P18 | 15.50 | mild | nasopharyngeal/oropharyngeal swab | M | 74 | no admission |
| P08 | No data | severe | nasopharyngeal/oropharyngeal swab | F | 51 | 4 |
| P09 | 25.36 | severe | nasopharyngeal/oropharyngeal swab | M | 49 | 3 |
| P10 | 21.23 | severe | nasopharyngeal/oropharyngeal swab | F | 47 | 16 |
| P11 | 36.01 | severe | nasopharyngeal/oropharyngeal swab | M | 45 | 27 |
| P12 | 31.04 | severe | nasopharyngeal/oropharyngeal swab | M | 51 | 23 |
| P13 | 22.94 | severe | nasopharyngeal aspirate | F | 44 | 55 |
| P16 | 34.35 | severe | nasopharyngeal/oropharyngeal swab | F | 45 | # |
| P17 | 30.77 | severe | nasopharyngeal/oropharyngeal swab | M | 49 | 10 |
Figure 1.Diagram showing location of the deletions found along the Spike gene and protein[29].
List of deletions found along the spike gene.
| Deletion code (Δ=deletion region) | Amplicon at the nucleotide level | Deleted nucleotide positions | Deleted amino acid positions | Number of nts deleted | Patient code | Number of reads with deletions | Total reads | Population frequency (in percentage) |
|---|---|---|---|---|---|---|---|---|
| Δ1 | N01 | 38–49 | 13S-17V | 12 | 234 | 126,140 | 0.19 | |
| Δ2 | N01 | 323–329 | 108T-110L | 2–7 | 7329 | 648,955 | 1.13 | |
| Δ3 | N02 | 420–434 | 140F-145Y | 2–14 | 10,111 | 1,617,589 | 0.63 | |
| Δ4 | N02 | 596–603 | 199G-201F | 2–8 | 1031 | 653,783 | 0.16 | |
| Δ5 | N03 | 724–736 | 242L-246R | 5–6 | 2024 | 259,012 | 0.78 | |
| Δ6 | N03-N04 | 1022–1027 | 341V-343N | 2–4 | 3809 | 1,042,526 | 0.37 | |
| Δ7 | N04 | 1120–1128 | 374F-S-396T | 9 | 387 | 157,405 | 0.25 | |
| Δ8 | N04 | 1177–1180 | 393T-394N | 4 | 338 | 155,414 | 0.22 | |
| Δ9 | N04-N05 | 1283–1324 | 428D-442D | 13–42 | 3835 | 496,156 | 0.77 | |
| Δ10 | N05 | 1368–1376 | 456F-459S | 9 | 245 | 172,903 | 0.14 | |
| Δ11 | N06 | 1444–1452 | 482G-484E | 9 | 521 | 173,278 | 0.30 | |
| Δ12 | N07 | 1865–1870 | 622V-A-624I | 6 | 16,725 | 436,054 | 3.84 | |
| Δ13 | N07 | 1888–1894 | 630T- | 19 | 193 | 148,533 | 0.13 | |
| Δ14 | N07 | 1961–1979 | 654E-660Y | 8 | 1068 | 192,954 | 0.55 | |
| Δ15 | N07 | 1980–2035 | 660Y-679N | 2–34 | 64,978 | 2,923,548 | 2.22 | |
| Δ16 | N08-N09 | 2451–2467 | 817F-823F | 2–16 | 11,739 | 2,176,059 | 0.54 | |
| Δ17 | N10-N11 | 3018–3019 | 1006T-1007Y | 2 | 1544 | 299,893 | 0.51 | |
| Δ18 | N12-N13 | 3499 | 1167G | 1 | 23,359 | 472,695 | 4.94 |
Figure 2.Bar plot of deletions in amplicon N07 in the 18 patients (P01-P18) at the nucleotide level: Panel 1, patients with mild disease; Panel 2, patients with severe disease. The x axis provides the multiple alignment (MA) nucleotide positions and the amplitude of the deletions by subregions, and the y axis shows the frequency of the deletion (percentage) on the right and the number of reads on the left. As no insertions were observed, the MA positions correspond to S gene positions. Dashed lines indicates S1/S2 (left) and S2’ (right) cleavage sites. Bar plots for the 18 patients by amplicons are provided in supplementary material (Figures S1 to S14 for nucleotides and S15 to S27 for amino acids).
List of deletions found in amplicon N07 at the nucleotide level aligned under the reference sequence Wuhan Hu-1 (MN908947.3). Alignment between nucleotides 1974 and 2070 is shown. Nucleotides represented in bold red in the reference sequence indicate the S1/S2 cleavage site R (CGT) / S (AGT).
| Patient MILD/SEVERE | Nucleotide alignments |
|---|---|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Deletions in amplicon N07 at the amino acid (aa) level. wt = wild type (MN908947.3). S, stop; Lost + S, loss of reading frame and appearance of a stop codon; rRF, recover reading frame. Haplotypes that did not lose the TATA box-like sequence are highlighted in yellow, and haplotypes with a deletion upstream of the TATA box-like sequence are highlighted in blue. Cleavage S1/S2 amino acid site between residues 685 / 686 (PRRAR/S). * stop codon.
| Patient MILD/SEVERE | Amino acid alignments | |
|---|---|---|
|
| ||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Figure 3.Based on the life cycle of SARS-CoV, this diagram represents the hypothesis derived from our results. Entry of the virus in the host cell is shown at the top right of the diagram. At the transcription step, two scenarios are depicted: to the left, the viral particle resulting from normal S protein, and to the right the viral particle resulting from truncated S protein. In normal conditions, once the nucleoprotein is freed into the cytoplasm ss + RNA is translated into the non-structural proteins required for transcription. ss + RNA is transcribed into ss-RNA and later into genomic ss + RNA which is encapsidated (left side of the figure). Once the complete viral particle has been formed, it is secreted from the cell by exocytosis. The right side of the figure depicts the situation when a deletion occurs in the S gene during transcription of the complete genome and before subgenomic mRNAs are generated to produce the structural proteins. Translation of a deleted subgenomic spike mRNA would lead to a truncated S protein composed of the S1 domain without S2, which could be shed outside the cell as free S1. The box depicts possible destinations of free S1, which could bind to (1) the ACE2 cell receptor, (2) S1-specific neutralizing antibodies, or (3) free ACE2 receptor. ***The red triangle indicates the deletion in genomic RNA. ***Abbreviations: ACE2, angiotensin converting enzyme 2; mRNA, messenger RNA; NAb; neutralizing antibodies; pp1a, polyprotein 1a; RdRp, RNA-dependent RNA polymerase; S, spike; S1, subunit S1 at the N-terminal domain of the S protein, which includes receptor binding domain (RBD); S2, subunit S2 located at the C-terminal domain of S protein, which includes fusion peptide (FP), heptad repeat (HR) domain 1 and 2, and the transmembrane domain (TM); ss, single stranded; ss + RNA, single-stranded positive sense RNA; TMPRS22, human serine protease TMPRSS2.