| Literature DB >> 34940505 |
Abstract
A novel coronavirus (SARS-CoV-2) emerged towards the end of 2019 that caused a severe respiratory disease in humans called COVID-19. It led to a pandemic with a high rate of morbidity and mortality that is ongoing and threatening humankind. Most of the mutations occurring in SARS-CoV-2 are synonymous or deleterious, but a few of them produce improved viral functions. The first known mutation associated with higher transmissibility, D614G, was detected in early 2020. Since then, the virus has evolved; new mutations have occurred, and many variants have been described. Depending on the genes affected and the location of the mutations, they could provide altered infectivity, transmissibility, or immune escape. To date, mutations that cause variations in the SARS-CoV-2 spike protein have been among the most studied because of the protein's role in the initial virus-cell contact and because it is the most variable region in the virus genome. Some concerning mutations associated with an impact on viral fitness have been described in the Spike protein, such as D614G, N501Y, E484K, K417N/T, L452R, and P681R, among others. To understand the impact of the infectivity and antigenicity of the virus, the mutation landscape of SARS-CoV-2 has been under constant global scrutiny. The virus variants are defined according to their origin, their genetic profile (some characteristic mutations prevalent in the lineage), and the severity of the disease they produce, which determines the level of concern. If they increase fitness, new variants can outcompete others in the population. The Alpha variant was more transmissible than previous versions and quickly spread globally. The Beta and Gamma variants accumulated mutations that partially escape the immune defenses and affect the effectiveness of vaccines. Nowadays, the Delta variant, identified around March 2021, has spread and displaced the other variants, becoming the most concerning of all lineages that have emerged. The Delta variant has a particular genetic profile, bearing unique mutations, such as T478K in the spike protein and M203R in the nucleocapsid. This review summarizes the current knowledge of the different mutations that have appeared in SARS-CoV-2, mainly on the spike protein. It analyzes their impact on the protein function and, subsequently, on the level of concern of different variants and their importance in the ongoing pandemic.Entities:
Keywords: COVID-19 vaccines; SARS-CoV-2 genome; Spike protein; escape mutation; neutralizing antibodies (nAbs); receptor binding domain (RBD); variant of concern (VOC)
Year: 2021 PMID: 34940505 PMCID: PMC8705434 DOI: 10.3390/jdb9040058
Source DB: PubMed Journal: J Dev Biol ISSN: 2221-3759
SARS-CoV-2 genome structure. Polyproteins pp1a and pp1ab synthesize non-structural proteins nsp1–nsp16, which are responsible for the replication of ssRNA. The 3′ third of the genome contains genes that synthesize structural proteins and ORFs. Many of the resulting proteins still have unknown functions. Data were collected from NCBI’s public gene database (https://www.ncbi.nlm.nih.gov/gene accessed on 14 September 2021) and [2,10,11,12,13,14,15,16,17,18,19,20]. Adapted from [2].
| Gene | Transcript | Protein Name(s) | Position in the Genome | Length (aa) | Function |
|---|---|---|---|---|---|
| 5′UTR | 1–265 | ||||
| ORF1a | pp1ab, pp1a [ | nsp1 | 266–805 | 180 | Leader protein. Cellular mRNA degradation, inhibiting IFN signaling [ |
| nsp2 | 806–2719 | 638 | Unknown. | ||
| nsp3, PLpro | 2720–8554 | 1945 | Papain-like protease, adenosine diphosphate-ribose 1″-phosphatase. Blocks host innate immune response, promotes cytokine expression [ | ||
| nsp4 | 8555–10054 | 500 | Double-membrane vesicles formation [ | ||
| nsp5, 3CLpro, Mpro | 10055–10972 | 306 | 3-chymotrypsin-like Cys protease. Main protease. Mediates cleavages downstream of nsp4. Inhibits IFN signaling [ | ||
| nsp6 | 10973–11842 | 290 | Restricting autophagosome expansion. Double-membrane vesicle formation [ | ||
| nsp7 | 11843–12091 | 83 | Cofactor with nsp8 and nsp12 [ | ||
| nsp8 | 12092–12685 | 198 | Replicase. Cofactor with nsp7 and nsp12. Primase [ | ||
| nsp9 | 12686–13024 | 113 | Replicase. ssRNA-binding protein. Dimerization and RNA binding [ | ||
| nsp10 | 13025–13441 | 139 | RNA synthesis protein. Scaffold and cooperation with nsp14 ExoN and nsp16 in methyltransferase activities [ | ||
| ORF1ab | pp1ab [ | nsp11 | 13442–13480 | 13 | Endoribonuclease and 3′-to-5′ exonuclease [ |
| nsp12, RdRP | 13442–16236 | 932 | RNA-dependent RNA polymerase: replication and transcription of the viral genome. Primer dependent RdRp [ | ||
| nsp13 | 16237–18039 | 601 | DNA and RNA helicase/NTPase, 2′-O-ribose methyltransferase. RNA 5′-triphosphatase. RNA helicase 5′ triphosphatase [ | ||
| nsp14 | 18040–19620 | 527 | ExoN. 3′-to-5′ exonuclease. N7-guanine methyltransferase [ | ||
| nsp15 | 19621–20658 | 346 | Endoribonuclease, 3′-to-5′ exonuclease. NendoU. Evasion of dsRNA sensors [ | ||
| nsp16 | 20659–21552 | 298 | 2′-O-ribose methyltransferase [ | ||
| S | Spike (S) | 21563–25384 | 1273 | Structural protein; surface glycoprotein. Mediates virus–host cell binding. | |
| ORF3a | ORF3a | 25393–26220 | 275 | Ion channel activity (viroporin) activates the NLRP3 inflammasome. May play a role in virus replication and pathogenesis. | |
| E | Envelope (E) | 26245–26472 | 75 | Structural protein. Envelope protein. Facilitates assembly and release of the virus. It has ion channel activity required for pathogenesis. | |
| M | Membrane (M) | 26523–27191 | 222 | Structural protein. Membrane glycoprotein. Located in the transmembrane domain; it is the most abundant structural protein. | |
| ORF6 | ORF6 | 27202–27387 | 61 | Suppression of both primary interferon production and interferon signaling [ | |
| ORF7a | ORF7a | 27394–27759 | 121 | Type I transmembrane protein. | |
| ORF7b | ORF7b | 27756–27887 | 43 | Localize to the Golgi compartment. | |
| ORF8 | ORF8 | 27894-28259 | 121 | Interferes with host antiviral mechanisms [ | |
| N | Nucleocapsid (N) | 28274–29533 | 419 | Structural protein. Nucleocapsid phosphoprotein protects the viral RNA genome and is involved in packaging RNA into virus particles. | |
| ORF10 | ORF10 | 29558–29674 | 38 | Unknown. No transcripts identified [ | |
| 3′UTR | 29675–29903 |
Figure 1(A) SARS-CoV-2 structure of the SARS-CoV-2, comprising an ssRNA and 4 structural proteins interacts with the host cell through the ACE2 receptor (dark blue). Blue field represents inside the host cell, and yellow field outside the cell. (B) Changes in the spike during the virus-cell interaction. At the left, the inactive or ‘down’ conformation, reluctant to bind ACE2. Spike–ACE2 binding produces a conformational change in the S towards the ‘up’ configuration. This is followed by S1/S2 cleavage by host enzymes and activation of the entry to the cell. (C) Viral ssRNA enters the cell and produces pp1a and pp1ab, which will provide the RTC plus structural proteins (S, N, E, M) and accessory proteins (ORF). (D) N proteins are assembled with the new replicated viral ssRNA to form the vRNP. New viruses bud from the ERGIC and are released from the cell.
Figure 2On the top, a schematic view of the SARS-CoV-2 genome that spans almost 30 kb. Polyprotein pp1a produces nsp1–nsp10 and pp1ab generates nsp1–nsp16. In the middle, the detailed structure of the spike protein. SS: signal sequence, NTD: N-terminal-domain, RBD: receptor binding domain, RBM: receptor binding motif, SD: subdomain, FP: fusion peptide, HR: heptad repeat, CH: central helix, CD: connector domain, TD: transmembrane domain, CT: cytoplasmic tail. Cleavage of the spike protein in SD1/SD2 yields spike subunits S1 and S2, activating the virus entry in the host cell. At the bottom, a schematic representation of the mutations included in the VOCs until September 2021. Red dotted lines point out mutations of concern that are shared by different variants.
Summary of main variants. Name(s), lineage, type, and distinctive mutations of the different variants mapped against the Wuhan-Hu-1 reference sequence (MN908947). The concern status is defined by the WHO [89], the CDC [90], and the ECDC [91]; de-esc: de-escalated. Mutations in the spike gene are underlined. Ϯ: Mutations in the Delta variant are undefined, particularly in the ORF1 region, and they differ from one database to another. In bold letters are those mutations documented by all different sources [77,92,93].
| WHO Name [ | Other Names | Lineage (Pango) | First Documented | Status WHO (*CDC) | Status from September 2021 | Mutations: Amino Acid Modifications in Comparison to the Ancestral Wuhan-Hu-1 Sequence (NC_045512) | Impact of Mutations on Virus Functions |
|---|---|---|---|---|---|---|---|
| Alpha | VOC 202012/01, 20I/501Y.V1 (British variant) | B.1.1.7 | UK, Sept-2020 [ | VOC | WHO: VOC ECDC: de-esc CDC: VBM | PL:T183I, PL:A890D, PL:I1412T, nsp6:Δ106-108, RdRP:P323L, | Increased affinity of S protein for ACE2 receptor, provided mostly by N501Y [ |
| Beta | 20H/501Y.V2 (Southafrican variant) | B.1.351 | South Africa, Oct-2020 [ | VOC | WHO: VOC ECDC: VOC CDC: VBM | nsp2:T85I, PL:K837N, 3CL:K90R, nsp6:Δ106-108, RdRP:P323L, | D614G, N501Y, E484K, and K417N increase the binding affinity for the ACE receptors [ |
| Gamma | P.1, 20J/501Y.V3, (Brazilian variant) | P.1, B.1.1.28.1 | Brazil. Dec-2020 [ | VOC | WHO: VOC ECDC: VOC CDC: VBM | PL:S370L, PL:K977Q, nsp6:Δ106-108, RdRPP323L, nsp13:E341D, | D614G, N501Y, E484K, and K417T increase the binding affinity for the ACE receptors [ |
| Delta | VUI-21APR-02, (Indian variant) | B.1.617.2 | India. Dec-2020 [ | VOC | VOC | nsp2:P129L, PL:P822L, PL:H1274Y, nsp4:A446V, nsp6:L37F, nsp6:V149A, | Delta presents a unique mutation T478K. Along with L452R, it provides immune escape [ |
| Lineage B.1.617 | Ancestral B.1.617 | India [ | nsp1:Δ17, PL:Δ880, PL:A994D, RdRP:P323L, nsp13:Δ206, nsp13:Δ430, nsp14:Δ21, nsp15:A255V, nsp16:H186R, | Presents E484Q and L452R in the spike protein. Potential properties of immune escape and increased infectivity. | |||
| Epsilon | CAL.20C/ | B.1.427, B.1.429 | USA. Mar-2020 | VUM | WHO: VUM ECDC: de-esc CDC: VBM | nsp2:T85I, RdRP:P323L, nsp13:D260Y, | Expected to provide advantage at spreading [ |
| Zeta | VUI-202101/01 | P.2, B.1.1.28.2 | Brazil. Apr-2020 [ | VOI* | WHO: de-esc ECDC: de-esc CDC: VBM | Spike mutations: | Presents E484K in the spike protein. Potential properties of immune escape. |
| Eta | B.1.525 | Nigeria and UK. Dec-2020 | VOI | WHO: VUM ECDC: de-esc | PL:T1189I, nsp6:Δ106-108, RdRP:P323F, | Presents E484K, Δ69–70, Δ144–145 in the spike protein. Potential properties of immune escape. | |
| Theta | GR/1092K.V1 | P.3 | Philipines, Jan-2021 [Pango] | WHO: de-esc ECDC: de-esc | PL:D736G, PL:S1807F, nsp4:D217N, nsp4:L438P, nsp6:D112E, nsp7:L71F, RdRP:P323L, nsp13:L280F, nsp13:A358V, | Presents E484K and N501Y in the spike protein. Potential properties of immune escape and increased infectivity. | |
| Iota | B.1.526 | New York. Nov-2020 [ | VOI | WHO: VUM ECDC: de-esc CDC: VBM | nsp2:T85I, nsp4:L438P, nsp6:Δ106-108, RdRP:P323L, nsp13:Q88H, | Presents E484K in the spike protein. Potential properties of immune escape. | |
| Kappa | B.1.617.1 | India. Oct- 2020. | VOI | WHO: VUM ECDC: de-esc CDC: VBM | PL:T749I, nsp6:T77A, RdRP:P323L, nsp13:G206C, nsp13:M429I, nsp15:K259R, nsp15:S261A, | Presents E484Q, L452R, and P681R in the spike protein. Potential properties of immune escape and increased infectivity. | |
| Lambda | C.37 | Peru. Dec-2020. | VOI | WHO: VOI ECDC: VOI | nsp1:E102K, PL:A41V, PL:T428I, PL:D821N, PL1469S, nsp4:D217N, nsp4:D459N, 3CL:G15S, nsp6:L122S, nsp8:T148I, RdRP:P323L, nsp13:D105Y, | Presents L452R and Δ69–70 in the spike protein. Potential properties of immune escape. | |
| Mu | B.1.621 | Colombia. Jan-2021. | VOI | WHO: VOI ECDC: VOI CDC: VBM | PL:T237A, PL:T720I, nsp4:T492I, nsp6:Q160R, RdRP:P323L, nsp13:P419S, | Presents E484K and N501Y in the spike protein. Potential properties of immune escape and increased infectivity. | |
| B.1.617.3 | India. Feb-2021 [ | VUM, VOI* | ECDC: de-esc CDC: VBM | PL:A1T, PL:A1526V, PL:T1830I, 3CL:A194S, nsp6:A117V, RdRP:P323L, | Presents E484Q, L452R, and P681R in the spike protein. Potential properties of immune escape and increased infectivity. |