Literature DB >> 35266305

SARS-CoV-2 variants and spike mutations involved in second wave of COVID-19 pandemic in India.

Radhakrishna Muttineni1, Binitha R N2, Kalyani Putty3, Kavitha Marapakala4, Sandra K P5, Jaslin Panyam1, Aravind Vemula1, Shashi Mohan Singh1, Subin Balachandran2, Viroji Rao S T6, Anand Kumar Kondapi5.   

Abstract

Against the backdrop of the second wave of COVID-19 pandemic in India that started in March 2021, we have monitored the spike (S) protein mutations in all the reported (GISAID portal) whole-genome sequences of SARS-CoV-2 circulating in India from 1 January 2021 to 31 August 2021. In the 43,102 SARS-CoV-2 genomic sequences analysed, we have identified 24,260 amino acid mutations in the S protein, based on which 265 Pango lineages could be categorized. The dominant lineage in most of the 28 states of India and its 8 union territories was B.1.617.2 (the delta variant). However, the states Madhya Pradesh, Jammu & Kashmir, and Punjab had B.1.1.7 (alpha variant) as the major lineage, while the Himachal Pradesh state reported B.1.36 as the dominating lineage. A detailed analysis of various domains of S protein was carried out for detecting mutations having a prevalence of >1%; 70, 18, 7, 3, 9, 4, and 1 (N = 112) such mutations were observed in the N-terminal domain, receptor binding domain, C -terminal domain, fusion peptide region, heptapeptide repeat (HR)-1 domains, signal peptide domain, and transmembrane region, respectively. However, no mutations were recorded in the HR-2 and cytoplasmic domains of the S protein. Interestingly, 13.39% (N = 15) of these mutations were reported to increase the infectivity and pathogenicity of the virus; 2% (N = 3) were known to be vaccine breakthrough mutations, and 0.89% (N = 1) were known to escape neutralizing antibodies. The biological significance of 82% (N = 92) of the reported mutations is yet unknown. As SARS-CoV-2 variants are emerging rapidly, it is critical to continuously monitor local viral mutations to understand national trends of virus circulation. This can tremendously help in designing better preventive regimens in the country, and avoid vaccine breakthrough infections.
© 2022 Wiley-VCH GmbH.

Entities:  

Keywords:  B.1.617.2; COVID-19 second wave; India; SARS-CoV-2; spike protein variants; whole-genome

Mesh:

Substances:

Year:  2022        PMID: 35266305      PMCID: PMC9115369          DOI: 10.1111/tbed.14508

Source DB:  PubMed          Journal:  Transbound Emerg Dis        ISSN: 1865-1674            Impact factor:   4.521


INTRODUCTION

As on November 2021, severe acute respiratory syndrome coronavirus 2 (SARS‐CoV‐2) accounted for about 260 million COVID‐19 infections and about 5.2 million deaths worldwide. In India, more than 34 million cases with around half a million deaths were reported in this period (Coronavirus Disease [COVID‐19], n.d.). Viruses (especially RNA viruses) constantly change through mutations (Bessière & Volmer, 2021); a variant has one or more mutations that differentiate it from others in circulation. Owing to the great efforts of many researchers worldwide who are generating and sharing virus whole‐genome sequences publicly, multiple variants of SARS‐CoV‐2 have been documented globally throughout this pandemic. Currently, more than 4 million SARS‐CoV‐2 sequences are available via the Global Initiative on Sharing All Influenza Data (GISAID) (GISAID ‐ Initiative, n.d.). Sequence data greatly enables the identification of mutations that potentially change viral properties and detect emerging SARS‐CoV‐2 variants. The entry of SARS‐CoV‐2 into host cells is mediated by its spike (S) protein binding to the host cell‐surface receptor, angiotensin‐converting enzyme 2 (ACE2) (Zhao et al., 2020). The S protein is heavily glycosylated and induces a protective immune response (F. Li, 2016; Walls et al., 2020). The S protein is divided into two subunits S1 and S2. These two domains are biologically significant. S1 exhibits the function of receptor‐binding and S2 region is associated with the membrane fusion to facilitate cell entry (Xia, 2021). S1 region has four domains, signal peptide (1–13 aa), N‐terminal (14–305 aa), receptor binding (319–541 aa) and C‐ terminal‐ SD1 and SD2 (542–685 aa). The S2 region has five domains, fusion peptide (688–811 aa), heptapeptide repeats 1 and 2 (812–1142 aa), transmembrane (1213–1236 aa), cytoplasmic domain, and extra amino acids (1274–1943 aa) (Winger & Caspari, 2021). Mutations in S protein were always of major concern as they play a major role in the emergence of new SARS‐CoV‐2 variants (Harvey et al., 2021). The World Health Organization (WHO) classification of variant viruses as variants of concern (VOC) and variants of interest (VOI) is also based on mutations in S protein (Tracking SARS‐CoV‐2 Variants, n.d.) (Table 1).
TABLE 1

SARS‐CoV‐2 variant classifications as per WHO

S. No.Pango lineageWHO variant classificationSpike mutations
1B.1.525VOIA67V, 69del, 70del, 144del, E484K, D614G, Q677H, F888L
2B.1.526VOIL5F, T95I, D253G, E484K, D614G, A701V
3B.1.617.1VOIT95I, G142D, E154K, L452R, E484Q, D614G, P681R, Q1071H
4B.1.617.3VOIT19R, G142D, L452R, E484Q, D614G, P681R, D950N
5B.1.1.7VOC69del, 70del, 144del, N501Y, A570D, D614G, P681H, T716I, S982A, D1118H
6B.1.617.2VOCT19R, T95I, G142D, E156‐, F157‐, R158G, L452R, T478K, D614G, P681R, D950N
7B.1.351, B.1.351.2, B.1.351.3VOCD80A, D215G, 241del, 242del, 243del, K417N, E484K, N501Y, D614G, A701V
8P.1, P.1.1, P.1.2VOCL18F, T20N, P26S, D138Y, R190S, K417T, E484K, N501Y, D614G, H655Y, T1027I
SARS‐CoV‐2 variant classifications as per WHO In India, the second wave of COVID‐19 emerged in March 2021 (and started declining towards August 2021), with a new set of symptoms affecting the gastrointestinal system along with the previous (first wave) symptoms that primarily included respiratory problems (Kamble et al., 2021). Oxygen requirement, mortality rate and the disease‐spreading rate increased during the second wave even though vaccines like COVAXIN (inactivated whole virus vaccine) and COVISHIELD (adenovirus vectored vaccine) were available and in use since January 2021 (albeit in initial stages by the time second wave emerged in India). In India, an estimate of 62,76,35,244 people completed 2 doses of vaccination; however, vaccine breakthrough infections are not uncommon (Dash et al., 2021). With this perspective, we initiated this study wherein we analysed the S protein sequences of SARS‐CoV‐2 variants reported in India and submitted in the GISAID portal from 1 January 2021 to 31 August 2021. Mutations throughout the S protein domains were studied in detail and analysed (at a translational level) to understand whether the mutations reported affect the virus properties and antigenic consequences such as infectivity, transmissibility, and resistance to the available vaccines, based on the literature available on these mutations.

MATERIALS AND METHODS

Whole‐genome sequences (N = 43,102) of SARS‐CoV‐2 variants circulating in India were downloaded from the GISAID website. The GISAID website has an inbuilt algorithm that categorises the Pango lineages and designates them. The same was used in this study. All the available data from 1 January 2021 to 31 August 2021 were analysed in the study. State‐wise statistics and sequence analysis of all the variants were also performed. Each genomic sequence was scrutinised for the acquirement of the data with regard to its: date of collection, location, accession ID, Clade, Pango lineage, and S mutations (Table S1). Each genomic sequence was thoroughly analysed for the Pango lineages and the spike mutations, as they were the main objectives. To study the variants circulating in India, we have segregated the data in a state‐wise pattern. First, every variant observed was counted, and the sequences were noted down. Next, all mutations in the spike region were listed out and calculated, then the percentiles for the enumerated count of all the mutations against more than 40,000 were taken. Briefly, using the excel tool, duplicates were separated. Transpose formula was used to make the total in every single column. All the mutations were arranged in ascending order and segregated into their respective domains in the S protein. All the clinically significant and major mutations were listed out for further analysis. Only mutations with frequencies >1% are analysed in detail. These mutations were studied in detail in S protein region‐1 and S protein region‐2. Based on existing literature, the S protein region's mutations were analysed for increased infectivity, decreased infectivity, and decreased neutralising sensitivity. S protein structure PDBID‐6VXX was downloaded from PDB, and increasing infectivity mutations were mapped. The final statistical data and mutation mapping was formulated into the graphical models using BioRender (https://biorender.com).

RESULTS AND DISCUSSION

In 2021, there has been a massive increase in the COVID‐19 community transmission in several countries worldwide (Coronavirus Disease (COVID‐19), n.d.). India also faced a gigantic surge in infectivity and transmissibility of SARS‐CoV‐2. In this study, we have analysed in detail 43,102 genomic sequences of SARS‐CoV‐2 strains circulating in India reported to the GISAID portal from 1 January 2021 to 31 August 2021. We identified the percentage prevalence of Pango lineages, studied the S protein mutations and analysed the structural impact on S protein and clinical significance of the observed S mutations based on existing literature. We have identified a total of 24,260 mutations in the S protein (Table S2). All of these mutations were further categorised based on Pango nomenclature. A total of 265 Pango lineages could be categorised based on the 24,260 mutations identified (Table 2). The dominant lineage in most of the 28 states of India and its 8 union territories was B.1.617.2 (the delta variant). However, the states Madhya Pradesh, Jammu & Kashmir, and Punjab had B.1.1.7 (alpha variant) as the major lineage, while the state Himachal Pradesh had B.1.36 as the dominating lineage (Figure 1 and Table 2). Furthermore, Pango lineages identified in the country whose prevalence was >1% were also recorded (Figure 2). B.1.617.2‐Delta variant was the major dominating variant (45%), followed by B.1.617‐kappa (9.8 %), B.1.1.7 alpha (8.6 %), B.1 (7%), B.1.36 (4%), B.1.36.29 (2.8%), B.1.1 (2. 7%), B.1.1.216 (1.8%), B.1.1.306 (1.8%), AY12 (1.8%), and AY4 (1.2%). About 3.6% of the variants could not be assigned to any known Pango lineages (Figure 2). The total prevalence of Pango lineages among the different states and union territories of India is detailed in Table S3. It can be seen that two of the variants identified in the study period (B.1.617.2 [45%] and B.1.17 [8.6%]) are classified as VOC (Tracking SARS‐CoV‐2 Variants, n.d.).
TABLE 2

Pango lineages of SARS CoV‐2 variants in India

Pango lineageTotal no. obtained% Prevalence
B.1.617.219,61345.5036889239
B.1.617.142589.8788919308
B.1.1.737208.6306899912
B.130697.1203192427
B.1.3617274.0067746276
None15763.6564428565
B.1.36.2912202.8304951046
B.1.111842.7469722983
AY.127811.8119808826
B.1.1.2166571.5242912162
B.1.1.3066331.4686093453
AY.45401.2528420955
B.1.5402470.5730592548
B.1.5252410.5591387871
B.1.1.3262400.5568187091
B.1.617.32360.5475383973
B.1.36.82280.5289777737
B.1.3512260.5243376177
B.1.6181790.4152939539
AY.31420.3294510696
B.1.5601380.3201707577
B.1.6041380.3201707577
B.1.36.101370.3178506798
B.1.1.220910.2111270939
B.1.1.46900.2088070159
A750.1740058466
B.6750.1740058466
B.1.1.354730.1693656907
B.1.538680.1577653009
B.1.153670.155445223
B.1.533570.1322444434
AY.6530.1229641316
B500.1160038977
B.1.210500.1160038977
B.1.36.22500.1160038977
B.1.36.18490.1136838198
B.1.575460.1067235859
B.1.36.17430.099763352
B.6.6400.0928031182
AY.1370.0858428843
B.1.537320.0742424945
B.4.7310.0719224166
B.1.1.25290.0672822607
B.1.351.3290.0672822607
B.1.36.38280.0649621827
B.1.535260.0603220268
B.1.311250.0580019489
B.1.468250.0580019489
B.1.548230.053361793
B.1.1.44220.051041715
AY.11200.0464015591
B.1.1.101190.0440814811
B.1.195180.0417614032
A.23.1170.0394413252
B.1.177170.0394413252
B.1.36.16170.0394413252
B.1.2150.0348011693
B.1.369150.0348011693
B.1.469140.0324810914
B.1.1.318130.0301610134
B.4120.0278409355
B.1.1.201110.0255208575
B.1.333100.0232007795
B.1.459100.0232007795
B.1.462100.0232007795
P.2100.0232007795
B.1.1.17490.0208807016
B.1.36.3390.0208807016
B.1.36.3590.0208807016
B.1.42890.0208807016
B.1.16080.0185606236
B.1.22280.0185606236
B.1.46580.0185606236
A.2970.0162405457
B.1.1.27470.0162405457
B.1.1.41970.0162405457
B.1.1.43170.0162405457
B.1.36.1970.0162405457
B.1.52370.0162405457
B.1.55170.0162405457
B.1.14560.0139204677
B.1.52460.0139204677
B.1.63360.0139204677
AY.550.0116003898
B.1.1.19450.0116003898
B.1.1.28250.0116003898
B.1.1.550.0116003898
B.1.11150.0116003898
B.1.36.2450.0116003898
B.1.44150.0116003898
B.1.45650.0116003898
B.1.56950.0116003898
B.1.58250.0116003898
C.3650.0116003898
A.2140.0092803118
A.940.0092803118
AY.240.0092803118
B.1.1.1040.0092803118
B.1.1.29140.0092803118
B.1.20340.0092803118
B.1.22140.0092803118
B.1.36740.0092803118
B.1.39840.0092803118
B.1.47040.0092803118
B.1.56440.0092803118
A.2730.0069602339
AM.430.0069602339
B.1.1.30730.0069602339
B.1.1.39830.0069602339
B.1.11330.0069602339
B.1.14330.0069602339
B.1.177.1630.0069602339
B.1.214.230.0069602339
B.1.25830.0069602339
B.1.26030.0069602339
B.1.351.230.0069602339
B.1.39330.0069602339
B.1.43230.0069602339
B.1.466.130.0069602339
B.1.466.230.0069602339
B.1.52630.0069602339
B.1.9430.0069602339
A.220.0046401559
AE.220.0046401559
AY.1020.0046401559
AY.3.120.0046401559
B.1.1.13220.0046401559
B.1.1.19220.0046401559
B.1.1.23120.0046401559
B.1.1.25420.0046401559
B.1.1.31520.0046401559
B.1.1.32720.0046401559
B.1.1.35020.0046401559
B.1.1.35320.0046401559
B.1.1.36920.0046401559
B.1.1.40120.0046401559
B.1.1.40220.0046401559
B.1.1.41120.0046401559
B.1.1.46220.0046401559
B.1.1.52320.0046401559
B.1.1.820.0046401559
B.1.1.8320.0046401559
B.1.17020.0046401559
B.1.177.720.0046401559
B.1.20120.0046401559
B.1.24320.0046401559
B.1.258.2020.0046401559
B.1.30220.0046401559
B.1.34620.0046401559
B.1.36.2620.0046401559
B.1.36.3120.0046401559
B.1.36.720.0046401559
B.1.39520.0046401559
B.1.45320.0046401559
B.1.47620.0046401559
B.1.55720.0046401559
B.1.59620.0046401559
B.1.60920.0046401559
B.1.61920.0046401559
B.1.62920.0046401559
B.2920.0046401559
B.4020.0046401559
B.4910.002320078
C.1110.002320078
P.1.110.002320078
A.2.5.110.002320078
A.2210.002320078
A.2310.002320078
AE.410.002320078
AM.210.002320078
AY.710.002320078
B.1.1.110.002320078
B.1.1.11610.002320078
B.1.1.11710.002320078
B.1.1.12110.002320078
B.1.1.13510.002320078
B.1.1.14110.002320078
B.1.1.14810.002320078
B.1.1.15710.002320078
B.1.1.16110.002320078
B.1.1.16410.002320078
B.1.1.20010.002320078
B.1.1.21310.002320078
B.1.1.21410.002320078
B.1.1.22810.002320078
B.1.1.26210.002320078
B.1.1.26310.002320078
B.1.1.2710.002320078
B.1.1.2810.002320078
B.1.1.29410.002320078
B.1.1.30810.002320078
B.1.1.31010.002320078
B.1.1.31210.002320078
B.1.1.31710.002320078
B.1.1.3310.002320078
B.1.1.36410.002320078
B.1.1.36510.002320078
B.1.1.36610.002320078
B.1.1.37210.002320078
B.1.1.37410.002320078
B.1.1.37810.002320078
B.1.1.40610.002320078
B.1.1.41310.002320078
B.1.1.41610.002320078
B.1.1.43310.002320078
B.1.1.45210.002320078
B.1.1.48710.002320078
B.1.1.50010.002320078
B.1.1.52610.002320078
B.1.1.5710.002320078
B.1.1.71210.002320078
B.1.1.7410.002320078
B.1.1.7510.002320078
B.1.1.9710.002320078
B.1.11910.002320078
B.1.14910.002320078
B.1.15110.002320078
B.1.16410.002320078
B.1.177.1810.002320078
B.1.177.1910.002320078
B.1.177.410.002320078
B.1.177.8710.002320078
B.1.17910.002320078
B.1.18410.002320078
B.1.22510.002320078
B.1.23110.002320078
B.1.23210.002320078
B.1.23910.002320078
B.1.28910.002320078
B.1.34910.002320078
B.1.36.210.002320078
B.1.36.2810.002320078
B.1.36.3610.002320078
B.1.36.910.002320078
B.1.36210.002320078
B.1.37110.002320078
B.1.37810.002320078
B.1.38010.002320078
B.1.38210.002320078
B.1.39710.002320078
B.1.40910.002320078
B.1.41110.002320078
B.1.41310.002320078
B.1.42710.002320078
B.1.42910.002320078
B.1.43810.002320078
B.1.45010.002320078
B.1.46010.002320078
B.1.49610.002320078
B.1.52710.002320078
B.1.54510.002320078
B.1.56210.002320078
B.1.59410.002320078
B.1.60010.002320078
B.1.61710.002320078
B.1.62010.002320078
B.1.9310.002320078
B.1.9610.002320078
B.310.002320078
B.6.310.002320078
C.110.002320078
C.36.310.002320078
L.310.002320078
Q.110.002320078
R.110.002320078
TOTAL = 265 43102 100%
FIGURE 1

Dominant lineages in each of the states of India during January–August 2021. In the Indian map, the most prevalent lineages were shown. Green indicates B.1.617.2 (the delta variant), red indicates B.1.1.7 (alpha variant), and purple indicates B.1.36 lineage

FIGURE 2

Dominant Pango lineages in India during January–August 2021. Pango lineages whose prevalence was >1% were shown. The numbers in the outer circle refers to percentages of Pango lineages

Pango lineages of SARS CoV‐2 variants in India Dominant lineages in each of the states of India during January–August 2021. In the Indian map, the most prevalent lineages were shown. Green indicates B.1.617.2 (the delta variant), red indicates B.1.1.7 (alpha variant), and purple indicates B.1.36 lineage Dominant Pango lineages in India during January–August 2021. Pango lineages whose prevalence was >1% were shown. The numbers in the outer circle refers to percentages of Pango lineages

Clinical significance of the reported S mutations

Of the 24,260 mutations recorded in the S protein, mutations that possibly are clinically significant by playing a crucial role in altering the biological properties of the virus were analysed based on existing literature. We observed 70 mutations with >1% prevalence in the N‐terminal domain, 18 in the receptor binding domain (RBD), 8 in the C‐terminal domain (CTD), 4 in the signal peptide domain, 9 in HR‐1, 3 in fusion peptide region, and 1 in the transmembrane region of the S protein (Table S4). Mutations with a prevalence of >1% were not observed in the HR‐2 and cytoplasmic domains of S protein. All the clinically significant mutations (as per published literature) were recorded, and their percentiles were calculated to understand the significance of such mutations in the Indian context during the second wave of COVID‐19 in India. Of the 112 mutations recorded throughout the S protein with >1% prevalence in Indian populations, 15 (T19R, D63G, V70del, G142D, D377Y, L452R, T478K, E484Q, E484K, N501Y, A570D, D614G, Q677H, P681H, P681R) were known to be involved in increasing viral pathogenicity, 3 (E156G, F157del, R158del) were known to be vaccine breakthrough mutations, 1 mutation (Y144del) was known to be involved in the escape of the virus to neutralizing antibodies, and 1 (R385K) mutation was known to be involved in decreased virulence (Figure 3). The biological significance of 92 of the recorded mutations is not yet known. In India, 13 S protein mutations were correlated with the massive surge during the second wave of infections in April 2021; the most prevalent mutations included D614G, L452R, P681R, G142D, and Q1071H (Venkatakrishnan et al., 2021). It has been reported that B.1.617.1 and B.1.617.2 were the predominant circulating strains in the western part of India since January 2021, and that E484Q, D614G, and P681R were the common mutations of these strains (Cherian et al., 2021).
FIGURE 3

Spike mutations of >1% prevalence in India, January–August 2021. Percentage prevalence of mutations across the domains of S protein whose prevalence was >1% were shown. A specific colour was assigned for each respective category of amino acid mutations, increase of infectivity (pink), decrease the infectivity (yellow), vaccine breakthrough mutations(grey), escaping neutralizing antibodies (black), and those whose role is yet unknown are shown in turquoise. X‐axis refers to amino acid changes in S protein, Y‐axis refers to frequency of S protein mutations in percentages. SP: signal peptide sequence domain, RBD: receptor binding domain, CTD: C‐terminal domain, FP: fusion peptide domain, HR‐1: heptapeptide repeat domain, TM: transmembrane domain

Spike mutations of >1% prevalence in India, January–August 2021. Percentage prevalence of mutations across the domains of S protein whose prevalence was >1% were shown. A specific colour was assigned for each respective category of amino acid mutations, increase of infectivity (pink), decrease the infectivity (yellow), vaccine breakthrough mutations(grey), escaping neutralizing antibodies (black), and those whose role is yet unknown are shown in turquoise. X‐axis refers to amino acid changes in S protein, Y‐axis refers to frequency of S protein mutations in percentages. SP: signal peptide sequence domain, RBD: receptor binding domain, CTD: C‐terminal domain, FP: fusion peptide domain, HR‐1: heptapeptide repeat domain, TM: transmembrane domain

N‐terminal domain

N‐terminal domain of S protein in the Indian variants recorded 70 mutations with >1% prevalence, and most of these mutations were shown to be clinically significant. P77L mutation is dominant in the delta variants (Patchett et al., 2021), but the clinical significance is unknown. Y144del mutation was known to be involved in escaping neutralizing antibodies (R. Wang et al., 2021). E156G, F157del, G142D mutations were known to be involved in vaccine breakthrough infections (Mishra et al., 2021; Shen et al., 2021). In our study, these mutations are found at 17% prevalence in Indian population and are likely to be clinically significant as these mutations are involved in reduced sensitivity to existing vaccines (Liu et al., 2021). T19R mutation (found at 43.66% prevalence in the current study) is found in Delta plus variant, which is classified as a VOC (Jordan et al., 2021). D63G and V70del were reported to increase viral pathogenicity (X.‐J. Yang, [Link], [Link]). S26L, T77A, K81N, and V82A mutations observed in the Indian variants in our study are at significant prevalence percentages, but their role in the infectivity of SARS‐CoV‐2 has not been reported yet. Furthermore, L42F, P46L, S97I, F108L, L116F, D119V, F120L, P65S, and T183I are not reported yet as significant in terms of viral infectivity.

Receptor binding domain

Eighteen mutations were recorded in the receptor‐binding domain with >1% prevalence. The high percentile mutations in the receptor‐binding domain reported in our study are P323L, D377Y, T478K, L452R, A446V, N501Y, and E484K. L452R and T478K are mutations already reported in the Indian variants and are known to increase the infectivity of the virus with decreased binding to monoclonal antibodies (mAbs), which may affect their neutralization potential (Cherian et al., 2021). Mutations at L452R, D377Y reported in our study were known to increase viral infectivity in a worldwide scenario (Motozono et al., 2021). E484Q mutation was reported as a neutralization‐resistant mutation that can increase the viral infectivity (Verghese et al., 2021). The E484K substitution alone conferred resistance to several monoclonal antibodies (Ku et al., 2021) while N501Y mutation was reported to increase the interacting force between S protein's RBD and ACE2 receptor and was reported to increase viral infectivity (Santos & Passos, 2021). One mutation in the receptor‐binding domain R385K was reported to decrease the viral infectivity (Miljanovic et al., 2021).

C‐terminal domain

Seven mutations were recorded in the CTD (with >1% prevalence) and five of these mutations that is, A570D, D614G, Q677H, P681H, and P681R are known to increase virus infectivity. A570D increases virus infectivity by enhancing host receptor binding, and can also act as a deleterious immune escape mutation (X. Yang et al., 2020). D614G mutation was shown to increase virion S protein infectivity and density (Zhang et al., 2020), Q677H was shown to enhance viral infectivity and confer neutralizing antibody resistance (Zeng et al., 2021). P681H mutation was reported as a worldwide emerging mutation that increases the infectivity of SARS‐CoV‐2 (Maison et al., 2021). P681R was shown to have played a crucial role in the alpha to delta replacement and increase in infectivity and pathogenicity of the virus (Liu et al., 2021).

Fusion peptide domain

Three mutations with >1% prevalence, that is, T716I, T749I, A706S were recorded in the fusion peptide region; the role of these mutations in the pathogenicity of SARS CoV‐2 is yet unknown.

HR‐1 domain

Nine mutations were recorded in the HR‐1 region of the spike protein (with >1% prevalence). S982A mutation was reported in the antigenic sites of alpha, which might be involved in increasing viral infectivity (Y. Wang et al., 2021).

Transmembrane domain

Only one mutation, that is, P1228L was recorded in the transmembrane domain, and the clinical significance of this mutation is not reported yet.

Signal peptide domain

Four mutations were recorded in the signal peptide domain: S2P, D3L, D3Y, H11Q. The biological significance of all these mutations is not reported yet.

Structural significance of the reported mutations on the S protein

Among the 15 mutations that were reported to increase the infectivity, 4 mutations (T19R, D63G, V70 del, and G142D) were located on the N‐terminal domain of the spike protein, which is involved in the host cell attachment through diverse polysaccharide moieties. Most of the VOC have mutations in the N‐terminal domain to escape the neutralization (McCallum et al., 2021). T19R mutation is found on the surface patch targeted by most NTD‐neutralizing antibodies (Planas et al., 2021). D63G mutation is important as it is directly involved in the RNA binding (Dinesh et al., 2020). V70del is present in the prominent exterior loop of the spike and is required for the efficient cell entry and increases cleavage of S2 thus enhancing spike infectivity (Meng et al., 2021). SARS‐CoV‐2 variants with a mutation at G142D showed resistance to the mAbs (Suryadevara et al., 2021). Six mutations that increase infectivity were found in the RBD, which interacts with host receptor ACE2. L452R mutation increases spike stability and viral infectivity, thereby increasing viral infection (Motozono et al., 2021). Mutation T478K is located on the interface with ACE2. Amino acid change from threonine to lysine is known to be involved in enhancement of electrostatic potential of S protein and hence was predicted to affect the spike–ACE2 interaction (Giacomo et al., 2021). Mutation at E484 (either with Q or K) has shown higher binding affinity for the ACE2 cell receptor resulting in enhanced transmissibility (Augusto et al., 2022). Variant with N501Y substitution was found to increase the viral transmissibility; substitution with tyrosine allowing more interaction with ACE2 receptor may lead to higher binding affinity to the host cell receptor (Zhou et al., 2021). The CTD harbours A570D mutation, which is found to be more infectious in the pseudoviral assay. Structural analysis on A570D revealed that this mutation introduces the molecular switch for the opening and closing of RBD (X.‐J. Yang, [Link], [Link]). D614G mutation is more infectious and several studies indicated that this mutation is associated with increased viral load in the patients infected with COVID‐19 as D614G shifts S protein conformation towards an ACE2‐binding fusion state (Yurkovetskiy et al., 2020). It was demonstrated that Q677H mutation increases viral infectivity and syncytium formation (Zeng et al., 2021). Mutations P681R and P681H are located in the polybasic S1/S2 furin cleavage site and were reported to enhance the fusogenic activity of the spike protein (Tao et al., 2021). Three mutations in the N‐terminal domain, E156G, F157del, and R158del revealed less flexibility compared to the Wuhan strain. Docking studies of these mutants with monoclonal antibodies reported low binding affinity compared to the Wuhan strain; this may be a possible case of immune escape (Chaudhari et al., 2021). Y144 del has been found to abrogate binding to neutralizing antibodies (McCarthy et al., 2021). The initial contact between RBD of the S‐protein and ACE2 is through the peptidase domain (PD) (Adhikari et al., 2020). The possible binding mechanism is of pivotal importance and has vital implications for vaccine design. The presence of an unexpected furin cleavage site at the S1/S2 boundary of SARS‐CoV‐2 S, which is cleaved during biosynthesis, is a novel feature setting this virus apart from SARS‐CoV and SARSr‐CoVs (Walls et al., 2020). The boundary of the (uncoupled) N‐terminal region lies in physical proximity to the furin‐targeted motif RRAR, which is essential for pre‐activation of SARS‐CoV‐2 spike protein through proteolysis (Serapian et al., 2020). A second proteolytic cleavage at site S2′ releases the fusion peptide, which penetrates the host cell membrane, preparing it for fusion (Apellániz et al., 2014). S glycoproteins are densely decorated by heterogeneous N‐linked glycans protruding from the trimer surface. These oligosaccharides participate in S folding, affect priming by host proteases, and might modulate the antibody recognition (Walls et al., 2020). Glycan acts as a shield that protects epitope from antibody binding. Site‐specific glycan analysis of SARS‐CoV‐2 S protein was performed in which glycopeptides are generated using proteases and analysed by liquid chromatography–mass spectrometry; as a result, 22 N‐linked glycosylation sequons per protomer were determined (Watanabe et al., 2020). One study demonstrated that the simulated models of N‐glycan at position 165 and 234 act as RBD modulators on its binding with ACE2. On addition of N‐glycans, RBD is stabilized and the removal of N‐glycans showed a reduced ACE2 binding, which revealed conformational shift of RBD (Casalino et al., 2020). E484 is an immunodominant spike protein residue in RBD region with various substitutions (E484A, E484D, E484G, and E484K) (Harvey et al., 2021). E484K introduces a residue with a charge opposite to the wild‐type, which would significantly alter the electrostatic complementarity of antibody binding to this region (Andreano et al., 2021). Also, the same group in another study reported that mutation E484K can lead to a fourfold decrease in neutralization activity of convalescent plasma to SARS‐CoV‐2 (Andreano et al., 2021). ΔY144 deletion was found to alter the N3 NTD loop (140–156 residues) and can abolish the neutralizing effect of various antibodies (Harvey et al., 2021). An in‐silico study predicted that replacement of aspartic acid to glycine (D614G) leads to the loss of hydrogen bond interactions formed with valine at position 859, thereby eliminating the hydrogen bonding between S1 and S2 domain, leading to increased main‐chain flexibility enabling a more favourable orientation of Q613, possibly facilitating cleavage by TMPRSS2 by perturbing its affinity with the S1‐furin cleavage site (Raghav et al., 2020). One previous study performed an unbiased multi‐microsecond molecular dynamics of 7 glycosylated S‐protein variants derived from cryo‐EM structure 6VSB and applied a matrix of low coupling energy (MLCE)‐based approach to predict potential changes in immunogenic regions on each variant (Triveri et al., 2021). In this study, reference S structure used was the dominant D614G variant. According to the MLCE study, contiguous residues uncoupled from the S protein core shrink in number compared to D614G S; their experimental data also confirms that variant evades Abs binding to the shrunk or lost epitopes. The number of residues defining the epitope located in the long RBD loop (residues 417−503) is much lower in mutants 501Y.V2 (South African), B1.1.28 (gamma), and N439K. Also, they predict that N439K RBD forms a new interaction with the human ACE2 receptor (hACE2) and has an enhanced affinity for hACE2 (Triveri et al., 2021). Two different kinds of vaccines, COVAXIN ((a whole‐virion inactivated SARS‐CoV‐2 antigen [Strain: NIV‐2020‐7700]) and COVISHIELD (a recombinant, replication‐deficient chimpanzee adenovirus vector encoding the SARS‐CoV‐2 Spike (S) glycoprotein) have been in use in India since January 2021. Both these vaccines are based on the original variant of SARS‐CoV‐2. The second wave originated during March 2021 and lasted till August 2021. In this context, it is crucial to identify the various variants that were circulating and were responsible for the increased surge in the pandemic during 2021. Such knowledge can also help to understand vaccine breakthrough infections. For example, in our study, B.1.617.1 variant showed co‐occurrence of three critical mutations, L452R, E484Q, and P681R in the S protein. E484K, T95I, and Y144del were identified as vaccine breakthrough mutations (Hacisuleyman et al., 2021; Wu et al., 2021). Variants including A475V, L452R, V483A, and F490L were known to be involved in the putative antibody resistance (Q. Li et al., 2020). It is alarming that these mutations were also identified in the present study with significant prevalence rates: E484K (2.03%), T95I (11.5%), Y144del (5.86%), L452R (30.25%). To conclude, we have identified the total number of circulating SARS‐CoV‐2 lineages reported during the second wave of the COVID‐19 pandemic in India. Further, the S mutations associated with the rise in COVID‐19 cases were investigated in detail. Importantly, we have identified 15 mutations that were known to be involved in increasing viral pathogenicity (T19R, D63G, V70del, G142D, D377Y, L452R, T478K, E484Q, E484K, N501Y, A570D, D614G, Q677H, P681H, P681R); 3 that were known to be vaccine breakthrough mutations (E156G, F157del, R158del); and 1 mutation that was known to escape neutralizing antibodies (Y144del). Since new variants will continue to emerge, it is crucial to identify and monitor mutations in the S protein, and any functionally significant co‐occurring modifications. Going forward, experimental assessment of SARS‐CoV‐2 S protein mutations can be beneficial from a public health point of view to understand vaccine breakthrough infections, and to devise better preventive strategies by individual nations.

CONFLICT OF INTEREST

The authors declare no conflict of interest.

ETHICS STATEMENT

The authors confirm that the ethical policies of the journal, as noted on the journal's author guidelines page, have been adhered to. No ethical approval was required as this article only uses whole genome sequences deposited on GISAID portal that are publicly available with no original research data.

AUTHOR CONTRIBUTIONS

Radhakrishna Muttineni conceptualized the study. Binitha R.N., Jaslin Panyam, Aravind Vemula, Shashi Mohan Singh, Subin Balachandran, and Sandra K.P. collected data and did the preliminary analysis. Radhakrishna Muttineni, Binitha R.N., Kalyani Putty, Kavitha Marapakala, Viroji Rao S.T., and Anand Kumar Kondapi analysed the data and wrote the manuscript. Supplementary table 1: Details of 43102 SARS CoV‐2 whole genome sequences used in the study. Click here for additional data file. Supplementary table 2: Details of 24260 S protein mutations identified in the study Click here for additional data file. Supplementary table 3: Individual state‐wide distribution of SARS CoV‐2 pango lineages in India Click here for additional data file. Supplementary table 4: Spike mutations with >1% prevalence across the domains Click here for additional data file.
  46 in total

1.  Site-specific glycan analysis of the SARS-CoV-2 spike.

Authors:  Yasunori Watanabe; Joel D Allen; Daniel Wrapp; Jason S McLellan; Max Crispin
Journal:  Science       Date:  2020-05-04       Impact factor: 47.728

2.  Analysis of Indian SARS-CoV-2 Genomes Reveals Prevalence of D614G Mutation in Spike Protein Predicting an Increase in Interaction With TMPRSS2 and Virus Infectivity.

Authors:  Sunil Raghav; Arup Ghosh; Jyotirmayee Turuk; Sugandh Kumar; Atimukta Jha; Swati Madhulika; Manasi Priyadarshini; Viplov K Biswas; P Sushree Shyamli; Bharati Singh; Neha Singh; Deepika Singh; Ankita Datey; Kiran Avula; Shuchi Smita; Jyotsnamayee Sabat; Debdutta Bhattacharya; Jaya Singh Kshatri; Dileep Vasudevan; Amol Suryawanshi; Rupesh Dash; Shantibhushan Senapati; Tushar K Beuria; Rajeeb Swain; Soma Chattopadhyay; Gulam Hussain Syed; Anshuman Dixit; Punit Prasad; Sanghamitra Pati; Ajay Parida
Journal:  Front Microbiol       Date:  2020-11-23       Impact factor: 5.640

Review 3.  Domains and Functions of Spike Protein in Sars-Cov-2 in the Context of Vaccine Design.

Authors:  Xuhua Xia
Journal:  Viruses       Date:  2021-01-14       Impact factor: 5.048

4.  SARS-CoV-2 spike E156G/Δ157-158 mutations contribute to increased infectivity and immune escape.

Authors:  Tarun Mishra; Rishikesh Dalavi; Garima Joshi; Atul Kumar; Pankaj Pandey; Sanjeev Shukla; Ram K Mishra; Ajit Chande
Journal:  Life Sci Alliance       Date:  2022-03-16

5.  Intra- and intermolecular atomic-scale interactions in the receptor binding domain of SARS-CoV-2 spike protein: implication for ACE2 receptor binding.

Authors:  Puja Adhikari; Neng Li; Matthew Shin; Nicole F Steinmetz; Reidun Twarock; Rudolf Podgornik; Wai-Yim Ching
Journal:  Phys Chem Chem Phys       Date:  2020-08-05       Impact factor: 3.676

6.  The First Molecular Characterization of Serbian SARS-CoV-2 Isolates From a Unique Early Second Wave in Europe.

Authors:  Danijela Miljanovic; Ognjen Milicevic; Ana Loncar; Dzihan Abazovic; Dragana Despot; Ana Banko
Journal:  Front Microbiol       Date:  2021-06-18       Impact factor: 5.640

7.  Beyond Shielding: The Roles of Glycans in the SARS-CoV-2 Spike Protein.

Authors:  Lorenzo Casalino; Zied Gaieb; Jory A Goldsmith; Christy K Hjorth; Abigail C Dommer; Aoife M Harbison; Carl A Fogarty; Emilia P Barros; Bryn C Taylor; Jason S McLellan; Elisa Fadda; Rommie E Amaro
Journal:  ACS Cent Sci       Date:  2020-09-23       Impact factor: 14.553

8.  Analysis of SARS-CoV-2 variant mutations reveals neutralization escape mechanisms and the ability to use ACE2 receptors from additional species.

Authors:  Ruoke Wang; Qi Zhang; Jiwan Ge; Wenlin Ren; Rui Zhang; Jun Lan; Bin Ju; Bin Su; Fengting Yu; Peng Chen; Huiyu Liao; Yingmei Feng; Xuemei Li; Xuanling Shi; Zheng Zhang; Fujie Zhang; Qiang Ding; Tong Zhang; Xinquan Wang; Linqi Zhang
Journal:  Immunity       Date:  2021-06-08       Impact factor: 31.745

9.  The Antigenicity of Epidemic SARS-CoV-2 Variants in the United Kingdom.

Authors:  Jiajing Wu; Li Zhang; Yue Zhang; Haixin Wang; Ruxia Ding; Jianhui Nie; Qianqian Li; Shuo Liu; Yongxin Yu; Xiaoming Yang; Kai Duan; Xiaowang Qu; Youchun Wang; Weijin Huang
Journal:  Front Immunol       Date:  2021-06-17       Impact factor: 7.561

View more
  1 in total

1.  SARS-CoV-2 variants and spike mutations involved in second wave of COVID-19 pandemic in India.

Authors:  Radhakrishna Muttineni; Binitha R N; Kalyani Putty; Kavitha Marapakala; Sandra K P; Jaslin Panyam; Aravind Vemula; Shashi Mohan Singh; Subin Balachandran; Viroji Rao S T; Anand Kumar Kondapi
Journal:  Transbound Emerg Dis       Date:  2022-03-24       Impact factor: 4.521

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.