Literature DB >> 35328105

Coronavirus Genomes and Unique Mutations in Structural and Non-Structural Proteins in Pakistani SARS-CoV-2 Delta Variants during the Fourth Wave of the Pandemic.

Muhammad Zeeshan Anwar1, Madeeha Shahzad Lodhi1, Muhammad Tahir Khan1, Malik Ihsanullah Khan1, Sumaira Sharif1.   

Abstract

Genomic epidemiology of SARS-CoV-2 is imperative to explore the transmission, evolution, and also pathogenicity of viruses. The emergence of SARS-CoV-2 variants of concern posed a severe threat to the global public health efforts. To assess the potential consequence of these emerging variants on public health, continuous molecular epidemiology is of vital importance. The current study has been designed to investigate the major SARS-CoV-2 variants and emerging mutations in virus structural and non-structural proteins (NSP) during the fourth wave in September 2021 from the Punjab province of Pakistan. Twenty SARS-CoV-2 positive samples have been collected from major cities were subjected to next-generation sequencing. Among the 20 whole genomes (GenBank Accession SRR16294858-SRR16294877), 2 samples failed to be completely sequenced. These genome sequences harbored 207 non-synonymous mutations, among which 19 were unique to GISAID. The genome sequences were detected: Delta 21I, 21J variants (B.1.617.2). Mutation's spike_F157del, spike_P681R, spike_T478K, spike_T19R, spike_L452R, spike_D614G, spike_G142D, spike_E156G, and spike_R158del have been detected in all samples where K1086Q, E554K, and C1250W were unique in spike protein. These genomic sequences also harbored 129 non-synonymous mutations in NSP. The most common were NSP3_P1469S (N = 17), NSP3_A488S (N = 17), NSP3_P1228L (N = 17), NSP4_V167L (N = 17), NSP4_T492I (N = 17), NSP6_T77A (N = 17), NSP14_A394V (N = 17), NSP12_G671S (N = 18), and NSP13_P77L (N = 18). The mutation, F313Y in NSP12, detected in the current study, was found in a single isolate from Belgium. Numerous other unique mutations have been detected in the virus papain-like protease (NSP3), main protease (NSP5), and RNA-dependent RNA polymerase (NSP12). The most common non-synonymous mutations in the spike protein were subjected to stability analysis, exhibiting a stabilizing effect on structures. The presence of Delta variants may affect therapeutic efforts and vaccine efficacy. Continuous genomic epidemiology of SARS-CoV-2 in Pakistan may be useful for better management of SARS-CoV-2 infections.

Entities:  

Keywords:  NSP; Pakistan; SARS-CoV-2; genome; mutations; variants

Mesh:

Year:  2022        PMID: 35328105      PMCID: PMC8951394          DOI: 10.3390/genes13030552

Source DB:  PubMed          Journal:  Genes (Basel)        ISSN: 2073-4425            Impact factor:   4.096


1. Background

The deadly SARS-CoV-2 (Severe Acute Respiratory Syndrome Coronavirus-2) posed a major effect on public health, disrupting the global healthcare system. SARS-CoV-2 has the ability to exhibit a high mutation frequency due to the presence of single-stranded RNA, posing major issues to public health. Molecular epidemiology is required to study the virus evolutionary stages in specific geographic locations and also to establish how the virus may affect the vaccine efficacy and drug response. The virus genome encodes 4 structural and 16 non-structural proteins (NSP) [1]. Variations have been reported, occurring very quickly, displacing the virus ancestral strains [2,3,4,5]. At population level, the virus rapid rates of transmission offered an advantage to its replication. The D614G mutation in spike (S) was one of the first discovered, improves viral infectivity, and shifts the spike (S) protein conformation toward binding a fusion-competent state [2,6,7]. Inside the virus S protein, all the variants of concern (VOC) harbored numerous mutations, in the receptor-binding domain (RBD) and in the N-terminus. These VOCs include α (B.1.1.7) (first reported in UK), β (B.1.351) (South Africa), and γ (P.1) in Brazil [4,5,6,7,8]. The Delta variant (B.1.617.2) in India, harbored numerous mutations in RBD SARS-CoV-2 isolates in October 2020 [9]. The latest variant, Omicron (B.1.1.529), was first reported from South Africa on 24 November 2021. Among the several highly contagious VOCs, a large number of mutations have been detected. The B.1.1.7 lineage has been thought to be 40–80% more contagious [10,11]. However, as compared to P.1 and B.1.351, Delta is exhibiting more transmissible potency than the earlier ones. The VOCs have greater potential in terms of pathogenicity, virulence, and transmission rate, and also exhibit lower antibody neutralization sensitivity [10,12]. Multiple mutations in the S protein and other critical genomic regions of SARS-CoV-2 are resulting in reduced vaccinal response and treatment efficacy. The E484K mutation, present in the receptor-binding ridge, has been detected in S protein in multiple lineages. This mutation has been shown to be decreasing the virus binding to polyclonal sera [13] and protects the virus against treatment with the monoclonal antibodies [14] (p. 19). The P681H mutations have been found in the B.1.1.7, B.1.1.318, and P.3, whereas mutation P681R has been found in the A.23.1 lineages and all B.1.617 variants. The P681H and P681R improve S protein fusion to the host cell [15,16]. The P681H and D614G mutations have been thought to be responsible for the B.1.1.7 increased transmissibility [17]. Among the different SARS-CoV-2 variants, B.1.617 lineage was discovered in India [18,19,20]. This lineage includes three primary subtypes (B1.617.1, B.1.617.2, and B.1.617.3), each of which has mutations in the S protein’s N-terminal domain (NTD) and receptor-binding domain (RBD), possesses the ability to boost the immune evasion capability. The Delta variant is thought to spread more quickly than other variants. The B.1.617.1 is characterized by mutations L452R, P681R, and E484Q in S, whereas the Delta variant is characterized by the presence of mutations L452R, P681R, and T478K in S. The L452R and T478K enhance S protein’s binding affinity with human angiotensin-converting enzyme 2 (ACE2) receptor [21,22]. To ensure a better public health policy, it is critical to continuously monitor and identify the rapidly evolving variations in circulating isolates of SARS-CoV-2 in populations. The current study was designed to investigate the major circulating VOC and also the emerging mutations in structural proteins and NSP among the SARS-COV-2 patients from major cities in the Punjab province of Pakistan. Twenty SARS-CoV-2 positive samples have been collected and whole-genome sequencing was performed through Ion Torrent next-generation technology. All the samples harbored numerous mutations in NSP and structural proteins, including S protein.

2. Materials and Methods

2.1. Ethical Approval

The ethical clearance for the current study was obtained from the Ethical Review Committee for Medical and Biomedical Research at The University of Lahore (IMBB/UOL/21/1379).

2.2. Area of Sample Collection

The nasopharyngeal swab method was used to collect samples from COVID-19 infected individuals, as recommended by the World Health Organization (WHO) [23]. All the samples were collected for whole genome sequencing during the fourth wave in September 2021 from major cities in Punjab province, including Lahore, Sialkot, Okara, and Pak Pattan.

2.3. Processing of Samples

The proper specimen collection procedure under laboratory testing for coronavirus disease (COVID-19) interim guidance by WHO was followed for SARS-CoV-2 samples collection and processing. The nasopharyngeal swabs were collected in 2 mL of vial transport medium (LinkGen, Taizhou, Jiangsu, China) and placed at 4 °C for further use. All these samples were confirmed with TaqPath™ COVID-19 CE-IVD RT-PCR Kit (Thermo Fisher, Waltham, MA, USA).

2.4. RNA Extraction, Quantification and cDNA SYNTHESIS

RNA extraction was performed from nasopharyngeal swabs using a standard volume of 200 ul by MagMAX™ Viral/Pathogen Nucleic Acid Isolation Kit (Thermo Fisher, Waltham, MA, USA) and (Applied Biosystem, Waltham, MA, USA). The extracted RNA was stored at −80 °C for downstream application. The real-time quantification and copy number determination of nucleic acid was performed using TaqPath™ 1-Step RT-qPCR Master Mix (Thermo Fisher, Waltham, MA, USA) and the relevant copy number was determined according to the CT and control reactions in quantification. Only samples with relevant CT of 18–28 values were chosen. Using a minimum input amount of 10 ng/uL of the RNA sample, RNA to cDNA reversed transcription was performed with the SuperScript™ VILO™ cDNA Synthesis Kit (Thermo Fisher, Waltham, MA, USA).

2.5. Library Preparation

The library was prepared manually using Ion AmpliSeq™ Library Kit (Thermo Fisher, Waltham, MA, USA). Amplification was performed with a 2-pool Ion AmpliSeq™ SARS-CoV-2 Research Panel (Thermo Fisher, Waltham, MA, USA). The library was partially digested and ligated with Ion Xpress™ Barcode Adapters 1–96 Kit (Thermo Fisher, Waltham, MA, USA). Library purification and quantification steps were carried out with GenDx-AMPure® XP beads (GENDX, Utrecht, The Netherlands) and TaqMan™ Quantitation Kit (Thermo Fisher, Waltham, MA, USA). The libraries were diluted to a final concentration of 40 pM, loaded on Thermo Fisher Scientific Ion Chef™ Instrument for PCR, and loaded on Ion S5 530 and 510 (Thermo Fisher, Waltham, MA, USA).

2.6. Whole Genome Sequencing and Data Analysis

Sequencing was accomplished on Ion GeneStudio™ S5 System at LabGenetix, Lahore, Pakistan. The fastQ base sequence file quality was assessed using the FastQC (v0.11.8). Trimmomatic tool (v0.39) was used to remove the low-quality reads (Q < 30) and index adapter sequences were utilized to improve sample multiplexing. Sequenced reads were aligned with the reference (NC 045512, using Burrows Wheeler Aligner (BWA, v0.6). The duplicated PCR reads were removed with Picard Tools (v2.21.6). Mapping problems due to the presence of small Indels were removed with a Genome Analysis Toolkit, “RealignerTargetCreator” and “InDelRealigner” (GATK v. 3.3.0) to analyze the mapped read. To improve the accuracy of variant calling, GATK tool “HaplotypeCaller” was applied for realignment of sequenced reads via local de-novo assembly of haplotypes in the regions showing variation. All the 20 whole genome sequences in fasta format were aligned with reference (NC_045512) using CoVsurver application (https://www.gisaid.org/epiflu-applications/covsurver-mutations-app/ (accessed on 28 September 2021)) on 15–20 September 2021, collected during the fourth wave of infection. The CoVsurver research tool has been developed with GISAID (Global initiative on sharing all influenza data), aiding researchers to identify and interpret the amino acids (aa) changes in coronavirus genomes. The server rapidly aligns the query genome sequences in fasta format with reference SARS-CoV-2 and screen coronavirus genomes for aa changes to identify any special epidemiological relevance. All mutations in structural proteins of SARS-CoV-2 were separated and arranged in the form of excel sheets. The statistical analysis was performed using EpiData Analysis [24] to analyze various non-synonymous mutations in virus structural and NSP.

2.7. Mutations Effect on Virus Structural Proteins

Some of the most common mutations were analyzed for their thermodynamic effect on S protein through DynaMut server [25]. The server implements mutation effect, which can be used to analyze the protein stability and structural flexibility upon point mutation. The server also measures the vibrational entropy changes and the impact of a mutation with graph-based signatures (p-value < 0.001) along with a good resolution of results. To compute the SARS-CoV-2 wild type and mutant protein stability and flexibility, the PDB file of proteins were retrieved from the Protein Data Bank [26] and uploaded to the DynaMut server and a point mutation was inserted at specific site. The impact in the form of total energies and vibrational entropy energies between wild type and mutants was recorded. The high-resolution structures of wilt type and mutant S proteins were retrieved for further processing. In DynaMut, changes upon point mutation on free energy of protein folding, combining the impacts of mutation stability of protein and also the dynamic properties were computed by ENCoM, Bio3D, and DUET computational servers, generating a more robust predictor of energies.

2.8. Phylogenetic Analysis

Genomic sequences in the current study were subjected to MAFT server [27] and Nextstrain [28] for phylogenetic analysis. These are public servers, containing a pipeline for analysis, and visualization, presenting a real-time view into the evolution of viral pathogens.

3. Results

3.1. SARS-CoV-2 Patient Information

Twenty nasopharyngeal swab samples were collected during the fourth wave of the pandemic in September 2021 from SARS-CoV-2 patients (Table 1). Among these samples, 13 were collected from male suspect and seven from female. All the SARS-CoV-2 patients developed clear signs and symptoms, including fever, cough, headache, fatigue, and also significant loss of smell and taste. Eleven patients were found in age category 1 (20–40), seven were in category 2 (41–60) and two were 65 years old.
Table 1

SARS-CoV-2 patients’ information.

Sample IDGender *Age (Years)Location
1F20Okara
2M33Sialkot
3F65Sialkot
4M26Sahiwal
5M31Nankana
6M26Shakot
7M60Pak Pattan
8F25Okara
9M24Okara
10M28Pak Pattan
11F43Okara
12M46Pak Pattan
13F42Pak Pattan
14M53Pak Pattan
15M30Okara
16M65Okara
17F21Lahore
18F42Lahore
19M42Lahore
20M32Okara

* F: female, M: male.

3.2. Whole Genome Sequences

Among the 20 samples (GenBank Accession No. SRR16294858-SRR16294877), samples 7 and 10 (Table 2) (Accession No. SRR16294864 and SRR162948667) were not completely sequenced. The genomic sequences were submitted to NCBI BioProject PRJNA770504 (https://www.ncbi.nlm.nih.gov/bioproject/PRJNA770504 (accessed on 28 September 2021)). The average length of sequenced genomes was 29,586 nucleotides (9691 amino acids). The sequence information, including length and clade, is shown in Table 2.
Table 2

Genome length and mutation’s pattern in SARS-CoV-2 sample.

SampleLength #(nt)Length #(aa)Muts #Muts %Unique Muts
129,8229705370.38%2
229,8119705340.35%0
329,8179705380.39%0
429,8239705310.32%0
529,8229705370.38%0
629,7979705330.34%0
829,7749705360.37%1
929,8279705320.33%0
1129,8249705310.32%1
1229,8289705350.36%0
1329,7029669690.71%0
1429,8289705280.29%1
1529,8269705340.35%0
1629,8269705310.32%0
1729,7379705370.38%2
1829,8219705390.40%0
1929,8279705360.37%2
2029,3839650981.02%10

# Mut: mutations, nt: nucleotide, aa: amino acid.

3.3. Unique Mutations in Structural Proteins

All the genomic sequences harbored 207 different non-synonymous mutations, including indel, among which 19 were unique to GISAID (S1). All the genomic samples were of GK clade. Mutations E554K (S1 domain of S), K1086Q, and C1250W (S2 domain of S) (Figure 1) were unique to S protein, and in GISAID, present in samples 1, 11, and 19. The remaining unique mutations in all samples are provided in the Supplementary File S1.
Figure 1

Domain organization of structural proteins. ORF: open reading frame, E: envelope, M: membrane, N: nucleocapsid, SP: signal peptide, NTD: N-terminal domain, RBD: receptor-binding domain, RBM: receptor-binding motif, FP: fusion peptide, HR: heptapeptide repeat sequence, TM: transmembrane, CP: cytoplasmic domain, SR: serine rich, CTD: C-terminal domain.

3.4. SARS-CoV-2 Variants

All the samples were detected as Delta variants. Mutation details in S and other structures are provided in Supplementary Files S1–S3. These sequences harbored diverse kinds of mutations in the N-terminal and RBD of S protein that may upsurge the immune evasion possibility of Delta variant.

3.5. Mutations in Spike and Other Structural Proteins

Among the 207 mutations, 26 were detected in S protein (Supplementary File S2). The most common non-synonymous and indel mutations, present in all complete genome samples were spike_T478K, spike_T19R, spike_L452R, spike_F157del, spike_E156G, spike_P681R, spike_D614G, spike_R158del, and spike_G142D (Table 3). Some of these mutations were present in receptor-binding domain (RBD) of S protein (Figure 1). Mutations, spike_D950N and spike_T95I were present in twelve samples each.
Table 3

Mutational frequency in structural proteins.

MutationNo. of Samples/Frequency
Spike_T478K18
Spike_T19R18
Spike_L452R18
Spike_F157del18
Spike_E156G18
Spike_P681R18
Spike_D614G18
Spike_R158del18
Spike_G142D18
N_D377Y18
N_D377Y18
N_R203M18
M_I82T18
N_G215C17
Spike_D950N12
Spike_T95I12
Similarly, nucleocapsid (N) also harbored nine different kinds of mutations (Supplementary File S2). Among these, the most common were N_D377Y, N_R203M, and N_D63G have been detected in all 18 samples (Table 3). The D63G was present in N-terminal domain (NTD) of N protein, which is also called RNA-binding domain (RBD) (Figure 1 and Figure 2). Two mutations, N_R203M and N_G215C, were present in SR-Linker region of N proteins and one D377Y in C-terminal domain (CTD). No mutation was detected in the envelope the protein where the membrane (M) protein harbored only two non-synonymous mutations (I82T (N = 18) and V70F (N = 1)) in the transmembrane domain.
Figure 2

Location of most common mutations in S protein. Yellow indicates the receptor-binding motif, present in the RBD region (raspberry color). The majority of these mutations were found in the loop regions.

Delta variant harbors L452R, P681R, and T478K in S protein. These mutations, along with others, present in all genomic samples, exhibited a stabilizing effect on the S protein structure (Figure 3 and Figure 4), and may demonstrate a good binding affinity towards human ACE2 SARS-CoV-2 receptor protein. Phylogenetically, all the isolates were 21I and 21J, subclades of Delta variants (Figure 5).
Figure 3

Mutation effect on S protein structure stability and flexibility. Mutants G142D, T478K, E156G, and L452R in S protein, exhibited stabilizing effect. The blue region shows rigidification of structure behind mutations and red shows gain in flexibility.

Figure 4

Mutation effect on S protein stability and flexibility. Mutants P681R and T19R in S protein, exhibited a stabilizing effect. The blue region shows rigidification of structure behind mutations and red shows gain in flexibility.

Figure 5

SARS-CoV-2 genomic epidemiology from Pakistan. The graph shows an estimate of divergence over the time period within genomes. (A) Current study isolates (labelled Pakistan Delta isolates). (B) Position of current SARS-CoV-2 isolates in radial tree (UOL) was built using Nextstrain 28 (Nextclade (nextstrain.org). Sub-clades have been color coded in the tree, characterized by some specific mutations in structural proteins (https://covariants.org/variants) (accessed on 28 September 2021).

The spike_G142D mutant exhibited a little destabilizing affect (0.206 kcal/mole when compared with T478K (0.584 kcal/mol), E156G (0.049 kcal/mol), L452R (0.059 kcal/mol), P681R (0.503 kcal/mol), and T19R (0.403 kcal/mol).

3.6. Unique Mutations in Structural Proteins

The sequences harbored 129 mutations in NSP in which 13 were unique to GISAID (Table 4) (Supplementary File S3). The highest number of unique mutations were detected in NSP3 (N = 10), followed by NSP5 (V86L), NSP12 (F313Y), and NS3 (I118L). The remaining unique mutations in the samples are provided in Supplementary File S3.
Table 4

Unique mutations in NSP and NS3 SARS-CoV-2 isolates from Punjab province, Pakistan.

Sample IDUnique Mutations
20NSP3_V1388A
20NSP3_W1498S
20NSP3_S1495N
20NSP3_Y1535V
20NSP3_S1534R
20NSP3_K1497H
20NSP3_S1494G
20NSP3_D1499P
20NSP12_F313Y *
19NSP5_V86L *
14NS3_I118L
8NSP3_D339Y *
1NSP3_T1303P

* NSP3; Papain-like protease (PLpro), NSP5; Main protease (Mpro), NSP12; RNA-dependent RNA polymerase.

3.7. Mutations in NSP

The mutation details in NSP are shown in Table 5. In SARS-CoV-2, NSP3 is a large multidomain protein of 1945 amino acids containing a papain-like protease (PLpro) domain (aa746-1060). PLpro, an essential component of the virus replication transcription complex, is highly conserved, present between unique and a nucleic acid-binding domains. Among the unique mutation, NS3_I118L was detected in 14 isolates (Table 4).
Table 5

Mutations in NSP of SARS-CoV-2 circulating isolates in Punjab province.

MutationCountMutationCountMutationCountMutationCount
NSP12_G671S #18NSP3_E391D1NSP3_F1516del1NSP14_I231del1
NSP13_P77L18NSP3_M1529del1NSP3_I1514del1NSP14_V236del1
NSP3_P1469S17NSP3_S1494G1NSP3_L1511del1NSP14_V136I1
NSP3_A488S17NSP3_I1528del1NSP3_L1505del1NSP14_W247del1
NSP3_P1228L17NSP3_F1503del1*NSP3_H920Y1NSP14_P203L1
NSP4_T492I17NSP3_V1522del1NSP3_L1525del1NSP14_G248S1
NSP4_V167L17NSP3_F1510del1NSP3_W1498S1NSP14_S221del1
NSP6_T77A17NSP3_F1519del1NSP3_L1531del1NSP14_Q246del1
NSP14_A394V17NSP3_A1512del1NSP3_L1500del1NSP14_S230del1
NSP12_P323L16NSP3_A1526del1NSP4_A446V1NSP14_I242del1
NSP14_M72I6NSP3_G1524del1NSP4_I377M1NSP14_Y235del1
NSP3_S1370F3NSP3_K1497H1NSP5_S254F1NSP14_C216del1
NSP3_S1285F2NSP3_D339Y1NSP5_V86L1NSP14_T223del1
NSP5_P184T2NSP3_A1507del1NSP6_M92V1NSP14_D243del1
NSP1_H83Y1NSP3_Y1521del1NSP6_T181I1NSP14_T219del1
NSP2_L271F1NSP3_Y1535V1NSP6_V149A1NSP14_M241del1
NSP2_E373K1NSP3_A1527del1NSP10_A20S1NSP14_P239del1
NSP2_D315N1NSP3_Q1530del1NSP12_A16V1NSP14_W227del1
NSP2_S591N1NSP3_S1534R1NSP12_A185V1NSP14_A225del1
NSP2_A306V1NSP3_F1533del1#NSP12_L829F1NSP14_F240del1
NSP2_K81N1NSP3_T1517del1NSP12_S318T1NSP14_Q245del1
NSP3_S1495N1NSP3_R1518del1NSP12_F317Y1NSP14_C226del1
NSP3_L1523del1NSP3_Y1513del1#NSP12_M380I1NSP14_H228del1
NSP3_S1699F1NSP3_W1509del1NSP12_Q357H1NSP14_T215del1
NSP3_E1508del1NSP3_D1499P1NSP12_F313Y1NSP14_G232del1
NSP3_L862F *1NSP3_S211G1NSP14_D222del1NSP14_A220del1
NSP3_P822L *1NSP3_V1506del1NSP14_F233del1NSP14_R213del1
NSP3_A465V1NSP3_F1520del1NSP14_V244del1NSP14_N238del1
NSP3_T1501del1NSP3_V1388A1NSP14_F217del1NSP14_S218del1
NSP3_F1532del1NSP3_A644S1NSP14_D234del1NSP14_Y224del1
NSP3_G1504del1NSP3_L1515del1NSP14_R212del1NSP14_Y237del1
NSP3_A1502del1NSP3_T1303P1NSP14_H229del1NSP14_A214del1

del: deletion, RdRp: RNA-dependent RNA-polymerase, NSP: non-structural protein. * Mutations in PLpro domain. # Mutations in RdRp domain of NSP12.

The highest frequency of mutations was detected in virus NSP3 (Table 2) in which P1469S (N = 17), A488S (N = 17), and P1228L (N = 17) were the most common. The majority of these mutations were detected in the C-terminal domain (CTD) of NSP3. Deletions with a single frequency in amino acid regions 1500 to 1533 were also observed at CTD. However, the PLpro domain (746–1060) seems highly conserved (Figure 6). Mutations with a single frequency at position L862F, P822L, and H920Y were detected in the PLpro domain of NSP3 (Figure 6D). NSP2, which modulates the host cell survival signaling pathway, also harbored six non-synonymous mutations with a very low frequency. NSP4 harbors four mutations in which V167L in NTD and T492I in CTD were present in 17 isolates.
Figure 6

Structure of NSP12 and NSp3 (PLpro). (A) SARS-CoV-2 RdRp (PDB ID: 6M71) contains an N-terminal β-hairpin (residues 31–50). NiRAN (residues 50–249), interface domain (residues 251–365). The NiRAN domain is comprised of three helices and five β-strands, associated with RdRp domain (residues 366–920). (B) Complex of NSP12, NSP7, and NSP8. (C) Organization of NSP3 protein. (D) Structure of PLpro with mutations labeled NSP3_P822L (P77L), NSP3_L862F (L117F), and H920Y (H175y). PLpro has a small N-terminal ubiquitin-like (Ubl) domain and catalytic domain (thumb–palm–fingers).

NSP5 (main protease), harbored three non-synonymous mutations in a very low frequency. Four mutations were detected in NSP6 in which T77A was detected with the highest frequency (N = 17). Virus NSP12, which is also called RdRp (RNA-dependent RNA polymerase), plays an essential role in replication. Mutations NSP12_G671S (N = 18) and NSP12_P323L (N = 16) were the most common, present in the interface and RdRp domain (Figure 6). A single mutation P77L in NSP13 was detected in all genomic isolates (N = 18). Forty-one mutations were detected in NSP14, including five non-synonymous mutations in which NSP14_A394V (N = 17) NSP14_M72I (N = 6) were the most common.

4. Discussion

The recent emergence of different SARS-CoV-2 variants posed severe health issues. The α is one of the identified variants that first emerged in the UK and became one of the predominant lineages worldwide. Pakistan reported its first case with α variant in December 2020 [29], triggered S protein, and rapidly spread, leading to the third wave in Pakistan [30]. Molecular epidemiology is essential for better management of viral diagnosis and treatment. However, limited genomic data are available on other genetic lineages from all provinces of Pakistan. Further, frequency of mutations in all target’s proteins, including NSP, has not been investigated properly. Surveillance of the population mainly depends on characterizing the differences between reported, emerging, and non-VOC lineages in order to design efficient diagnostic methods, proper therapeutics, and the designing of effective vaccines. Among the different variants, the Delta variant of SARS-CoV-2 is more deadly and highly transmissible with severe disease signs and symptoms. This variant has been detected in 77 countries as of 24 June 2021 [31,32]. The UK has faced drastic effects from public health measures due to the Delta variant. To prevent the rapid transmission of the virus in the population, there is an urgent need to design more integrated molecular diagnostic systems with the implication for strict rules of quarantines for international travelers. In the current study, all the genomic samples were detected as Delta variants when analyzed on GISAID CoV-Surver on 15 to 20 September 2021, collected during the fourth wave of infection. Mutations T478K, T19R, L452R, F157del, E156G, P681R, D614G, R158del, and G142D were the most common, present in all genomics isolates. The T478K mutation present in RBD of S protein is involved in interaction with human ACE2 [33,34]. The T478K is unique to the SARS-CoV2 Delta variant, present in the epitope region of potent neutralizing monoclonal antibodies [35]. This point mutation exhibited a structural stabilizing impact on S protein (Figure 3). A previous study during the third wave from Pakistan reported that the Delta variant is prevalent in 45% of cases followed by β (46%) [36]. Very little information was provided on mutations in structural proteins while NSP mutations were not provided in detail. In the current study, comprehensive details of mutations in the virus, all targeting proteins, including accessory proteins, have been provided for better understanding of the variations in circulating isolates. Among the structural proteins, the S protein harbored two mutations in the RBD region (L452R and T478K), four in the NTD (T19R, G142D, Δ156–157 and R158del), and one at the furin-cleavage site (P681R) and S2 region (D950N), detected in 12 virus genome sequences (Table 3). This cluster was observed in Indian isolates in October 2020 [37], exhibiting higher pathogenicity than other variants [38]. The T478K mutation at the RBD region of S protein, fell near the E484K that facilitates antibody escape [39]. The L452R is an antibody-escaping mutation and the virus with this mutant is resistant to convalescent plasma and monoclonal antibody therapy [40]. The furin-cleavage site plays an important role in viral pathogenesis [41] and mutation analysis revealed that L452R and P681R increase the ACE2 binding and transmissibility. Consistent with previous study [42] in which variants harbor different mutations, it exhibited increased binding between S protein RBD region and ACE2 receptor. Similar to this previous study, mutants T19R, E156G, L452R, T478K, and P681R in S protein (Figure 3 and Figure 4) exhibited a stabilizing effect on protein structure, facilitating the binding affinity for more stable interactions with human ACE2. This stability effect may increase the virus transmission in populations. Mutations Arg158 and Phe-157/del were also detected in all genomic isolates (Table 3), present in NTD of S protein. In a more recent study, E156G, Arg158, and Phe-157/del were found, causing rigidity and reduced flexibility, thus providing fitness advantage and immune escape [43]. The Delta S protein P681R mutation has an important role in the variant replacement of the α-to-Delta. The Delta S P681R mutation present at the furin-cleavage site exhibited stabilizing effects (Figure 4), separating the S1 and S2 regions of S protein. Mechanistically, the P681R mutation of Delta improved the full-length cleavage of S1 and S2, facilitated the virus cell surface entry and increased infection 16. These mutations must be regularly monitored in ongoing pandemics for surveillance and virus severity [44]. Considering the circulation of the Delta virus with specific mutations and a high transmissible rate, the Pakistani national health authority needs to take timely measures against it. Further, molecular epidemiological studies with insight into genomic analysis of the virus are needed to screen the most prevalent variants and specific mutations that may affect the diagnosis and vaccine potency, to adopt potential measures in the future. Emerging mutations in NSP may affect the virus transmission and pathogenicity. Recently, a synonymous mutation (F106F) in NSP3 along with other signature mutations exhibited a virus fitness effect [45]. In CoV-2 NSP3, the PLpro domain is a large protein and an essential member of replication transcription complex [46,47]. PLpro domain has a catalytic domain (Figure 1) for cleavage activity. Mutations in this domain may affect a catalytic process of PLpro. In the current study, three mutations were detected in this domain with many unique to GISAID. The effect of these mutations may be investigated in future studies for better designing of inhibitors against PLpro. The NSP6_L37F mutation, which was very common in the earlier infections, were associated with asymptomatic cases. NSP6 reduces autophagic ability, important for viral infections and promotes cell death [48]. Mutation NSP6_T77A in the current study (Table 2) seems emerging in the current wave. However, its effect on virus severity is needed to be explored for better management of COVID-19. Previously, mutation P323L in RdRp was the most common in Pakistani isolates [49]. This mutation is still present in the Delta variants (Table 2), which has a stabilizing effect, while interacting with viral RNA, which lies in the interface domain of RdRp. This domain is the antiviral (filibuvir, Simeprevir) binding site [43], which may show weak interaction if mutations emerge. Mutation G671S in NSP12 detected in the current study has been observed as emerging, present in all 18 isolates, increasing the stability of the protein [50]. However, further investigations are needed to see its effect on virus pathogenicity. A single mutation, NSP13_P77L, which is characteristic only in Delta variants, was detected in all genomic sequences in the current study, and may have a destabilizing effect [51]. NSP14_A394V is the most common mutation in NSP14 (Table 2), and has a role in one of the positive section sites of virus [52]. Vaccine efficacy may be reduced in previous VOCs; however, BNT162b2 retained its potency against β VOC [53]. Omicron harbored a number of known and unique mutation patterns (Supplementary Files) as compared to other VoCs; therefore, the vaccines which are effective against Omicron infections are not clear. Vaccines have been found to be effective against other VOCs. Observational investigation in Qatar and Kaiser Permanente [54,55] found more than 90% vaccine efficacy against the Delta-variant. Data from New York indicates a good efficacy for individuals 65 years and older, exhibiting different levels of efficacy for different vaccines [56]. In conclusion, Delta variants with some unique mutations, are circulating during the fourth wave of the pandemic, in major cities in the Punjab province of Pakistan. The structural proteins and NSP harbored numerous mutations, present in different functional domains. Mutations spike_K1086Q, spike_E554K, and spike_C1250W were unique to GISAID. The most common mutations in the S protein exhibited a stabilizing effect, facilitating the binding affinity for S protein RBD with human ACE2. Geographic specific vaccines and drugs may be designed for better management of COVID-19 in the future. The geo-climate distribution of the mutations may decipher higher uniqueness and disease severity in underdeveloped countries, including Pakistan. The effect of these mutations on virus pathogenicity may be experimentally verified to access the effect on virus severity and vaccine efficacy. Continuous molecular epidemiology is required to screen geographic-specific mutations for better understanding of the virus pathogenicity, diagnosis, and treatment of COVID-19.
  50 in total

1.  Spike mutation D614G alters SARS-CoV-2 fitness.

Authors:  Jessica A Plante; Yang Liu; Jianying Liu; Hongjie Xia; Bryan A Johnson; Kumari G Lokugamage; Xianwen Zhang; Antonio E Muruato; Jing Zou; Camila R Fontes-Garfias; Divya Mirchandani; Dionna Scharton; John P Bilello; Zhiqiang Ku; Zhiqiang An; Birte Kalveram; Alexander N Freiberg; Vineet D Menachery; Xuping Xie; Kenneth S Plante; Scott C Weaver; Pei-Yong Shi
Journal:  Nature       Date:  2020-10-26       Impact factor: 49.962

2.  Prospective mapping of viral mutations that escape antibodies used to treat COVID-19.

Authors:  Tyler N Starr; Allison J Greaney; Amin Addetia; William W Hannon; Manish C Choudhary; Adam S Dingens; Jonathan Z Li; Jesse D Bloom
Journal:  Science       Date:  2021-01-25       Impact factor: 47.728

3.  New SARS-CoV-2 Variants - Clinical, Public Health, and Vaccine Implications.

Authors:  Salim S Abdool Karim; Tulio de Oliveira
Journal:  N Engl J Med       Date:  2021-03-24       Impact factor: 91.245

Review 4.  The variant gambit: COVID-19's next move.

Authors:  Jessica A Plante; Brooke M Mitchell; Kenneth S Plante; Kari Debbink; Scott C Weaver; Vineet D Menachery
Journal:  Cell Host Microbe       Date:  2021-03-01       Impact factor: 31.316

5.  Waning of BNT162b2 Vaccine Protection against SARS-CoV-2 Infection in Qatar.

Authors:  Hiam Chemaitelly; Patrick Tang; Mohammad R Hasan; Sawsan AlMukdad; Hadi M Yassine; Fatiha M Benslimane; Hebah A Al Khatib; Peter Coyle; Houssein H Ayoub; Zaina Al Kanaani; Einas Al Kuwari; Andrew Jeremijenko; Anvar H Kaleeckal; Ali N Latif; Riyazuddin M Shaik; Hanan F Abdul Rahim; Gheyath K Nasrallah; Mohamed G Al Kuwari; Hamad E Al Romaihi; Adeel A Butt; Mohamed H Al-Thani; Abdullatif Al Khal; Roberto Bertollini; Laith J Abu-Raddad
Journal:  N Engl J Med       Date:  2021-10-06       Impact factor: 91.245

6.  Effect of Bamlanivimab as Monotherapy or in Combination With Etesevimab on Viral Load in Patients With Mild to Moderate COVID-19: A Randomized Clinical Trial.

Authors:  Robert L Gottlieb; Ajay Nirula; Peter Chen; Joseph Boscia; Barry Heller; Jason Morris; Gregory Huhn; Jose Cardona; Bharat Mocherla; Valentina Stosor; Imad Shawa; Princy Kumar; Andrew C Adams; Jacob Van Naarden; Kenneth L Custer; Michael Durante; Gerard Oakley; Andrew E Schade; Timothy R Holzer; Philip J Ebert; Richard E Higgs; Nicole L Kallewaard; Janelle Sabo; Dipak R Patel; Paul Klekotka; Lei Shen; Daniel M Skovronsky
Journal:  JAMA       Date:  2021-02-16       Impact factor: 56.272

7.  SARS-CoV-2 variant B.1.617 is resistant to bamlanivimab and evades antibodies induced by infection and vaccination.

Authors:  Markus Hoffmann; Heike Hofmann-Winkler; Nadine Krüger; Amy Kempf; Inga Nehlmeier; Luise Graichen; Prerna Arora; Anzhalika Sidarovich; Anna-Sophie Moldenhauer; Martin S Winkler; Sebastian Schulz; Hans-Martin Jäck; Metodi V Stankov; Georg M N Behrens; Stefan Pöhlmann
Journal:  Cell Rep       Date:  2021-06-29       Impact factor: 9.423

8.  DynaMut: predicting the impact of mutations on protein conformation, flexibility and stability.

Authors:  Carlos Hm Rodrigues; Douglas Ev Pires; David B Ascher
Journal:  Nucleic Acids Res       Date:  2018-07-02       Impact factor: 16.971

9.  Transmission of SARS-CoV-2 on mink farms between humans and mink and back to humans.

Authors:  Bas B Oude Munnink; Reina S Sikkema; David F Nieuwenhuijse; Robert Jan Molenaar; Emmanuelle Munger; Richard Molenkamp; Arco van der Spek; Paulien Tolsma; Ariene Rietveld; Miranda Brouwer; Noortje Bouwmeester-Vincken; Frank Harders; Renate Hakze-van der Honing; Marjolein C A Wegdam-Blans; Ruth J Bouwstra; Corine GeurtsvanKessel; Annemiek A van der Eijk; Francisca C Velkers; Lidwien A M Smit; Arjan Stegeman; Wim H M van der Poel; Marion P G Koopmans
Journal:  Science       Date:  2020-11-10       Impact factor: 47.728

10.  Tracking Changes in SARS-CoV-2 Spike: Evidence that D614G Increases Infectivity of the COVID-19 Virus.

Authors:  Bette Korber; Will M Fischer; Sandrasegaram Gnanakaran; Hyejin Yoon; James Theiler; Werner Abfalterer; Nick Hengartner; Elena E Giorgi; Tanmoy Bhattacharya; Brian Foley; Kathryn M Hastie; Matthew D Parker; David G Partridge; Cariad M Evans; Timothy M Freeman; Thushan I de Silva; Charlene McDanal; Lautaro G Perez; Haili Tang; Alex Moon-Walker; Sean P Whelan; Celia C LaBranche; Erica O Saphire; David C Montefiori
Journal:  Cell       Date:  2020-07-03       Impact factor: 66.850

View more
  2 in total

Review 1.  Molecular characteristics, immune evasion, and impact of SARS-CoV-2 variants.

Authors:  Cong Sun; Chu Xie; Guo-Long Bu; Lan-Yi Zhong; Mu-Sheng Zeng
Journal:  Signal Transduct Target Ther       Date:  2022-06-28

2.  Burnout in health care workers during the fourth wave of COVID-19: A cross sectional study from Pakistan.

Authors:  Shoaib Ahmad; Sadia Yaqoob; Sifwa Safdar; Huzaifa Ahmad Cheema; Zarmina Islam; Nida Iqbal; Zoaib Habib Tharwani; Sarya Swed; Mohammad Soban Ijaz; Majeeb Ur Rehman; Abia Shahid; Ufaq Tahir; Shkaib Ahmad; Wajeeha Bilal; Mohammad Yasir Essar; Saleem Iqbal; Zafar Ali Choudry
Journal:  Ann Med Surg (Lond)       Date:  2022-08-07
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.