Literature DB >> 34514073

Exploration of antigenic determinants in spike glycoprotein of SARS-CoV2 and identification of five salient potential epitopes.

Aditya Agrawal1, Rajat Varshney2, Mamta Pathak1, Shailesh Kumar Patel1, Vishal Rai1, Sourabh Sulabh3, Rohini Gupta4, Khushal Singh Solanki1, Ritu Varshney5, Ramadevi Nimmanapalli2.   

Abstract

Emerging pathogens have been an eternal threat to mankind. In a series of pandemics caused by notorious coronaviruses, a newly emerged SARS-CoV2 virus is creating panic among the world population. The unavailability of reliable theranostics insists the exploration of antigenic determinants in spike glycoprotein of SARS-CoV2. The four novel inserts ('70VSGTNGT76', '150KSWM153', 247SYLTPG252 and 674QTQTNSPRR682) in SARS-CoV2 spike protein were unraveled via multiple sequence alignment of spike proteins of SARS-CoV2, SARS-CoV, and MERS-CoV. The three-dimension (3D) modeling of the spike protein of the SARS-CoV2 and their interaction with the ACE2 receptor was delineated with the help of SWISS-MODEL and 3DLigandSite web servers. The predicted 3D model of SARS-CoV2 was further verified by SAVES, RAMPAGE, and ProSA-web tools. The potential B-cell immunogenic epitopes of SARS-CoV2 were predicted out by using various software viz. IEDB B-cell epitopes prediction tool, BepiPred linear epitope prediction tool, Emini Surface Accessibility Prediction tool, and Kolaskar-Tongaonkar antigenicity web tool. The five epitopes (i.e. '71SGTNGTKRFDN81, 247SYLTPG252, 634RVYST638, 675QTQTNSPRRARSV687, and 1054QSAPH1058) were selected as potent antigenic determinants. The quantum of information generated by this study will prove beneficial for the development of effective therapeutics, diagnostics, and multi-epitopic vaccines to combat this ongoing menace. © Indian Virological Society 2021.

Entities:  

Keywords:  ACE2 receptor; Epitopes; SARS-CoV2; Spike protein; Vaccines

Year:  2021        PMID: 34514073      PMCID: PMC8422955          DOI: 10.1007/s13337-021-00737-9

Source DB:  PubMed          Journal:  Virusdisease        ISSN: 2347-3584


Introduction

The first case of SARS-CoV2 had been documented on 31 December 2019 by the health commission of Hubei province, China [34]. This has been the third straight decade in which a coronavirus has traversed species barriers to cause disease in the human population [26]. The symptoms include pneumonia in affected individuals along with fever, sneezing, dyspnea, myalgia, and chest pain. The novel virus has been associated with pneumonia in affected individuals and clinically exhibited by fever, sneezing, dyspnea, myalgia, and chest pain [2, 15, 37]. The SARS-CoV2 is a member of Coronaviridae, a family of single-stranded positive-sense RNA viruses [9]. It belongs to the subgenus Sarbecovirus, under the genus betacoronavirus of this family [41]. The genome of the virus was reported to be 29.891 kb in size and arranged in the order: 5′-orf1/ab (replicase genes)-Structural genes (S, E, M, N)-3’ [7]. The S gene encodes the spike glycoprotein of coronaviruses, which is responsible for binding to the host cell receptor to initiate infection [21]. The spike glycoprotein is made up of two subunits, viz. S1and S2. N-terminal domain (NTD) and receptor-binding domain (RBD) makes up the S1 subunit whereas fusion peptide (FP), heptad repeat (HR) 1&2, transmembrane domain (TM), and a cytoplasmic domain (CP) constitutes the S2 subunit [7]. The role of the S1 subunit as receptor binding and the S2 subunit role in facilitating membrane fusion have been witnessed [6]. The SARS-CoV2 utilizes the same receptor [i.e. the angiotensin-converting enzyme 2 (ACE2)] as that of severe acute respiratory syndrome coronavirus (SARS-CoV) [24, 31] but bind with higher affinity in comparison to SARS-CoV. Being the major structural protein, neutralizing antibodies are directed against the spike glycoprotein and thus they have been described as potent targets for the development of vaccines against both Middle East respiratory syndrome coronavirus (MERS-CoV) and SARS-CoV [12, 40]. Similarly, they can be targeted for the development of novel vaccines against SARS-CoV2 as well. Bioinformatics tools for the prediction of epitopes in a viral protein can have versatile applications such as the development of immunoassays, synthetic antiviral peptides, and vaccines [28]. Prediction of antigenic determinants for the development of peptide-based vaccines for other diseases has been done earlier [27, 34]. The same can be applied in the case of SARS-CoV2, especially for the improvement in the existing vaccines which is the need of the hour in the wake of this global emergency. Therefore, this study was undertaken to predict and analyze antigenic determinants present on the spike glycoprotein of SARS-CoV2 using different bioinformatics tools.

Material and methods

Retrieval, alignment and conservation analysis of sequences of coronaviruses and ACE2 receptor

Using NCBI protein database (http://www.ncbi.nlm.nih.gov/protein/), the spike glycoprotein amino acid sequences of SARS-CoV2 (1273 AA; NCBI reference sequence number: NC_045512.2), SARS-CoV (1255 AA; GenBank accession number: BAC81404.1) and MERS-CoV (1353 AA; GenBank accession number: AMO03401.1) and along with ACE2 receptor sequences of Homo sapiens (805 AA; NCBI reference sequence number: NP_001358344.1) were retrieved in FASTA format. To identify the similarity between the spike proteins of these coronaviruses, Clustal Omega multiple sequence alignment tool (http://www.ebi.ac.uk/Tools/msa/clustalo/) [23] was employed.

The 3D modeling of SARS-CoV2 Spike protein and delineation of its interaction with ACE2 receptor

The 3D structure assessments from retrieved amino acid sequences of the spike protein of SARS-CoV2 and ACE2 receptor were performed using protein structure homology modelling namely SWISS-MODEL web server (https://swissmodel.expasy.org/interactive) [5]. The ligand-binding position in the 3D structure of the ACE-2 receptor was determined by the 3DLigandSite online web server (http://www.sbg.bio.ic.ac.uk/3dligandsite) [32].

Analysis and validation of model quality

The predicted 3D model of SARS-CoV2 was analysed by ERRAT and VERIFY3D tools of SAVES (Structure Analysis and Verification Server) (http://services.mbi.ucla.edu/SAVES/) and ProSA-web tool (https://prosa.services.came.sbg.ac.at) based on their physio-chemical properties. Inappropriate amino acid residues in models were determined by ERRAT based on atomic interaction [10] whereas the quality and compatibility of models were evaluated using electron density of atomic interaction [8]. Also, the estimation of model quality was done by ProSA-web tool (https://prosa.services.came.sbg.ac.at) which utilizes the energy plot generated by atomic interaction to design the residues z-score [33]. Model excellence was verified by determining the amino acid residues of different regions by RAMPAGE web tool (http://mordred.bioc.cam.ac.uk/~rapper/rampage.php) based on their phi-psi torsion angles [22].

Predictions of continuous B cell epitopes

To predict the surface antigenic epitopes in the spike protein of SARS-CoV2, the Immune Epitope Database (IEDB) webserver (http://tools.iedb.org/bcell/) [11] possessing a variety of immunoinformatics prediction tools which recognize the immunogenic amino acid sequences based on their physio-chemical properties were utilized. BepiPred linear epitope prediction tool (http://tools.iedb.org/bcell/) [20] was applied to find out the list of continuous B cell epitopes from the retrieved sequence of SARS-CoV2 spike protein. The surface assessable nature and antigenicity of the predicted peptide sequences were determined by Emini Surface Accessibility Prediction tool (http://tools.iedb.org/bcell/results) [13] and Kolaskar-Tongaonkar antigenicity web tool (http://tools.immuneepitope.org/bcell/) [19] respectively. To check the maximum retention period during HPLC, the hydrophilic or hydrophobic behaviour of the predicted peptide sequences of SARS-CoV2 spike protein was measured on the hydrophilicity scale using Parker Hydrophilicity Prediction method (http://tools.immuneepitope.org/bcell/) [25]. The predicted surface immunogenic sequences of the SARS-CoV2 spike protein were docked with the 3D model of ACE2 receptor by HPEPDOCK web server (http://huanglab.phys.hust.edu.cn/hpepdock/) [39], which utilizes hierarchical algorithm.

Results

Comparison of the spike protein of SARS-CoV2 with SARS-CoV and MERS- CoV

The multiple sequence alignment (MSA) between the spike protein of SARS-CoV2, SARS-CoV, and MERS- CoV revealed 77.38% and 31.98% similarity of the virus with SARS-CoV and MERS-CoV respectively. The spike protein of SARS-CoV2 was found to be highly similar to SARS-CoV, however, four regions of novel inserts were found. Furthermore, these novel inserts viz. ‘70VSGTNGT76’, ‘150KSWM153’, 247SYLTPG252 and 674QTQTNSPRR682 were found unique for SARS-CoV2 (Fig. 1).
Fig. 1

Multiple sequence alignment between spike proteins of SARS-CoV2 (1273 AA), SARS-CoV (1255AA) and MERS-CoV (1353 AA) with highlighted four novel inserts in 2019-nCoV

Multiple sequence alignment between spike proteins of SARS-CoV2 (1273 AA), SARS-CoV (1255AA) and MERS-CoV (1353 AA) with highlighted four novel inserts in 2019-nCoV The 3D homology structure of SARS-CoV2 spike protein created by the Swiss model revealed 76.47% sequence identity with SARS-CoV using 6acc.1A [PDB title: SARS-CoV spike glycoprotein and ACE2 complex] as a template. Besides, the final model was prepared with the overall coverage of 0.93, the sequence similarity of 0.54 along with global quality estimation values of −3.63, −1.99, and −2.69 for QMEAN, solvation, and torsion respectively (Fig. 2). Similarly, ACE2 receptor homology model was designed with an overall coverage of 0.74 and a sequence similarity of 0.62 using 2ajf.1 [PDB title: Structure of SARS coronavirus spike receptor-binding domain complexed with its receptor] as a template (Fig. 3). The ligands binding position in the ACE2 receptor was observed using the 3DLigandSite web server and found that the probability of spike protein (ligand) of SARS-CoV2 to bind/interact with ACE2 receptor at these “374HIS, 378HIS, 402GLU, and 518ARG” positions are quite high (Fig. 3).
Fig. 2

Prediction of three-dimensional (3D) structure of SARS-CoV2 spike protein using reference sequence (1273 amino acids) by homology modelling software Swiss Model

Fig. 3

Representation of 3D structure a and ligand binding position b in ACE2 receptor (Homo Sapiens) using NCBI reference sequence (805 amino acids) by homology modelling server (Swiss Model) and 3DligandSite server respectively

Prediction of three-dimensional (3D) structure of SARS-CoV2 spike protein using reference sequence (1273 amino acids) by homology modelling software Swiss Model Representation of 3D structure a and ligand binding position b in ACE2 receptor (Homo Sapiens) using NCBI reference sequence (805 amino acids) by homology modelling server (Swiss Model) and 3DligandSite server respectively

Model Quality analysis and validation of SARS-CoV2 spike protein

The quality factor values of 46.94 and 46.57 were obtained on the analysis of 3D model of SARS-CoV2 spike protein in ERRAT (Fig. 4). The VERIFY3D model analysis tool revealed 57.03% residues had an average of >  = 0.2 3D-1D score in comparison to the accepted value of 80% (Fig. 4). The value of z-score analysed by the ProSA-web server was found to be -11.85, which indicates a fair model quality along with largely negative energy of the residues (Fig. 5). Ramachandran plots generated by RAMPAGE web server revealed 83.2% amino acid residues lie in the most favoured regions while 16.1% and 0.7% of the residues fit in allowed and disallowed regions respectively (Fig. 6).
Fig. 4

Quality analysis and verification of predicted 3D model of SARS-CoV2 spike protein through ERRAT a and VERIFY3D b (The gap difference between the raw score and average score was due to atomic interaction)

Fig. 5

Model quality analysis for the predicted 3D Model of SARS-CoV2 spike protein through ProSA-web server a z-scores of all protein chains in PDB determined by X-ray crystallography (light blue) or NMR spectroscopy (dark blue) with respect to their length. The Z-scores are represented as large dots. b Energy plot of 2019-nCoV spike protein (Color figure online)

Fig. 6

Ramachandran plots of the predicted 3D model of SARS-CoV2 spike protein

Quality analysis and verification of predicted 3D model of SARS-CoV2 spike protein through ERRAT a and VERIFY3D b (The gap difference between the raw score and average score was due to atomic interaction) Model quality analysis for the predicted 3D Model of SARS-CoV2 spike protein through ProSA-web server a z-scores of all protein chains in PDB determined by X-ray crystallography (light blue) or NMR spectroscopy (dark blue) with respect to their length. The Z-scores are represented as large dots. b Energy plot of 2019-nCoV spike protein (Color figure online) Ramachandran plots of the predicted 3D model of SARS-CoV2 spike protein

Continuous B cell epitope prediction

The surface antigenic epitopes of SARS-CoV2 spike protein were analysed by numerous prediction tools and a total of 13 continuous epitopes of different lengths were selected through the IEDB server and concluding scores of the predicted sequences are mentioned in Table 1. In BepiPred tool, the peptide sequence 805ILPDPSKPSKRS816 achieved the highest score of 2.291 against the threshold value of 0.350 among all the predicted amino acid sequences (Fig. 7). The most accessible surface peptide was 675QTQTNSPRRARSV687 with a value of 8.837 in Emini Surface Accessibility scale (Fig. 7) whereas the most antigenic peptide region was 634RVYST638 with the highest value of 1.068 in Kolaskar-Tongaonkar antigenicity scale in comparison to their threshold value of 1.000 and 1.041 respectively (Fig. 7). On Parker hydrophilic scale, the epitope 772VEQDKNTQE780 achieved a score of 5.756 against the average score of 1.238 (Fig. 7). However, among all the predicted sequences, only two epitopes (i.e. 634RVYST638 and 1054QSAPH1058) passed all the threshold levels at each antigenic scale while three epitopic sequences (i.e. 71SGTNGTKRFDN81, 247SYLTPG252 and 675QTQTNSPRRARSV687) were selected that contain novel inserts.
Table 1

List of predicted immunogenic epitopes of B cell for 2019-nCoV spike protein, their position, number of residues and scores on different IEDB scales (BepiPred, Emini accessibility prediction scale, Kolaskar and Tongaonkar antigenicity prediction scale and Parker hydrophillicity plot scale)

S. NoPeptide sequence (Epitope)PositionNo. of residuesBepiPred score (0.350)Emini accessibility Score (1.000)Kolaskar and Tongaonkar antigenicity Score (1.041)Parker hydrophillicity plot Score (1.238)
1RTQLPPAYTNS21–32121.1852.8681.0161.675
2SGTNGTKRFDN#71–81111.2623.4640.8994.818
3NKSWME149–15460.6872.0220.8812.133
4GKQGNF181–18660.9341.2290.9273.483
5SYLTPG#247–25261.3090.9951.0451.400
6YQAGSTPCNGV473–483111.3580.4441.0493.282
7TVCGPKKSTN523–532101.1211.1701.0204.080
8RVYST*634–63850.5541.4261.0682.060
9QTQTNSPRRARSV#675–687131.6858.8370.9834.269
10VEQDKNTQE772–78091.3236.9230.9955.756
11ILPDPSKPSKRS805–816122.2914.6901.0192.850
12QSAPH*1054–105851.2131.5971.0523.760
13KNHTSPDVDLG1157–1167111.3781.9131.0033.764

*Peptides revealed a higher score than threshold in all tools.

#Peptide epitopes possess novel inserts in spike protein.

Fig. 7

Immunogenic B cell epitopes prediction for SARS-CoV2 spike protein based on physiochemical properties of residues on different IEDB scales (BepiPred, Emini accessibility prediction, Kolaskar and Tongaonkar antigenicity prediction method and Parker hydrophilicity plot). Immunogenic sequences showing peaks above the threshold (red line) will be expected as a fraction of B cell epitope (Color figure online)

List of predicted immunogenic epitopes of B cell for 2019-nCoV spike protein, their position, number of residues and scores on different IEDB scales (BepiPred, Emini accessibility prediction scale, Kolaskar and Tongaonkar antigenicity prediction scale and Parker hydrophillicity plot scale) *Peptides revealed a higher score than threshold in all tools. #Peptide epitopes possess novel inserts in spike protein. Immunogenic B cell epitopes prediction for SARS-CoV2 spike protein based on physiochemical properties of residues on different IEDB scales (BepiPred, Emini accessibility prediction, Kolaskar and Tongaonkar antigenicity prediction method and Parker hydrophilicity plot). Immunogenic sequences showing peaks above the threshold (red line) will be expected as a fraction of B cell epitope (Color figure online)

Interaction of ACE2 receptor with SARS-CoV2

The interaction of the ACE2 receptor with the domain of spiked protein of SARS-CoV2 was assessed via the HPEPDOCK server by using the 3D homology model of the ACE2 receptor. The exposed peptide sequences (i.e. ‘71SGTNGTKRFDN81, 247SYLTPG252, 634RVYST638, 675QTQTNSPRRARSV687, and 1054QSAPH1058) of the domain of spike protein served as ligands to interact with the ACE2 receptor. During interaction (i.e. docking), numerous ligand binding sites of the ACE2 receptor were unearthed by using the aforesaid server; amongst which the docking site with the highest binding energy was chosen. The peptide epitope, 634RVYST638 of the domain of spike protein of SARS-CoV2 binds more efficiently with the highest docking score of −209.636 in comparison to epitope 71SGTNGTKRFDN81 whose docking score was −194.568. The epitopic peptides viz. 675QTQTNSPRRARSV687, 247SYLTPG252, and 1054QSAPH1058 of the domain of SARS-CoV2 were also found to dock successfully with a score of −190.391, −189.671, and −159.815 respectively (Fig. 8).
Fig. 8

Depiction of docked predicted immunogenic peptides on 3D model of ACE2 receptor using HPEPDOCK Server

Depiction of docked predicted immunogenic peptides on 3D model of ACE2 receptor using HPEPDOCK Server

Discussion

Three well-known trans-species switches of coronavirus (SARS-CoV1 in 2002, MERS in 2012 and SARS-CoV2 in 2019) affected human population [26]. The subtle genetic and spike protein changes account for alteration in host range, tissue tropism and pathogenicity. Hence, the comparative study of spike protein of SARS-CoV2, SARS-CoV and MERS with MSA has been conducted. The MSA analysis revealed only 77.38% similarity in the spike protein with SARS-CoV, which gives a clue about its probable emergence and change in host preference. The change in surface proteins like spike protein might be crucial for interspecies receptor recognition on probable intermediate hosts and reservoirs facilitating the viral multiplication and assembly [4]. As per reports, SARS-CoV2 has been associated with several cross-species transmission based on homologous recombination in spike protein between SARS-CoV2 and bat coronavirus [17]. The presence of novel inserts in the spike protein of SARS-CoV2 may render the vaccine of SARS-CoV and MERS-CoV futile for combating the current SARS-CoV2 pandemic. Hence, the immunogenic profile of SARS-CoV2 needs to be explored. The bio-informative tools implicated in the previous epitopic prediction of the spike protein of SARS-CoV and MERS-CoV [16, 28] were also included in our study to unravel immunogenic surface epitopes of SARS-CoV 2 spike protein which in turn may facilitate the development of decoy ligands/neutralizing antibodies to limit viral infection. In this study, we found four novel inserts in the SARS-CoV2 spike protein after comparing it with SARS-CoV. Consequently, the comparative analysis pointed towards the divergent immunogenic profile of SARS-CoV2 with SARS-CoV A total of 13 surface epitopes of SARS-CoV2 spike protein were predicted out using different bioinformatic tools among which 5 were found potent epitopes which well supported the previous study [30]. To neutralize the virus, the presence of surface accessible potent immunogenic epitopes of virus spike protein is the prerequisite for the fabrication of neutralizing antibodies. Moreover, the binding of neutralizing antibodies with homologous epitopes will disrupt virus spike protein interaction with the ACE2 receptor. Hence, the Emini Surface Accessibility Prediction tool and Kolaskar-Tongaonkar antigenicity web tool were taken into account in the present study for assessment of the surface assessable nature and antigenicity of the predicted peptide sequences apart from merely ruling out B-cell epitopes from the retrieved sequence of SARS-CoV2 spike protein via BepiPred linear epitope prediction tool. Various bio-informative tools viz. MODPEP [36], HPEPDOCK, PEPFOLD, etc. can be used to assess protein-peptide interaction. However, the HPEPDOCK software has been used in the present study as it was proven to be significantly better than the other peptide docking algorithms for the prediction of near-native conformations [38]. As reported recently, the ACE2 receptor is the probable target for the SARS-CoV2 facilitating its entry inside the host cells [24, 31]. In this context, successful docking between the novel inserts of RBD of SARS-CoV2 and ACE2 receptors of humans were delineated using 3D homology modelling. The peptide epitope (with the highest docking score) present on spike protein of SARS-CoV2 binds more efficiently with ACE-2 receptor in comparison to low docking score epitope. The more binding energy (negative) value consequence the compact binding. Although the structure of full-length ACE2 in complex with SARS-CoV2 has been elucidated recently [35], however, these novel predicted peptides may be explored in rationale vaccine design, immunotherapy, and various diagnostic platforms like ELISA, LFA, Dot-blot assay, etc. to detect antibodies against SARS-CoV2. The Immunoinformatics approach for designing multi-epitopic vaccines based on surface immunogens has been used earlier for various diseases such as Ebola, Zika [1, 3, 27]. Previous studies reported the development of SARS-CoV vaccines using spike protein as a target [12, 18, 29]. Blocking of the receptors by an antiviral peptide specifically prevents the entry of SARS-CoV in Vero cells [14] has been reported earlier. Therefore, the same can be achieved for SARS-CoV2 using the predicted peptides; however, in-vitro and in-vivo studies need to be performed for further validation.

Conclusion

Keeping into consideration the threat caused by the virus to humankind and its massive spread throughout the world, it is essential to develop highly specific prophylactic and therapeutic measures viz. vaccines, antiviral peptides, and small biological molecules to curb this ongoing menace. As the spike protein of SARS-CoV2 has a role in binding with cellular receptors, it gives a clue about the possible usage of the same as a target. This study reveals the prediction of five potent epitopes (i.e. ‘71SGTNGTKRFDN81, 247SYLTPG252, 634RVYST638, 675QTQTNSPRRARSV687, and 1054QSAPH1058) homologous to spike protein of the virus and their successful docking with the 3D model of ACE2 receptor. Experimental validations of these in silico predicted epitopes would be required for its consideration as an antigen-based diagnostic tool and as antiviral peptides. The use of virus antigen-based peptides along with their combination cocktails may serve as a boon in the wake of this current havoc.
  2 in total

1.  3DLigandSite: structure-based prediction of protein-ligand binding sites.

Authors:  Jake E McGreig; Hannah Uri; Magdalena Antczak; Michael J E Sternberg; Martin Michaelis; Mark N Wass
Journal:  Nucleic Acids Res       Date:  2022-04-12       Impact factor: 19.160

2.  Unique peptide signatures of SARS-CοV-2 virus against human proteome reveal variants' immune escape and infectiveness.

Authors:  Vasileios Pierros; Evangelos Kontopodis; Dimitrios J Stravopodis; George Th Tsangaris
Journal:  Heliyon       Date:  2022-04-04
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.