Literature DB >> 32022276

Immunoinformatics-aided identification of T cell and B cell epitopes in the surface glycoprotein of 2019-nCoV.

Vargab Baruah1, Sujoy Bose1.   

Abstract

The 2019 novel coronavirus (2019-nCoV) outbreak has caused a large number of deaths with thousands of confirmed cases worldwide, especially in East Asia. This study took an immunoinformatics approach to identify significant cytotoxic T lymphocyte (CTL) and B cell epitopes in the 2019-nCoV surface glycoprotein. Also, interactions between identified CTL epitopes and their corresponding major histocompatibility complex (MHC) class I supertype representatives prevalent in China were studied by molecular dynamics simulations. We identified five CTL epitopes, three sequential B cell epitopes and five discontinuous B cell epitopes in the viral surface glycoprotein. Also, during simulations, the CTL epitopes were observed to be binding MHC class I peptide-binding grooves via multiple contacts, with continuous hydrogen bonds and salt bridge anchors, indicating their potential in generating immune responses. Some of these identified epitopes can be potential candidates for the development of 2019-nCoV vaccines.
© 2020 Wiley Periodicals, Inc.

Entities:  

Keywords:  2019-nCoV; COVID-19; SARS-CoV-2; coronavirus; epitope prediction; immunoinformatics

Mesh:

Substances:

Year:  2020        PMID: 32022276      PMCID: PMC7166505          DOI: 10.1002/jmv.25698

Source DB:  PubMed          Journal:  J Med Virol        ISSN: 0146-6615            Impact factor:   2.327


INTRODUCTION

The 2019 novel coronavirus (2019‐nCoV; Family Coronaviridae), also known as the Wuhan coronavirus, has created a global emergency the likes of which had not been seen since the 2003 severe acute respiratory syndrome (SARS) outbreak. As of 27th January 2020, the death toll from the coronavirus stood at 80, with more than 2500 confirmed cases in China, as well as in 11 other countries. The virus, first identified in Wuhan, Hubei, China, causes a contagious, respiratory infection. The incubation period is reportedly 2 to 10 days, and studies are underway to determine whether the transmission is possible from asymptomatic individuals. Similar to other coronaviruses, 2019‐nCoV reportedly possesses a surface glycoprotein. In more established coronavirus pathogenesis, this protein has been shown to bind host cellular receptors and to mediate membrane fusion. More importantly, it has been described as a potential vaccine target in both SARS‐coronavirus (SARS‐CoV) and Middle East respiratory syndrome coronavirus (MERS‐CoV). , This study aimed to identify significant cytotoxic T cell (CTL) and B cell epitopes within the 2019‐nCoV surface glycoprotein using bioinformatic analysis. We also studied the potential interaction of the putative CTL epitopes with their corresponding representative major histocompatibility complex (MCH) class I supertypes most frequently observed in the Chinese population. We investigated hydrogen bonding, salt‐bridges, and residue‐residue contacts between CTL binding epitopes and the MHC class I molecules by molecular dynamics (MD) simulations.

METHODS

Amino acid sequence of the 1273 AA, viral surface glycoprotein was retrieved in FASTA format from NCBI Protein Database (YP_009724390.1). Because of the novel nature of 2019‐nCoV, we compared the glycoprotein's alignment with the spike glycoprotein of SARS‐Cov (UniProt: P59594) to ensure sequence similarity in EMBOSS Water pairwise sequence aligner. NetCTL 1.2 server (http://www.cbs.dtu.dk/services/NetCTL) was used to identify CTL epitopes within the glycoprotein sequence. NetCTL uses NetMHC server's artificial neural networks (ANNs) to predict MHC binding, NetChop‐3.0 to predict C‐terminal cleavages and a weight matrix to calculate TAP transport efficiency to generate predictions. The scores of the three prediction methods are merged into a single score, which can be translated as sensitivity and specificity for the epitope‐MHC‐I binding event. We predicted MHC‐peptide‐binding for the following MHC class I supertypes—A2, A3, B7, B44, and B62. Together, these supertypes account for the majority of human leukocyte antigen A (HLA‐A) (76.5%) and HLA‐B (67.1%) distribution in the Chinese population. Epitopes with a combined score of >0.75 (sensitivity = 0.80, specificity = 0.97) were further confirmed in CTLPred (http://crdd.osdd.net/raghava/ctlpred), where a consensus scoring approach was taken with thresholds of 0.51 (ANN) and 0.36 (support vector machines). The three highest‐scoring epitopes for each supertype were selected for analyses of binding affinity to their corresponding representative supertypes in NetMHCpan 4.0 server which predicted epitope‐MHC‐I binding using ANN trained on quantitative binding data and mass spectroscopy‐derived MHC eluted ligands (http://www.cbs.dtu.dk/services/NetMHCpan). Default threshold % ranks of 0.5 for strong binders, and 2 for weak binders were used. Binding CTL epitopes were selected for homology modeling in PEP‐FOLD3, and structures with the lowest coarse‐grained energy were selected for protein‐protein docking with corresponding HLA chains using ZDOCK 3.0.2. , Complexes that scored the highest, based on interface atomic contact energy, shape complementarity, and electrostatics were selected for MD simulations in NAMD. Structures were solvated inside a TIP3P water box, and neutralized with Cl− and/or Na+, as required. Energy minimization was carried out in a series of conjugate gradients to ensure appropriate structure geometry. Equilibration with restrained solute atoms was performed in an NPT ensemble (310 K, 1 bar) with periodic boundary conditions, and constant temperature and pressure dynamics via Langevin dynamics and Nose‐Hoover Langevin piston respectively. Finally, free production runs, without positional restraints were carried out for 2 ns. Hydrogen bonds between epitopes and HLA chains were mapped in VMD with a two‐way donor‐acceptor distance of 3 Å and an angle of 20° as thresholds. Residue‐residue contacts between epitopes and HLA chains were visualized in a contact map matrix, and salt bridge‐forming residues were identified using the oxygennitrogen cut‐off distance of 3.2 Å. Sequential B cell epitopes within the surface glycoprotein were identified in BepiPred 2.0 (http://www.cbs.dtu.dk/services/BepiPred), and residues scoring above the default threshold of 0.5 were further confirmed in ABCPred, with default threshold of 0.51 and an overlapping filter (http://crdd.osdd.net/raghava/abcpred). , BepiPred 2.0 uses a random forest algorithm trained on epitope and non‐epitope amino acids validated from crystal structures, whereas ABCPred employs ANN using fixed‐length patterns trained on 700 B cell epitopes and 700 random non‐epitopes. To identify discontinuous B cell epitopes, we used Ellipro using the surface glycoprotein structure modeled in RaptorX, and the top five scoring epitopes were selected as significant. , The modeled structure was validated by the Ramachandran plot generated in RAMPAGE. Antigenicity of all identified epitopes was predicted using VaxiJen 2.0. Conservation of the sequential epitopes within spike proteins across bat coronavirus (Bt‐CoV, UniProt: A3EX94), SARS‐CoV, and MERS‐CoV (UniProt: K9N5Q8) were studied by multiple sequence alignment in Clustal Omega and visualized in NCBI Multiple Sequence Alignment Viewer.

RESULTS

The surface glycoprotein of 2019‐nCoV was found to have 76.3% identity and 87.3% similarity with the spike glycoprotein of SARS‐CoV. We identified five CTL epitopes in the surface glycoprotein, with all five epitopes classified as strong binders to their corresponding MHC class I supertype representative (Table 1). During MDs simulations, we observed the CTL epitopes to bind with the peptide‐binding groove of MHC class I molecules, which is typically used for antigen presentation. The relevant residues of the epitopes and HLA chains in contact were mapped in a contact map matrix (Figure 1A). We observed salt bridge‐forming residues positioned over the viral epitopes in three of the five complexes, as well as continuous hydrogen bonding between epitopes and HLA chains maintained throughout the simulation in all complexes (Figure 1B,C). Mutations in salt bridge‐forming amino acids have been shown to weaken TCR recognition of MHC class I‐antigen complexes. Also, the continuous formation of a set of hydrogen bonds between the MHC molecules and epitopes indicates an accommodating nature of the HLA chains to bind epitopes within the groove between the two helices. Our results are therefore optimistic of ideal antigen presentation to CTLs.
Table 1

CTL epitopes identified in 2019‐nCoV surface glycoprotein and their corresponding MHC class I supertype representative

EpitopeEpitope score—ANN/SVMAntigenicity (score)HLA (% rank)
YLQPRTFLL0.83/0.64Y (0.45)HLA‐A*02:01 (0.01)
GVYFASTEK0.58/0.98Y (0.71)HLA‐A*03:01 (0.00)
EPVLKGVKL0.73/0.61Y (1.23)HLA‐B*07:02 (0.28)
VVNQNAQAL0.77/0.78Y (0.47)HLA‐B*07:02 (0.78)
WTAGAAAYY0.82/0.54Y (0.63)HLA‐B*15:01 (0.37)

Abbreviations: 2019‐nCoV, 2019 novel coronavirus; ANN/SVM, artificial neural network/support vector machine; CTL, cytotoxic T lymphocyte; HLA, human leukocyte antigen; MHC, major histocompatibility complex.

Figure 1

Interactions between cytotoxic T lymphocyte binding epitopes and human leukocyte antigen (HLA) chains of corresponding major histocompatibility complex class I supertype representative during molecular dynamics simulations. A, Residue‐residue contacts (X: epitope, Y: HLA) were mapped in contact map matrices. B, Salt bridge‐forming residues were detected in three complexes. C, Continuous hydrogen bonds were observed between all epitopes and corresponding HLA chains throughout the simulations

CTL epitopes identified in 2019‐nCoV surface glycoprotein and their corresponding MHC class I supertype representative Abbreviations: 2019‐nCoV, 2019 novel coronavirus; ANN/SVM, artificial neural network/support vector machine; CTL, cytotoxic T lymphocyte; HLA, human leukocyte antigen; MHC, major histocompatibility complex. Interactions between cytotoxic T lymphocyte binding epitopes and human leukocyte antigen (HLA) chains of corresponding major histocompatibility complex class I supertype representative during molecular dynamics simulations. A, Residue‐residue contacts (X: epitope, Y: HLA) were mapped in contact map matrices. B, Salt bridge‐forming residues were detected in three complexes. C, Continuous hydrogen bonds were observed between all epitopes and corresponding HLA chains throughout the simulations We also identified three sequential B cell epitopes, and following modeling of 98% residues of the surface glycoprotein (P = 1.03e−15, uGDT = 946), we further identified five discontinuous B cell epitopes in the structure (Tables 2 and 3). Ramachandran plot indicated 94.2% (1194 AA) residues in the favored region, 5% (63 AA) residues in the allowed region, and 0.8% (10 AA) residues in the outlier region (Figure 2).
Table 2

Sequential B cell epitopes identified in 2019‐nCoV surface glycoprotein

EpitopeEpitope probabilityAntigenicity (score)IFN‐γ epitope (score)
CVNLTTRTQLPPAYTN0.74Y (1.38)N (−0.92)
NVTWFHAIHVSGTNGT0.55Y (0.84)N (−0.30)
SFSTFKCYGVSPTKLNDL0.69Y (1.06)N (−0.16)

Abbreviations: 2019‐nCoV, 2019 novel coronavirus; IFN‐γ, interferon γ.

Table 3

Discontinuous B cell epitopes identified in 2019‐nCoV surface glycoprotein

Sl. noEpitope:positionScore
1H1271, Y1272, T12730.998
2E1258, D1259, D1260, S1261, E1262, P1263, V1264, L1265, K1266, G1267, V1268, K1269, L12700.985
3G838, D839, C840, L841, G842, D843, I844, A845, A846, R847, D848, L849, I850, C851, A852, Q853, K854, F8550.866
4Y1215, I1216, L1218, G1219, F1220, I1221, A1222, G1223, L1224, I1225, A1226, I1227, V1228, M1229, V1230, T1231, I1232, M1233, L1234, C1235, C1236, M1237, T1238, S1239, C1240, C1241, S1242, C1243, L1244, K1245, G1246, C1247, C1248, S1249, C1250, G1251, S1252, C1253, C1254, K1255, F1256, D12570.832
5M1, F2, V3, F4, L5, V6, L7, L8, P9, L10, V11, S12, S13, Q14, C15, V16, N17, L180.786

Abbreviations: 2019‐nCoV, 2019 novel coronavirus; Bt‐CoV, bat coronavirus; CTL, cytotoxic T lymphocyte; MERS‐CoV, Middle East respiratory syndrome coronavirus; SARS‐CoV, severe acute respiratory syndrome coronavirus.

Figure 2

Discontinuous B cell epitopes in 2019‐nCoV surface glycoprotein and structure validation by Ramachandran plot. Left—five discontinuous B cell epitopes identified in the modeled structure, detailed in Table 3. Right—ramachandran plot indicated a high proportion of residues to be located within the favored and allowed regions. 2019‐nCoV, 2019 novel coronavirus

Sequential B cell epitopes identified in 2019‐nCoV surface glycoprotein Abbreviations: 2019‐nCoV, 2019 novel coronavirus; IFN‐γ, interferon γ. Discontinuous B cell epitopes identified in 2019‐nCoV surface glycoprotein Abbreviations: 2019‐nCoV, 2019 novel coronavirus; Bt‐CoV, bat coronavirus; CTL, cytotoxic T lymphocyte; MERS‐CoV, Middle East respiratory syndrome coronavirus; SARS‐CoV, severe acute respiratory syndrome coronavirus. Discontinuous B cell epitopes in 2019‐nCoV surface glycoprotein and structure validation by Ramachandran plot. Left—five discontinuous B cell epitopes identified in the modeled structure, detailed in Table 3. Right—ramachandran plot indicated a high proportion of residues to be located within the favored and allowed regions. 2019‐nCoV, 2019 novel coronavirus With the exception of CTL epitope—VVNQNAQAL, which was found to have 100% identity with SARS‐CoV, all identified sequential CTL and B cell epitopes were at least partially unique to 2019‐nCoV, compared with Bt‐CoV, SARS‐CoV, and MERS‐CoV. CTL epitope—EPVLKGVKL did not match any corresponding sequences of the other coronaviruses (Figure 3).
Figure 3

Visualization of frequency‐based differences in residues of sequential CTL and B cell epitopes across bat coronavirus, MERS‐coronavirus and SARS‐coronavirus. CTL, cytotoxic T lymphocyte; MERS, Middle East respiratory syndrome; SARS, severe acute respiratory syndrome

Visualization of frequency‐based differences in residues of sequential CTL and B cell epitopes across bat coronavirus, MERS‐coronavirus and SARS‐coronavirus. CTL, cytotoxic T lymphocyte; MERS, Middle East respiratory syndrome; SARS, severe acute respiratory syndrome

DISCUSSION

Rapid identification of immune epitopes is of crucial importance at the time of an impending pandemic. In this study, we attempted to identify significant CTL and B cell epitopes in the surface glycoprotein of 2019‐nCoV. CTLs can become activated after they detect infected cells presenting the offending viral antigens as part of surface antigen‐MHC‐I complexes. Successful presentation depends on the effective harboring of antigens by MHC‐I molecules via hydrogen bonds and salt bridge interactions that allow both high affinity with broader specificity. After docking and structure refinement, we analyzed interactions between individual CTL epitopes and HLA chains of their corresponding MHC class I supertype representative by MDs simulations. We observed multiple, maintained hydrogen bond interactions, in addition to the presence of salt‐bridges in three complexes. Collectively, these interactions indicate a potentially successful presentation of the epitopes by MHC class I molecules, thereby activating CTLs and generating immune responses. We compared the epitopes obtained in our study with epitopes predicted in the spike protein of MERS‐CoV and SARS‐CoV. , We found one overlapping CTL epitope between MERS‐CoV and 2019‐nCoV with one gap and one mismatch (YLQPRTFLL/YKLQPLTFLL), and no comparable epitopes with SARS‐CoV. In addition to the activation of CTLs, successful immunogens need to generate a persistent humoral immunity and in our study, we identified three sequential and five discontinuous B cell epitopes in the surface glycoprotein. There are a few limitations of our study. Epitope detection in glycoproteins is often complicated by the presence of covalently attached glycans. Modeling viral glycoproteins with the associated glycans is especially challenging due to the lack of related structures, although attempts have been made in hepatitis C and rabies. , Second, similar to other RNA virus, coronavirus replication is error‐prone, with a reported mutation rate of 4 × 10−4 substitutions per site per year in SARS‐CoV. This necessitates urgency in antiviral vaccine development before the identified epitopes are potentially rendered obsolete.
  26 in total

1.  Severe acute respiratory syndrome coronavirus sequence characteristics and evolutionary rate estimate from maximum likelihood analysis.

Authors:  Marco Salemi; Walter M Fitch; Massimo Ciccozzi; Maria Jose Ruiz-Alvarez; Giovanni Rezza; Martha J Lewis
Journal:  J Virol       Date:  2004-02       Impact factor: 5.103

2.  Prediction of CTL epitopes using QM, SVM and ANN techniques.

Authors:  Manoj Bhasin; G P S Raghava
Journal:  Vaccine       Date:  2004-08-13       Impact factor: 3.641

3.  Template-based protein structure modeling using the RaptorX web server.

Authors:  Morten Källberg; Haipeng Wang; Sheng Wang; Jian Peng; Zhiyong Wang; Hui Lu; Jinbo Xu
Journal:  Nat Protoc       Date:  2012-07-19       Impact factor: 13.491

4.  Identification of common molecular subsequences.

Authors:  T F Smith; M S Waterman
Journal:  J Mol Biol       Date:  1981-03-25       Impact factor: 5.469

5.  VaxiJen: a server for prediction of protective antigens, tumour antigens and subunit vaccines.

Authors:  Irini A Doytchinova; Darren R Flower
Journal:  BMC Bioinformatics       Date:  2007-01-05       Impact factor: 3.169

6.  BepiPred-2.0: improving sequence-based B-cell epitope prediction using conformational epitopes.

Authors:  Martin Closter Jespersen; Bjoern Peters; Morten Nielsen; Paolo Marcatili
Journal:  Nucleic Acids Res       Date:  2017-07-03       Impact factor: 16.971

7.  Salt bridge-forming residues positioned over viral peptides presented by MHC class I impacts T-cell recognition in a binding-dependent manner.

Authors:  Wei Ji; Ling Niu; Weiyu Peng; Yongli Zhang; Hao Cheng; Feng Gao; Yi Shi; Jianxun Qi; George F Gao; William J Liu
Journal:  Mol Immunol       Date:  2019-06-18       Impact factor: 4.407

8.  Epitope-based peptide vaccine design and target site depiction against Middle East Respiratory Syndrome Coronavirus: an immune-informatics study.

Authors:  Muhammad Tahir Ul Qamar; Saman Saleem; Usman Ali Ashfaq; Amna Bari; Farooq Anwar; Safar Alqahtani
Journal:  J Transl Med       Date:  2019-11-08       Impact factor: 5.531

9.  Predicted 3D Model of the Rabies Virus Glycoprotein Trimer.

Authors:  Bastida-González Fernando; Celaya-Trejo Yersin; Correa-Basurto José; Zárate-Segura Paola
Journal:  Biomed Res Int       Date:  2016-05-04       Impact factor: 3.411

Review 10.  Computational Modeling of Hepatitis C Virus Envelope Glycoprotein Structure and Recognition.

Authors:  Johnathan D Guest; Brian G Pierce
Journal:  Front Immunol       Date:  2018-05-28       Impact factor: 7.561

View more
  98 in total

1.  Harnessing innate immunity to eliminate SARS-CoV-2 and ameliorate COVID-19 disease.

Authors:  Rachel M Golonka; Piu Saha; Beng San Yeoh; Saurabh Chattopadhyay; Andrew T Gewirtz; Bina Joe; Matam Vijay-Kumar
Journal:  Physiol Genomics       Date:  2020-04-10       Impact factor: 3.107

Review 2.  Clinical, molecular, and epidemiological characterization of the SARS-CoV-2 virus and the Coronavirus Disease 2019 (COVID-19), a comprehensive literature review.

Authors:  Esteban Ortiz-Prado; Katherine Simbaña-Rivera; Lenin Gómez-Barreno; Mario Rubio-Neira; Linda P Guaman; Nikolaos C Kyriakidis; Claire Muslin; Ana María Gómez Jaramillo; Carlos Barba-Ostria; Doménica Cevallos-Robalino; Hugo Sanches-SanMiguel; Luis Unigarro; Rasa Zalakeviciute; Naomi Gadian; Andrés López-Cortés
Journal:  Diagn Microbiol Infect Dis       Date:  2020-05-30       Impact factor: 2.803

3.  Multisystem Imaging Manifestations of COVID-19, Part 2: From Cardiac Complications to Pediatric Manifestations.

Authors:  Margarita V Revzin; Sarah Raza; Neil C Srivastava; Robin Warshawsky; Catherine D'Agostino; Ajay Malhotra; Anna S Bader; Ritesh D Patel; Kan Chen; Christopher Kyriakakos; John S Pellerito
Journal:  Radiographics       Date:  2020 Nov-Dec       Impact factor: 5.333

Review 4.  Coronavirus Disease 2019-COVID-19.

Authors:  Kuldeep Dhama; Sharun Khan; Ruchi Tiwari; Shubhankar Sircar; Sudipta Bhat; Yashpal Singh Malik; Karam Pal Singh; Wanpen Chaicumpa; D Katterine Bonilla-Aldana; Alfonso J Rodriguez-Morales
Journal:  Clin Microbiol Rev       Date:  2020-06-24       Impact factor: 26.132

5.  A study of differential circRNA and lncRNA expressions in COVID-19-infected peripheral blood.

Authors:  Yingping Wu; Tiejun Zhao; Riqiang Deng; Xiaoping Xia; Bin Li; Xunzhang Wang
Journal:  Sci Rep       Date:  2021-04-12       Impact factor: 4.379

Review 6.  A Scoping Insight on Potential Prophylactics, Vaccines and Therapeutic Weaponry for the Ongoing Novel Coronavirus (COVID-19) Pandemic- A Comprehensive Review.

Authors:  Priyanka Dash; Subhashree Mohapatra; Sayantan Ghosh; Bismita Nayak
Journal:  Front Pharmacol       Date:  2021-02-26       Impact factor: 5.810

7.  SARS-CoV-2-specific T cell responses and correlations with COVID-19 patient predisposition.

Authors:  Arne Sattler; Stefan Angermair; Helena Stockmann; Katrin Moira Heim; Dmytro Khadzhynov; Sascha Treskatsch; Fabian Halleck; Martin E Kreis; Katja Kotsch
Journal:  J Clin Invest       Date:  2020-12-01       Impact factor: 14.808

8.  [Biochemical analysis between common type and critical type of COVID-19 and clinical value of neutrophil/lymphocyte ratio].

Authors:  Hongbing Li; Maojun Zhao; Yingsheng Xu
Journal:  Nan Fang Yi Ke Da Xue Xue Bao       Date:  2020-07-30

9.  Quasispecies of SARS-CoV-2 revealed by single nucleotide polymorphisms (SNPs) analysis.

Authors:  Rongsui Gao; Wenhong Zu; Yang Liu; Junhua Li; Zeyao Li; Yanling Wen; Haiyan Wang; Jing Yuan; Lin Cheng; Shengyuan Zhang; Yu Zhang; Shuye Zhang; Weilong Liu; Xun Lan; Lei Liu; Feng Li; Zheng Zhang
Journal:  Virulence       Date:  2021-12       Impact factor: 5.882

Review 10.  Bioinformatic HLA Studies in the Context of SARS-CoV-2 Pandemic and Review on Association of HLA Alleles with Preexisting Medical Conditions.

Authors:  Mina Mobini Kesheh; Sara Shavandi; Parastoo Hosseini; Rezvan Kakavand-Ghalehnoei; Hossein Keyvani
Journal:  Biomed Res Int       Date:  2021-05-28       Impact factor: 3.411

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.