Literature DB >> 32864374

Coronavirus epitope prediction from highly conserved region of spike protein.

Valentina Yurina1.   

Abstract

PURPOSE: The aim of this research was to predict the epitope for coronavirus family spike protein. Coronavirus family is highly evolved viruses which cause several outbreaks in the past decades. Therefore, it is crucial to design a global vaccine candidate to prevent the coronavirus outbreak in the future.
MATERIALS AND METHODS: The spike protein amino acid sequences from nine coronavirus family were searched in the Uniprot database. The spike protein sequences were aligned using Clustal method. The highly conservatives amino acids were analyzed its B cell linear and continuous epitopes and T cell epitopes.
RESULTS: From the alignment results it was found that there is a highly conserved region in the extracellular domain of spike protein. With prediction methods from this highly conserved region, B cell and T cell epitopes from spike protein were derived.
CONCLUSION: From several different prediction results, B cell epitope and T cell epitope were identified in the highly conserved region thus it is promising to be developed as a coronavirus vaccine candidate. © Korean Vaccine Society.

Entities:  

Keywords:  Coronavirus; Epitopes; Respiratory tract infections; Tools

Year:  2020        PMID: 32864374      PMCID: PMC7445319          DOI: 10.7774/cevr.2020.9.2.169

Source DB:  PubMed          Journal:  Clin Exp Vaccine Res        ISSN: 2287-3651


Introduction

Coronavirus is a large family of viruses that cause mild to moderate upper respiratory infections. However, some types of coronavirus can also cause more serious illnesses, such as Middle East respiratory syndrome coronavirus (MERS-CoV), severe acute respiratory syndrome coronavirus (SARS-CoV), and coronavirus disease 2019 (COVID-19) [1]. Up to now, seven coronaviruses (HCoVs) have been identified, namely HCoV-229E, HCoV-OC43, HCoV-NL63, HCoV-HKU1, SARS-CoV, MERS-CoV, and COVID-19. COVID-19 is a member of the coronaviridae family, which by the early of May 2020 has infected more than 3.5 million people and caused almost 250.000 deaths worldwide. The spread of COVID-19 is expanding globally within less than 3 months and causing many losses in various sectors [1]. Severe acute respiratory syndrome (SARS) is an acute respiratory disorder caused by a coronavirus (SARS-CoV). During the global outbreak in 2002/2003, this catastrophic disease resulted in 8,400 cases and 900 deaths according to a report by the World Health Organization [2]. MERS-CoV is an emerging virus that is involved in cases of acute respiratory infections in the Arabian Peninsula, Tunisia, Morocco, France, Italy, Germany, and England. The novel coronavirus, which has been contagious in Saudi Arabia since March 2012, has never before been found in the world and has characteristics that are different from the SARS coronavirus that infected 32 countries in the world in 2003 [3]. All types of coronaviruses cause clinical symptoms that can include fever, coughing, acute respiratory distress, pneumonia, fatigue, headaches, dyspnea, lymphopenia, and infrequently cause gastrointestinal symptoms such as diarrhea. Severe COVID-19 infection can be characterized by turbidity in both lung subpleural areas, acute respiratory distress syndrome, and acute cardiac injury. In critical patients occur both local and systemic immune responses, which lead to intense inflammation [14]. Vaccination is still the most effective preventive for virus infection. One of the latest vaccine technology developments are peptide-based vaccines or epitope vaccines. Epitope based vaccine is synthesized based on in silico analyzes through an immunoinformatics approach. In silico studies reduce costs and time needed in developing vaccines and construct vaccines with higher efficacy and safety than conventional vaccines [567]. Looking at the global pandemic COVID-19, MERS, and SARS caused by coronavirus, it is considered necessary to develop an effective vaccine against all types of coronavirus. Alignment of nine strains of the coronavirus has now been carried out and a highly conserved region of the S2 spike protein has been found. Highly conserved regions can be potential vaccine candidates because they can recognize various strains of the coronavirus. Spike protein is a surface protein in coronavirus that plays a role in binding with receptors and facilitating membrane fusion. The spike S1 protein plays a role in binding virions to the cell membrane through its interaction with the receptors so that it initiates the infection process. S2 protein facilitates fusion between virions and cell membranes [89].

Materials and Methods

Data collection

Spike protein sequences from nine coronavirus strains were collected from protein data bank (https://www.uniprot.org/) (Table 1).
Table 1

Coronavirus ID number

No.IDOrganism name
1.Q6Q1S2Human coronavirus NL63 (HCoV-NL63)
2.P36334Human coronavirus OC43 (HCoV-OC43)
3.P0DTC2Severe acute respiratory syndrome coronavirus 2 (2019-nCoV, SARS-CoV-2)
4.P59594Human SARS coronavirus (severe acute respiratory syndrome coronavirus, SARS-CoV)
5.P15423Human coronavirus 229E (HCoV-229E)
6.K9N5Q8Middle East respiratory syndrome-related coronavirus
7.Q0ZME7Human coronavirus HKU1 (isolate N5) (HCoV-HKU1)
8.Q5MQD0Human coronavirus HKU1 (isolate N1) (HCoV-HKU1)
9.Q14EB0Human coronavirus HKU1 (isolate N2) (HCoV-HKU1)

Alignment and epitopes prediction

Nine spike protein sequences were aligned using COBALT (constraint-based multiple alignment tools) which is available at https://www.ncbi.nlm.nih.gov/tools/cobalt/cobalt.cgi. Highly conservatives' sequences were chosen and analyzed its B cells epitope using several tools (Emini Surface Accessibility Prediction, Chou and Fasman Beta-turn Prediction, Parker hydrophilicity prediction, Kolaskar Tongaonkar Antigenicity for linear epitopes) and DiscoTope for continuous epitopes. While the T cell epitopes were predicted using NetCTL, Immune Epitope Database (IEDB)-major histocompatibility complex (MHC) I, IEDB-MHC II, and MotifScan tools.

Results

Highly conserved region from coronavirus spike protein

Spike protein sequences from nine strains of coronavirus which infected human were collected. Alignment result showed a highly conserved region in amino acid number 945–1100 from severe acute respiratory syndrome coronavirus 2 (2019-nCoV, SARS-CoV-2) spike protein (Fig. 1). This region was used to predict the T and B cells epitopes.
Fig. 1

Highly conserve region form coronavirus spike protein, amino acid number 945–1,100 was used to predict epitopes.

T cells and B epitopes

Several tools to predict T cells epitopes identified epitopes that presented by MHC class I and II (Table 2). While, the B cells linear epitopes prediction was presented in Table 3, the continuous B cells epitopes is demonstrated in Fig. 2. In summary, all of the epitopes identified in highly conserved region is revealed in Fig. 3.
Table 2

T cells epitopes prediction result

No.StartStopPeptideMethodsMHC classPrediction based onTool
ASTP
110381046RVDFCGKGYANNINetCTL
210161024AEIRASANLANNIIEDB-MHC I
310151029AAEIRASANLAATKMANNIIIEDB-MHC II
410381046RVDFCGKGYSMI and IIMotifScan
510411050FCGKGYHLMQSARI and IIMHCPred

MHC, major histocompatibility complex; A, quantitative binding affinity; S, supertypes; T, TAP binding; P, proteasomal cleavage; ANN, artificial neural network; IEDB, Immune Epitope Database; SM, sequence motif; QSAR, quantitative structure-activity relationship model.

Table 3

Linear B cells epitope predicted from highly conserved region

No.StartStopPeptideToolMethod
1.987996EAEVQIDRLBepiprepMachine learning-decision trees
10341045LGQSKRVDFCGK
2.10671075VPAQEKNFTEmini Surface Accessibility PredictionPropensity scale
10851090GKAHFP
3.10531059PQSAPHGChou and Fasman Beta-turn PredictionPropensity scale
4.960966NTLVKQLKolaskar Tongaonkar AntigenicityPropensity scale
972978ISSVLND
10041011LQTYVTQQ
10791084PAICHD
5.10711147QEKNFTTAPAICHDGKAHFPREGVFVSNGTHWFVTQRNFYEPQIITT DNTFVSGNCDVVIGIVNNTVYDPLQPELDSEllipro
Fig. 2

Continuous B cells epitope predicted from highly conserved region and its residues (180 residues).

Fig. 3

Selected highly conserved region for epitopes prediction is presented in yellow, T cell epitopes showed in underlined font, and B cell linear epitopes showed in red, numbers indicated the amino acid.

Discussion

Vaccination is one of the most effective approaches to prevent viral infections. However, the development of vaccines requires a long time and high costs since it is required for the screening of large arrays of potential epitope candidates. Using the in-silico predictions method, it can dramatically reduce the cost for vaccine development. The immune system recognizes antigens through the mechanism of humoral and cellular immune systems, each of which is mediated by B cells and T cells. Both types of immune cells recognize the antigen not as a whole but only in a portion of the pathogenic components called antigens. The introduction of B cell antigens and T cells requires a different process [10]. We predict epitopes from spike glycoprotein (S protein) since this protein has been studied as the most antigenic part of the virus [11]. Prior to epitope prediction, sequencing of S protein sequences of nine strains of the coronavirus was carried out. From this alignment, it is obtained that the highly conserved region is from amino acid residue number 945–1100. From the highly conserved region, epitope prediction is carried out; both B cell epitope and T cell epitope. Epitope prediction is performed in the highly conserved area with the intention that the vaccine can be used for a variety of coronavirus strains, including it is expected that if a new type of virus strain develops in the future, the area this is conserved and vaccination remains effective. Our findings provide a sequence from highly conserved region of S2 protein which can help guide new experimental efforts to develop coronavirus vaccine candidate. B cell epitope prediction is performed to predict both linear and continuous epitopes. From the prediction of linear epitopes in the highly conserved region it was found that the area contained several potential epitopes. Prediction of continuous epitopes has similar results with the presence of epitopes that is recognized by B cells in the spike protein. T cell epitopes prediction in highly conserved region also has similar results. The conclusion of these predictions is the presence of epitopes in the highly conserved region so that they can be developed as vaccine candidates. The results of this study can be a reference for the next stage of coronavirus vaccine development. A delivery strategy that can be useful in the development of the coronavirus vaccine is by the mucosal pathway using live bacteria vector as a career. Live bacteria become an important career because they can induce the mucosal immune system in addition to the systemic immune system [12], the mucosal immune system is very important to defense against viral infections that attack the respiratory tract.
  10 in total

Review 1.  Computer-aided drug discovery and development (CADDD): in silico-chemico-biological approach.

Authors:  I M Kapetanovic
Journal:  Chem Biol Interact       Date:  2006-12-16       Impact factor: 5.192

2.  Recent advances in B-cell epitope prediction methods.

Authors:  Yasser El-Manzalawy; Vasant Honavar
Journal:  Immunome Res       Date:  2010-11-03

3.  Comparative epidemiology of Middle East respiratory syndrome coronavirus (MERS-CoV) in Saudi Arabia and South Korea.

Authors:  Xin Chen; Abrar Ahmad Chughtai; Amalie Dyda; Chandini Raina MacIntyre
Journal:  Emerg Microbes Infect       Date:  2017-06-07       Impact factor: 7.163

Review 4.  Fundamentals and Methods for T- and B-Cell Epitope Prediction.

Authors:  Jose L Sanchez-Trincado; Marta Gomez-Perosanz; Pedro A Reche
Journal:  J Immunol Res       Date:  2017-12-28       Impact factor: 4.818

Review 5.  Live Bacterial Vectors-A Promising DNA Vaccine Delivery System.

Authors:  Valentina Yurina
Journal:  Med Sci (Basel)       Date:  2018-03-23

6.  Preliminary Identification of Potential Vaccine Targets for the COVID-19 Coronavirus (SARS-CoV-2) Based on SARS-CoV Immunological Studies.

Authors:  Syed Faraz Ahmed; Ahmed A Quadeer; Matthew R McKay
Journal:  Viruses       Date:  2020-02-25       Impact factor: 5.048

7.  Epidemiologic clues to SARS origin in China.

Authors:  Rui-Heng Xu; Jian-Feng He; Meiron R Evans; Guo-Wen Peng; Hume E Field; De-Wen Yu; Chin-Kei Lee; Hui-Min Luo; Wei-Sheng Lin; Peng Lin; Ling-Hui Li; Wen-Jia Liang; Jin-Yan Lin; Alan Schnur
Journal:  Emerg Infect Dis       Date:  2004-06       Impact factor: 6.883

8.  Proof of principle for epitope-focused vaccine design.

Authors:  Bruno E Correia; John T Bates; Rebecca J Loomis; Gretchen Baneyx; Chris Carrico; Joseph G Jardine; Peter Rupert; Colin Correnti; Oleksandr Kalyuzhniy; Vinayak Vittal; Mary J Connell; Eric Stevens; Alexandria Schroeter; Man Chen; Skye Macpherson; Andreia M Serra; Yumiko Adachi; Margaret A Holmes; Yuxing Li; Rachel E Klevit; Barney S Graham; Richard T Wyatt; David Baker; Roland K Strong; James E Crowe; Philip R Johnson; William R Schief
Journal:  Nature       Date:  2014-02-05       Impact factor: 49.962

Review 9.  The epidemiology and pathogenesis of coronavirus disease (COVID-19) outbreak.

Authors:  Hussin A Rothan; Siddappa N Byrareddy
Journal:  J Autoimmun       Date:  2020-02-26       Impact factor: 7.094

10.  Pathological findings of COVID-19 associated with acute respiratory distress syndrome.

Authors:  Zhe Xu; Lei Shi; Yijin Wang; Jiyuan Zhang; Lei Huang; Chao Zhang; Shuhong Liu; Peng Zhao; Hongxia Liu; Li Zhu; Yanhong Tai; Changqing Bai; Tingting Gao; Jinwen Song; Peng Xia; Jinghui Dong; Jingmin Zhao; Fu-Sheng Wang
Journal:  Lancet Respir Med       Date:  2020-02-18       Impact factor: 30.700

  10 in total
  4 in total

Review 1.  Adaptive Immune Responses and Immunity to SARS-CoV-2.

Authors:  Dragan Primorac; Kristijan Vrdoljak; Petar Brlek; Eduard Pavelić; Vilim Molnar; Vid Matišić; Ivana Erceg Ivkošić; Marijo Parčina
Journal:  Front Immunol       Date:  2022-05-04       Impact factor: 8.786

Review 2.  Predicting epitopes for vaccine development using bioinformatics tools.

Authors:  Valentina Yurina; Oktavia Rahayu Adianingsih
Journal:  Ther Adv Vaccines Immunother       Date:  2022-05-21

3.  Molecular docking and dynamic simulation of conserved B cell epitope of SARS-CoV-2 glycoprotein Indonesian isolates: an immunoinformatic approach.

Authors:  Fedik Abdul Rantam; Viol Dhea Kharisma; Christrijogo Sumartono; Jusak Nugraha; Andi Yasmin Wijaya; Helen Susilowati; Suryo Kuncorojakti; Alexander Patera Nugraha
Journal:  F1000Res       Date:  2021-08-16

4.  CAVES: A Novel Tool for Comparative Analysis of Variant Epitope Sequences.

Authors:  Katherine Li; Connor Lowey; Paul Sandstrom; Hezhao Ji
Journal:  Viruses       Date:  2022-05-26       Impact factor: 5.818

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.