| Literature DB >> 33147821 |
Ahmed Rakib1, Saad Ahmed Sami1, Md Ashiqul Islam1,2, Shahriar Ahmed1, Farhana Binta Faiz1, Bibi Humayra Khanam1, Kay Kay Shain Marma1, Maksuda Rahman1, Mir Muhammad Nasir Uddin1, Firzan Nainu3, Talha Bin Emran4, Jesus Simal-Gandara5.
Abstract
With an increasing fatality rate, severe acute respiratory syndrome-coronavirus-2 (SARS-CoV-2) has emerged as a promising threat to human health worldwide. Recently, the World Health Organization (WHO) has announced the infectious disease caused by SARS-CoV-2, which is known as coronavirus disease-2019 (COVID-2019), as a global pandemic. Additionally, the positive cases are still following an upward trend worldwide and as a corollary, there is a need for a potential vaccine to impede the progression of the disease. Lately, it has been documented that the nucleocapsid (N) protein of SARS-CoV-2 is responsible for viral replication and interferes with host immune responses. We comparatively analyzed the sequences of N protein of SARS-CoV-2 for the identification of core attributes and analyzed the ancestry through phylogenetic analysis. Subsequently, we predicted the most immunogenic epitope for the T-cell and B-cell. Importantly, our investigation mainly focused on major histocompatibility complex (MHC) class I potential peptides and NTASWFTAL interacted with most human leukocyte antigen (HLA) that are encoded by MHC class I molecules. Further, molecular docking analysis unveiled that NTASWFTAL possessed a greater affinity towards HLA and also available in a greater range of the population. Our study provides a consolidated base for vaccine design and we hope that this computational analysis will pave the way for designing novel vaccine candidates.Entities:
Keywords: COVID-19; SARS-CoV-2; bioinformatics; epitope; immunoinformatics; nucleocapsid protein; vaccine
Mesh:
Substances:
Year: 2020 PMID: 33147821 PMCID: PMC7663370 DOI: 10.3390/molecules25215088
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.411
Figure 1Workflow of the methodologies used in peptide vaccine design by utilizing SARS-CoV-2 nucleocapsid (N) protein.
Toxicity prediction of selected epitopes.
| Epitopes | Toxicity Prediction | SVM Score | Hydrophobicity | Hydrophilicity | Molecular Weight |
|---|---|---|---|---|---|
| LSPRWYFYY | Non-Toxin | −1.08 | −0.06 | −1.26 | 1294.59 |
| GTTLPKGFY | Non-Toxin | −1.13 | −0.01 | −0.49 | 983.26 |
| DLSPRWYFY | Non-Toxin | −1.18 | −0.14 | −0.67 | 1246.5 |
| SSPDDQIGY | Non-Toxin | −0.33 | −0.2 | 0.3 | 981.1 |
| LLNKHIDAY | Non-Toxin | −0.81 | −0.09 | −0.28 | 1086.39 |
| GTDYKHWPQ | Non-Toxin | −0.29 | −0.29 | −0.04 | 1131.34 |
| SPDDQIGYY | Non-Toxin | −0.54 | −0.17 | 0.01 | 1057.2 |
| NTASWFTAL | Non-Toxin | −1 | 0.08 | -1 | 1010.23 |
Figure 2(A) 3D structure of modeled SARS-CoV-2 N protein; (B) composition of the secondary structure from amino acid residues of SARS-CoV-2 N protein; (C) predicted secondary structure of SARS-CoV-2 N protein; (D) Z-score of the SARS-CoV-2 N protein predicted by PROSA server and (E) Ramachandran plot analysis of the SARS-CoV-2 N protein.
The potential CD8+ T-cell epitopes along with their interacting MHC class I alleles and total processing score, epitopes conservancy_hits and pMHC-I immunogenicity score.
| Epitopes | NetCTL | Epitope | MHC-I Interaction with | pMHC-I |
|---|---|---|---|---|
| LSPRWYFYY | 2.3408 | 100 | HLA-A*29:02 (1.32), | 0.35734 |
| NTASWFTAL | 0.9521 | 100 | HLA-A*68:02 (1.11), | 0.22775 |
| DLSPRWYFY | 1.4994 | 100 | HLA-A*29:02 (0.99) | 0.25933 |
| SPDDQIGYY | 1.1404 | 100 | HLA-B*35:01 (0.52) | 0.06844 |
| SSPDDQIGY | 0.6895 | 100 | 0.0634 |
Analysis of the population coverage for the proposed epitope against SARS-CoV-2.
| Population | Coverage (%) a | Average Hit b | PC90 c |
|---|---|---|---|
| Central Africa | 35.31 | 0.40 | 0.15 |
| East Africa | 39.25 | 0.45 | 0.16 |
| East Asia | 57.56 | 0.71 | 0.24 |
| Europe | 42.95 | 0.50 | 0.18 |
| North Africa | 42.15 | 0.49 | 0.17 |
| North America | 45.32 | 0.53 | 0.18 |
| Northeast Asia | 48.11 | 0.55 | 0.19 |
| Oceania | 31.43 | 0.34 | 0.15 |
| South Africa | 33.91 | 0.38 | 0.15 |
| South America | 38.66 | 0.44 | 0.16 |
| South Asia | 36.53 | 0.41 | 0.16 |
| Southeast Asia | 49.45 | 0.57 | 0.20 |
| Southwest Asia | 28.53 | 0.32 | 0.14 |
| West Africa | 56.22 | 0.67 | 0.23 |
| West Indies | 12.89 | 0.13 | 0.11 |
Notes: a Projected population coverage. b Average number of epitope hits/HLA combinations recognized by the population. c Minimum number of epitope hits/HLA combinations recognized by 90% of the population.
Figure 3(A) Three-dimensional representation of the predicted epitope, NTASWFTAL, and (B) three-dimensional representation of the HLA-A*68:02 molecule.
Results of the molecular docking analysis amongst HLA-A*68:02 and the predicted epitope, NTASWFTAL, and 9-mer peptide from envelope glycoprotein gp160 from HIV type 1 (positive control).
| Epitopes | Docking Score (kcal/mol) |
|---|---|
| NTASWFTAL | −9.4 |
| Positive Control | −8.2 |
Figure 43D representation of molecular docking studies representing the binding affinity of the predicted epitope, NTASWFTAL to the groove of the HLA-A*68:02. The interacting A chain residues are displayed as red ball and stick, interacting B chain residues are displayed as cyan ball and stick, hydrogen bonds are displayed as green dotted lines, alkyl/pi-alkyl bonds are displayed as pink dotted lines and carbon-hydrogen bonds are displayed as white dotted lines.
Figure 53D representation of molecular docking studies representing the binding affinity of the 9-mer peptide from envelope glycoprotein gp160 form HIV type 1 (positive control) to the groove of the HLA-A*68:02. The interacting A chain residues are displayed as red ball and stick, interacting B chain residues are displayed as cyan ball and stick, hydrogen bonds are displayed as green dotted lines, alkyl/pi-alkyl bonds are displayed as pink dotted lines and salt bridges are displayed as gold dotted lines.
List of predicted B cell epitopes from BepiPred linear epitope prediction analysis.
| Start | End | Peptide | Length |
|---|---|---|---|
| 361 | 390 | KTFPPTEPKKDKKKKADETQALPQRQKKQQ | 30 |
| 338 | 347 | KLDDKDPNFK | 10 |
| 323 | 331 | EVTPSGTWL | 9 |
| 273 | 287 | AFGRRGPEQTQGNFG | 15 |
| 232 | 269 | SKMSGKGQQQQGQTVTKKSAAEASKKPRQKRTATKAYN | 38 |
| 164 | 216 | GTTLPKGFYAEGSRGGSQASSRSSSRSRNSSRNSTPGSSRGTSPARMAGNGGD | 53 |
| 137 | 154 | GALNTPKDHIGTRNPANN | 18 |
| 115 | 127 | TGPEAGLPYGANK | 13 |
| 93 | 104 | RIRGGDGKMKDL | 12 |
| 58 | 85 | QHGKEDLKFPRGQGVPINTNSSPDDQIG | 28 |
| 1 | 51 | MSDNGPQNQRNAPRITFGGPSDSTGSNQNGERSGARSKQRRPQGLPNNTAS | 51 |
List of predicted B-cell epitopes from Kolaskar and Tongaonkar antigenicity prediction method.
| Start | End | Peptide | Length |
|---|---|---|---|
| 52 | 59 | WFTALTQH | 8 |
| 69 | 75 | GQGVPIN | 7 |
| 83 | 89 | QIGYYRR | 7 |
| 106 | 115 | PRWYFYYLGT | 10 |
| 130 | 136 | IIWVATE | 7 |
| 154 | 166 | NAAIVLQLPQGTT | 13 |
| 217 | 227 | AALALLLLDRL | 11 |
| 243 | 249 | GQTVTKK | 7 |
| 267 | 273 | AYNVTQA | 7 |
| 299 | 315 | KHWPQIAQFAPSASAFF | 17 |
| 333 | 339 | YTGAIKL | 7 |
| 347 | 363 | KDQVILLNKHIDAYKTF | 17 |
| 379 | 385 | TQALPQR | 7 |
| 389 | 401 | QQTVTLLPAADLD | 13 |
| 403 | 411 | FSKQLQQSM | 9 |
List of predicted B-cell epitopes from Emini surface accessibility prediction method.
| Start | End | Peptide | Length |
|---|---|---|---|
| 4 | 11 | NGPQNQRN | 8 |
| 36 | 42 | RSKQRRP | 7 |
| 185 | 197 | RSSSRSRNSSRNS | 13 |
| 254 | 264 | ASKKPRQKRTA | 11 |
| 277 | 282 | RGPEQT | 6 |
| 295 | 300 | GTDYKH | 6 |
| 340 | 346 | DDKDPNF | 7 |
| 365 | 377 | PTEPKKDKKKKAD | 13 |
| 384 | 390 | QRQKKQQ | 7 |
Figure 6Combined B-cell linear epitope prediction using (A) Bepipred linear epitope prediction, (B) Chou and Fasman beta-turn prediction and (C) Emini surface accessibility prediction methods.
Figure 7Combined B-cell linear epitope prediction using (A) Karplus and Schulz flexibility prediction, (B) Kolaskar and Tongaonkar antigenicity and (C) Parker hydrophilicity prediction methods.