| Literature DB >> 33254432 |
Alexander Kwarteng1, Ebenezer Asiedu2, Samuel Amoah Sakyi3, Samuel Opoku Asiedu2.
Abstract
The occurrence of the SARS-CoV2 infection has become a worldwide threat and the urgent need to discover therapeutic interventions remains paramount. The primary roles of the coronavirus nucleocapsid (N) protein are to interact with the viral genome and pack them into ribonucleoprotein complex. It also plays critical roles at many stages of the viral life cycle. Herein, we explore the N protein of SARS-CoV2 to identify promising epitope-based vaccine candidates and target the N-terminal domain of SARS-CoV2 N-protein for potential inhibitors using an integrative bioinformatics approach. We identified B-cell epitopes and T-cell epitopes that are non-toxic, non-allergenic, capable of inducing IFN-γ and structurally stable with high global population coverage of response. The 404SKQLQQSMSSADS416 and 92RRIRGGDGKMKDL104 sequences of N-protein were identified to induce B-cell immunity. We also identified 79SSPDDQIGY87 and 305AQFAPSASAFFGMSR319 as potential T-cell epitopes that form stable structures with human leucocyte antigens. We have also identified zidovudine triphosphate, an anti-HIV agent, as a potential inhibitor of the N-terminal domain of SARS-CoV2 N-protein based on docking and simulation analysis and should be considered for experimental validations. The findings of this study can help fast-track the discovery of therapeutic options to combat COVID-19.Entities:
Keywords: COVID-19; Epitope; Immuno-informatics; Molecular dynamics simulation; Vaccine; Zidovudine
Mesh:
Substances:
Year: 2020 PMID: 33254432 PMCID: PMC7574726 DOI: 10.1016/j.biopha.2020.110914
Source DB: PubMed Journal: Biomed Pharmacother ISSN: 0753-3322 Impact factor: 6.529
Fig. 1Structural models of N-protein and HLAs. (A) N-terminal domain of SARS-CoV2 N-protein (modeled based on PDB ID: 6M3M) (B) C-terminal domain of SARS-CoV2 N-protein based on PDB ID: 7C22 (C) HLA-A*03:01 based on PDB ID: 6O9B (D) HLA-B*35:01 based on PDB ID: 6AVG (E) HLA-DRB1*09:01 based on PDB ID: 1A6A. The β-sheets (blue), α-helix (red) and loops (green) are also shown.
Evaluation and characterization of epitopes. The predicted HLA allele that binds well to the MHC-restricted epitopes are also provided.
| Candidate | Peptide | Length | Toxicity | Allergenicity | RMSD (nm) | ||
| 1 | AGLPYGANK | 9 | Non-toxin | Probable non-allergen | 0.47 | ||
| 2 | |||||||
| 3 | |||||||
| 4 | LPQGTTLPKGF | 11 | Non-toxin | Probable non-allergen | 0.39 | ||
| 1 | KTFPPTEPK | 9 | Non-toxin | Probable non-allergen | Positive | 0.52 | HLA-A*11:01, HLA-A*03:01, HLA-A*30:01, HLA-A*31:01, HLA-A*68:01, HLA-A*32:01, HLA-A*30:02, HLA-A*33:01, HLA-B*57:01, HLA-B*58:01, HLA-A*26:01, HLA-A*01:01, HLA-B*15:01, HLA-B*07:02 |
| 2 | |||||||
| 3 | SSPDDQIGYY | 10 | Non-toxin | Probable non-allergen | Positive | 0.35 | HLA-A*01:01, HLA-A*30:02, HLA-B*35:01, HLA-A*26:01, HLA-B*53:01, HLA-B*57:01, HLA-B*58:01, HLA-B*15:01, HLA-A*11:01, HLA-A*68:01, HLA-B*07:02, HLA-B*44:03, HLA-A*03:01, HLA-B*44:02, HLA-A*32:01, HLA-B*51:01 |
| 4 | VTPSGTWLTY | 10 | Non-toxin | Probable non-allergen | Positive | 0.3 | HLA-A*01:01, HLA-A*30:02, HLA-B*35:01, HLA-A*26:01, HLA-B*53:01, HLA-B*15:01, HLA-B*58:01, HLA-B*57:01, HLA-A*11:01, HLA-A*32:01, HLA-B*07:02, HLA-A*03:01, HLA-A*68:01, HLA-A*24:02, HLA-B*51:01, HLA-A*23:01, HLA-A*30:01, HLA-A*33:01, HLA-A*31:01, HLA-B*44:03, HLA-B*44:02 |
| 1 | |||||||
| 2 | |||||||
| 3 | |||||||
| 4 | QIAQFAPSASAFFGM | 15 | Non-toxin | Probable non-allergen | Positive | 0.37 | HLA-DPA1*01:03/DPB1*02:01, HLA-DQA1*01:02/DQB1*06:02, HLA-DPA1*02:01/DPB1*01:01, HLA-DPA1*02:01/DPB1*01:01, HLA-DPA1*03:01/DPB1*04:02, HLA-DQA1*03:01/DQB1*03:02, HLA-DRB5*01:01, HLA-DPA1*03:01/DPB1*04:02, HLA-DRB5*01:01 |
| 5 | WPQIAQFAPSASAFF | 15 | Non-toxin | Probable non-allergen | Positive | 0.29 | HLA-DRB4*01:01, HLA-DPA1*02:01/DPB1*14:01, HLA-DRB4*01:01, HLA-DRB1*11:01, HLA-DRB1*04:01, HLA-DPA1*01:03/DPB1*02:01, HLA-DPA1*02:01/DPB1*01:01, HLA-DRB5*01:01 |
RMSD was calculated after least square fitting of Cα atoms. The epitope candidates considered for further analysis are in bold.
Fig. 2Population coverage of MHC restricted epitope response. (A) Global population coverage of MHC I-restricted epitope (B) Global population of MHC II-restricted epitopes (C) Population coverage of MHC I and MHC II epitopes response in particular ethnicities across the world.
Fig. 3Multiple sequence alignment of SARS-CoV2 N-protein with other related coronaviruses. The sequences in the rectangular shape are peptide sequence of epitopes.
Fig. 4Predicted binding conformations of MHC-restricted epitopes to HLA proteins. (A) HLA-DRB1*09:01-AQFAPSASAFFGMSR complex (B) HLA-DRB1*09:01-IAQFAPSASAFFGMS complex (C) HLA-DRB1*09:01-PQIAQFAPSASAFFG complex (D) HLA-A*03:01-SSPDDQIGY complex (E) HLA-B*35:01-SSPDDQIGY complex. The 3D structures of the epitopes (red) and the surfaces of the HLA proteins (green) are shown.
Fig. 5Molecular dynamics simulation analysis of MHCI-restricted epitope complexes and MHC II-epitope complexes. The MHCI-restricted epitope (78SSPDDQIGY86) is complexed to either HLA-A*03:01 or HLA-B*35:01 and the MHCII-restricted epitopes (305AQFAPSASAFFGMSR319, 304IAQFAPSASAFFGMS318 and 302PQIAQFAPSASAFFG316) are complexed to HLA-DRB1*09:01. (A, B) RMSD. (C, D) number of hydrogen bonds. (E, F) Evolution of the minimum distance between epitope and HLA.
Fig. 6Simulation of immune responses to SARS-CoV2 N-protein. (A) Virus-specific antibody responses along with the antigen counts are shown. (B) The various cytokines released against the antigen. Insert plot shows danger signals and IL-2 levels. (C) The different forms of B-cells including the memory B-cells are shown. (D) T-helper cell counts and kinetics of memory T-cells (E) T-cytotoxic cell levels (F) Evolution of natural killer cells.
Fig. 7Binding site analysis of AMP. (A) Structural superposition of SARS-CoV2 N-protein N-terminal domain model (green) to crystal structure of hCoV N-protein N-terminal domain (red) in complex to AMP (blue). (B) Superposition of active sites showing polar interactions (yellow dashes) between AMP and active site residues in SARS-CoV2 (green) and hCoV (red).
List of ligands used for the virtual screening. Their binding energies based on autodock analysis are also shown.
| Ligand | IUPAC nomenclature | M.W(g/mol) | Chemical representation | Binding energy (kcal/mol) |
|---|---|---|---|---|
| Adenosine 5′-monophosphate | [(2R,3S,4R,5R)-5-(6-aminopurin-9-yl)-3,4-dihydroxyoxolan-2-yl]methyl dihydrogen phosphate | 347.22 | −4.94 | |
| Amprenavir | [(3S)-oxolan-3-yl] N-[(2S,3R)-4-[(4-aminophenyl)sulfonyl-(2-methylpropyl)amino]-3-hydroxy-1-phenylbutan-2-yl]carbamate | 505.6 | −3.24 | |
| Entecavir | 2-amino-9-[(1S,3R,4S)-4-hydroxy-3-(hydroxymethyl)-2-methylidenecyclopentyl]-1H-purin-6-one | 277.28 | −3.28 | |
| Indinavir | (2S)-1-[(2S,4R)-4-benzyl-2-hydroxy-5-[[(1S,2R)-2-hydroxy-2,3-dihydro-1H-inden-1-yl]amino]-5-oxopentyl]-N-tert-butyl-4-(pyridin-3-ylmethyl)piperazine-2-carboxamide | 613.8 | −2.85 | |
| Pleconaril | 3-[3,5-dimethyl-4-[3-(3-methyl-1,2-oxazol-5-yl)propoxy]phenyl]-5-(trifluoromethyl)-1,2,4-oxadiazole | 381.3 | −5.44 | |
| Raltegravir | N-[2-[4-[(4-fluorophenyl)methylcarbamoyl]-5-hydroxy-1-methyl-6-oxopyrimidin-2-yl]propan-2-yl]-5-methyl-1,3,4-oxadiazole-2-carboxamide | 444.4 | −5.5 | |
| Trifluridine | 1-[(2R,4S,5R)-4-hydroxy-5-(hydroxymethyl)oxolan-2-yl]-5-(trifluoromethyl)pyrimidine-2,4-dione | 296.2 | −4.26 | |
| Umifenovir | ethyl 6-bromo-4-[(dimethylamino)methyl]-5-hydroxy-1-methyl-2-(phenylsulfanylmethyl)indole-3-carboxylate | 477.4 | −3.62 | |
| Vicriviroc | (4,6-dimethylpyrimidin-5-yl)-[4-[(3S)-4-[(1R)-2-methoxy-1-[4-(trifluoromethyl)phenyl]ethyl]-3-methylpiperazin-1-yl]-4-methylpiperidin-1-yl]methanone | 533.6 | −4.3 | |
| Zalcitabine | 4-amino-1-[(2R,5S)-5-(hydroxymethyl)oxolan-2-yl]pyrimidin-2-one | 211.22 | −3.48 | |
| Penciclovir | 2-amino-9-[4-hydroxy-3-(hydroxymethyl)butyl]-1H-purin-6-one | 253.26 | −2.87 | |
| Brivudine | 5-[(E)-2-bromoethenyl]-1-[(2R,4S,5R)-4-hydroxy-5-(hydroxymethyl)oxolan-2-yl]pyrimidine-2,4-dione | 333.13 | −4.87 | |
| Famciclovir | [2-(acetyloxymethyl)-4-(2-aminopurin-9-yl)butyl] acetate | 321.33 | −3.61 | |
| Cidofovir | [(2S)-1-(4-amino-2-oxopyrimidin-1-yl)-3-hydroxypropan-2-yl]oxymethylphosphonic acid | 279.19 | −2.83 | |
| Ganciclovir | 2-amino-9-(1,3-dihydroxypropan-2-yloxymethyl)-1H-purin-6-one | 255.23 | −3.58 | |
| Valgancyclovir | [2-[(2-amino-6-oxo-1H-purin-9-yl)methoxy]-3-hydroxypropyl] (2S)-2-amino-3-methylbutanoate | 354.36 | −2.18 | |
| Acyclovir | 2-amino-9-(2-hydroxyethoxymethyl)-1H-purin-6-one | 225.2 | −3.17 | |
| Valacyclovir | 2-[(2-amino-6-oxo-1H-purin-9-yl)methoxy]ethyl (2S)-2-amino-3-methylbutanoate | 324.34 | −2.26 | |
| Ribavirin | 1-[(2R,3R,4S,5R)-3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl]-1,2,4-triazole-3-carboxamide | 244.2 | −3.02 | |
| Remdesivir | 2-ethylbutyl (2S)-2-[[[(2R,3S,4R,5R)-5-(4-aminopyrrolo[2,1-f][1,2,4]triazin-7-yl)-5-cyano-3,4-dihydroxyoxolan-2-yl]methoxy-phenoxyphosphoryl]amino]propanoate | 602.6 | −3.02 | |
| Zidovudine triphosphate | [[(2S,3S,5R)-3-azido-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl] phosphono hydrogen phosphate | 469.2 | −7.83 | |
| Lamivudine triphosphate | [[(2R,5S)-5-(4-amino-2-oxopyrimidin-1-yl)-1,3-oxathiolan-2-yl]methoxy-hydroxyphosphoryl] phosphono hydrogen phosphate | 469.2 | −3.09 |
MM/PBSA calculations of energy terms and binding energy governing the protein-ligand interactions.
| Energy terms (kJ/mol) | |||||
|---|---|---|---|---|---|
| Ligand | Van der Waal | Electrostatics | Polar solvation | Non-polar solvation (SASA) | Binding energy (kJ/mol) |
| AMP | −0.001 | −106.428 | −1.855 | 0.051 | −108.234 |
| RAL | −0.002 | 0.175 | −8.177 | 0.068 | −7.936 |
| ZTP | −15.851 | −2012.207 | 721.787 | −6.565 | −1312.837 |
| PLE | −43.228 | −3.938 | 23.653 | −6.002 | −29.514 |
Fig. 8Molecular dynamics analysis of selected antiviral drugs complexed with the N-terminal domain (NTD) of SARS-CoV2 N-protein. (A) RMSD of complexes after least square fitting to reference structure. (B) Number of hydrogen bonds formed between the ligands and protein. (C) The minimum distance between the ligands and the protein are shown (D) Radius of gyration (Rg) of the protein-ligand complexes.
Fig. 9The interaction between ZTP and the N-terminal domain (NTD). (A) RMSF analysis of the NTD. RMSF of the N-protein is shown alongside the RMSF of the protein in complex with ZTP or AMP. Residual fluctuations for the residues of the β-hairpin of the SARS-CoV2 N-protein NTD and loop regions are demonstrated. (B) The active site of ZTP (red) and the polar interaction (yellow dashes) with active site residues (blue) are shown. The structure of the β-hairpin is also demonstrated (cyan).