| Literature DB >> 32920121 |
Sandeep Kumar Kushwaha1, Veerbhan Kesarwani2, Samraggi Choudhury3, Sonu Gandhi4, Shailesh Sharma5.
Abstract
Genomics-led researches are engaged in tracing virus expression pattern, and induced immune responses in human to develop effective vaccine against COVID-19. In this study, targeted expression profiling and differential gene expression analysis of major histocompatibility complexes and innate immune system genes were performed through SARS-CoV-2 infected RNA-seq data of human cell line, and virus transcriptome was generated for T-and B-cell epitope prediction. Docking studies of epitopes with MHC and B-cell receptors were performed to identify potential T-and B-cell epitopes. Transcriptome analysis revealed the specific multiple allele expressions in cell line, genes for elicited induce immune response, and virus gene expression. Proposed T- and B-cell epitopes have high potential to elicit equivalent immune responses caused by SARS-CoV-2 infection which can be useful to provide links between elicited immune response and virus gene expression. This study will facilitate in vitro and in vivo vaccine related research studies in disease control.Entities:
Keywords: COVID-19; Epitopes; Peptide; SAR-CoV-2; T- and B-cell; Vaccine
Year: 2020 PMID: 32920121 PMCID: PMC7500163 DOI: 10.1016/j.ygeno.2020.09.019
Source DB: PubMed Journal: Genomics ISSN: 0888-7543 Impact factor: 5.736
List of top 10 up and down regulated genes in SARS-CoV-2 infected NHBE cell lines.
| Genes | P value | Fold change | TPM (Mock) | TPM (Infection) | Description |
|---|---|---|---|---|---|
| ENSG00000121858 | 2.95E-07 | −3.252 | 73.471 | 9.201 | Tumor necrosis factor (ligand) superfamily |
| ENSG00000075624 | 0.0018 | −3.034 | 5489.099 | 1796.564 | Actin |
| ENSG00000142192 | 0.0002 | −2.832 | 210.277 | 15.893 | Amyloid beta (A4) precursor protein |
| ENSG00000162772 | 4.19E-06 | −2.763 | 24.919 | 5.715 | Activating transcription factor 3 |
| ENSG00000141367 | 0.0005 | −2.461 | 63.660 | 14.057 | Clathrin |
| ENSG00000078142 | 0.0009 | −2.191 | 13.849 | 5.136 | Phosphatidylinositol 3-kinase |
| ENSG00000125538 | 0.0013 | −2.052 | 149.285 | 53.990 | Interleukin 1 |
| ENSG00000108599 | 0.0003 | −2.031 | 30.403 | 8.495 | A kinase (PRKA) anchor protein 10 |
| ENSG00000179750 | 0.0005 | −1.986 | 25.141 | 6.919 | Apolipoprotein B mRNA editing enzyme |
| ENSG00000015171 | 0.0021 | −1.737 | 80.428 | 21.587 | Zinc finger |
| ENSG00000157601 | 1.64E-08 | 3.607 | 11.958 | 120.264 | Myxovirus (influenza virus) resistance 1 |
| ENSG00000105372 | 0.0029 | 3.165 | 1576.726 | 3727.784 | Ribosomal protein S19 |
| ENSG00000100979 | 2.38E-05 | 3.098 | 3.235 | 52.695 | Phospholipid transfer protein |
| ENSG00000170581 | 4.10E-06 | 2.912 | 6.547 | 53.179 | Signal transducer and activator of transcription 2 |
| ENSG00000160014 | 7.21E-05 | 2.904 | 78.825 | 338.738 | Calmodulin 3 (phosphorylase kinase |
| ENSG00000165949 | 8.43E-07 | 2.840 | 12.392 | 128.770 | Interferon |
| ENSG00000080503 | 0.0005 | 2.765 | 18.355 | 41.172 | SWI/SNF related |
| ENSG00000090339 | 1.15E-06 | 2.726 | 27.981 | 174.443 | Intercellular adhesion molecule 1 |
| ENSG00000145901 | 0.0014 | 2.523 | 56.837 | 254.358 | TNFAIP3 interacting protein 1 |
| ENSG00000160710 | 0.0008 | 2.450 | 6.969 | 26.304 | Adenosine deaminase |
| ENSG00000135018 | 1.53E-05 | 2.335 | 32.631 | 158.313 | Ubiquilin 1 |
Fig. 1Schematic representation of the pathways expressed during humoral immune response of SARS-CoV-2 infection. A) MAPK/cAMP pathway; B) Interleukin/MHC pathway; C) NF-kappa B pathway; D) PI3K-AKT signal transduction pathway. Camp acts an intracellular regulator of cellular homeostasis and also negatively regulates proinflammatory signalling, cytokines such as TNF-α and IL-1β leads to decrease in cAMP in the cells, but mechanism behind this action is poorly known so far. In presence of TNF- α, ERK 1/2 is phosphorylated which is present in the MAPK signalling cascade, NF-Kappa B p65 gets translocated in the nucleus all these occurs when there are lower levels of cAMP [55]. The MAPK pathway and PI3K-AKT pathway are two independent signalling pathways which plays important role in cell survival, proliferation and motility. These pathways also cross talk extensively to either positively or negatively regulate each other [56].
SARS-CoV-2 transcriptome annotation through BLAST similarity search at e-value threshold 1e-10 along with their expression values.
| Transcripts ids | Functional class | Uniport annotation | Expression (TPM) |
|---|---|---|---|
| TRINITY_DN10_c0_g1_i1_p1 | H | ORF1ab polyprotein Tax = BtRs-BetaCoV/YN2013 TaxID = 1,503,303 | 3661.09 |
| TRINITY_DN0_c0_g1_i1_p3 | M | Membrane protein Tax = Bat SARS-like coronavirus TaxID = 1,508,227 | 91,746.3 |
| TRINITY_DN0_c0_g1_i2_p5 | M | Membrane protein Tax = Bat SARS-like coronavirus TaxID = 1,508,227 | 1716.75 |
| TRINITY_DN0_c0_g1_i4_p5 | M | Membrane protein Tax = Bat SARS-like coronavirus TaxID = 1,508,227 | 3188.07 |
| TRINITY_DN0_c0_g1_i6_p5 | M | Membrane protein Tax = Bat SARS-like coronavirus TaxID = 1,508,227 | 12,432 |
| TRINITY_DN0_c0_g1_i1_p1 | N | Nucleoprotein Tax = Bat SARS-like coronavirus TaxID = 1,508,227 | 91,746.3 |
| TRINITY_DN0_c0_g1_i2_p3 | N | Nucleoprotein Tax = Bat SARS-like coronavirus TaxID = 1,508,227 | 1716.75 |
| TRINITY_DN0_c0_g1_i4_p3 | N | Nucleoprotein Tax = Bat SARS-like coronavirus TaxID = 1,508,227 | 3188.07 |
| TRINITY_DN0_c0_g1_i6_p3 | N | Nucleoprotein Tax = Bat SARS-like coronavirus TaxID = 1,508,227 | 12,432 |
| TRINITY_DN0_c0_g1_i7_p1 | N | Nucleoprotein Tax = Bat SARS-like coronavirus TaxID = 1,508,227 | 838,657 |
| TRINITY_DN0_c0_g1_i2_p2 | NendoU | Non-structural polyprotein 1ab Tax = Bat SARS-like coronavirus | 1716.75 |
| TRINITY_DN0_c0_g1_i4_p2 | NendoU | Non-structural polyprotein 1ab Tax = Bat SARS-like coronavirus | 3188.07 |
| TRINITY_DN0_c0_g1_i6_p2 | NendoU | Non-structural polyprotein 1ab Tax = Bat SARS-like coronavirus | 12,432 |
| TRINITY_DN0_c0_g1_i3_p1 | NSP1 | Non-structural polyprotein 1ab Tax = Betacoronavirus TaxID = 694,002 | 10,488.1 |
| TRINITY_DN0_c0_g1_i5_p1 | NSP1 | Non-structural polyprotein 1ab Tax = Betacoronavirus TaxID = 694,002 | 3765.1 |
| TRINITY_DN20_c0_g1_i1_p1 | NSP2 | Non-structural polyprotein 1ab Tax = Betacoronavirus TaxID = 694,002 | 1786.89 |
| TRINITY_DN4_c0_g1_i1_p1 | NSP2 | Non-structural polyprotein 1ab Tax = Bat SARS-like coronavirus | 3107.15 |
| TRINITY_DN13_c0_g1_i1_p1 | NSP3 | Non-structural polyprotein 1ab Tax = Betacoronavirus TaxID = 694,002 | 1009.35 |
| TRINITY_DN17_c0_g1_i1_p1 | NSP3 | Non-structural polyprotein 1ab Tax = Bat SARS-like coronavirus | 1416.36 |
| TRINITY_DN19_c0_g1_i1_p1 | NSP3 | Non-structural polyprotein 1ab Tax = Betacoronavirus TaxID = 694,002 | 1759.06 |
| TRINITY_DN2_c0_g1_i1_p1 | NSP3 | Non-structural polyprotein 1ab Tax = Bat SARS-like coronavirus | 1602.68 |
| TRINITY_DN5_c0_g1_i1_p1 | NSP3 | Non-structural polyprotein 1ab Tax = Betacoronavirus TaxID = 694,002 | 553.586 |
| TRINITY_DN6_c0_g1_i1_p1 | NSP3 | Non-structural polyprotein 1ab Tax = Betacoronavirus TaxID = 694,002 | 1813.9 |
| TRINITY_DN7_c0_g1_i1_p1 | NSP6 | Non-structural polyprotein 1ab Tax = Betacoronavirus TaxID = 694,002 | 697.846 |
| TRINITY_DN1_c0_g1_i1_p1 | NSP8 | Non-structural polyprotein 1ab Tax = Bat SARS-like coronavirus | 1941.65 |
| TRINITY_DN0_c0_g1_i1_p2 | ORF3a | Uncharacterized protein Tax = Human SARS coronavirus TaxID = 694,009 | 91,746.3 |
| TRINITY_DN0_c0_g1_i2_p4 | ORF3a | Uncharacterized protein Tax = Human SARS coronavirus TaxID = 694,009 | 1716.75 |
| TRINITY_DN0_c0_g1_i4_p4 | ORF3a | Uncharacterized protein Tax = Human SARS coronavirus TaxID = 694,009 | 3188.07 |
| TRINITY_DN0_c0_g1_i6_p4 | ORF3a | Uncharacterized protein Tax = Human SARS coronavirus TaxID = 694,009 | 12,432 |
| TRINITY_DN0_c0_g1_i1_p4 | ORF7a | SARS_X4 domain-containing protein Tax = Bat SARS-like coronavirus | 91,746.3 |
| TRINITY_DN0_c0_g1_i2_p6 | ORF7a | SARS_X4 domain-containing protein Tax = Bat SARS-like coronavirus | 1716.75 |
| TRINITY_DN0_c0_g1_i4_p6 | ORF7a | SARS_X4 domain-containing protein Tax = Bat SARS-like coronavirus | 3188.07 |
| TRINITY_DN0_c0_g1_i6_p6 | ORF7a | SARS_X4 domain-containing protein Tax = Bat SARS-like coronavirus | 12,432 |
| TRINITY_DN0_c0_g1_i1_p5 | ORF8 | Uncharacterized protein, Bat SARS-like coronavirus TaxID = 1,508,227 | 91,746.3 |
| TRINITY_DN0_c0_g1_i2_p7 | ORF8 | Uncharacterized protein, Bat SARS-like coronavirus TaxID = 1,508,227 | 1716.75 |
| TRINITY_DN0_c0_g1_i4_p7 | ORF8 | Uncharacterized protein, Bat SARS-like coronavirus TaxID = 1,508,227 | 3188.07 |
| TRINITY_DN0_c0_g1_i6_p7 | ORF8 | Uncharacterized protein, Bat SARS-like coronavirus TaxID = 1,508,227 | 12,432 |
| TRINITY_DN15_c0_g1_i1_p1 | Proteinase | UPI0001D192D5 related cluster, TaxID = RepID = UPI0001D192D5 | 1706.65 |
| TRINITY_DN9_c0_g1_i1_p1 | Proteinase | UPI000181CE36 related cluster, TaxID = RepID = UPI000181CE36 | 811.289 |
| TRINITY_DN11_c0_g1_i1_p1 | RdRp | ORF1ab polyprotein | 1168.03 |
| TRINITY_DN12_c0_g1_i1_p1 | RdRp | RNA-dependent RNA polymerase Tax = Human SARS coronavirus | 1833.88 |
| TRINITY_DN0_c0_g1_i2_p1 | SG | Spike protein n = 1 Tax = Bat SARS-like coronavirus TaxID = 1,508,227 | 1716.75 |
| TRINITY_DN0_c0_g1_i4_p1 | SG | Spike protein n = 1 Tax = Bat SARS-like coronavirus TaxID = 1,508,227 | 3188.07 |
| TRINITY_DN0_c0_g1_i6_p1 | SG | Spike protein n = 1 Tax = Bat SARS-like coronavirus TaxID = 1,508,227 | 12,432 |
H: Helicase, M: Membrane protein, N: Nucleoprotein, NendoU: Uridylate-specific endoribonuclease, NSP: Non-structural protein, ORF: open reading frame, SG: Surface glycoprotein, RdRp: RNA-dependent RNA polymerase, TPM: Transcript per million.
MHC class II alleles and epitopes along with their core 9-mer epitopes.
| Anno | MHC class I epitope sequences | Immscore | Antiscore | MHC class II alleles | MHC class II epitope sequences |
|---|---|---|---|---|---|
| SG | QYIKWPWYI | 0.41673 | 1.4953 | HLA-DPA1*01:03/DPB1*02:01 | QYIKWPWYIWLGFIA |
| WPWYIWLGF | HLA-DPA1*01:03/DPB1*04:01 | YIKWPWYIWLGFIAG | |||
| SG | VVFLHVTYV | 0.1278 | 1.5122 | HLA-DRB1*01:01; | QSAPHGVVFLHVTYV |
| HLA-DRB1*04:05; | SAPHGVVFLHVTYVP | ||||
| HLA-DRB1*07:01; | APHGVVFLHVTYVPA | ||||
| HLA-DRB3*02:02; | PHGVVFLHVTYVPAQ | ||||
| SG | FLHVTYVPA | 0.11472 | 1.3346 | HLA-DPA1*01:03/DPB1*02:01; | APHGVVFLHVTYVPA |
| HLA-DPA1*01:03/DPB1*04:01; | PHGVVFLHVTYVPAQ | ||||
| HLA-DPA1*02:01/DPB1*14:01; | HGVVFLHVTYVPAQE | ||||
| HLA-DRB1*01:01; | GVVFLHVTYVPAQEK | ||||
| HLA-DRB1*04:05; | VVFLHVTYVPAQEKN | ||||
| HLA-DRB1*07:01; | VFLHVTYVPAQEKNF | ||||
| HLA-DRB3*02:02; | FLHVTYVPAQEKNFT |
Immscore: Immunogencity score, Antiscore: Antigencity score, SG: Surface glycoprotein.
B-cell epitopes identified form protein sequences of assembled transcriptome and modelled protein structure of various coding protein of SAR-CoV-2 genome.
| B-cell Epitopes | Anno | Len | Start | End | Immscore | Antiscore | Toxicity |
|---|---|---|---|---|---|---|---|
| SEQLDFIDTKRGV | NSP2 | 13 | 36 | 48 | 0.13632 | 1.7773 | NT |
| KGTLEPEYF | H | 9 | 414 | 422 | 0.17084 | 1.3504 | NT |
| HCGETSWQTGDFV | NSP2 | 13 | 145 | 157 | 0.24817 | 1.1314 | NT |
| KTVGELGDVRE | NSP3 | 11 | 902 | 912 | 0.25316 | 0.9231 | NT |
| LTGTGVLTESNK | SG | 12 | 546 | 557 | 0.10111 | 0.8122 | NT |
| TGVVGEGSEGLN | NSP2 | 12 | 256 | 267 | 0.17371 | 0.7539 | NT |
| QTTETAHSC | H | 9 | 548 | 556 | 0.11862 | 0.7078 | NT |
| MEVTPSGTWLT | N | 11 | 322 | 332 | 0.13056 | 0.5982 | NT |
| SDARTAPHG | NSP1 | 9 | 74 | 82 | 0.14895 | 0.5706 | NT |
| LKATEETFK | H | 9 | 138 | 146 | 0.34467 | 0.5278 | NT |
| NENGTITDA | SG | 9 | 280 | 288 | 0.2408 | 0.5257 | NT |
| KGHFDGQQGEVPVS | NendoU | 14 | 12 | 25 | 0.14142 | 0.5183 | NT |
| LQAGNATEVPANS | NSP10 | 13 | 1 | 11 | 0.2729 | 0.4491 | NT |
| VQIPTTCANDPVGFT | NSP10 | 15 | 97 | 111 | 0.2336 | 0.4488 | NT |
Immscore: Immunogenicity score, Antiscore: Antigenicity score, NT: Non-toxic, H: Helicase, M: Membrane protein, N: Nucleoprotein, NendoU: Uridylate-specific endoribonuclease, NSP: Non-structural protein, ORF: open reading frame, SG: Surface glycoprotein, RdRp: RNA-dependent RNA polymerase, 2′-O-MT: 2′-O-methyltransferase, ExoN: Guanine-N7 methyltransferase.
Docking analysis of T-cell epitope (VVFLHVTYV) and MHC protein structures.
| MHC alleles (PDB code) | Expression (TPM) | Protein chain | Interaction energy | Total energy | Hydrogen bond (Peptide – Receptor) |
|---|---|---|---|---|---|
| HLA-DRB3*02:02 (2Q6W) | NoExp | B | −96.73 | −1560.1 | THR7-ASP152, LEU4-SER120 |
| HLA-A*02:03 (3OX8) | 46.1625 | A | −63.72 | −1937.21 | THR7-ASP77, THR7-TYR84 |
| HLA-DQA1*05:01/DQB1*03:01 (4D8P) | NoExp | A | −69.82 | −890.23 | VAL1-SER8, TYR8-THR93, VAL1-VAL6, PHE3-SER8, TYR8-THR83, TYR8-ASP142 |
| HLA-DRB1*04:01 (5JLZ) | NoExp | B | −57.25 | −1377.11 | THR7-GLU187, HIS5-GLU187, PHE3-VAL101 |
| HLA-DRB1*01:01 (5V4N) | NoExp | C | −63.35 | −1430.47 | VAL1-HIS360, THR7-GLU411 |
| HLA-A*23:01 (5WWJ) | 78.8768 | C | −95.26 | −1611.15 | TYR8-CYS264, TYR8-CYS388, VAL9-TYR327, VAL9-ASN331 |
| HLA-A*24:02 (5XOV) | 2597.88 | A | −82.36 | −2186.76 | THR90-HIS5, HIS191-HIS5, THR190-THR7, THR190-THR7 |
| HLA-A*11:01 (6ID4) | NoExp | A | −7.12 | −1452.3 | LEU4-TYR27, LEU4-ARG6, TYR8-SER4, SER4-VAL9 |
| HLA-B*44:02 (3DX6) | NoExp | A | −83.81 | −2384.29 | THR7-ASP114,VAL1-SER167, VAL1-GLU55 |
| HLA-DPA1*01:03/DPB1*02:01 (3LQZ) | NoExp | A | −75.51 | −3195.72 | HIS5-GLU134, HIS5-HIS144 |
| HLA-A*02:06 (3OXR) | NoExp | C | −47.82 | −2993.06 | VAL1-HGLN54, PHE3-GLN54, HIS5-GLU55, HIS5-TRP51 |
| HLA-B*57:01 (3X11) | NoExp | A | −31.02 | −2183.55 | TYR8-GLN155, VAL9-TYR99, VAL1-SER116, PHE3-SER8 |
| HLA-A*68:02 (4HWZ) | 61.582 | A | −56.06 | −2054.65 | PHE3-ARG144, VAL1-ASP77, VAL9-THR73, HIS5-THR73 |
| HLA-B*44:03 (4JQX) | 38.0344 | A | −39.35 | −1709.95 | THR7-ASN77, VAL9-SER69, TYR6-GLN89 |
| HLA-B*35:01 (4PRA) | 53.433 | A | −22.74 | −1780.49 | VAL1-ARG6, VAL9-SER116, THR7-ASP144, TYR8-TYR74 |
| HLA-A*02:01(4U6Y) | 27.0879 | A | −29.1 | −1507.37 | VAL1-GLU63, VAL1-TYR99, HIS5-HIS114, THR7-HIS144 |
| HLA-DRB1*11:01(5NI9) | NoExp | B | −90.52 | −1566.14 | VAL6-TYR30, TYR8-TYR48 |
| HLA-B*58:01 (5VWH) | NoExp | A | −76.23 | −2288.87 | THR7-ARG14, HIS5-ARG21 |
| HLA-A*30:02 (6J1V) | NoExp | A | −80.19 | −2377.92 | VAL6-ARG202 |
| HLA-A*30:01 (6J1W) | NoExp | A | −46.93 | −1869.26 | PHE3-ARG273, HIS5-THR271 |
| HLA-A*03:01 (6O9B) | 23.4843 | A | −62.78 | −2263.52 | TYR8-TYR-123, TYR8-ARG114, HIS5-ARG114 |
| HLA-A*68:01 (6PBH) | 61.582 | A | −88.24 | −2076.8 | TYR6-TYR99, ALA9-THR143 |
TPM (Transcript per million).
Fig. 2Docking of between peptide and protein receptor. A) T-cell peptide (VVFLHVTYV) and HLA allele (HLA-A*24:02) protein structure (5XOV); B) B-cell peptide (KTVGELGDVRE) and protein structure of B-cell receptor (5DRW); I) Epitope structure; II and II) Cartoon structure of docked molecule (hydrogen bonds in red colour); d) Molecular level description of interaction through LIGPlot software (hydrogen bonds in green). (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Molecular docking and interaction analysis of B-cell epitope to B-cell protein receptor.
| Peptide | PDB code | Protein chain | Interaction energy | Total energy | Hydrogen bonds (Peptide – Receptor) |
|---|---|---|---|---|---|
| SEQLDFIDTKRGV | 5DRW | B | −94.17 | −1656.88 | THR9-PRO49, THR9-SER48, THR9-ARG51, GLU7-LYS55 |
| 1K1F | A | −25.53 | −362.98 | THR9-ARG50, TRP7-ARG51, GLU4ARG51,CYS2-ASN39 | |
| HCGETSWQTGDFV | 5DRW | B | −85.9 | −1579.94 | SER6-GLN205, THR9-GLN205 |
| 1K1F | A | −29.14 | −366.08 | GLN8-ARG22 | |
| KGTLEPEYF | 5DRW | B | −91.21 | −1767.94 | THR3-SER20, GLU7-SER10 |
| 1K1F | A | −35.76 | −629.66 | GLU5-GLN51 | |
| KTVGELGDVRE | 5DRW | B | −37.47 | −1490.12 | THR2-PRO125, VAL3-SER127, GLU11-ASN144, GLY7-SER168, GLY7-SER182 |
| 1K1F | A | −54.36 | −686.76 | THR5-ARG25 | |
| LTGTGVLTESNK | 5DRW | B | −40.46 | −1235.01 | THR4-TYR37, THR4-ASN39, SER10-GLN43 |
| 1K1F | A | −110.1 | −652.47 | GLU9-SER18, GLU9-SER41 |
Fig. 3Genome-wide transcriptome analysis of SARS-CoV-2, T- and B-cell epitopes identification for multi-peptide based vaccine development. Docked structures of B-cell receptor and MHC alleles are given at right-side and their T-and B-cell epitopes are highlighted in yellow colour. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)