| Literature DB >> 35362839 |
Dzhemal Moten1, Desislava Kolchakova2, Krasimir Todorov3, Tsvetelina Mladenova3, Balik Dzhambazov2.
Abstract
Allergic diseases are a socially significant problem of global importance. The number of people suffering from pollen allergies has increased dramatically in recent decades. Pollen allergies affect up to 30% of the world population. Pollen of the common ragweed (Ambrosia artemisiifolia L.) is one of the most aggressive allergens in the world. We have used a series of immunoinformatics approaches to design an effective epitope-based vaccine, which might induce a competent immunity against a major allergen Amb a 11. CD8+ and CD4+ T-cell epitopes and their corresponding MHC restricted alleles were identified by prediction tools provided by immune epitope database (IEDB). Among T-cell epitopes, MHC class I peptide (GLMEPAFTYV) and MHC class II peptide (LVCFSFSLVLILGLV) were identified as most suitable. From all predicted B-cell epitopes, only one epitope (GKLVKFSEQQLVDC) containing sequence from the conserved region was chosen for next processing. Selected epitopes have been validated by molecular docking analysis. These epitopes showed a very strong binding affinity to MHC I molecule and MHC II molecule with binding energy scores - 729.3 and - 725.0 kcal/mole respectively. Performed experimental validation showed that only the MHC class II peptide (LVCFSFSLVLILGLV) can stimulate T cells from ragweed allergic patients and IgE antibodies specific to the ragweed pollen do not recognize this epitope. Therefore, this peptide could be potentially used as a vaccine against the major allergen Amb a 11. The B-cell epitope GKLVKFSEQQLVDC forms a stable complex with the IgE molecule (energy weighted score - 695,0 kcal/mole). Tested sera from patients with ragweed allergy showed that the ragweed specific IgE antibodies can bind to the identified B-cell epitope. Population coverage analysis was performed for CD8+ and CD4+ T-cell epitopes. It was predicted that CD4+ T-cell epitope (LVCFSFSLVLILGLV) covers 90.56% of the population of Europe and 99.36% of the world population. CD8+ T-cell epitope (GLMEPAFTYV) has a population coverage of 77.37% for Europe and 71.35% for all the world.Entities:
Keywords: Amb a 11; Ambrosia artemisiifolia; Immunoinformatics; Peptide-based vaccine; Pollen allergy
Mesh:
Substances:
Year: 2022 PMID: 35362839 PMCID: PMC8972712 DOI: 10.1007/s10930-022-10050-z
Source DB: PubMed Journal: Protein J ISSN: 1572-3887 Impact factor: 4.000
List of homologous allergens of Amb a 11
| No. | Allergen | Sequence ID in SwissProt/NCBI/PIR | Sequence length | E score |
|---|---|---|---|---|
| 1 | Act d 1 | P00785 | 380 | 2.2e−41 |
| 2 | Act d 1 | AAA32629 | 380 | 4.4e−41 |
| 3 | Ana c 2 | BAA21849 | 351 | 6.3e−41 |
| 4 | Car p papain | AAB02650 | 345 | 7.0e−39 |
| 5 | Gly m 1 | AAB09252 | 379 | 1.0e−35 |
| 6 | Gly m 1 | P22895 | 379 | 1.5e−35 |
| 7 | Der f 1.0109 | ABL84750 | 321 | 5.2e−25 |
| 8 | Der f 1.0110 | ABL84751 | 321 | 5.2e−25 |
| 9 | Der f 1.0101 | BAC53948 | 321 | 5.2e−25 |
| 10 | Der f 1.0108 | ABL84749 | 321 | 1.1e−24 |
| 11 | Der f 1 | P16311 | 321 | 1.6e−24 |
| 12 | Eur m 1.0102 | AAC82352 | 321 | 1.9e−24 |
| 13 | Eur m 1.0101 | P25780 | 321 | 2.9e−24 |
| 14 | Eur m 1.0101 | AAC82351 | 321 | 2.9e−24 |
| 15 | Der f 1 | CAA46316 | 212 | 6.9e−23 |
| 16 | Der p 1.0124 | CAQ68250 | 302 | 3.6e−22 |
| 17 | Der p 1.0113 | ABA39435 | 302 | 5.5e−22 |
| 18 | Der p 1 | P08176 | 320 | 7.8e−22 |
| 19 | Eur m 1.0101 | CAA42677 | 211 | 7.9e−22 |
| 20 | Blo t 1.0201 | AAQ24541 | 333 | 5.0e−16 |
| 21 | Blo t 1.0101 | AAK58415 | 221 | 6.8e−14 |
| 22 | Der p 1 | AAA28296 | 96 | 4.4e−08 |
Conserved sequences of homologous pollen allergens with their positions
| Conserved sequences of pollen allergen | Position of the selected sequences |
|---|---|
| QGGCGSCWAF | 164–173 |
| SEQELVDC | 194–201 |
| YPYV | 239–242 |
| IVGYG | 326–330 |
| YWIVRNSWDTTWG | 338–350 |
Fig. 1Phylogenetic tree based on amino acid sequences of the pollen allergen Amb a 11 and the identified 22 homologous allergens. The reconstruction has been performed by using ML analysis applying the GTR + I + G evolutionary model. The numbers above branches indicate the bootstrap support from 1000 replicates
Most potential CD8 + T-cell epitopes with interacting MHC-I alleles, their positions, length of epitope, Ann IC50, Smm IC50, and percentile rank
| No. | T-cell epitopes | Interacting MHC-I alleles | Start position | End position | Length of epitope | Percentile rank | Ann IC50 | Smm IC50 |
|---|---|---|---|---|---|---|---|---|
| 1 | LQFYSEGVY | HLA-B*15:01 | 271 | 279 | 9 | 0.1 | 7.97 | 9.44 |
| HLA-A*30:02 | 0.22 | 22 | 93.64 | |||||
| 2 | NVFKYNVRR | HLA-A*68:01 | 58 | 66 | 9 | 0.12 | 5.59 | 7.47 |
| HLA-A*33:01 | 0.44 | 41.77 | 106.67 | |||||
| 3 | HLA-A*02:01 | 196 | 205 | 10 | 0.12 | 5.24 | 2.23 | |
| HLA-A*02:03 | 0.12 | 3.32 | 2.31 | |||||
| HLA-A*02:06 | 0.17 | 4.1 | 19.01 | |||||
| HLA-A*68:02 | 0.2 | 19.67 | 32.95 | |||||
| HLA-B*15:01 | 0.72 | 21.51 | 175.15 | |||||
| HLA-A*03:01 | 1.24 | 22.66 | 281.39 | |||||
| HLA-A*31:01 | 2.15 | 23.4 | 1755.17 | |||||
| 4 | MTNLEFVNTY | HLA-A*26:01 | 87 | 96 | 10 | 0.15 | 169.65 | 287.53 |
| HLA-A*01:01 | 0.28 | 121.96 | 416.68 | |||||
| 5 | TYANSKISHF | HLA-A*23:01 | 95 | 104 | 10 | 0.16 | 33.13 | 18.82 |
| HLA-A*24:02 | 0.2 | 65.19 | 68.98 | |||||
| 6 | RFNVFKYNVR | HLA-A*31:01 | 56 | 65 | 10 | 0.16 | 4.51 | 21.55 |
| 7 | GENEKGIKFW | HLA-B*44:02 | 297 | 306 | 10 | 0.17 | 29.91 | 85.68 |
| HLA-B*44:03 | 0.28 | 51.23 | 269.2 | |||||
| 8 | MEINKLVCF | HLA-B*44:02 | 1 | 9 | 9 | 0.18 | 42.26 | 248.66 |
| HLA-B*44:03 | 0.18 | 95.9 | 110.53 | |||||
| 9 | ESEEGFMGMY | HLA-A*26:01 | 31 | 40 | 10 | 0.19 | 86.03 | 542.86 |
| HLA-A*01:01 | 0.2 | 37.78 | 351.4 | |||||
| 10 | FADMTNLEF | HLA-B*53:01 | 84 | 92 | 9 | 0.2 | 20.49 | 39.53 |
| HLA-A*01:01 | 0.22 | 50.8 | 313.73 | |||||
| HLA-B*35:01 | 0.4 | 8.44 | 15.76 |
Most potential CD4 + T-cell epitopes with interacting MHC-II alleles, their positions, length of epitope, Smm IC50, and percentile rank
| No. | T-cell epitopes | Interacting MHC-II alleles | Start position | End position | Length | Percentile rank | Smm IC50 |
|---|---|---|---|---|---|---|---|
| 1 | CGSCWAFAAVVALEG | HLA-DQA1*04:01/DQB1*04:02 | 152 | 166 | 15 | 0.06 | 112.00 |
| HLA-DQA1*01:02/DQB1*06:02 | 1.05 | 123.00 | |||||
| HLA-DQA1*05:01/DQB1*03:01 | 1.45 | 34.00 | |||||
| 2 | GCGSCWAFAAVVALE | HLA-DQA1*04:01/DQB1*04:02 | 151 | 165 | 15 | 0.06 | 110.00 |
| HLA-DQA1*01:02/DQB1*06:02 | 1.15 | 133.00 | |||||
| 3 | GSCWAFAAVVALEGI | HLA-DQA1*04:01/DQB1*04:02 | 153 | 167 | 15 | 0.06 | 112.00 |
| HLA-DQA1*01:02/DQB1*06:02 | 1.05 | 121.00 | |||||
| HLA-DQA1*05:01/DQB1*03:01 | 1.65 | 37.00 | |||||
| 4 | KLVCFSFSLVLILGL | HLA-DPA1*03:01/DPB1*04:02 | 5 | 19 | 15 | 0.11 | 17.00 |
| HLA-DPA1*01/DPB1*04:01 | 0.51 | 80.00 | |||||
| 5 | HLA-DPA1*03:01/DPB1*04:02 | 6 | 20 | 15 | 0.11 | 17.00 | |
| HLA-DPA1*01/DPB1*04:01 | 0.51 | 80.00 | |||||
| HLA-DPA1*01:03/DPB1*02:01 | 0.66 | 54.00 | |||||
| HLA-DPA1*02:01/ | 0.87 | 102.00 | |||||
| DPB1*01:01 | 1.30 | 284.00 | |||||
| HLA-DQA1*01:02/ | |||||||
| DQB1*06:02 | |||||||
| 6 | VCFSFSLVLILGLVE | HLA-DPA1*03:01/DPB1*04:02 | 7 | 21 | 15 | 0.12 | 18.00 |
| HLA-DPA1*01/DPB1*04:01 | 0.60 | 95.00 | |||||
| HLA-DPA1*02:01/DPB1*01:01 | 0.87 | 43.00 | |||||
| 7 | INKLVCFSFSLVLIL | HLA-DPA1*03:01/DPB1*04:02 | 3 | 17 | 15 | 0.18 | 23.00 |
| HLA-DPA1*01/DPB1*04:01 | 0.35 | 62.00 | |||||
| HLA-DPA1*01:03/DPB1*02:01 | 0.39 | 36.00 | |||||
| 8 | NKLVCFSFSLVLILG | HLA-DPA1*03:01/DPB1*04:02 | 4 | 18 | 15 | 0.18 | 23.00 |
| HLA-DPA1*01/DPB1*04:01 | 0.35 | 62.00 | |||||
| HLA-DPA1*01:03/DPB1*02:01 | 0.37 | 35.00 | |||||
| 9 | CFSFSLVLILGLVES | HLA-DPA1*03:01/DPB1*04:02 | 8 | 22 | 15 | 0.26 | 32.00 |
| HLA-DPA1*02:01/DPB1*01:01 | 1.25 | 65.00 | |||||
| 10 | FSFSLVLILGLVESF | HLA-DPA1*03:01/DPB1*04:02 | 9 | 23 | 15 | 0.27 | 33.00 |
| HLA-DQA1*05:01/DQB1*02:01 | 0.85 | 428.00 |
Population coverage for predicted T-cell epitopes
| Predicted CD8 + T-cell epitopes | Interacting MHC I alleles | Predicted CD4 + T-cell epitopes | Interacting MHC II alleles | Population coverage | |
|---|---|---|---|---|---|
| World | Europe | ||||
| LQFYSEGVY | HLA-B*15:01 | CGSCWAFAAVVALEG | HLA-DQA1*04:01/ | CD8 + : | CD8 + : |
| HLA-A*30:02 | DQB1*04:02 | 10.67% | 11.42% | ||
| HLA-DQA1*01:02/ | CD4 + : | CD4 + : | |||
| DQB1*06:02 | 80.52% | 79.36% | |||
| HLA-DQA1*05:01/ | |||||
| DQB1*03:01 | |||||
| NVFKYNVRR | HLA-A*68:01 | GCGSCWAFAAVVALE | HLA-DQA1*04:01/ | CD8 + : | CD8 + : |
| HLA-A*33:01 | DQB1*04:02 | 7.56% | 7.64% | ||
| HLA-DQA1*01:02/ | CD4 + : | CD4 + : | |||
| DQB1*06:02 | 47.25% | 45.43% | |||
| HLA-A*02:01 | GSCWAFAAVVALEGI | HLA-DQA1*04:01/ | CD8 + : | CD8 + : | |
| HLA-A*02:03 | DQB1*04:02 | 71.35% | 77.37% | ||
| HLA-A*02:06 | HLA-DQA1*01:02/ | CD4 + : | CD4 + : | ||
| HLA-A*68:02 | DQB1*06:02 | 68.82% | 79.36% | ||
| HLA-B*15:01 | HLA-DQA1*05:01/ | ||||
| HLA-A*03:01 | DQB1*03:01 | ||||
| HLA-A*31:01 | |||||
| MTNLEFVNTY | HLA-A*26:01 | KLVCFSFSLVLILGL | HLA-DPA1*03:01/ | CD8 + : | CD8 + : |
| HLA-A*01:01 | DPB1*04:02 | 22.62% | 32.31% | ||
| HLA-DPA1*01/ | CD4 + : | CD4 + : | |||
| DPB1*04:01 | 68.91% | 86.60% | |||
| TYANSKISHF | HLA-A*23:01 | HLA-DQA1*03:01/ | CD8 + : | CD8 + : | |
| HLA-A*24:02 | DQB1*02:01 | 26.18% | 21.31% | ||
| HLA-DPA1*03:01/ | CD4 + : | CD4 + : | |||
| DPB1*04:02 | 90.56% | 99.36% | |||
| HLA-DPA1*01/ | |||||
| DPB1*04:01 | |||||
| HLA-DPA1*01:03/ | |||||
| DPB1*02:01 | |||||
| HLA-DPA1*02:01/ | |||||
| DPB1*01:01 | |||||
| RFNVFKYNVR | HLA-A*31:01 | VCFSFSLVLILGLVE | HLA-DPA1*03:01/ | CD8 + : | CD8 + : |
| DPB1*04:02 | 5.36% | 4.72% | |||
| HLA-DPA1*01/ | CD4 + : | CD4 + : | |||
| DPB1*04:01 | 67.19% | 79.13% | |||
| HLA-DPA1*02:01/ | |||||
| DPB1*01:01 | |||||
| GENEKGIKFW | HLA-B*44:02 | INKLVCFSFSLVLIL | HLA-DPA1*03:01/ | CD8 + : | CD8 + : |
| HLA-B*44:03 | DPB1*04:02 | 13.63% | 20.13% | ||
| HLA-DPA1*01/ | CD4 + : | CD4 + : | |||
| DPB1*04:01 | 84.60% | 99.36% | |||
| HLA-DPA1*01:03/ | |||||
| DPB1*02:01 | |||||
| MEINKLVCF | HLA-B*44:02 | NKLVCFSFSLVLILG | HLA-DPA1*03:01/ | CD8 + : | CD8 + : |
| HLA-B*44:03 | DPB1*04:02 | 13.63% | 20.13% | ||
| HLA-DPA1*01/ | CD4 + : | CD4 + : | |||
| DPB1*04:01 | 84.60% | 99.36% | |||
| HLA-DPA1*01:03/ | |||||
| DPB1*02:01 | |||||
| ESEEGFMGMY | HLA-A*26:01 | CFSFSLVLILGLVES | HLA-DPA1*03:01/ | CD8 + : | CD8 + : |
| HLA-A*01:01 | DPB1*04:02 | 22.62% | 32.31% | ||
| HLA-DPA1*02:01/ | CD4 + : | CD4 + : | |||
| DPB1*01:01 | 54.34% | 41.00% | |||
| FADMTNLEF | HLA-B*53:01 | FSFSLVLILGLVESF | HLA-DPA1*03:01/ | CD8 + : | CD8 + : |
| HLA-A*01:01 | DPB1*04:02 | 26.30% | 33.82% | ||
| HLA-B*35:01 | HLA-DQA1*05:01/ | CD4 + : | CD4 + : | ||
| DQB1*02:01 | 63.48% | 62.03% | |||
Surface accessibility (threshold = 1.000), hydrophilicity (threshold = 1.916), flexibility (threshold = 1.004), beta turn (threshold = 1.016) and antigenicity (threshold = 1.013) prediction score for each residue of B-cell epitope
| B-cell epitope | Emini surface accessibility | Parker hydrophilicity | Karplus and Schulz flexibility | Chou and Fashman beta turn | Kolaskar and Tongaonkar antigenicity |
|---|---|---|---|---|---|
| G | 0.949 | 1.714 | 1.052 | 1.063 | 1.053 |
| K | 0.863 | 1.943 | 1.045 | 1.094 | 1.021 |
| L | 0.998 | 1.829 | 1.017 | 1.089 | 1.052 |
| V | 1.039 | 1.614 | 0.998 | 1.057 | 1.067 |
| K | 1.062 | 1.719 | 1.0 | 1.084 | 1.064 |
| F | 1.379 | 1.957 | 1.013 | 1.036 | 1.076 |
| S | 2.285 | 2.729 | 1.041 | 1.091 | 1.042 |
| 1.942 | 1.943 | 1.059 | 1.104 | 1.023 | |
| 1.808 | 2.6 | 1.056 | 1.131 | 1.088 | |
| 1.007 | 3.343 | 1.032 | 1.054 | 1.056 | |
| 1.312 | 2.614 | 1.097 | 1.192 | 1.113 | |
| 1.3 | 2.929 | 1.066 | 1.023 | 1.115 | |
| 1.172 | 1.471 | 1.047 | 1.069 | 1.088 | |
| 1.386 | 1.357 | 1.05 | 1.066 | 1.073 |
Fig. 23D structures of the predicted T and B-cell epitopes. A CD4 + T-cell epitope; B CD8 + T-cell epitope; C B-cell epitope
Fig. 3A Peptide-allele docked complex of the predicted CD4 + T-cell epitope “LVCFSFSLVLILGLV” with respective frequent allele (energy weighted score − 725.0 kcal/mole); B Molecular docking of CD8 + T-cell epitope “GLMEPAFTYV” of Amb a 11 allergenic protein docked in HLA-A0201 (energy weighted score − 729.3 kcal/mole); C B-cell epitope “GKLVKFSEQQLVDC” is bound with immunoglobulin E “4poz” with lowest energy weighted score − 695.0 kcal/mole
Fig. 4T-cell response to identified T- and B-cell epitopes in patients with ragweed pollen allergy (n = 10) and healthy individuals (n = 10). IL-2 production in the supernatants was assayed by ELISA after stimulation of the cells with the synthetic peptides for 72 h. T-cell epitope 1: GLMEPAFTYV; T-cell epitope 2: LVCFSFSLVLILGLV; B-cell epitope: GKLVKFSEQQLVDC. **p < 0.01 as determined by Kruskal–Wallis test. Results are presented as means ± SD (n = 10)
Fig. 5Serum levels of specific IgE antibodies to identified T- and B-cell epitopes in patients with ragweed pollen allergy (n = 10) and healthy individuals (n = 10). T-cell epitope 1: GLMEPAFTYV; T-cell epitope 2: LVCFSFSLVLILGLV; B-cell epitope: GKLVKFSEQQLVDC. ***p < 0.001 as determined by Kruskal–Wallis test