| Literature DB >> 35877438 |
Piotr Minkiewicz1, Christopher P Mattison2, Małgorzata Darewicz1.
Abstract
The aim of the study presented here was to determine if there is a correlation between the presence of specific protein domains within tree nut allergens or tree nut allergen epitopes and the frequency of bioactive fragments and the predicted susceptibility to enzymatic digestion in allergenic proteins from tree nuts of cashew (Anacardium occidentale), pecan (Carya illinoinensis), English walnut (Juglans regia) and pistachio (Pistacia vera) plants. These bioactive peptides are distributed along the length of the protein and are not enriched in IgE epitope sequences. Classification of proteins as bioactive peptide precursors based on the presence of specific protein domains may be a promising approach. Proteins possessing a vicilin, N-terminal family domain, or napin domain contain a relatively low occurrence of bioactive fragments. In contrast, proteins possessing the cupin 1 domain without the vicilin N-terminal family domain contain a relatively high total frequency of bioactive fragments and predicted release of bioactive fragments by the joint action of pepsin, trypsin, and chymotrypsin. This approach could be utilized in food science to simplify the selection of protein domains enriched for bioactive peptides.Entities:
Keywords: albumin; bioactive; bioinformatics; cupin; epitope; food allergy; legumin; peptide; tree nut; vicilin
Year: 2022 PMID: 35877438 PMCID: PMC9317212 DOI: 10.3390/cimb44070214
Source DB: PubMed Journal: Curr Issues Mol Biol ISSN: 1467-3037 Impact factor: 2.976
List of allergenic proteins submitted to in silico analysis.
| No | Species | Protein | Allergen | UniProt Access. No | InterPro Domain IDs 1 | Set of Domains |
|---|---|---|---|---|---|---|
| 1 |
| 7S vicilin | Ana o 1.0101 | Q8L5L5 | IPR006045, IPR014710, IPR011051 | 3a |
| 2 |
| 7S vicilin | Ana o 1.0102 | Q8L5L6 | IPR006045, IPR014710, IPR011051 | 3a |
| 3 |
| 11S legumin | Ana o 2.0101 | Q8GZP6 | IPR022379, IPR006044, IPR006045, IPR014710, IPR011051 | 3b |
| 4 |
| 2S albumin | Ana o 3.0101 | Q8H2B8 | IPR036312, IPR016140, IPR000617 | 1 |
| 5 |
| 2S albumin | Car i 1.0101 | Q84XA9 | IPR036312, IPR016140, IPR000617 | 1 |
| 6 |
| 7S viclin | Car i 2.0101 | B3STU4 | IPR006045, IPR014710, IPR011051, IPR006792 | 2 |
| 7 |
| 11S legumin | Car i 4.0101 | B5KVH4 | IPR022379, IPR006044, IPR006045, IPR014710, IPR011051 | 3b |
| 8 |
| 2S albumin | Jug r 1.0101 | P93198 | IPR036312, IPR016140, IPR000617 | 1 |
| 9 |
| 7S vicilin | Jug r 2.0101 | Q9SEW4 | IPR006045, IPR014710, IPR011051, IPR006792 | 2 |
| 10 |
| non-specific lipid transfer protein type 1 (nsLTP1) | Jug r 3 | C5H617 | IPR036312, IPR016140, IPR000528 | - |
| 11 |
| 11S legumin | Jug r 4.0101 | Q2TPW5 | IPR022379, IPR006044, IPR006045, IPR014710, IPR011051 | 3b |
| 12 |
| 2S albumin | Pis v 1.0101 | B7P072 | IPR036312, IPR016140, IPR000617 | 1 |
| 13 |
| 11S legumin | Pis v 2.0101 | B7P073 | IPR022379, IPR006044, IPR006045, IPR014710, IPR011051 | 3b |
| 14 |
| 11S legumin | Pis v 2.0201 | B7P074 | IPR022379, IPR006044, IPR006045, IPR014710, IPR011051 | 3b |
| 15 |
| 7S vicilin | Pis v 3.0101 | B4X640 | IPR006045, IPR014710, IPR011051 | 3a |
| 16 |
| manganese superoxide dismutase | Pis v 4.0101 | B2BDZ8 | IPR001189, IPR019833, IPR019832, IPR019831, IPR036324, IPR036314 | - |
| 17 |
| 11S legumin | Pis v 5.0101 | B7SLJ1 | IPR006044, IPR006045, IPR014710, IPR011051 | 3b |
1 IDs of domains according to the InterPro database.
Figure 1Sequences of allergens from Anacardium occidentale. Location of epitopes is indicated using red font. Green background indicates epitopes with highest ∑A (range 1.6000–2.1000), blue background indicates epitopes with lowest ∑A (range 0.1000–0.5999).
Figure 2Sequences of allergens from Carya illinoinensis. Location of epitopes is indicated using red font. Green background indicates epitopes with highest ∑A (range 1.6000–2.1000), blue background indicates epitopes with lowest ∑A (range 0.1000–0.5999). Bold font indicates epitopes and their fragments occurring both in Carya illinoinensis and Juglans regia proteins.
Figure 3Sequences of allergens from Juglans regia. Location of epitopes is indicated using red font. Green background indicates epitopes with highest ∑A (range 1.6000–2.1000), blue background indicates epitopes with lowest ∑A (range 0.1000–0.5999). Bold font indicates epitopes and their fragments occurring both in Carya illinoinensis and Juglans regia proteins.
Figure 4Sequences of allergens from Pistacia vera. Location of epitopes is indicated using red font. Green background indicates epitopes with highest ∑A (range 1.6000–2.1000).
Interspecies share of linear epitopes between proteins from C. illinoinensis and J. regia.
| Epitopes (ID According to the Immune Epitope Database) 1 | Allergen Containing Epitopes, Annotated in Immune Epitope Database | Allergen Containing Epitopes, Found in the UniProt Database Using BLAST |
|---|---|---|
| 157220, 157381, 157603, 157808, 157835 | Car i 4.0101 | Jug r 4.0101 |
| 174135 | Jug r 1.0101 | Car i 1.0101 |
| 157811, 158509, 241161, 241344, 241355, 241557 | Jug r 2.0101 | Car i 2.0101 |
| 114508, 114569, 114610, 114627, 114695 | Jug r 4.0101 | Car i 4.0101 |
1 Sequences of epitopes retrieved from IEDB are presented in Table S2 (Supplementary Materials).
Figure 5Scores describing entire proteins as potential precursors of bioactive peptides. (a) domain sets present in proteins: 1 (blue)—Set 1; 2 (green)—set 2; 3 (yellow)—set 3 (3a + 3b); (b) ∑A; (c) DHt [%]; (d) ∑AE. Proteins are sorted from highest to lowest ∑A value (column (b)).
Figure 6Scores concerning potential release of bioactive peptides from epitopes. (a) domain sets present in proteins: 1 (blue)—Set 1; 2 (green)—set 2; 3 (yellow)—set 3 (3a + 3b); (b) ∑A; (c) DHt [%]; (d) ∑AE. Epitopes are sorted from highest to lowest ∑A value (column (b)).
Figure 7Scores attributed to particular set of domains. ∑A—total frequency of bioactive fragments occurrence in a protein sequence (Equation (1)); DHt—theoretical degree of hydrolysis (Equation (2)) expressed in %; ∑AE—total frequency of release of bioactive fragments by proteolytic enzymes (Equation (3)), SD—standard deviation.
Figure 8Differences in scores between groups of proteins containing particular sets of domains, expressed as matrices of Euclidean distances: (a) ∑A; (b) DHt [%]; (c) ∑AE Differences subjected to t-test are labeled. Symbol “+” indicates differences that are statistically significant at p < 0.05. Symbol “−“ indicates a lack of statistically significant difference.
Summary of domains present in allergenic proteins.
| ID | Name | Score for Entire Proteins | Score for Epitopes | ||||
|---|---|---|---|---|---|---|---|
| ∑A | DHt | ∑AE | ∑A | DHt | ∑AE | ||
| IPR000528 | Plant lipid transfer protein/Par allergen | + (s) | − (s) | + (s) | nd | nd | nd |
| IPR000617 | Napin/Bra allergen | − | − | − | − | 0 | − |
| IPR001189 | Manganese/iron superoxide dismutase | + (s) | 0 (s) | 0 (s) | nd | nd | nd |
| IPR006044 | 11-S seed storage protein, plant | 0 | 0 | 0 | 0 | 0 | 0 |
| IPR006045 | Cupin 1 | + | 0 | + | + | 0 | + |
| IPR006792 | Vicilin, N-terminal | − | + | − | − | 0 | 0 |
| IPR011051 | RmlC-like cupin domain superfamily | + | 0 | + | + | 0 | + |
| IPR014710 | RmlC-like jelly roll fold | + | 0 | + | + | 0 | + |
| IPR016140 | Bifunctional inhibitor/plant lipid transfer protein/seed storage helical domain | 0 | 0 | 0 | − | 0 | − |
| IPR019831 | Manganese/iron superoxide dismutase, N-terminal | + (s) | 0 (s) | 0 (s) | nd | nd | nd |
| IPR019832 | Manganese/iron superoxide dismutase, C-terminal | + (s) | 0 (s) | 0 (s) | nd | nd | nd |
| IPR019833 | Manganese/iron superoxide dismutase, binding site | + (s) | 0 (s) | 0 (s) | nd | nd | nd |
| IPR022379 | 11-S seed storage protein, conserved site | 0 | 0 | 0 | 0 | 0 | 0 |
| IPR036312 | Bifunctional inhibitor/plant lipid transfer protein/seed storage helical domain superfamily | 0 | 0 | 0 | − | 0 | − |
| IPR036314 | Manganese/iron superoxide dismutase, C-terminal domain superfamily | + (s) | 0 (s) | 0 (s) | nd | nd | nd |
| IPR036324 | Manganese/iron superoxide dismutase, N-terminal domain superfamily | + (s) | 0 (s) | 0 (s) | nd | nd | nd |
“+”—presence of domain associated with high score; “−“—presence of domain associated with low score; “0”—presence of domain has no defined influence on score; “nd”—no data; (s)—score for single protein.
Figure 9Cupin domain proteins associated with relatively high ∑A scores. Ana o 1.0101 or (a) and Ana o 2.0101 (b) ribbon models indicating the relative placement of segments within the two allergens having high ∑A scores. Loop sections are colored red, cupin domains yellow, and segments with high ∑A scores in green.