| Literature DB >> 32708775 |
Maciej Kochanowski1, Mirosław Różycki1, Joanna Dąbrowska1, Aneta Bełcik1, Jacek Karamon1, Jacek Sroka1, Tomasz Cencek1.
Abstract
Anisakis simplex third-stage larvae are the main source of hidden allergens in marine fish products. Some Anisakis allergens are thermostable and, even highly processed, could cause hypersensitivity reactions. However, Anisakis proteome has not been studied under autoclaving conditions of 121 °C for 60 min, which is an important process in the food industry. The aim of the study was the identification and characterization of allergens, potential allergens, and other proteins of heat-treated A. simplex larvae. Liquid chromatography-tandem mass spectrometry (LC-MS/MS) was used to identify 470 proteins, including allergens-Ani s 1, Ani s 2, Ani s 3, Ani s 4, Ani s 5-and 13 potential allergens that were mainly homologs of Anisakis spp., Ascaris spp., and Acari allergens. Ani s 2, Ani s 3, Ani s 5, and three possible allergens were found among the top 25 most abundant proteins. The computational analysis allowed us to detect allergen epitopes, assign protein families, and domains as well as to annotate the localization of proteins. The predicted 3D models of proteins revealed similarities between potential allergens and homologous allergens. Despite the partial degradation of heated A. simplex antigens, their immunoreactivity with anti-A. simplex IgG antibodies was confirmed using a Western blot. In conclusion, identified epitopes of allergenic peptides highlighted that the occurrence of Anisakis proteins in thermally processed fish products could be a potential allergic hazard. Further studies are necessary to confirm the IgE immunoreactivity and thermostability of identified proteins.Entities:
Keywords: Anisakis simplex; allergen; autoclaving; bioinformatics; foodborne parasite; hidden allergen; mass spectrometry; potential allergen; proteome
Mesh:
Substances:
Year: 2020 PMID: 32708775 PMCID: PMC7407331 DOI: 10.3390/biom10071066
Source DB: PubMed Journal: Biomolecules ISSN: 2218-273X
Figure 1Colloidal Coomassie-stained SDS-PAGE analysis (color inversion mode) of the three following crude (CR) antigens of A. simplex: native, heated for 60 min at 100 °C, and autoclaved for 60 min at 121 °C (a). Western blot analysis of anti-A. simplex rabbit IgG antibodies reactivity against following CR antigens of A. simplex: native, heated 60 min at 100 °C, and autoclaved for 60 min at 121 °C (b). Pos.—membrane incubated with hyperimmune serum from a rabbit immunized with A. simplex CR antigen; Neg.—membrane incubated with rabbit preimmune serum. Molecular weight (Mw) estimations are presented in kilodaltons (kDa), as performed by Bio-1D software. The integrated density (IntDen) was calculated by ImageJ software.
Figure 2The three-dimensional scatter plot analysis of A. simplex proteins (n = 470) identified using LC-MS/MS. The proteins were distributed based on molecular weight (Mw), isoelectric point (pI), and the relative abundance of proteins. The Mw and pI values were calculated using the ExPASy Compute pI/Mw tool, while the relative abundance was estimated based on the average exponentially modified protein abundance index (emPAI) values (mean of three biological replicates) calculated by Mascot. The relative abundance is shown in log2 scale. Red cubes are allergens (n = 5), and cyan cubes are potential allergens (n = 13). Cubes (allergens and potential allergens) are signed with UniProt ID and have a vertical line for better pI and Mw visualization. Yellow spheres are the other identified proteins. A scatter plot was constructed using Teraplot software (ver. 1.4.06; Kylebank Software Ltd., Ayr, UK).
The top 25 most abundant proteins of Anisakis simplex identified by liquid chromatography-tandem mass spectrometry (LC-MS/MS).
| No | UniProt ID | Protein Name 1 | InterPro Family 2 | InterPro Domain 2 | Mw 3 (Da) | pI 3 | Mascot Scor 4 | Coverage 4 (%) | emPAI Value 4 |
|---|---|---|---|---|---|---|---|---|---|
| 1 | A0A0M3K6N3 | Myosin, essential light chain (inferred by orthology to a Caenorhabditis elegans protein) | EF-hand domain pair | EF-hand domain | 16,418 | 4.89 | 4418 | 90 | 201.76 |
| 2 | A0A0M3KCE6 | Tropomyosin (inferred by orthology to a C. elegans protein) | Tropomyosin | -5 | 13,701 | 4.54 | 3023 | 63 | 109.71 |
| 3 | A0A0M3J349 | DUF4440 domain-containing protein | Nuclear transport factor 2 (NTF2)-like domain superfamily (IPR032710) | Domain of unknown function DUF4440 | 13,671 | 6.29 | 785 | 72 | 81.05 |
| 4 | A0A0M3J4D2 | Uncharacterized protein | - | - | 23,930 | 8.72 | 1897 | 71 | 67.66 |
| 5 | A0A0M3K7H9 | Uncharacterized protein | - | - | 11,337 | 9.89 | 922 | 70 | 63.02 |
| 6 | Q9NAS5 | Tropomyosin (allergen Ani s 3) | Tropomyosin | - | 33,203 | 4.68 | 3241 | 72 | 62.69 |
| 7 | A0A0M3KA05 | SXP/RAL-2 family protein 2 isoform 1 | - | Domain of unknown function DUF4440 | 14,517 | 4.52 | 5725 | 71 | 45.10 |
| 8 | A0A0M3K8L6 | Uncharacterized protein | DVA-1 superfamily | Polyprotein allergen, nematode | 42,337 | 7.07 | 3504 | 62 | 42.49 |
| 9 | A0A0M3KF53 | Uncharacterized protein | - | - | 26,477 | 4.98 | 2630 | 70 | 38.24 |
| 10 | A0A0M3KIS5 | IF rod domain-containing protein | Intermediate filament, rod domain, coil 1B | Intermediate filament, rod domain | 27,871 | 5.17 | 1999 | 73 | 33.77 |
| 11 | A0A0M3JCV3 | KASH_CCD domain-containing protein | - | KASH5-like coiled-coil domain | 17,230 | 5.4 | 1885 | 68 | 32.84 |
| 12 | A0A0M3J6C9 | Uncharacterized protein | - | - | 26,754 | 7.78 | 1755 | 54 | 30.87 |
| 13 | A0A0M3JGD2 | Uncharacterized protein | - | - | 10,889 | 4.86 | 771 | 52 | 23.42 |
| 14 | A0A158PP35 | Paramyosin (inferred by orthology to a C. elegans protein) | - | Myosin tail | 101,686 | 5.34 | 11203 | 67 | 22.63 |
| 15 | A0A0M3JX08 | Protein lethal(2)essential for life (inferred by orthology to a Drosophila melanogaster protein) | HSP20-like chaperone | Alpha-crystallin/Hsp20 domain | 21,063 | 5.98 | 1284 | 80 | 21.20 |
| 16 | A0A0M3JAH1 | Intermediate filament protein ifb-1 (inferred by orthology to a C. elegans protein) | Lamin tail domain superfamily | Intermediate filament, rod domain | 24,324 | 5.33 | 991 | 65 | 20.01 |
| 17 | A0A0M3JTQ4 | Myosin regulatory light chain 2 (inferred by orthology to a D. melanogaster protein) | EF-hand domain pair | EF-hand domain | 18,715 | 5.05 | 2597 | 81 | 19.51 |
| 18 | A0A0M3KKR0 | Uncharacterized protein | - | - | 5009 | 4.79 | 1077 | 56 | 18.10 |
| 19 | A0A0M3J766 | Uncharacterized protein | - | - | 13,471 | 5.82 | 522 | 48 | 16.84 |
| 20 | Q9NJA9 | Paramyosin (allergen Ani s 2) | - | Myosin tail | 100,461 | 5.21 | 9369 | 69 | 16.04 |
| 21 | A1IKL2 | SXP/RAL-2 family protein Ani s 5 (allergen Ani s 5) | - | Domain of unknown function DUF4440 | 14,738 | 4.85 | 1765 | 47 | 14.65 |
| 22 | A0A0M3JID9 | IF rod domain-containing protein | Intermediate filament, rod domain, coil 1B | Intermediate filament, rod domain | 10,555 | 6.29 | 649 | 43 | 14.45 |
| 23 | A0A0M3JGV0 | Uncharacterized protein | - | - | 13,766 | 4.73 | 844 | 42 | 14.31 |
| 24 | A0A0M3IYQ5 | Glutamate dehydrogenase, mitochondrial (inferred by orthology to a D. melanogaster protein) | NAD(P)-binding domain superfamily | - | 11,634 | 8.89 | 650 | 61 | 14.09 |
| 25 | A0A0M3J159 | Small heat shock protein (SHSP) domain-containing protein | HSP20-like chaperone | Alpha-crystallin/Hsp20 domain | 13,230 | 6.29 | 1036 | 54 | 13.78 |
1 Protein name according to the Universal Protein Resource (UniProt); 2 InterPro families and InterPro domains were predicted using OmicsBox software; 3 the theoretical molecular weight (Mw) in daltons (Da) and theoretical isoelectric point (pI) were estimated using the ExPASy Compute pI/Mw tool; 4 Mascote score, coverage (in %), and the exponentially modified protein abundance index (emPAI) were calculated as the average of three biological replicates using values derived from Mascot; 5 not detected.
Figure 3Results of the InterPro analysis of all detected proteins performed using OmicsBox software. Most abundant InterPro families (a); most abundant InterPro domains (b).
Figure 4Gene ontology (GO) analysis of cellular components of all identified proteins (a), and the top 25 most abundant proteins (b). The number of annotations was limited for better readability in the following manner: a minimum of 38 matching sequences in GO analysis of all proteins, and a minimum of 4 matching sequences in the case of the top 25 most abundant proteins. Annotations were performed using OmicsBox software.
Allergens of A. simplex identified using LC-MS/MS.
| No | UniProt ID | Allergen Name 1 | Protein Name | InterPro Family | InterPro Domain | AllFam 2 | Mw (Da) | pI | Mascot Score | Coverage (%) | emPAI Value |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | Q7Z1K3 | Ani s 1 | Major allergen Ani s 1 (21 kDa allergen) | Pancreatic trypsin inhibitor Kunitz domain superfamily | Pancreatic trypsin inhibitor Kunitz domain | Animal Kunitz serine protease inhibitor (AF003) | 18,869 | 7.48 | 481 | 21 | 2.53 |
| 2 | Q9NJA9 | Ani s 2 | Paramyosin | - | Myosin tail | Myosin heavy chain | 100,461 | 5.21 | 9369 | 69 | 16.04 |
| 3 | Q9NAS5 | Ani s 3 | Tropomyosin | Tropomyosin | - | Tropomyosin | 33,203 | 4.68 | 3241 | 72 | 62.69 |
| 4 | Q14QT4 | Ani s 4 | Ani s 4 allergen | - | Cystatin domain | Cystatin | 10,291 | 5.54 | 196 | 27 | 3.67 |
| 5 | A1IKL2 | Ani s 5 | SXP/RAL-2 family protein Ani s 5 | - | Domain of unknown function DUF148 | SXP/RAL-2 family | 14,738 | 4.85 | 1765 | 47 | 14.65 |
1 Allergen name approved by the World Health Organization and International Union of Immunological Societies (WHO/IUIS) Allergen Nomenclature Sub-committee; 2 classification of allergen family was conducted according to ‘AllFam - The Database of Allergen Families’ (ver. 2017.03.07).
Potential allergens of A. simplex identified by LC-MS/MS and AllergenOnline.org analysis.
| Possible Allergens 1 | Homologous Allergens 1 | AllergenOnline.org Result 1 | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| No | UniProt ID | Protein Name | InterPro Family | InterPro Domain | Mw (Da) | pI | emPAI Value | Genbank ID | ALLERGEN name | Protein Name (Taxonomy) | AllFam | Identity 1 | Similarity 1 |
| 1 | A0A0M3KCE6 | Tropomyosin (inferred by orthology to a C. elegans protein) | Tropomyosin | - | 13,701 | 4.54 | 109.71 | ACN32322.1 | Asc l 3 | Tropomyosin (Ascaris lumbricoides) | Tropomyosin | 0.946 | 0.973 |
| 2 | A0A0M3KA05 | SXP/RAL-2 family protein 2 isoform 1 | - | Domain of unknown function DUF148 | 14,517 | 4.52 | 45.10 | BAF75681.1 | Ani s 8 | SXP/RAL-2 family protein 2 isoform 1 (Anisakis simplex) | SXP/RAL-2 family | 0.987 | 1 |
| 3 | A0A158PP35 | Paramyosin (inferred by orthology to a C. elegans protein) | - | Myosin tail | 101,686 | 5.34 | 22.63 | Q9NJA9.1 | Ani s 2 | Paramyosin (Anisakis simplex) | Myosin heavy chain | 0.854 | 0.958 |
| 4 | A0A0M3J4T5 | DUF148 domain-containing protein | - | Domain of unknown function DUF148 | 16,646 | 5.56 | 7.10 | BAF43534.1 | Ani s 5 | SXP/RAL-2 family protein (Anisakis simplex) | SXP/RAL-2 family | 0.971 | 0.993 |
| 5 | A0A0M3J693 | Tropomyosin (inferred by orthology to a C. elegans protein) | Tropomyosin | - | 11,304 | 4.64 | 7.79 | ACN32322.1 | Asc l 3 | Tropomyosin (Ascaris lumbricoides) | Tropomyosin | 0.824 | 0.953 |
| 6 | A0A0M3K9V2 | Heat shock 70 kDa protein cognate 1 (inferred by orthology to a D. melanogaster protein) | Heat shock protein 70 kD, peptide-binding domain superfamily | - | 67,585 | 5.45 | 4.84 | AOD75395.1 | Tyr p 28 | Heat shock-like protein (Tyrophagus putrescentiae) | Heat shock protein (Hsp70) | 0.737 | 0.86 |
| 7 | A0A0M3KIW1 | Uncharacterized protein | Heat shock protein 70 kD, peptide-binding domain superfamily | - | 24,599 | 8.49 | 2.31 | AOD75395.1 | Tyr p 28 | Heat shock-like protein (Tyrophagus putrescentiae) | Heat shock protein (Hsp70) | 0.851 | 0.95 |
| 8 | A0A0M3J6G4 | Peptidyl-prolyl cis-trans isomerase | Cyclophilin-like domain superfamily | Cyclophilin-type peptidyl-prolyl cis-trans isomerase domain | 12,614 | 9.04 | 1.99 | AVV30163.1 | Ole e 15 | Cyclophilin 0101 (Olea europaea) | - | 0.789 | 0.916 |
| 9 | A0A0M3JU57 | Troponin-like protein | EF-hand domain pair | EF-hand domain | 19,444 | 4.15 | 1.39 | CAB58171.1 | Ani s troponin C | Troponin-like protein (Anisakis simplex) | EF hand family (AF007) | 0.994 | 1 |
| 10 | A0A0M3K5H6 | 78 kDa glucose-regulated protein | Heat shock protein 70 kD, peptide-binding domain superfamily | Endoplasmic reticulum chaperone BIP, nucleotide-binding domain | 75,409 | 5.07 | 0.77 | ABF18258.1 | Aed a 8 | Heat shock cognate 70 (Aedes aegypti) | Heat shock protein (Hsp70) | 0.777 | 0.93 |
| 11 | A0A0M3J8H0 | Uncharacterized protein | - | - | 9756 | 5.52 | 0.51 | ABV55106.1 | Ani s 9 | Ani s 9 allergen precursor (Anisakis simplex) | SXP/RAL-2 family | 1 | 1 |
| 12 | A0A0M3K821 | Tubulin alpha chain | - | - | 47,784 | 4.83 | 0.42 | AIO08861.1 | Der f 33 | Der f 33 allergen (Dermatophagoides farinae) | Tubulin/FtsZ family (AF025) | 0.735 | 0.866 |
| 13 | A0A0M3KAH2 | Tubulin alpha chain | - | - | 48,222 | 5 | 0.51 | AUX14773.1 | Der f 33-like | Der f 33-like protein (Dermatophagoides pteronyssinus) | - | 0.743 | 0.891 |
1 Potential allergen and homologous allergens with identity and similarity values (1 is equal to 100%) were calculated using the full-length FASTA searching algorithm by AllergenOnline.org.
List of known B-cell/T-cell epitopes that were bioinformatically identified in Ani s 1 and Ani s 5 peptides derived from the autoclaved antigen of A. simplex.
| Peptide 1 | Epitope 3 | |||||
|---|---|---|---|---|---|---|
| Sequence 1 | Position1 | IEDB Epitope ID 3 | Sequence 3 | Position3 | B-Cell/T-Cell 3 | Reference 3 |
|
| ||||||
|
| 188‒194 | 137143 | EDAKCERGK | 179‒194 | B-cell | [ |
| 421049 | GK | 186‒194 | T-cell | [ | ||
|
| 137‒143 | 421391 | MGLCCPTK | 129‒142 | B-cell | [ |
| 421053 | GLCCPTK | 130‒142 | B-cell | [ | ||
| 420891 | 138‒149 | T-cell | [ | |||
| 420774 | CCPTK | 132‒143 | T-cell | [ | ||
|
| 160‒170 | 137278 | R | 159‒173 | B-cell | [ |
| 421086 | 162‒173 | T-cell | [ | |||
| 421087 | 162‒181 | B-cell | [ | |||
| 421233 | KMDR | 156‒167 | T-cell | [ | ||
| 421406 | 165‒181 | B-cell | [ | |||
| 421613 | R | 159‒169 | B-cell | [ | ||
|
| 125‒136 | 420792 | CPNGYQCK | 117‒130 | B-cell | [ |
| 421053 | 130‒142 | B-cell | [ | |||
| 421091 | GYQCK | 120‒131 | T-cell | [ | ||
| 421092 | GYQCK | 120‒136 | B-cell | [ | ||
| 421384 | 126‒137 | T-cell | [ | |||
| 421391 | 129‒142 | B-cell | [ | |||
|
| 157‒170 | 137278 | 159‒173 | B-cell | [ | |
| 421086 | 162‒173 | T-cell | [ | |||
| 421087 | 162‒181 | B-cell | [ | |||
| 421233 | K | 156‒167 | T-cell | [ | ||
| 421406 | 165‒181 | B-cell | [ | |||
| 421613 |
| 159‒169 | B-cell | [ | ||
|
| 56‒63 | 420822 | DK | 54‒65 | T-cell | [ |
| 421875 | WHDDK | 51‒64 | B-cell | [ | ||
|
| ||||||
| ADAELSK | 105‒111 | 230144 | KK | 103‒113 | B-cell | [ |
| FETFKK | 73‒78 | - | - | - | - | - |
| TDPEIEK | 50‒56 | - | - | - | - | - |
| AKEAELAK | 82‒89 | - | - | - | - | - |
| KADAELSK | 104‒111 | 230144 | K | 103‒113 | B-cell | [ |
| AFFELLK | 38‒44 | - | - | - | - | - |
| DETKTDPEIEK | 46‒56 | 230147 |
| 49‒54 | B-cell | [ |
|
| 140‒152 | 230136 | IQAIYKTLPQSVK | 127‒146 | B-cell | [ |
|
| 133‒144 | 230136 | IQAIYK | 127‒146 | B-cell | [ |
| 230137 | IQKAQKIQAIYK | 121‒140 | B-cell | [ | ||
| EAELAKAHEEAVAK | 84‒97 | - | - | - | - | - |
| DLDAWVDTLGGDYK | 57‒70 | 230149 |
| 58‒66 | B-cell | [ |
| 230150 | 58‒77 | B-cell | [ | |||
| DLDAWVDTLGGDYKAK | 57‒72 | 230149 |
| 58‒66 | B-cell | [ |
| 230150 | 58‒77 | B-cell | [ | |||
| TDPEIEKDLDAWVDTLGGDYK | 50‒70 | 230149 |
| 58‒66 | B-cell | [ |
| 230150 | 58‒77 | B-cell | [ | |||
1Peptides were identified using LC-MS/MS and retrieved from the Mascot search results; 2sequences of unique peptides are bolded; 3epitope mapping data were retrieved from the Immune Epitope Database and Analysis Resource (IEDB); 4UniProt ID and allergen name; 5the epitope sequences found in the peptides are underlined.
List of in silico predicted epitopes (MHC II, T-cell, and B-cell) in Ani s 2, Ani s 3, and Ani s 4 peptides derived from the autoclaved antigen of A. simplex.
| Peptide 1 | Epitope MHC II 3 | Epitope T-Cell 3 | Epitope B-Cell 3 | ||||
|---|---|---|---|---|---|---|---|
| Sequence 1 | Position 1 | Sequence 3 | Position 3 | Sequence 3 | Position 3 | Sequence 3 | Position 3 |
|
| |||||||
|
| 562‒568 | - | - | - | - | - | - |
|
| 465‒471 | - | - | - | - | - | - |
| SLEEQVK | 712‒718 | - | - | - | - | - | - |
| AALAELQK | 402‒409 | - | - | 404‒419 | - | - | |
|
| 617‒624 | - | - | CK | 615‒621 | - | - |
| ELEDAEGR | 829‒836 | - | - | - | - | R | 828‒837 |
| ELHAADER | 674‒681 | - | - | LDEVTK | 668‒679 | - | - |
|
| 487‒495 | - | - | RAQR | 483‒491 | - | - |
|
| 271‒280 | - | - | - | - | - | - |
|
| 760‒768 | - | - | - | - | RIRDLEVALDEETRR | 745‒774 |
| LTAALADAEAR | 523‒533 |
| 523‒528 | - | - | - | - |
|
| 361‒370 | - | - | - | - | - | - |
| EEEMEALRK | 506‒514 | - | - | - | - | - | - |
| VQLDNLQHVK | 222‒231 | - | - | - | - | - | - |
| LHELDLENAR | 449‒458 | - | - | - | - | - | - |
|
| 43‒52 | - | - | - | - | 45‒63 | |
|
| 787‒797 | - | - | 793‒800 | - | - | |
|
| 837‒848 | - | - | - | - | - | - |
|
| 178‒188 | - | - | - | - | - | - |
| KLHELDLENAR | 448‒458 | - | - | - | - | - | - |
| ISALSAELEECK | 605‒616 |
| 605‒610 | - | - | - | - |
|
| 232‒242 | - | - | - | - | - | - |
|
| 628‒639 | 635‒645 | - | - | - | - | |
|
| 776‒786 | - | - | - | - |
| 776‒781 |
|
| 287‒298 | - | - | - | - | - | - |
| ADLSVQLIALTDR | 63‒75 | - | - | - | - | - | - |
|
| 465‒477 | - | - | - | - | - | - |
| KISALSAELEECK | 604‒616 |
| 605‒610 | - | - | - | - |
|
| 627‒639 | - | - | - | - | - | - |
|
| 746‒758 | - | - | - | - | R | 745‒774 |
|
| 640‒654 | AHVR | 635‒645 |
| 640‒646 | - | - |
| LQSEVEVLIVDLEK | 347‒360 | - | - | - | - | - | - |
|
| 774‒786 | - | - | - | - |
| 776-781 |
|
| 590‒603 | - | - | LEDTQR | 584‒594 | - | - |
| ERADLSVQLIALTDR | 61‒75 | - | - | - | - | - | - |
|
| 692‒705 | - | - | ANRALADAAR | 682‒698 | - | - |
|
| 808‒822 | - | - | 812‒833 | - | - | |
|
| 384‒398 | - | - |
| 386‒392 | - | - |
|
| 323‒336 | - | - | - | - | - | - |
|
| 189‒203 | - | - | SNK | 186‒201 |
| 192-197 |
|
| 174‒188 | - | - | HVAEK | 169‒179 | - | - |
|
| 590‒604 | - | - | LEDTQR | 584‒594 | - | - |
|
| 206‒221 | - | - |
| 212‒218 | - | - |
|
| 640‒656 | AHVR | 635‒645 |
| 640‒646 | - | - |
|
| 345‒360 | - | - | - | - | - | - |
|
| 657‒673 |
| 661‒666 | 668‒679 | - | - | |
|
| 150‒166 |
| 155‒160 |
| 153‒166 | - | - |
| LLEESQLENEDAMNVLR | 103‒119 | - | - |
| 112‒116 | - | - |
| YQAEIAELEMTVDNLNR | 545‒561 | - | - |
| 553‒560 | - | - |
| QLQVQIQEAEAAALLGGKR | 719‒737 | - | - | - | - | - | - |
|
| 178‒194 | - | - | 186‒201 | - | - | |
|
| 281‒298 | - | - | - | - | - | - |
|
| 655‒673 |
| 661‒666 | 668‒679 | - | - | |
|
| 515‒533 |
| 523‒528 | - | - | - | - |
|
| 204‒221 | - | - |
| 212‒218 | - | - |
|
| 253‒270 |
| 259‒264 | - | - | - | - |
| SKFDAEVALHHEEVEDLR | 299‒316 | - | - | 311‒321 | - | - | |
|
| 253‒270 |
| 259‒264 | - | - | - | - |
| LLEESQLENEDAMNVLRK | 103‒120 | - | - |
| 112‒116 | - | - |
| KLLEESQLENEDAMNVLR | 102‒119 | - | - |
| 112‒116 | - | - |
|
| 148‒166 | - | - |
| 153‒166 | - | - |
| QAEYEEQIEIMLQKVSQLEK | 323‒342 | - | - | - | - | - | - |
|
| 534‒540 | - | - | - | - | - | - |
|
| 410‒416 | - | - | - | - | - | - |
|
| 94‒101 | - | - | - | - | ESNRKR | 88‒98 |
|
| 748‒759 | - | - | - | - | RIR | 745‒774 |
|
| 76‒91 | - | - | - | - | 88‒98 | |
|
| 572‒589 |
| 577‒582 | 584‒594 | - | - | |
|
| 15‒36 |
| 23‒29 | 20‒37 | - | - | |
|
| 605‒624 |
| 605‒610 |
| 615‒621 | - | - |
| VSQLEK | 337‒342 | - | - | - | - | - | - |
|
| 802‒807 | - | - | - | - | - | - |
|
| 711‒718 | - | - | DALR | 707‒717 | - | - |
| KYQAEIAELEMTVDNLNR | 544‒561 | - | - |
| 553‒560 | - | - |
| Q9NAS5 (Ani s 3) | |||||||
|
| 239‒244 | - | - | - | - | - | - |
|
| 22‒28 | - | - | 23‒64 | - | - | |
|
| 128‒133 | - | - | - | - | R | 127‒139 |
|
| 39‒44 | - | - | - | - | - | - |
|
| 232‒238 | - | - | - | - | - | - |
| SFQDEER | 134‒140 | - | - | - | - | RKVMENR | 127‒139 |
| SLEVSEEK | 206‒213 | - | - | - | - | - | - |
| ANTVESQLK | 141‒149 | - | - | - | - | - | - |
|
| 31‒38 | - | - | - | - | 33‒46 | |
|
| 13‒21 | - | - | - | - | - | - |
|
| 256‒264 | - | - | EFAERSVQKLQKEVDR | 240‒262 | DR | 254‒261 |
| IVELEEELR | 190‒198 | - | - | K | 189‒195 |
| 193‒197 |
|
| 239‒248 | - | - | 240‒262 | - | - | |
| EDSYEEQIR | 218‒226 | - | - | 223‒229 | LQR | 215‒226 | |
|
| 50‒59 | - | - | - | - | - | - |
|
| 39‒48 | - | - | - | - | TDKLER | 33‒46 |
| EAQMLAEEADR | 150‒160 | - | - | - | - | - | - |
| MTLLEEELER | 92‒101 | - | - | - | - | - | - |
| SLEVSEEKALQR | 206‒217 | - | - | - | - | - | - |
| EAQMLAEEADRK | 150‒161 | - | - | - | - | - | - |
|
| 78‒90 | - | - | - | - | - | - |
|
| 16‒28 | - | - | 23‒64 | - | - | |
| RMTLLEEELER | 91‒101 | - | - | - | - | - | - |
| LEEATHTADESER | 113‒125 | - | - | 113‒131 | - | - | |
|
| 77‒90 | - | - | - | - | - | - |
|
| 78‒91 | - | - | - | - | - | - |
|
| 252‒264 | - | - | EFAERSVQKLQK | 240‒262 |
| 254‒261 |
| ALQREDSYEEQIR | 214‒226 | - | - | 223‒229 |
| 215‒226 | |
|
| 60‒74 | - | - | - | - | - | - |
|
| 269-284 | - | - | - | - | - | - |
|
| 267‒284 | - | - | - | - | - | - |
|
| 141‒160 | - | - | - | - | - | - |
|
| 49‒59 | - | - | - | - | - | - |
|
| 50‒74 | - | - | DAAEEKVRQMTDKLERIEEELRDTQKK | 23‒64 | - | - |
| Q14QT4 (Ani s 4) | |||||||
|
| 69‒76 | - | - | - | - | - | - |
|
| 100‒108 | - | - | WQK | 97‒114 | - | - |
|
| 49‒63 | - | - | ELAGKSIAK | 40‒53 | - | - |
| - | - | 56‒66 | - | - | |||
|
| 40‒48 | - | - | 40‒53 | - | - | |
|
| 101‒108 | - | - | WQKK | 97‒114 | - | - |
1 Peptides were identified using LC-MS/MS and retrieved from the Mascot search results; 2 sequences of unique peptides are bolded; 3 major histocompatibility complex class II (MHC II) molecules, T-cell, and B-cell epitopes were predicted using DNA STAR Protean 3D software; 4 UniProt ID and allergen name; 5 the epitope sequences found in the peptides are underlined.
Figure 5The native tertiary structures of identified allergens. Prediction of unknown 3D structures of allergens: Ani s 1 (77% of residues modeled at > 90% confidence, model dimensions in angstroms (Å): X:64.335 Y:49.060 Z:49.310) (a), Ani s 2 (99% of residues modeled at > 90% confidence, model dimensions in Å: X:160.179 Y:213.187 Z:160.224) (b), Ani s 3 (99% of residues modeled at > 90% confidence, model dimensions in Å: X:368.887 Y:109.731 Z:146.232) (c), and Ani s 4 (86% of residues modeled at > 90% confidence, model dimensions in Å: X:31.951 Y:46.659 Z:32.835) (d) were performed using the Phyre2 server. The allergen Ani 5 structure was visualized based on the structure from the Protein Data Bank PDB ID: 2MAR (e). PyMOL software was used to visualize all structures. Colors of the allergen models are according to secondary structure: helices highlighted in red, sheets in yellow, and loops in green.
Figure 6The native tertiary structures of homologous allergens, as identified using AllergenOnline.org for potential allergens of A. simplex. Prediction of unknown 3D structures of allergens: Ani s 8 (85% of residues modeled at > 90% confidence, model dimensions in angstroms (Å): X: 62.875 Y: 60.531 Z: 35.486) (a), Ani s 9 (87% of residues modeled at > 90% confidence, model dimensions in Å: X:54.278 Y: 58.349 Z:28.680) (b), Ani s troponin C (99% of residues modeled at > 90% confidence, model dimensions in Å: X:50.390 Y: 40.831 Z:35.415) (c), Asc l 3 (99% of residues modeled at > 90% confidence, model dimensions in Å: X:361.607 Y:132.976 Z:152.396) (d), Tyr p 28 (97% of residues modeled at > 90% confidence, model dimensions in Å: X:81.072 Y:91.096 Z:63.286) (e), Der f 33 (89% of residues modeled at > 90% confidence, model dimensions in Å: X:63.736 Y:59.230 Z:58.888) (f), Der f 33-like (90% of residues modeled at > 90% confidence, model dimensions in Å: X:57.763 Y:55.796 Z:56.582) (g), Aed a 8 (90% of residues modeled at > 90% confidence, model dimensions in Å: X:57.763 Y:55.796 Z:56.582) (h), Ole e 15 (100% of residues modeled at > 90% confidence, model dimensions in Å: X:38.017 Y:36.365 Z:47.475) (i) were performed using the Phyre2 server. Allergen 3D structures of Ani s 2 and Ani s 5, which are also homologs of potential allergens, are shown in Figure 5. PyMOL software was used to visualize all structures. Colors of the allergen models are according to secondary structure: helices highlighted in red, sheets in yellow, and loops in green.
Figure 7Gene ontology analysis of cellular components of allergens (a) and potential allergens (b). The number of annotations was limited for better readability in the case of potential allergens (minimum of three sequences). Annotations were performed using OmicsBox software.