| Literature DB >> 23891546 |
Fabio Dall'antonia1, Tea Pavkov-Keller2, Klaus Zangger3, Walter Keller4.
Abstract
The structure determination of major allergens is a prerequisite for analyzing surface exposed areas of the allergen and for mapping conformational epitopes. These may be determined by experimental methods including crystallographic and NMR-based approaches or predicted by computational methods. In this review we summarize the existing structural information on allergens and their classification in protein fold families. The currently available allergen-antibody complexes are described and the experimentally obtained epitopes compared. Furthermore we discuss established methods for linear and conformational epitope mapping, putting special emphasis on a recently developed approach, which uses the structural similarity of proteins in combination with the experimental cross-reactivity data for epitope prediction.Entities:
Keywords: Allergen structure; IgE epitope; NMR; Protein family; Structure based epitope prediction; X-ray
Mesh:
Substances:
Year: 2013 PMID: 23891546 PMCID: PMC3969231 DOI: 10.1016/j.ymeth.2013.07.024
Source DB: PubMed Journal: Methods ISSN: 1046-2023 Impact factor: 3.608
Fig. 1MBP-fusions act as crystallization chaperons. (A) Der p 7 (PDB: 3h4z) and (B) Ara h 2 (3OB4) are shown as maltose-binding protein (MBP)-fusion proteins. Allergen structures are shown in ribbon representation and are colored according their secondary structure composition (α-helices in cyan, β-sheets in magenta). MBP is additionally shown as surface representation (gray). Dashed lines indicate the disordered regions that are missing in the crystal structures.
Fig. 2Information on chemical exchange (Rex) on the millisecond-microsecond time scale (A) and order parameter S2 (B) as derived from 15N relaxation measurements are shown as a function of residue number of Blo t 5. In (C) these parameters are mapped onto the sausage structure of Blo t 5 where the thickness is related to 1 − S2 and residues with Rex contributions are represented as red spheres. Reproduced with permission from [29].
Fig. 3Representative structures of important allergen families. Allergen structures are presented according to their secondary structure composition (α-helices in cyan, β-sheets in magenta). A: Bet v 1 (PDB: 1BV1), B:Ara h 1 (3S7E), C: Bos d 2 (1BJ7), D: Phl p 7 (1K9U, 2LVK), E: Der p 1 (3F5 V), F: Ara h 5 (4ESP), G: Pru av 1 (2AHN), H: Phl p 6 (1NLX), I: Der p 2 (1KTJ), J: Phlp1 (1N10) and Phl p 4 (3TSH). Ca2+ atoms are shown in yellow and FAD cofactor (K) in ball-and-stick representation. (D) Comparison of Phl p 7 dimer (as observed in the X-ray structure) and monomer (as present in the NMR structure). (J) Allergen Phl p 1 belonging to the expansin family consists of 2 domains and forms a dimer. The N-terminal domains are distinguished as grey surface representation.
Structures of allergens deposited in the PDB determined by X-ray crystallography or NMR grouped by Pfam families.
| Allergen | Species common name | Species scientific name | X-ray structure | NMR | References |
|---|---|---|---|---|---|
| Api g 1 | Celery | 2BK0 | |||
| Bet v 1 | White birch | 1LLT, 1BV1, 1FM4, 1QMR, 1FSK, 4A80–8, 4A8G, 4A8U, 4A8 V | 1BTV, 1B6F | ||
| Dau c 1 | Carrot | 2WQL | |||
| Fra a 1 | Strawberry | 2LPX | - | ||
| Gly m 4 | Soybean | 2K7H | |||
| Pru av 1 | Sweet cherry | 1E09, 1H2O | |||
| Vig r 6.0101 | Mung bean | 2FLH, 3C0 V | |||
| LLR18A/ LLR18B | Yellow lupine | 1ICX, 1IFV | |||
| LIPR-10.2.B | Yellow lupine | 2QIM | |||
| Ara h 2 | Peanut | 3OB4 | |||
| Ara h 6 | Peanut | 1W2Q | |||
| Ber e 1 | Brazilian nut | 2LVF | |||
| Bra n 1 | Rapeseed | 1PNB | |||
| Hor v 1 | Barley | 1JTB, 1BE2 | |||
| Pru p 3 | Peach | 2ALG, 2B5S | |||
| Ric c 3 | Castor bean | 1PSY | |||
| Zea m 14 | Corn | 1MZL, 1MZM, 1FK0–7 | 1AFH | ||
| Ara h 1 | Peanut | 3S7E, 3S7I, 3SMH | |||
| Ara h 3 | Peanut | 3C3 V | |||
| Gly m 6.0101 | Soybean | 1FXZ | |||
| Gly m 6.0501 | Soybean | 1OD5, 2D5F, 2D5H | |||
| Gly m conglycinin | Soybean | 1IPK, 1IPJ | |||
| Pru du 6 | Almond | 3FZ3 | |||
| Arg R 1 | Pigeon tick | 2X46, 2X45 | - | ||
| Bla g 4 | German cockroach | 3EBK | |||
| Bos d 2 | Cow | 1BJ7 | |||
| Bos d 5 | Cow | 1BSO | |||
| Can f 2 | Dog | 3L4R | |||
| Der f 13 | House dust mite | 2A0A | |||
| Equ c 1 | Horse | 1EW3 | |||
| Mus m 1 | Mouse | 1MUP, 1JV4 | 1DF3 | ||
| Per a 4 | American cockroach | 3EBW | |||
| Rat n 1 | Rat (urine) | 2A2U, 2A2G | |||
| Bet v 4 | White birch | 1H4B | |||
| Che a 3 | Lamb’s Quarters, White goosefoot | 2OPO | |||
| Cyp c 1 | Carp | 4CPV, 5CPV and other∗ | |||
| Phl p 7 | Timothy-grass | 1K9U | 2LVK, 2LVJ, 2LVI | {Verdino, 2002 #13 | |
| Act d 1 | Kiwi fruit | 2ACT, 1AEC | |||
| Car p 1 | Papaya | 1PPN, 1KHQ | |||
| Der p 1 | House dust mite | 1XKG, 2AS8, 3F5 V, 3RVX, 3RVW | |||
| Der f 1 | House dust mite | 3D6S, 3RVV | |||
| Ara h 5 | Peanut | 4ESP | |||
| Ara t 8 | Mouse-ear crest | 1A0 K | |||
| Bet v 2 | White birch | 1CQA | |||
| Hev b 8 | Latex | 1G5U | - | ||
| Thaumatin (PF00314): Thaumatin-like protein | |||||
| Act d 2 | Kiwi fruit | 4BCT | - | ||
| Mal d 2 | Apple | 3ZS3 | - | ||
| Mus a 4 | Banana | 1Z3Q | |||
| Pru av 2 | Sweet cherry | 2AHN | |||
| Phl p 1 (C-term domain) | Timothy-grass | 1N10 | |||
| Phl p 2 | Timothy-grass | 1WHO, 1WHP, 2VXQ | 1BMW | ||
| Phl p 3 | Timothy-grass | 3FT1, 3FT9 | 2JNZ | ||
| Zea m 1 (C-term domain) | Corn | 2HCZ | |||
| Phl p 1 (N-term domain) | Timothy-grass | 1N10 | |||
| Zea m 1 (N-termn domain) | Corn | 2HCZ | |||
| Phl p 5b | Timothy-grass | 1L3P | |||
| Phl p 6 | Timothy-grass | 1NLX | - | ||
| Blo t 5 | Storage mite | 2JMH, 2JRK | |||
| Der p 5 | House dust mite | 3MQ1 | |||
| Blo t 21 | Storage mite | 2LM9 | |||
| Sol i 3 | Fire ant | 2VZN | |||
| Ves v 5 | Common wasp | 1QNX | |||
| Hev b 6.02 | Latex | 1WKX, 1Q9B | 1HEV, 1T0 W | ||
| Tri a 18 | Wheat | 1WGC, 2CWG, 7WGA, 9WGA, 2UVO, 4AML, 2X3T | |||
| Der f 2 | House dust mite | 1XWV, 2F08 | 1AHK, 1AHM, 1WRF | ||
| Der p 2 | House dust mite | 1KTJ | 1A9 V | ||
| Hev b 2 | Latex | 3EM5, 3F55 | - | ||
| Mus a 5 | Banana | 2CYG | |||
| Api m 2 | Honey bee | 1FCQ, 1FCU, 1FCV, 2J88 | |||
| Ves v 2 | Common wasp | 2ATM | |||
| Asp f 11 | Mold | 2C3B | |||
| Mala s 6 | Mold | 2CFE | |||
| Cyn d 4 | Bermuda grass | 4DNS | |||
| Phl p 4 | Timothy-grass | 3TSH, 3TSJ | |||
Many parvalbumin structures have been solved (37 PDB entries currently) comprising proteins from 9 different species (from fish to human). Allergenicity has been determined for most fish parvalbumins and the structures of the carp parvalbumins are listed as representatives for this group.
Fig. 4Structural aspects of allergen epitopes and Ab interfaces in complexes. PDB codes are HEL (hen egg-white lysozyme): 1NDG, Bet v 1: 1FSK, Api m 2 (bee venom hyaluronidase): 2J88, BLG (β-lactoglobulin): 2R56, Bla g 2: 2NR6, Phl p 2: 2VXQ, Der f 1: 3RVV. Epitopes in side view (panels A) and top view (B) are represented as surfaces with atom-type colors (carbon grey, nitrogen blue and oxygen red) and the interacting CDRs are shown as secondary structure cartoons with sheets in yellow and loops in green, except when not binding (red). Panels C show the entire allergens as cartoons without Ab in the same top-view orientation. Epitope residues are highlighted in cyan, and with side chain stick representation, nitrogen and oxygen colors as before. Model sizes are not on the same scale. Convexity ratios in top-down order are 0.35, 0.48, 0.69, 0.37, 0.42, 0.43 and 0.39.
Structures of complexes between allergens and monoclonal antibodies with characteristics of interaction.
| Allergen | mAb Fab | PDB-id | Buried interface area | CDR | Number of ... | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Functional epit. res. | Structural epit. res. | Linear segments | Hydrogen bonds | Ionic interactions | Apolar contacts | ||||||
| HEL | HyHEL8 (IgG) | 1NDG | 1.9 Å | 1827 Å2 | L1, L2, L3H1, H2, H3 | 8 | 21 | 4 | 13 | 2 | 38 |
| Bet v 1 | BV16 (IgG) | 1FSK | 2.9 Å | 1649 Å2 | L1, L3H1, H2, H3 | 7 | 17 | 1 | 11 | 3 | 34 |
| Api m 2 (hyaluronidase) | 21E11 (IgG) | 2J88 | 2.6 Å | 1529 Å2 | L1, L2, L3H1, H2, H3 | 8 | 10 | 1 | 13 | 2 | 27 |
| BLG | D1 (IgE Fv) | 2R56 | 2.8 Å | 1750 Å2 | L1, L2, L3H1, H2, H3 | 9 | 21 | 5 | 13 | 3 | 28 |
| Bla g 2 | 7C11 (IgG) | 2NR6 | 2.8 Å | 1773 Å2 | L1, L2, L3H1, H2, H3 | 12 | 16 | 3 | 20 | 4 | 23 |
| Phl p 2 | HuMab2 (IgE Fv) | 2VXQ | 1.9 Å | 1616 Å2 | L1, L3H1, H2, H3 | 9 | 21 | 4 | 14 | 1 | 26 |
| Der f 1 | 4C1 (IgG) | 3RVV | 1.9 Å | 1385 Å2 | L1, L2, L3H1, H2, H3 | 5 | 15 | 2 | 7 | 2 | 26 |
| Der p 1 | 4C1 (IgG) | 3RVW | 2.0 Å | 1371 Å2 | L1, L2, L3H1, H2, H3 | 6 | 15 | 2 | 11 | 2 | 21 |
mAb: name of the monoclonal antibody and its immunoglobulin type. Fab = antigen-binding fragment. Fv: variable part of the antibody respectively Fab.
d: crystallographic resolution of the structure.
Δ(SASA) = [SASA(Ab) + SASA(allergen)] – SASA(complex); SASA = solvent accessible surface area as calculated with GETAREA [263] using a 1.4 Å probe radius.
CDR: complementarity determinant region; counted as involved if forming at least one specific interaction. L = light chain loops, H = heavy chain loops.
Linear segments constituting the epitopes are defined here as continuous sequence stretches of at least three residues (with single position gaps allowed).
Ionic interactions include both short-distance contacts with H-bond contribution (salt bridges) and weaker electrostatic-only interactions. Both ionic and apolar interactions were determined at a distance cutoff of 4.0 Å and counted once per allergen–Ab residue pair. Hydrogen bonds were determined at a cutoff of 3.25 Å.
Fig. 5Overlay of TROSY-HSQC spectra of Blo t 5 in the absence (red) and presence (grey) of Fab. Peaks belonging to residues of the interaction region between residues 40 and 60 are indicated by arrows. Folded peaks are in dotted squares. Reproduced with permission from [29].
Fig. 6Workflow of a typical IgE epitope localization project with SPADE as prediction tool. In this procedural description starting from a target allergen with available 3D coordinates, the structure determination of homologous allergens is not considered part of the project. It is instead assumed that SPADE is applied in the presence of other pre-determined structures, which can be revealed or confirmed with literature and/or database searches. The actual computational part of the project can/should be supported by immunological data, as quantitative cross-reactivity (CR) values increase the prediction accuracy. Once epitopes and amino acid key residues for IgE-binding are located, the prediction serves as a starting point for follow-up experiments such as point mutations with tests for reduced IgE binding.
Fig. 7Distribution of physicochemical features in sets of epitope vs. non-epitope surface residues, represented by probability density graphs (Kernel density estimation with R), based on the seven unbound allergen structures HEL (PDB: 1VDQ), Bet v 1 (1BVQ), Hyaluronidase (1FCQ), BLG (1BEB), Bla g 2 (1YG9), Phl p 2 (1WHO) and Der f 1 (3D6S). A: Normalized B-factors of C-α atoms (all residues, separate normalization per structure) were calculated as B-norm = (B–)/σ(B). B: Relative solvent accessibility S [%] was calculated as S = SASA(obs.) / SASA (max.) where SASA(obs.) is the solvent-accessible surface area of a residue Xi calculated with GETAREA and SASA(max.) is the corresponding area in a G-X-G tripeptide, averaged over 30 conformations [174]. C: Atomic lipophilicity parameters were taken from Ghose et al. [175], mapped to the surface according to the procedure of Heiden et al. [176] and averaged per residue (for surface vertices only). D: The electrostatic potential was calculated with APBS [177], mapped to the surface vertices and averaged per residue. Absolute quantities of [kBT/e] were used for the statistical analysis. A-D: Two-sided two-sample Kolmogorov–Smirnov tests were performed with R. The statistical significance of differences was defined with a confidence threshold of 0.99, so that it requires log10(p) ⩽ −2. Dashed lines indicate mean values.
Structures of allergens deposited in the PDB determined by X-ray crystallography or NMR with only one structure from the Pfam family determined (or Pfam family not known).
| Allergen | Function/fold | Species common name | Species scientific name | X-ray structure | NMR | References |
|---|---|---|---|---|---|---|
| Aed a 2 | Yellow fewer mosquito | 3DXL, 3DY9, 3DYE, 3DZT | ||||
| Alt a 1 | Mold | 3V0R | ||||
| Amb t 5 | Giant ragweed | 1BBG, 2BBG, 3BBG | - | |||
| Api m 1 | Phospholipase A2 | Honey bee | 1POC | |||
| Api m 4 | Melittin | Honey bee | 2MLT | 1BH1 | ||
| Art v 1 | Gamma-thionin (plant defensin) | Mugwort | 2KPY | |||
| Asc s 1 | ABA-1 polyprotein | Roundworm of pigs | 2XV9 | |||
| Asp f 1 | Ribonuclease | Mold | 1AQZ | |||
| Asp f 6 | Fe, Mn superoxide dismutase | Mold | 1KKC | |||
| Asp o 21 | TAKA-amylase A | Mold | 7TAA | |||
| Bla g 2 | Aspartic protease | German cockroach | 2NR6, 1YG9, 3LIZ | |||
| Bos d 4 | Alpha-lactalbumin | Cattle | 2G4 N, 1F6R, 1F6S, 1HFZ | |||
| Chi t 1 | Globin | Midge | 1ECO | |||
| Cla h 8 | Mannitol dehydrogenase | Mold | 3GDF, 3GDG | |||
| Der p 7 | Extracellular solute-binding protein | European house dust mite | 3H4Z | |||
| Der f 7 | House dust mite | 3UV1 | ||||
| Equ c 3 | Serum-albumin | Horse | 3V08 | - | ||
| Fel d 1 | Uteroglobin | Cat (saliva) | 2EJN, 1PUO, 1ZKR | |||
| Gal d 3 | Transferrin | Chicken | 1NNT, 1OVT, 1AIV, 1RYX, 1N04, 1IEJ, 2D3I | |||
| Gal d 4 | Lysozyme | Chicken | >50 entries | |||
| Gly m lectin | Legume lectin | Soybean | 1SBF | |||
| Jun a 1 | Pectate lyase | Mountain cedar | 1PXZ | |||
| Mala s 1 | Maltose-binding protein | Mold | 2P9 W | |||
| Mala s 13 | Thioredoxin | Mold | 2J23 | |||
| Ole e 6 | Olive | 1SS3 | ||||
| Ole e 9 | Beta-1,3-glucanase | Olive | 2JON | |||
| Ovalbumin | Serine-protease inhibitor (serpin) | Chicken | 1UHG, 1JTI, 1OVA | |||
| Per a 4 | Lipocalin-like | American cockroach | 3EBW | |||
| Sol i 2 | Fire ant | 2YGU |