| Literature DB >> 36011375 |
Wei Zhou1, Kaylah Bias1, Dylan Lenczewski-Jowers1, Jiliah Henderson1, Victor Cupp1, Anthony Ananga1,2, Joel Winyo Ochieng3, Violeta Tsolova1,2.
Abstract
Fruit allergens are proteins from fruits or pollen that cause allergy in humans, an increasing food safety concern worldwide. With the globalization of food trade and changing lifestyles and dietary habits, characterization and identification of these allergens are urgently needed to inform public awareness, diagnosis and treatment of allergies, drug design, as well as food standards and regulations. This study conducted a phylogenetic reconstruction and protein clustering among 60 fruit and pollen allergens from 19 species, and analyzed the clusters, in silico, for cross-reactivity (IgE), 3D protein structure prediction, transmembrane and signal peptides, and conserved domains and motifs. Herein, we wanted to predict the likelihood of their interaction with antibodies, as well as cross-reactivity between the many allergens derived from the same protein families, as the potential for cross-reactivity complicates the management of fruit allergies. Phylogenetic analysis classified the allergens into four clusters. The first cluster (n = 9) comprising pollen allergens showed a high risk of cross-reactivity between eight allergens, with Bet v1 conserved domain, but lacked a transmembrane helix and signal peptide. The second (n = 10) cluster similarly suggested a high risk of cross-reactivity among allergens, with Prolifin conserved domain. However, the group lacked a transmembrane helix and signal peptide. The third (n = 13) and fourth (n = 29) clusters comprised allergens with significant sequence diversity, predicted low risk of cross-reactivity, and showed both a transmembrane helix and signal peptide. These results are critical for treatment and drug design that mostly use transmembrane proteins as targets. The prediction of high risk of cross-reactivity indicates that it may be possible to design a generic drug that will be effective against the wide range of allergens. Therefore, in the past, we may have avoided the array of fruit species if one was allergic to any one member of the cluster.Entities:
Keywords: 3D structure; conserved domains; cross-reactive; drug design; fruit allergen
Mesh:
Substances:
Year: 2022 PMID: 36011375 PMCID: PMC9408803 DOI: 10.3390/genes13081464
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.141
Figure 1Phylogenetic and motif analyses. All of the 60 allergens were classified into four clusters, which were marked as I, II, III, IV. The logos of hidden Markov models and p-value of conserved motif for each allergen are attached.
Figure 2A-RISC index values, sequence alignment, and 3D structure of Cluster I.
Transmembrane and signal peptide analysis results.
| Cluster | Protein | Length | Number of Predicted TMHs | Exp Number of AAs in TMHs | Exp Number, First 60 AAs | Total Prob of N-in | Signal Length | Signal Peptide | Signal Peptide Probability | Signal Anchor Probability |
|---|---|---|---|---|---|---|---|---|---|---|
| Cluster I | CAM31909.1_kiwi_D8 | 157 | 0 | 0 | 0 | 0.14 | 70 | no | 0 | 0 |
| Cluster I | AAB97141.1_apricot_Ar1 | 160 | 0 | 0 | 0 | 0.06 | 70 | no | 0 | 0 |
| Cluster I | XP_004296886.1_rasberry_I1 | 159 | 0 | 0 | 0 | 0.04 | 70 | no | 0 | 0 |
| Cluster I | ABG54495.1_raspberry_I1 | 137 | 0 | 0 | 0 | 0.06 | 70 | no | 0 | 0 |
| Cluster I | CAJ29538.1_strawberry_A1 | 160 | 0 | 0 | 0 | 0.13 | 70 | no | 0 | 0 |
| Cluster I | ABB78006.1_peach_P1 | 160 | 0 | 0 | 0 | 0.04 | 70 | no | 0 | 0 |
| Cluster I | AAC02632.1_cherry_A1 | 160 | 0 | 0 | 0 | 0.04 | 70 | no | 0 | 0 |
| Cluster I | CAA58646.1_apple_D1 | 159 | 0 | 0 | 0 | 0.06 | 70 | no | 0 | 0 |
| Cluster I | AAC13315.1_pear_C1 | 159 | 0 | 0 | 0 | 0.04 | 70 | no | 0 | 0 |
| Cluster II | AAK54835.1_pineapple_C1 | 131 | 0 | 0 | 0 | 0.3 | 70 | no | 0 | 0 |
| Cluster II | sp|C0HL99.1_kiwi_D9 | 109 | 0 | 0 | 0 | 0.41 | 70 | no | 0 | 0 |
| Cluster II | XP_004287490.1_stawberry_A4 | 131 | 0 | 0 | 0 | 0.35 | 70 | no | 0 | 0 |
| Cluster II | AAW69549.1_melon_M2 | 131 | 0 | 0 | 0 | 0.25 | 70 | no | 0 | 0 |
| Cluster II | AAK54834.1_banana_A1 | 131 | 0 | 0 | 0 | 0.3 | 70 | no | 0.01 | 0 |
| Cluster II | AAD29414.1_apple_D4 | 61 | 0 | 0 | 0 | 0.25 | 70 | no | 0 | 0 |
| Cluster II | AAD29411.1_cherry_A4 | 131 | 0 | 0 | 0 | 0.3 | 70 | no | 0 | 0 |
| Cluster II | CAD37201.1_peach_P4 | 131 | 0 | 0 | 0 | 0.3 | 70 | no | 0 | 0 |
| Cluster II | CAD10390.1_date-palm_D2 | 131 | 0 | 0 | 0 | 0.28 | 70 | no | 0.01 | 0 |
| Cluster II | AAD29410.1_pear_C4 | 131 | 0 | 0 | 0 | 0.3 | 70 | no | 0 | 0 |
| Cluster III | GFZ19186.1_kiwi_D10 | 115 | 0 | 1 | 1 | 0.22 | 70 | yes | 1 | 0 |
| Cluster III | XP_009396870.1_banana_A3 | 119 | 0 | 13 | 13 | 0.37 | 70 | yes | 1 | 0 |
| Cluster III | XP_007206159.1_peach_P3 | 117 | 1 | 17 | 17 | 0.8 | 70 | yes | 1 | 0 |
| Cluster III | ADR66947.1_apricot_Ar3 | 117 | 1 | 17 | 17 | 0.8 | 70 | yes | 1 | 0 |
| Cluster III | PQP98495.1_plum_D3 | 117 | 1 | 17 | 17 | 0.79 | 70 | yes | 1 | 0 |
| Cluster III | AAF26449.1_cherry_A3 | 117 | 1 | 17 | 17 | 0.78 | 70 | yes | 1 | 0 |
| Cluster III | AAF26451.1_pear_C3 | 115 | 1 | 18 | 18 | 0.52 | 70 | yes | 1 | 0 |
| Cluster III | AAT80633.1_apple_D3 | 115 | 0 | 16 | 16 | 0.57 | 70 | yes | 1 | 0 |
| Cluster III | CAC86258.1_strawberry_A3 | 117 | 1 | 21 | 21 | 0.8 | 70 | yes | 1 | 0 |
| Cluster III | ABG54494.1_raspberry_I3 | 117 | 1 | 20 | 20 | 0.62 | 70 | yes | 1 | 0 |
| Cluster III | AAO33394.1_grape_V1 | 119 | 0 | 15 | 15 | 0.53 | 70 | yes | 1 | 0 |
| Cluster III | AHB19227.1_pomegranate_G1 | 120 | 0 | 11 | 11 | 0.44 | 70 | yes | 1 | 0 |
| Cluster IV | PSR89527.1_kiwi_D7 | 559 | 0 | 10 | 10 | 0.46 | 70 | yes | 0.89 | 0.1 |
| Cluster IV | XP_007199020.1_peach_P9 | 161 | 1 | 17 | 17 | 0.89 | 70 | yes | 0.99 | 0 |
| Cluster IV | M5X16_peach_P10 | 401 | 0 | 13 | 13 | 0.54 | 70 | yes | 0.92 | 0.1 |
| Cluster IV | BAC54964.1_kiwi_D6 | 185 | 1 | 18 | 18 | 0.83 | 70 | yes | 0.94 | 0.1 |
| Cluster IV | ABB77213.1_kiwi_D12 | 462 | 0 | 0 | 0 | 0.01 | 70 | yes | 1 | 0 |
| Cluster IV | AAC24001.1_pear_C5 | 308 | 0 | 0 | 0 | 0.05 | 70 | no | 0.01 | 0 |
| Cluster IV | P84527.2_greenkiwi_D5 | 213 | 0 | 8 | 8 | 0.25 | 70 | yes | 1 | 0 |
| Cluster IV | AAD32205.1_apricot_Ar5 | 168 | 0 | 0 | 0 | 0.1 | 70 | no | 0.01 | 0 |
| Cluster IV | BAA06905.1_melon_M1 | 731 | 0 | 0 | 0 | 0 | 70 | yes | 0.97 | 0 |
| Cluster IV | P85524.1_kiwi_D11_P | 150 | 0 | 0 | 0 | 0.21 | 70 | no | 0 | 0 |
| Cluster IV | AAR92223.1_greenkiwi_D4 | 116 | 1 | 16 | 16 | 0.29 | 70 | yes | 1 | 0 |
| Cluster IV | ACV85695.1_papaya_P1 | 494 | 1 | 18 | 18 | 0.92 | 70 | yes | 0.52 | 0.2 |
| Cluster IV | XP_008455060.1_melon_M3 | 116 | 1 | 21 | 21 | 0.85 | 70 | yes | 0.95 | 0 |
| Cluster IV | CAC81811.1_banana_A2 | 318 | 0 | 0 | 0 | 0.02 | 70 | yes | 1 | 0 |
| Cluster IV | CAB01591.1_pers_A1 | 326 | 0 | 15 | 14 | 0.66 | 70 | yes | 1 | 0 |
| Cluster IV | AAB82772.2_banana_A5 | 340 | 0 | 14 | 13 | 0.66 | 70 | yes | 1 | 0 |
| Cluster IV | ALQ56981.1_coconut_N1_P | 490 | 1 | 20 | 19 | 0.97 | 70 | yes | 0.99 | 0 |
| Cluster IV | XP_021820299.1_cherry_Av7 | 88 | 0 | 11 | 11 | 0.31 | 70 | yes | 1 | 0 |
| Cluster IV | XP_016648029.1_peach_P7 | 88 | 0 | 11 | 11 | 0.31 | 70 | yes | 1 | 0 |
| Cluster IV | CAI38795.2_greenkiwi_D2 | 225 | 1 | 19 | 19 | 0.85 | 70 | yes | 1 | 0 |
| Cluster IV | XP_009406737.1_banana_A4 | 226 | 1 | 22 | 22 | 0.82 | 70 | yes | 1 | 0 |
| Cluster IV | AAB38064.1_cherry_A2 | 245 | 1 | 21 | 21 | 0.92 | 70 | yes | 1 | 0 |
| Cluster IV | AAC36740.1_apple_D2 | 245 | 0 | 0 | 0 | 0.1 | 70 | yes | 1 | 0 |
| Cluster IV | ACE80959.1_peach_P2 | 246 | 1 | 18 | 18 | 0.77 | 70 | yes | 1 | 0 |
| Cluster IV | BAA21849.1_pineapple_C2 | 351 | 1 | 16 | 16 | 0.88 | 70 | yes | 1 | 0 |
| Cluster IV | CAA66378.1_papaya_P2 | 352 | 0 | 13 | 13 | 0.61 | 70 | yes | 0.99 | 0 |
| Cluster IV | CAA34486.1_greenkiwi_D1 | 380 | 1 | 17 | 16 | 0.65 | 70 | yes | 0.95 | 0.1 |
| Cluster IV | AAX40948.1_date_M1_1 | 330 | 0 | 4 | 4 | 0.21 | 70 | yes | 0.99 | 0 |
| Cluster IV | G1UH28_pomegranate_G14 | 299 | 0 | 16 | 14 | 0.57 | 70 | yes | 1 | 0 |
Note: Length: The length of the protein sequence. Number of predicted TMHs: The number of predicted transmembrane helices. Exp number of AAs in TMHs: The expected number of amino acids in transmembrane helices. If this number is larger than 18, it is very likely to be a transmembrane protein (or have a signal peptide). Exp number, first 60 AAs: The expected number of amino acids in transmembrane helices in the first 60 amino acids of the protein. If this number is more than a few, you should be warned that a predicted transmembrane helix in the N-term could be a signal peptide. Total prob of N-in: The total probability that the N-term is on the cytoplasmic side of the membrane. POSSIBLE N-term signal sequence: A warning that is produced when “Exp number, first 60 AAs” is larger than 10.
Figure 3A-RISC index values, sequence alignment, and 3D structure of Cluster II.
Figure 4A-RISC index values, sequence alignment, and 3D structure of Cluster III.
Figure 5A-RISC index values, sequence alignment, and 3D structure of Cluster IV.