| Literature DB >> 30377651 |
Majeed Jamakhani1, S S Lele1, Bhagwan Rekadwad2.
Abstract
This paper describes data on allergies caused by food (vegetable) and their negative impact on the nutritional balance of the human body. Allergic responses to vegetables such as tomatoes, capsicum and spinach are next to fish, eggs and nuts. Epitopes such as NP24 (allergens) are one of the salt-induced allergenic proteins found in the thaumatin-like protein (TLP) family. The mechanism of allergenicity of TLP found in Solanum lycopersicum (Tomato) fruit is poorly studied. Here we demonstrated allergenicity conferred by the NP24 protein found in Tomato. The data on the cross-reactivity of NP24 protein was generated using Allergen Online and Allermatch tools. Tomato allergenic protein epitope shows a significant identity of with allergens reported in Capsicum, Olive, Kiwi, Tobacco and Banana allergens. Hence, the datasets of sequences, comparative analysis and homology epitope mapping over three dimensional (3D) structures revealed that NP24 has higher cross-reactivity to Capsicum and Tobacco proteins. Thus, this data probably act as limelight for planning wet lab experiments.Entities:
Keywords: Epitope mapping; Fruit and vegetable allergies; IgE antibody; NP24 protein; Oral allergy syndrome
Year: 2018 PMID: 30377651 PMCID: PMC6203245 DOI: 10.1016/j.dib.2018.09.074
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Full-length (FL) alignment method lists allergen matches with > 50% identity, 80-mer match method lists allergen matches with > 35% identity, and 8-mer match method lists allergens with at least one exact match.
| Hits % | Hits % | Hits % | |||
|---|---|---|---|---|---|
| 1 | gi|146737976|gb|ABQ42566.1| thaumatin-like protein | 68.8 | 65.9 | 9 | |
| 2 | gi|269996497|gb|ACZ57583.1| allergenic thaumatin [ | 64.1 | 68.8 | 12 | |
| 3 | gi|349503011|gb|AEP84104.1| acidic thaumatin-like | 65 | 64.1 | 19 | |
| 4 | gi|139002766|dbj|BAF51970.1| thaumatin-like protei | 52.8 | 65 | 5 | |
| 5 | gi|9087177|sp|P81295.1|PRR3_JUNAS RecName: Full=Pa | 54.1 | 52.8 | 3 | |
| 6 | gi|38456224|gb|AAR21072.1| PR5 allergen Jun r 3.2 | 53.6 | 54.1 | 2 | |
| 7 | gi|51316532|sp|Q9LD79.2|PRR3_JUNVI RecName: Full=P | 58.2 | 53.6 | 0 | |
| 8 | gi|38456222|gb|AAR21071.1| PR5 allergen Jun r 3.1 | 53.1 | 58.2 | 2 | |
| 9 | gi|38456228|gb|AAR21074.1| PR5 allergen Cup s 3.2 | 52.6 | 53.1 | 2 | |
| 10 | gi|38456230|gb|AAR21075.1| PR5 allergen Cup s 3.3 | 52.6 | 52.6 | 2 | |
| 11 | gi|135917|sp|P27357.1|TLP_WHEAT RecName: Full=Thau | 45.9 | 52.6 | 1 | |
| 12 | gi|9929163|emb|CAC05258.1| Cup a 3 protein [Hesper | 52.2 | 45.9 | 2 | |
| 13 | gi|190613911|gb|ACE80959.1| putative allergen Pru | 42 | 52.2 | 0 | |
| 14 | gi|25091405|sp|P83332.1|TLP1_PRUPE RecName: Full=T | 41.6 | 42 | 0 | |
| 15 | gi|190613909|gb|ACE80958.1| putative allergen Pru | 41.6 | 41.6 | 0 | |
| 16 | gi|190613907|gb|ACE80957.1| putative allergen Pru | 41.6 | 41.6 | 0 | |
| 17 | gi|60418848|gb|AAX19851.1| thaumatin-like protein | 42 | 41.6 | 0 | |
| 18 | gi|218059715|emb|CAT99611.1| thaumatin-like protei | 42.9 | 42 | 0 | |
| 19 | gi|60418842|gb|AAX19848.1| thaumatin-like protein | 42 | 42.9 | 0 | |
| 20 | gi|392507603|gb|AFM77001.1| pathogenesis related p | 46.2 | 42 | 0 | |
| 21 | gi|30316292|sp|Q9FSG7.1|TP1A_MALDO RecName: Full=T | 42 | 46.2 | 0 | |
| 22 | gi|218059718|emb|CAT99612.1| thaumatin-like protei | 42.9 | 42 | 0 | |
| 23 | gi|190613905|gb|ACE80956.1| putative allergen Pru | 41.6 | 42.9 | 0 | |
| 24 | gi|359744030|gb|AEV57471.1| thaumatin-like protein | 40.7 | 41.6 | 0 | |
| 25 | gi|1144346|gb|AAB38064.1| thaumatin-like protein p | 41.4 | 40.7 | 0 | |
| 26 | gi|25091406|sp|P83335.1|TLP2_PRUPE RecName: Full=T | 42 | 41.4 | 1 | |
| 27 | gi|190613903|gb|ACE80955.1| putative allergen Pru | 42 | 42 | 1 | |
| 28 | gi|190613941|gb|ACE80974.1| putative allergen Pru | 41.6 | 42 | 1 |
Allergic assessment of NP24 protein using Allermatch Tool: a] Results of FASTA alignment of input sequence against UniProt and WHO-IUIS database. (Number indicates to percentage identity). b] (i) Percent identical amino acids in the aligned 80-aa sliding window, (ii) the number of hits the input sequence had with this allergen, and (iii) the percentage of windows analyzed for this input sequence hitting this allergen iv) Results of a FASTA alignment of the complete input sequence against this database sequence. The first number is the percentage of identity. The second number is the length of sequence over which FASTA aligned c] the percentage of exact hits the input sequence is found to hit this allergen sequence.
| II | III | IV | |||||
|---|---|---|---|---|---|---|---|
| Q9ARG0 | 88.98 | 95 | 168 | 100 | 88.99 | 49.17 | |
| P81370 | 69.26 | 73.75 | 168 | 100 | 69.27 | 8.68 | |
| O22322 | 62.96 | 73.75 | 144 | 85.71 | 62.96 | 6.2 | |
| P81295 | 54.16 | 63.75 | 168 | 100 | 54.15 | 3.72 | |
| Q9LD79 | 59.77 | 62.5 | 67 | 39.88 | 59.77 | 1.65 | |
| Q69CS2 | 52.63 | 61.25 | 168 | 100 | 52.63 | 2.48 | |
| Q69CS3 | 52.63 | 61.25 | 168 | 100 | 52.63 | 2.48 | |
| B6CQT7 | 42.66 | 50 | 151 | 89.88 | 42.67 | 0.83 | |
| B6CQT5 | 42.22 | 48.78 | 152 | 90.48 | 42.22 | 0.83 | |
| Q9FSG7 | 42.66 | 48.75 | 155 | 92.26 | 42.67 | 0.83 | |
| P50694 | 41.77 | 46.67 | 136 | 80.95 | 41.78 | 1.65 | |
| B6CQT3 | 42.34 | 46.34 | 147 | 87.5 | 42.34 | 2.89 | |
Fig. 1(a) Protein NP24 B-cell epitopes predicted showed in yellow peaks. (b) Starting and ending positions of 12 predicted epitopes showed: Analysis of NP24 protein by Algpred tool revealed the presence of two IgE epitopes viz., ‘IgE epitope 1’, stretching from position 58–70 (PRGTKMARIWGRT) and ‘IgE epitope 2’, stretching from position 73–85(NFNAAGRGTCQTG) as shown in Fig. 2. Both IgE Epitope surfaces overlap with the thaumatin family signatures. Further, B-cell epitope residues with the high score (FNAAGRGTCQTGDC) and medium scores (PRGTK) were found in IgE epitopes 1 and 2, which indicate their higher accessibility for antibody recognition.
Fig. 2Phylogenetic analysis of NP24 protein using MEGA6 software. The evolutionary history was inferred using the Neighbor Joining method. The evolutionary distances were computed using the Jukes-Cantor method. The percentage of replicate trees in which the associated taxa clustered together in the bootstrap test (1000 replicates).
Distribution of Epitopes and their conservancy among sequences.
| 1 | 1 | AAGTASARFWGRT | 13 | 28.57% (4/14) | 38.46% | 100.00% |
| 2 | 2 | TFDASGKGSCQTG | 13 | 14.29% (2/14) | 30.77% | 100.00% |
| 3 | 3 | ADINAVCPSELK | 12 | 21.43% (3/14) | 25.00% | 100.00% |
| 4 | 4 | VDGGCNSACNVFKT | 14 | 7.14% (1/14) | 28.57% | 100.00% |
| 5 | 5 | KISEFYGRKP | 10 | 7.14% (1/14) | 30.00% | 100.00% |
| 6 | 6 | YSCGENSFMD | 10 | 7.14% (1/14) | 20.00% | 100.00% |
| 7 | 7 | YYNSLGFNIK | 10 | 7.14% (1/14) | 30.00% | 100.00% |
Epitope regions of NP24 predicted by using bioinformatics tools.
| DNA Star | Bcell | 6–10, 17–40,46–80, 103–106, 138–166, 173–207 | |||
| T cell-AMPHI | 22–28, 38–48, 64–85, 87–94, 98–105, 132–148, 164–176,178–179 | ||||
| Antigenicity- jameson-wolf | 5–9, 21–30, 36–41, 56–67, 78–81, 112–119, 129–138, 140–142, 171–177, 181–187 | ||||
| MHC-II epitopes | 16–21 | ||||
| ABC prediction | B cell | 38–53, 13–28, 131–146, 175–190, 4–19, 125–140, 119–134, 73–88,58–73, 1–8, 194–207,155–170,67–82,47–62 | |||
| IEDB | Bipred | 15–28, 38–42, 54–67, 75–83, 110–120, 130–133, 135–152, 156–167, 177–199 | |||
| Algpred | |||||
| AAGTASARFWGRT | PRGTKMARIWGRT | 58 | 61.53846 | ||
| TFDASGKGSCQTG | NFNAAGRGTCQTG | 73 | 61.53846 | ||
Fig. 3IgE epitope of Protein NP 24 by Algpred: Starting position in sequence for epitope1 and Epitope 2 showed as 58 & 73 respectively. Algpred tool was used to analyze the allergenicity of our protein NP24 and its homologous proteins, results are shown in Table 3. Analysis of proteins NP24 suggest that there are 7 different types of IgE epitopes found in the Homologous sequences namely Epitope 1 (AAGTASARFWGRT), Epitope 2 (TFDASGKGSCQTG), Epitope 3 (ADINAVCPSELK), Epitope 4 (VDGGCNSACNVFKT), Epitope 5(KISEFYGRKP), Epitope 6 (YSCGENSFMD) and Epitope 7 (YYNSLGFNIK). Few interesting findings of these predictions are as follows; 1) IgE epitopes ranges from 0 to 4 that is some protein sequences have 0 epitopes and some have maximum 4 epitopes. 2) IgE epitopes present in Protein NP 24 showed 42.85% presence in other homologous sequences, i.e. epitope 1 and epitope 2. 3) 13/14 (92.85%) sequences displayed either one or 4 IgE epitopes. 4) 1/14 (7.2%) displayed zero IgE epitope.
IgE epitopes of Homologous with NP 24 protein and their matched respective sequences.
| NOT available | AAGTASARFWGRT | PPGTAMARIWGRT | 58 | 7.53 | 23,984.78 | |
| TFDASGKGSCQTG | NFDGSGRGSCQTG | 73 | ||||
| 4bct | AAGTASARFWGRT | GAGTKGARVWPRT | 60 | 7.91 | 21,614.23 | |
| ADINAVCPSELK | ADINGQCPNELR | 148 | ||||
| TFDASGKGSCQTG | NFDGAGRGKCQTG | 75 | ||||
| VDGGCNSACNVFKT | APGGCNNPCTVFKT | 160 | ||||
| 1z3q | AAGTASARFWGRT | NAGTTGGRIWGRT | 62 | 6.68 | 20,429.05 | |
| TFDASGKGSCQTG | SFDGSGRGRCQTG | 77 | ||||
| 1kur | AAGTASARFWGRT | AAGTASARFWGRT | 62 | 4.79 | 20,994.34 | |
| ADINAVCPSELK | ADINAVCPSELK | 146 | ||||
| TFDASGKGSCQTG | TFDASGKGSCQTG | 77 | ||||
| VDGGCNSACNVFKT | VDGGCNSACNVFKT | 158 | ||||
| NOT available | ADINAVCPSELK | ANVNLVCPSELQ | 154 | 4.81 | 25650.06 | |
| NOT available | AAGTASARFWGRT | AAGTASARFWGRT | 62 | 5.32 | 21103.51 | |
| ADINAVCPSELK | ADINAVCPSELK | 146 | ||||
| TFDASGKGSCQTG | TFDASGKGSCRSG | 77 | ||||
| VDGGCNSACNVFKT | VDGGCNSACNVLQT | 158 | ||||
| NOT available | AAGTASARFWGRT | AAGTASARFWGRT | 62 | 5.32 | 21,103.51 | |
| ADINAVCPSELK | ADINAVCPSELK | 146 | ||||
| TFDASGKGSCQTG | TFDASGKGSCRSG | 77 | ||||
| VDGGCNSACNVFKT | VDGGCNSACNVLQT | 158 | ||||
| NOT available | Not available | NOT available | NOT available | 7.77 | 9517.51 | |
| 2AHN | ADINAVCPSELK | ANVNAVCPSELQ | 157 | 4.84 | 25,706.97 | |
| NOT available | ADINAVCPSELK | ADINKVCPAELQ | 158 | 5.13 | 25,840.16 | |
| NOT available | ADINAVCPSELK | ADINKVCPAPLQ | 158 | 8.31 | 23,337.33 | |
| 3zs3 | AAGTASARFWGRT | APSPWSGRFWGRT | 67 | 4.72 | 23,210.91 | |
| 3vor, 4aud | KISEFYGRKP | KISEFYGRKP | 41 | 4.75 | 16,979.93 | |
| YSCGENSFMD | YSCGENSFMD | 87 | ||||
| YYNSLGFNIK | YYNSLGFNIK | 54 | ||||
| Query sequence | AAGTASARFWGRT | PRGTKMARIWGRT | 58 | 8.28 | 26,646.21 | |
| TFDASGKGSCQTG | NFNAAGRGTCQTG | 73 | ||||
Fig. 4(a) Cleft numbering 1–10 was assigned based on its volume, highest being 1 indicated in red color, next to purple. (b) Protein NP24 showing IgE epitopes was predicted by AlgPred; the structure was shown as a cartoon. IgE epitopes 1 and 2 were shown in cyan and green stick shapes.
Fig. 5NP24 protein Model Quality: Left Plot shows overall model quality which lies within the range of scores typically found for native proteins of similar size while right plot shows local model quality by plotting energies as a function of amino acid sequence position. In general, positive values correspond to problematic parts of the input structure.
Fig. 6Dali structural alignment between crystallographic structure and predicted structure of NP24.
Fig. 7Different interactions of modeled NP24 protein.
| Subject area | |
| More specific subject area | |
| Type of data | |
| How data was acquired | |
| Data format | |
| Experimental factors | |
| Experimental features | |
| Data source location | Department of Food Engineering and Technology, Institute of Chemical Technology, Matunga (E), Mumbai, India |
| Data accessibility | |
| Related research article | Y. Kondo, A. Urisu, R. Tokuda, Identification and Characterization of the Allergens in the Tomato fruit by Immunoblotting, Int. Arch. Allergy Immunol. 126 (2001) 294–299. |
| R. Pressey, Two isoforms of NP24: a thaumatin-like protein in Tomato fruit, Phytochemistry. 44, 1241–1245. |