Literature DB >> 30377651

In silico assessment data of allergenicity and cross-reactivity of NP24 epitopes from Solanum lycopersicum (Tomato) fruit.

Majeed Jamakhani1, S S Lele1, Bhagwan Rekadwad2.   

Abstract

This paper describes data on allergies caused by food (vegetable) and their negative impact on the nutritional balance of the human body. Allergic responses to vegetables such as tomatoes, capsicum and spinach are next to fish, eggs and nuts. Epitopes such as NP24 (allergens) are one of the salt-induced allergenic proteins found in the thaumatin-like protein (TLP) family. The mechanism of allergenicity of TLP found in Solanum lycopersicum (Tomato) fruit is poorly studied. Here we demonstrated allergenicity conferred by the NP24 protein found in Tomato. The data on the cross-reactivity of NP24 protein was generated using Allergen Online and Allermatch tools. Tomato allergenic protein epitope shows a significant identity of with allergens reported in Capsicum, Olive, Kiwi, Tobacco and Banana allergens. Hence, the datasets of sequences, comparative analysis and homology epitope mapping over three dimensional (3D) structures revealed that NP24 has higher cross-reactivity to Capsicum and Tobacco proteins. Thus, this data probably act as limelight for planning wet lab experiments.

Entities:  

Keywords:  Epitope mapping; Fruit and vegetable allergies; IgE antibody; NP24 protein; Oral allergy syndrome

Year:  2018        PMID: 30377651      PMCID: PMC6203245          DOI: 10.1016/j.dib.2018.09.074

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications table Value of the data This data depicts a method for the detection of severe allergic reactions caused by an allergic protein found in plants. This result in the release of histamine. It would be an indicator of IgE antibodies bound by tomato allergens. Data gives in-depth information on allergic NP24 protein found tomato which shows top identity with bell pepper, kiwi and olive TLPs. This method will be useful for prediction of IgE epitopes in food, vegetables and pollen TLPs laid in epitopes (either region 1 and 2). This work would act as a limelight for planning wet lab experiments.

Data

This paper describes the data on plant proteins (PR1 to PR 17) especially belonging to vegetables. Allergic plant protein from Solanum lycopersicum (Tomato) such as Thaumatin protein belonged to PR-5 group. It shows functional diversity in allergenicity and kinase function. Tomato fruit NP24 protein has seven different types of IgE epitopes. Mapping of IgE epitopes of NP 24 protein shows that NP 24 protein is allergic and shows cross-reactivity as well. Nutritious food is required for growth, development and maintenance of health. Most commonly used food such as fishes, eggs, fruits and nuts have wide acceptability and fall under most nutritious foods. On contrary, intake of such food can cause allergy or create allergic situations in some individuals. Overreaction of body immune defence system is the cause for allergies. A study of the prevalence of sensitization to foods in Europe was carried out with 4522 individuals living in 13 countries for IgE test against 24 foods. The survey reported that individuals from most of the countries the high prevalence of vegetables, fruits and nuts than to eggs, milk and kinds of seafood. Allergic sensitivity to nuts was 7% whereas 0.2 and 0.4% to fish and eggs respectively. Similarly, Tomato is one of the most commonly consumed vegetables across the next to apple and wheat. “Solanum lycopersicum fruits that are called as vegetables by “Nutritionists”. Food allergy from Tomato fruit showed the significant allergic prevalence of 3.3% [1]. Prevalence of clinical oral food challenge (OFC proven) for food allergy test in preschool children in developed countries was reported to be as high as 10%. Unlike in developed countries, it is 7% in Asian countries such as China and India [2]. However, urbanization increased consumption of processed food and stressful lifestyle has resulted in reduced immunity and increased allergies to foods especially in children [3], [4]. The hypersensitive response is one of the most efficient mechanisms for conferring immunity a from phytopathogens which include fungi, bacteria and viruses. The pathogen-related proteins (PR proteins) are defensive molecules which protect plants. These PR proteins are belonging to the family of "stress-inducible" proteins. These were first discovered in Nicotiana tabacum (Tobacco) plants causing hyper sensitively to infection from Tobacco Mosaic Virus [5], [6]. Later, many PR proteins have been detected in other plant species [7], [8], [9], [10], [11]. The pathogenesis-related protein families are broadly classified into 17 groups. Amongst them, thaumatin-like proteins (PR-5) is the fifth group of the PR protein family having molecular weights ranged from 20 to 26 kDa. They were named as thaumatin-like proteins because the amino acid sequence is homologous to thaumatin-a sweet-tasting protein derived from Thaumatococcus daniellii [12]. Thaumatin and Thaumatin-like proteins also identified in animals [13] and fungi [14], [15]. The thaumatin family protein has eight disulphide residues [16], [17]. Despite the lack of atopic individuals (in humans), pollens and food TLPs are identified as inhalants and ingestant respectively [18]. Reports show that 39.2% of children are monosensitized to grass pollen and has allergy to Tomato fruits IgE antibodies [19]. In 1988 Ortolani et al., confirmed that association between Tomato fruit oral allergy syndrome (OAS) and grass pollen allergy is statistically significant [20]. It has also been reported that sometimes anaphylaxis arises within few hours as soon as Tomato fruit consumed [21]. Several allergens in Tomato fruit have been described such as Sol-l1, Sol-l2, Sol-l3, Sol-l chitinase, Sol-l-Glucanase, Sol-l-peroxidase and Sola-TLP (NP24). But, the clinical relevance of each of these allergens is yet not clear. Only a limited number of TLPs have been identified from plant pollens and foods. Among fruits, only hybrid forms of TLPs are shown a hypertensive reaction with IgE [22], [23], [24]. So far, out of 15 allergen TLPs, only 7 TLPs have been crystallized and their 3D structures were elucidated [25]. However, comparative analysis of the structural features of allergenic TLPs in the context of prediction of IgE epitopes have not been reported so far. The salt-induced TLPs is Protein NP24 (molecular weight 24 kDa) containing 247 amino acids is found in Tomato fruit tissues [26]. Previously NP24 was first isolated, purified and crystallized from Tomato fruit fruits [27]. Later studies reported that there are two isoforms (I and II) of the thaumatin-like protein NP24 present [19]. Isoform-I was expressed mainly in the outer pericarp of healthy Tomato fruit fruits and low in green Tomato fruits which subsequently increases during ripening of fruit. On the other hand, Isoform-II is relatively high in green Tomato fruits. It׳s concentration rise as the fruit turns pink and subsequently decreases as the fruit turns red. Fully ripened Tomato fruit (mature fruit) will have mainly isoform-I and the half-ripened fruit will have both isoforms (I & II) in significant quantity. Detection allergy in individuals either in vivo or in vitro using molecular biology techniques is very difficult task time-consuming task and cost ineffective as well [28]. Possible development of severe allergic risks during the test and lack of sensitivity of allergic reaction are the major drawbacks of in vivo and in vitro methods. These shortcomings make the computational method as a good approach for the identification of epitopes and allergenicity. From the above discussion, it may be inferred that vegetable allergies, especially by consumption of Tomato fruit are much more prevalent than what one would expect. Hence the present work was undertaken mainly for computational analysis of NP24 protein from Tomato fruits in order to identify allergic components such as IgE epitopes, their position and possible cross-reactivity with other food and pollen TLPs.

Experimental design, materials, and methods

The protein sequence data of the NP24/Thaumatin-like protein

The NP24 protein sequences were retrieved from the National Center for Biotechnology Information (https://www.ncbi.nlm.nih.gov/). Search result yielded around 5, 359 hits from different organisms. Further narrowing down to Solanum lycopersicum yielded 44 results. We have used the complete gene sequence which has 247 amino acids (NCBI accession number P12670).

Prediction of cross reactivity of NP24 protein

The cross-reactivity of Protein NP 24 sequence determined using freewares such as Allergen online (www.allergome.org) and Allermatch (http://allermatch.org/) tools. Both online tools give comparative data on cross-reactivity and IgE binding properties of clinically important NP24 protein.

Prediction of 3-dimensional (3D) confirmation of NP24 protein

The HHpred method was used to predict the 3D structure of homologous sequences. HHpred utilizes Hidden Markov Model – Hidden Markov Model (HMM-HMM) algorithm/ (http://toolkit.tuebingen.mpg.de/) for identification of 3D structure. This modeling process involves various steps such as (a) Database search and E value: Here homologous sequences detected greater than 90% and very less E value are considered, (b) To check similarity between secondary structure of sequences, (c) Identification of possible conservative motifs for designing structure, (d) Aligning the target sequence with the template structure, and (e) Realignment of sequences.

Data on prediction and characterization of antigenic determinants of NP24 protein

Antigenic determinants are the part of an antigen. These are identified by the antibody, B cells or T cells, react with them and produce hypersensitivity reactions. Prediction of linear B cell with accuracy is a challenging process to design immunotherapy. BepiPred prediction method adopted for identification and locating B cell epitopes by a combination of Hidden-Markov model and Parker & Levitt propensity scale algorithm. Algpred tool (http://www.imtech.res.in/raghava/algpred/submission.html) used for the prediction of antigenic determinant binding to IgE. To predict allergens, initially, BLAST was used to identify sequences and aligned using Allergen Representative Peptides. Multiple Em for Motif Elicitation (MEME) tool was used for discovering motifs in a group of related protein sequences followed by statistical analysis. Phylogenetic tree analysis was done using MEGA6.

IEDB homology mapping of NP24 protein

Mapping gives the best representation of antigenic determinant points over the 3D structure. It also provides the information about the position of epitopes i.e. whether epitope regions lies inside the structure or at the surface level and how this epitope distributed over the similar sequences as that of the query sequence. IEDB epitope homology mapping tool (http://tools.immuneepitope.org/tools/bcell/iedb) detects PDB that are homologous to the epitope source sequence.

Epitope Conservancy Analysis (ECA)

ECA tool computes the degree of conservancy of an epitope within a given protein sequence set at a given identity level. Obtained results were represented as summary view (for all epitope sequences) and a detail view (for individual epitope). The summary view for each epitope shows degree of conservancy (percentage of protein sequence matches a specified identity level) and the matching minimum/maximum identity levels within the protein sequence set. The detail view of an epitope shows the positions and the matching protein sub-sequences for all sequences in the protein dataset.

Datasets of cross-reactivity of NP24 protein

Allergen Online, Allermatch and FARRP tools database gave significant information regarding cross-reactivity. In 2000 Alberse hypothesized that greater than 70% identity by comparing a query sequence with homologous known allergen sequences showed the significant cross-reactivity. While those with less than 50% identities are unlikely to be cross-reactive. This suggests that alignment of the query with greater than 50% identity with allergen sequence were cross-reactive in full-length alignment. In 80-mer sliding window method, similarity search was performed for every 80-amino acids segment of the query sequence. The cutoff value was greater than 35% (as per FAO/WHO 2001 expert panel recommendation) which indicates the possible cross-reactivity of allergens. However, in the 8-amino acid exact match (8-mer) method, any exact match for the query was considered to identify the protein as a potential cross-reactive allergen. Cross-reactivity analysis using all three methods in both FARRP & Allermatch gives in-depth cross-reactivity information. Table 1 shows the results of allergen online database. In full-length alignment method for allergic food, Kiwi, Olive and Sapota showed significant identity to NP24 (above 65%), whereas allergic pollen like Japanese Cedar and White Cedar showed an identity of approximately 50%. In-80-slide window result for protein NP24, the number of 80-mer sequences was found to be 168 among that 29 sequences showed matching of 80 amino acid stretches with the allergens deposited in databases. In the 8-mer, the total number of 8-mers were 240. Of which, 15 sequences with at least one 8-mer match corresponding to allergens found in the database.
Table 1

Full-length (FL) alignment method lists allergen matches with > 50% identity, 80-mer match method lists allergen matches with > 35% identity, and 8-mer match method lists allergens with at least one exact match.

HitDescriptionSpeciesFull lengthSliding 80-mer8-mer
Hits %Hits %Hits %
1gi|146737976|gb|ABQ42566.1| thaumatin-like proteinActinidia deliciosa68.865.99
2gi|269996497|gb|ACZ57583.1| allergenic thaumatin [Olea europaea64.168.812
3gi|349503011|gb|AEP84104.1| acidic thaumatin-likeManilkara zapota6564.119
4gi|139002766|dbj|BAF51970.1| thaumatin-like proteiCryptomeria japonica52.8655
5gi|9087177|sp|P81295.1|PRR3_JUNAS RecName: Full=PaJuniperus ashei54.152.83
6gi|38456224|gb|AAR21072.1| PR5 allergen Jun r 3.2Juniperus rigida53.654.12
7gi|51316532|sp|Q9LD79.2|PRR3_JUNVI RecName: Full=PJuniperus virginiana58.253.60
8gi|38456222|gb|AAR21071.1| PR5 allergen Jun r 3.1Juniperus rigida53.158.22
9gi|38456228|gb|AAR21074.1| PR5 allergen Cup s 3.2Cupressus sempervirens52.653.12
10gi|38456230|gb|AAR21075.1| PR5 allergen Cup s 3.3Cupressus sempervirens52.652.62
11gi|135917|sp|P27357.1|TLP_WHEAT RecName: Full=ThauTriticum aestivum45.952.61
12gi|9929163|emb|CAC05258.1| Cup a 3 protein [HesperCupressus arizonica52.245.92
13gi|190613911|gb|ACE80959.1| putative allergen PruPrunus dulcis, Prunus persica4252.20
14gi|25091405|sp|P83332.1|TLP1_PRUPE RecName: Full=TPrunus persica41.6420
15gi|190613909|gb|ACE80958.1| putative allergen PruPrunus dulcis, Prunus persica41.641.60
16gi|190613907|gb|ACE80957.1| putative allergen PruPrunus dulcis, Prunus persica41.641.60
17gi|60418848|gb|AAX19851.1| thaumatin-like proteinMalus, domestica4241.60
18gi|218059715|emb|CAT99611.1| thaumatin-like proteiMalus, domestica42.9420
19gi|60418842|gb|AAX19848.1| thaumatin-like proteinMalus, domestica4242.90
20gi|392507603|gb|AFM77001.1| pathogenesis related pMalus, domestica46.2420
21gi|30316292|sp|Q9FSG7.1|TP1A_MALDO RecName: Full=TMalus, domestica4246.20
22gi|218059718|emb|CAT99612.1| thaumatin-like proteiMalus, domestica42.9420
23gi|190613905|gb|ACE80956.1| putative allergen PruPrunusdulcis, Prunus persica41.642.90
24gi|359744030|gb|AEV57471.1| thaumatin-like proteinPrunus persica40.741.60
25gi|1144346|gb|AAB38064.1| thaumatin-like protein pPrunus avium41.440.70
26gi|25091406|sp|P83335.1|TLP2_PRUPE RecName: Full=TPrunus persica4241.41
27gi|190613903|gb|ACE80955.1| putative allergen PruPrunus dulcis, Prunus persica42421
28gi|190613941|gb|ACE80974.1| putative allergen PruPrunus dulcis41.6421
Full-length (FL) alignment method lists allergen matches with > 50% identity, 80-mer match method lists allergen matches with > 35% identity, and 8-mer match method lists allergens with at least one exact match. In full-length alignment method, interestingly food alignment window for protein NP24 shows more than 50% identity to TLPs Capsicum, Kiwi fruit, Banana, Apple, White Cedar and Cupressus sempervirens. In 80-slide window and exact match method result for protein NP 24 showed the similar result as of full length (Table 2).
Table 2

Allergic assessment of NP24 protein using Allermatch Tool: a] Results of FASTA alignment of input sequence against UniProt and WHO-IUIS database. (Number indicates to percentage identity). b] (i) Percent identical amino acids in the aligned 80-aa sliding window, (ii) the number of hits the input sequence had with this allergen, and (iii) the percentage of windows analyzed for this input sequence hitting this allergen iv) Results of a FASTA alignment of the complete input sequence against this database sequence. The first number is the percentage of identity. The second number is the length of sequence over which FASTA aligned c] the percentage of exact hits the input sequence is found to hit this allergen sequence.

Protein IDSpecies NameFull lengtha(%)80 merwindowb(%)
6 amino acid matchc(%)
IIIIIIIV
Q9ARG0Capsicum annuum (bell peper)88.989516810088.9949.17
P81370Actinidiadeliciosa (kiwi)69.2673.7516810069.278.68
O22322Musa acuminata (banana)62.9673.7514485.7162.966.2
P81295Juniperusashei (white cedar)54.1663.7516810054.153.72
Q9LD79Juniperusvirginiana (red cedar)59.7762.56739.8859.771.65
Q69CS2Cupressussempervirens (cypress)52.6361.2516810052.632.48
Q69CS3Cupressussempervirens (cypress)52.6361.2516810052.632.48
B6CQT7Prunuspersica (peach)42.665015189.8842.670.83
B6CQT5Prunuspersica (peach)42.2248.7815290.4842.220.83
Q9FSG7Malusdomestica (Apple)42.6648.7515592.2642.670.83
P50694Prunusavium (cherry)41.7746.6713680.9541.781.65
B6CQT3Prunuspersica (peach)42.3446.3414787.542.342.89
Allergic assessment of NP24 protein using Allermatch Tool: a] Results of FASTA alignment of input sequence against UniProt and WHO-IUIS database. (Number indicates to percentage identity). b] (i) Percent identical amino acids in the aligned 80-aa sliding window, (ii) the number of hits the input sequence had with this allergen, and (iii) the percentage of windows analyzed for this input sequence hitting this allergen iv) Results of a FASTA alignment of the complete input sequence against this database sequence. The first number is the percentage of identity. The second number is the length of sequence over which FASTA aligned c] the percentage of exact hits the input sequence is found to hit this allergen sequence. IgE epitopes of Homologous with NP 24 protein and their matched respective sequences.

Recognition and assessing antigenic determinants of NP24 protein

NP24 protein sequence was retrieved from the SwissProt database (accession number P12670) and analyzed for B cell epitopes using IEDB tool. Prediction scores for each residue of NP24 obtained by IEDB BepiPred (Fig. 1). The residues with scores above the cutoff value (>35%) was predicted as epitope and highlighted in yellow color. Total 12 epitopes were identified. Amongst 12 epitopes, epitope number 10, 6, 2 and 11 have shown high scores followed epitope 8, 7 and 4 (with intermediate scores) which followed by epitope 1, 3, 5, 9 and 12.
Fig. 1

(a) Protein NP24 B-cell epitopes predicted showed in yellow peaks. (b) Starting and ending positions of 12 predicted epitopes showed: Analysis of NP24 protein by Algpred tool revealed the presence of two IgE epitopes viz., ‘IgE epitope 1’, stretching from position 58–70 (PRGTKMARIWGRT) and ‘IgE epitope 2’, stretching from position 73–85(NFNAAGRGTCQTG) as shown in Fig. 2. Both IgE Epitope surfaces overlap with the thaumatin family signatures. Further, B-cell epitope residues with the high score (FNAAGRGTCQTGDC) and medium scores (PRGTK) were found in IgE epitopes 1 and 2, which indicate their higher accessibility for antibody recognition.

(a) Protein NP24 B-cell epitopes predicted showed in yellow peaks. (b) Starting and ending positions of 12 predicted epitopes showed: Analysis of NP24 protein by Algpred tool revealed the presence of two IgE epitopes viz., ‘IgE epitope 1’, stretching from position 58–70 (PRGTKMARIWGRT) and ‘IgE epitope 2’, stretching from position 73–85(NFNAAGRGTCQTG) as shown in Fig. 2. Both IgE Epitope surfaces overlap with the thaumatin family signatures. Further, B-cell epitope residues with the high score (FNAAGRGTCQTGDC) and medium scores (PRGTK) were found in IgE epitopes 1 and 2, which indicate their higher accessibility for antibody recognition.
Fig. 2

Phylogenetic analysis of NP24 protein using MEGA6 software. The evolutionary history was inferred using the Neighbor Joining method. The evolutionary distances were computed using the Jukes-Cantor method. The percentage of replicate trees in which the associated taxa clustered together in the bootstrap test (1000 replicates).

Epitope conservancy analysis and distribution of epitopes

Epitope conservancy analysis tool calculates the degree of conservancy of an epitope within a given protein sequence set at a different degree of sequence identity. The degree of conservation is defined as “the fraction of protein sequences containing the epitope at a given identity level”. Epitome conservancy analysis was performed on 7 identified epitopes from Algpred tool. It was observed that IgE Epitopes 1, 3 & 2 showed the highest degree of the conservancy. Detailed analysis is depicted in Table 4 shows epitope sequence, starting position, ending position and percentage of identity of the query sequence. Epitope regions of NP24 predicted by different tools as discussed earlier. Table 5 shows positions 50 to 73 of the NP24 sequence.
Table 4

Distribution of Epitopes and their conservancy among sequences.

Epitope No.Epitope nameSequence of epitopeLength of epitope sequencePercentage of protein sequence matches at identity ≥ 100%Minimum identityMaximum identity
11AAGTASARFWGRT1328.57% (4/14)38.46%100.00%
22TFDASGKGSCQTG1314.29% (2/14)30.77%100.00%
33ADINAVCPSELK1221.43% (3/14)25.00%100.00%
44VDGGCNSACNVFKT147.14% (1/14)28.57%100.00%
55KISEFYGRKP107.14% (1/14)30.00%100.00%
66YSCGENSFMD107.14% (1/14)20.00%100.00%
77YYNSLGFNIK107.14% (1/14)30.00%100.00%
Table 5

Epitope regions of NP24 predicted by using bioinformatics tools.

ToolsEpitope TypesEpitope Regions obtained
DNA StarBcell6–10, 17–40,46–80, 103–106, 138–166, 173–207
T cell-AMPHI22–28, 38–48, 64–85, 87–94, 98–105, 132–148, 164–176,178–179
Antigenicity- jameson-wolf5–9, 21–30, 36–41, 56–67, 78–81, 112–119, 129–138, 140–142, 171–177, 181–187
MHC-II epitopes16–21
ABC predictionB cell38–53, 13–28, 131–146, 175–190, 4–19, 125–140, 119–134, 73–88,58–73, 1–8, 194–207,155–170,67–82,47–62
IEDBBipred15–28, 38–42, 54–67, 75–83, 110–120, 130–133, 135–152, 156–167, 177–199
AlgpredIgE epitopeSequence-matchedPositionPID
AAGTASARFWGRTPRGTKMARIWGRT5861.53846
TFDASGKGSCQTGNFNAAGRGTCQTG7361.53846
Distribution of Epitopes and their conservancy among sequences. Epitope regions of NP24 predicted by using bioinformatics tools.

Phylogenetic analysis of NP24 protein

The phylogenetic tree was inferred using MEGA6 tool to understand the evolutionary pattern of homologous sequences which share IgE epitopes with NP24 (Fig. 2). Homologous sequences obtained from epitope conservancy were alignment by using MUSCEL. The phylogenetic tree was constructed by MEGA 6.0 tool. Protein NP24 is closely related to osmotic like protein of Capsicum annum (Fig. 3) sharing the Epitope 1 & 2. This infers that the person with Tomato fruit allergy might have an allergy to Capsicum.
Fig. 3

IgE epitope of Protein NP 24 by Algpred: Starting position in sequence for epitope1 and Epitope 2 showed as 58 & 73 respectively. Algpred tool was used to analyze the allergenicity of our protein NP24 and its homologous proteins, results are shown in Table 3. Analysis of proteins NP24 suggest that there are 7 different types of IgE epitopes found in the Homologous sequences namely Epitope 1 (AAGTASARFWGRT), Epitope 2 (TFDASGKGSCQTG), Epitope 3 (ADINAVCPSELK), Epitope 4 (VDGGCNSACNVFKT), Epitope 5(KISEFYGRKP), Epitope 6 (YSCGENSFMD) and Epitope 7 (YYNSLGFNIK). Few interesting findings of these predictions are as follows; 1) IgE epitopes ranges from 0 to 4 that is some protein sequences have 0 epitopes and some have maximum 4 epitopes. 2) IgE epitopes present in Protein NP 24 showed 42.85% presence in other homologous sequences, i.e. epitope 1 and epitope 2. 3) 13/14 (92.85%) sequences displayed either one or 4 IgE epitopes. 4) 1/14 (7.2%) displayed zero IgE epitope.

Phylogenetic analysis of NP24 protein using MEGA6 software. The evolutionary history was inferred using the Neighbor Joining method. The evolutionary distances were computed using the Jukes-Cantor method. The percentage of replicate trees in which the associated taxa clustered together in the bootstrap test (1000 replicates). IgE epitope of Protein NP 24 by Algpred: Starting position in sequence for epitope1 and Epitope 2 showed as 58 & 73 respectively. Algpred tool was used to analyze the allergenicity of our protein NP24 and its homologous proteins, results are shown in Table 3. Analysis of proteins NP24 suggest that there are 7 different types of IgE epitopes found in the Homologous sequences namely Epitope 1 (AAGTASARFWGRT), Epitope 2 (TFDASGKGSCQTG), Epitope 3 (ADINAVCPSELK), Epitope 4 (VDGGCNSACNVFKT), Epitope 5(KISEFYGRKP), Epitope 6 (YSCGENSFMD) and Epitope 7 (YYNSLGFNIK). Few interesting findings of these predictions are as follows; 1) IgE epitopes ranges from 0 to 4 that is some protein sequences have 0 epitopes and some have maximum 4 epitopes. 2) IgE epitopes present in Protein NP 24 showed 42.85% presence in other homologous sequences, i.e. epitope 1 and epitope 2. 3) 13/14 (92.85%) sequences displayed either one or 4 IgE epitopes. 4) 1/14 (7.2%) displayed zero IgE epitope.
Table 3

IgE epitopes of Homologous with NP 24 protein and their matched respective sequences.

Species NamePDB structuresIgE epitopes
PIMW
IgE epitope sequenceMatched sequencePosition
Capsicum annuum (Q9ARG0)NOT availableAAGTASARFWGRTPPGTAMARIWGRT587.5323,984.78
TFDASGKGSCQTGNFDGSGRGSCQTG73
Actinidia deliciosa ( Kiwi) (P81370)4bctAAGTASARFWGRTGAGTKGARVWPRT607.9121,614.23
ADINAVCPSELKADINGQCPNELR148
TFDASGKGSCQTGNFDGAGRGKCQTG75
VDGGCNSACNVFKTAPGGCNNPCTVFKT160
Musa acuminate (Banana) (O22322)1z3qAAGTASARFWGRTNAGTTGGRIWGRT626.6820,429.05
TFDASGKGSCQTGSFDGSGRGRCQTG77
Juniperusashei (Mountain cedar) (P81295)1kurAAGTASARFWGRTAAGTASARFWGRT624.7920,994.34
ADINAVCPSELKADINAVCPSELK146
TFDASGKGSCQTGTFDASGKGSCQTG77
VDGGCNSACNVFKTVDGGCNSACNVFKT158
Prunus persica (Peach) (B6CQT3)NOT availableADINAVCPSELKANVNLVCPSELQ1544.8125650.06
Cupressussem pervirens (Q69CS2)NOT availableAAGTASARFWGRTAAGTASARFWGRT625.3221103.51
ADINAVCPSELKADINAVCPSELK146
TFDASGKGSCQTGTFDASGKGSCRSG77
VDGGCNSACNVFKTVDGGCNSACNVLQT158
Cupressussem pervirens (Q69CS3)NOT availableAAGTASARFWGRTAAGTASARFWGRT625.3221,103.51
ADINAVCPSELKADINAVCPSELK146
TFDASGKGSCQTGTFDASGKGSCRSG77
VDGGCNSACNVFKTVDGGCNSACNVLQT158
Juniperus virginiana (Q9LD79)NOT availableNot availableNOT availableNOT available7.779517.51
Prunus avium (P50694)2AHNADINAVCPSELKANVNAVCPSELQ1574.8425,706.97
Prunus persica (B6CQT5)NOT availableADINAVCPSELKADINKVCPAELQ1585.1325,840.16
Prunus persica (B6CQT7)NOT availableADINAVCPSELKADINKVCPAPLQ1588.3123,337.33
Malus domestica (Q9FSG7)3zs3AAGTASARFWGRTAPSPWSGRFWGRT674.7223,210.91
Alternaria alternate (P79085)3vor, 4audKISEFYGRKPKISEFYGRKP414.7516,979.93
YSCGENSFMDYSCGENSFMD87
YYNSLGFNIKYYNSLGFNIK54
Solanum lycopersicum (P12670)Query sequenceAAGTASARFWGRTPRGTKMARIWGRT588.2826,646.21
TFDASGKGSCQTGNFNAAGRGTCQTG73

Data on computational modeling of NP24 protein

Crystal structure of NP24-I (PDB ID: 2i0w) was created using X-ray diffraction, at a resolution of 2.5 Å and deposited in PDB in 2006. An in silico modeling of NP24-I was compared with the crystal structure of NP 24 using DALI tool. Homology method was used to develop the model of Protein NP24. Total 11 crystallographic structures of TLPs were available in RCSB PDB database. In order to construct the structure of NP24, one can choose any one of these or a combination of these structures as templates. On the basis of alignment score and coverage of sequences, 6 structures were selected namely- cherry allergen Pru av 2(2AHN), Pathogenesis-related protein 5d from Nicotiana tabacum (1AUN), kiwi-fruit allergen Act d 2 (4BCT), Zeamatin (1DU5), Thaumatin I (2VHK) & thaumatin-like xylanase inhibitor TLXI (3G7M). The further 3D model was generated using Modeller of HHpred (Fig. 4).
Fig. 4

(a) Cleft numbering 1–10 was assigned based on its volume, highest being 1 indicated in red color, next to purple. (b) Protein NP24 showing IgE epitopes was predicted by AlgPred; the structure was shown as a cartoon. IgE epitopes 1 and 2 were shown in cyan and green stick shapes.

(a) Cleft numbering 1–10 was assigned based on its volume, highest being 1 indicated in red color, next to purple. (b) Protein NP24 showing IgE epitopes was predicted by AlgPred; the structure was shown as a cartoon. IgE epitopes 1 and 2 were shown in cyan and green stick shapes. To understand the model quality and recognition of errors in the model, PROSA tool was used. PROSA result shows overall model quality and local model quality. Quality has been assessed using Z-score. Obtained Z-score -4.43 lies within the range (Fig. 5). Furthermore, we have aligned the model structure with that of the crystal structure of NP24-I (PDB ID: 2i0w) using the DaliLite tool. Both structures have a greater identity with Z-score equal to 35.4 and RMS of C-alpha value equal to 0.6 (Fig. 6). Structural analysis of modeled protein shows that NP24-I has five helices, 19 strands, 34 turns, 128 hydrogen bonds and eight disulphide bridges which stabilized the entire structure. All interactions including disulphide bridges are shown in Fig. 7. It was observed that there were 10 clefts. Out of 10 clefts, four have higher volumes which are more significant. It was also observed that different residues present in protein NP24 according to their properties of an amino acid. The first cleft is the biggest cleft with the volume 2582 Å contains one cysteine molecule, three positive and two negatively charged atoms with more number of aliphatic residues. Whereas the second cleft with 1766 2582 Å volume contains two cysteine molecules which show a more stabilized structure. An important feature of the 2nd cleft is the presence of a higher number of neutral and aromatic residues. IgE epitopes present in protein NP24 predicted by AlgPred tool show stick-like shape.
Fig. 5

NP24 protein Model Quality: Left Plot shows overall model quality which lies within the range of scores typically found for native proteins of similar size while right plot shows local model quality by plotting energies as a function of amino acid sequence position. In general, positive values correspond to problematic parts of the input structure.

Fig. 6

Dali structural alignment between crystallographic structure and predicted structure of NP24.

Fig. 7

Different interactions of modeled NP24 protein.

NP24 protein Model Quality: Left Plot shows overall model quality which lies within the range of scores typically found for native proteins of similar size while right plot shows local model quality by plotting energies as a function of amino acid sequence position. In general, positive values correspond to problematic parts of the input structure. Dali structural alignment between crystallographic structure and predicted structure of NP24. Different interactions of modeled NP24 protein.

IgE Epitope mapping using IEDB׳s homology mapping tool

Predicted IgE epitopes in NP24 protein were mapped on 3D structures and the sequence of some TLPs. It shows source sequences appeared in the regions that are similar to the epitope sequence. The epitope location annotated by IEDB were highlighted by green and orange color which indicate perfect sequence matches while other matches (identity > 80%, overlap >= 80% and no more than one gap) indicated in light grey color. Epitope mapping resulted in 19 hits which had epitope matches to the structures. These 19 hits have sequence similarity greater than 39%. This study also shows number of residues of the epitope are exactly present in the homologous structures of proteins. Pathogenesis-related (PR) group of protein is one of the specific plant proteins group especially found in vegetables which has the number of types from ranged from PR1 to PR 17 [29], [30]. Protein sequences which show maximum similarity to Thaumatin protein (TP) belonged to PR-5 group. Most of the PR families show functional diversity in allergenicity and kinase function despite their sequence similarity [31]. Current understanding of the cross-reactivity of food allergens from Tomato fruit is very rare. NP24 protein alignment and analysis using 80-mer window methods (FARRP and Allermatch) shows that Kiwi, Bell Pepper, Banana, Wheat and Peach show > 60% identity with Thaumatin-like proteins (TLPs) from plant food (vegetables). The pollen TLPs considerable similarity numbers of pollen TLP were very few [32]. The number of 8-mer matches with NP24 protein by FARRP and Allermatch was found to be high for Bell Pepper, Olive and Kiwi. AlgPred epitope analysis of NP24 protein indicates the presence of seven (7) different types of IgE epitopes in the homologous sequences. Among TLPs, some protein sequences very few have only four epitopes and others are devoid of epitopes. IgE epitopes 1 and 2 are the predominant of food TLPs. On contrary, IgE epitopes 1, 2 and 3 frequently in most pollen TLPs. Secondary structure analysis shows that structure of NP24 protein is much more similar to other TLPs. The secondary structure of NP24 shows high percentages of amino acids such as glycine 28/247 (11.33%), threonine 25/247 (10.12%) and proline 21/247 (8.5%). Mapping of NP24 protein epitopes: The two predicted IgE epitopes (1 and 2) of NP24 protein have been mapped. It was observed that some residues in IgE epitopes one (1) and two (2) were buried. But, most of the residues were readily accessible and specific IgE of protein NP24 produce allergic reactions. The protein NP24 is a commonly found component in Tomato fruits and Spinach leaves. It shows the close match with allergenic TLPs of Capsicum, Kiwi, White Cedar, Cupressus sempervirens and Banana suggesting cross allergic reaction. Two unique IgE epitope of NP24 protein were identified viz. Epitope 1 (AAGTASARFWGRT), Epitope 2 (TFDASGKGSCQTG) at positions 58–70 & 73–85 position respectively. Amongst seven IgE epitopes, the epitope number 1, 2 & 3 showed a greater degree of conservancy within the homologous sequence to NP24 protein. Phylogenetic analysis of protein NP24 with other TLPs revealed that Capsicum shows highest allergic cross-reactivity with Tomato fruit NP24 protein.
Subject areaBiotechnology
More specific subject areaProteomics, Bioinformatics
Type of dataTables, images figures
How data was acquiredIn silico analysis
Data formatBoth, raw and analyzed
Experimental factorsAs specified and recommended by tools and algorithm
Experimental featuresStructural and functional analysis of protein NP24
Data source locationDepartment of Food Engineering and Technology, Institute of Chemical Technology, Matunga (E), Mumbai, India
Data accessibilityPrimary data is available in NCBI repository portal. Analysed data is available within this article.
Related research articleY. Kondo, A. Urisu, R. Tokuda, Identification and Characterization of the Allergens in the Tomato fruit by Immunoblotting, Int. Arch. Allergy Immunol. 126 (2001) 294–299.
R. Pressey, Two isoforms of NP24: a thaumatin-like protein in Tomato fruit, Phytochemistry. 44, 1241–1245.
  27 in total

Review 1.  Thaumatin-like proteins -- a new family of pollen and fruit allergens.

Authors:  Heimo Breiteneder
Journal:  Allergy       Date:  2004-05       Impact factor: 13.146

Review 2.  Allergic disease in urban and rural populations: increasing prevalence with increasing urbanization.

Authors:  N Nicolaou; N Siddique; A Custovic
Journal:  Allergy       Date:  2005-11       Impact factor: 13.146

3.  Prevalence and distribution of sensitization to foods in the European Community Respiratory Health Survey: a EuroPrevall analysis.

Authors:  P Burney; C Summers; S Chinn; R Hooper; R van Ree; J Lidholm
Journal:  Allergy       Date:  2010-02-22       Impact factor: 13.146

4.  Lentinula edodes tlg1 encodes a thaumatin-like protein that is involved in lentinan degradation and fruiting body senescence.

Authors:  Yuichi Sakamoto; Hisayuki Watanabe; Masaru Nagai; Keiko Nakade; Machiko Takahashi; Toshitsugu Sato
Journal:  Plant Physiol       Date:  2006-04-28       Impact factor: 8.340

5.  Polyacrylamide disc electrophoresis of the soluble leaf proteins from Nicotiana tabacum var. "Samsun" and "Samsun NN". II. Changes in protein constitution after infection with tobacco mosaic virus.

Authors:  L C van Loon; A van Kammen
Journal:  Virology       Date:  1970-02       Impact factor: 3.616

6.  Identification and characterization of the allergens in the tomato fruit by immunoblotting.

Authors:  Y Kondo; A Urisu; R Tokuda
Journal:  Int Arch Allergy Immunol       Date:  2001-12       Impact factor: 2.749

7.  In silico analyses of structural and allergenicity features of sapodilla (Manilkara zapota) acidic thaumatin-like protein in comparison with allergenic plant TLPs.

Authors:  Hassan G Ashok Kumar; Yeldur P Venkatesh
Journal:  Mol Immunol       Date:  2013-10-01       Impact factor: 4.407

8.  Sensitivity to tomato and peanut allergens in children monosensitized to grass pollen.

Authors:  M de Martino; E Novembre; G Cozza; A de Marco; P Bonazza; A Vierucci
Journal:  Allergy       Date:  1988-04       Impact factor: 13.146

9.  Study on influence of age, gender and genetic variants on lactose intolerance and its impact on milk intake in adult Asian Indians.

Authors:  Shruti V Baadkar; Manjari S Mukherjee; Smita S Lele
Journal:  Ann Hum Biol       Date:  2014-04-15       Impact factor: 1.533

10.  A global survey of changing patterns of food allergy burden in children.

Authors:  Susan L Prescott; Ruby Pawankar; Katrina J Allen; Dianne E Campbell; John Kh Sinn; Alessandro Fiocchi; Motohiro Ebisawa; Hugh A Sampson; Kirsten Beyer; Bee-Wah Lee
Journal:  World Allergy Organ J       Date:  2013-12-04       Impact factor: 4.084

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.