| Literature DB >> 28962466 |
Fang Chao Zhu1,2, Rui Zong Jia1, Lin Xu1, Hua Kong1, Yun Ling Guo1, Qi Xing Huang1, Yun Judy Zhu1,3, An Ping Guo1.
Abstract
The proteins in the food are the source of common allergic components to certain patients. Current lists of plant endogenous allergens were based on the medical/clinical reports as well as laboratory results. Plant genome sequences made it possible to predict and characterize the genome-wide of putative endogenous allergens in rice (Oryza sativa L.). In this work, we identified and characterized 122 candidate rice allergens including the 22 allergens in present databases. Conserved domain analysis also revealed 37 domains among rice allergens including one novel domain (histidine kinase-, DNA gyrase B-, and HSP90-like ATPase, PF13589) adding to the allergen protein database. Phylogenetic analysis of the allergens revealed the diversity among the Prolamin superfamily and DnaK protein family, respectively. Additionally, some allergens proteins clustered on the rice chromosome might suggest the molecular function during the evolution.Entities:
Keywords: Allergen; Conserved domain; Cross-reacting endogenous allergens; Phylogenetic; Rice
Year: 2015 PMID: 28962466 PMCID: PMC5598104 DOI: 10.1016/j.toxrep.2015.07.017
Source DB: PubMed Journal: Toxicol Rep ISSN: 2214-7500
The list of rice genome-wide putative allergens.
Full-length sequences alignments were performed between rice genome and the allergens from Allergen Nomenclature and FARRP databases. 122 putative rice allergens were finally identified, their most similar allergen databases entries and predicted conserved domains were listed.
aEntries in bold indicates the rice allergens that recorded by allergen databases.
bID indicates identity in percentage (%) of the most similar allergen database entries aligned with BLAST-P.
cAN indicates IUIS Allergen Nomenclature designation described by allergen database, where ND means official nomenclature of allergen remains unassigned.
dPresents the de-nove conserved domains caculated by MEME program (v 4.9.1, www.meme.nbcr.net).
Fig. 1Conserved protein domains among the putative rice allergens. All rice allergen sequences were searched in the Pfam-A database for the matching family. The number of allergens that contained specific protein domain was counted. The letters in the brackets stand for the Pfam accession numbers.
Fig. 2GO classification of putative rice allergens. GO analysis was performed using WEGO tool in order to reveal the biological functions most frequently found among identified rice allergens. Of the 122 allergens, 116 contained GO annotations which were classified into three categories: cellular component, molecular function, and biological process.
Fig. 3Sequence conservation of Prolamin superfamily (A) and heat shock proteins (B). Prolamin superfamily comprises alpha-amylase/trypsin inhibitors and nonspecific lipid transfer proteins. Reference allergenic sequences of each family were adopted from the systematic allergen nomenclature database. Bootstrapped (1000 replicates) neighbor-joining trees were constructed using the PAUP. Numbers less than 50% (clustering 50 out of 100 times) were omitted due to possible collapse of the branch. Branch lengths proportional to genetic distance are indicated in the scale bar.
Fig. 4Distribution of putative rice allergen genes across chromosomes. The hollow triangle for DnaK family protein, the hollow circle for metabolic enzyme, the hollow diamond for nonspecific lipid-transfer protein, the hollow pentacle for protease inhibitor, the hollow hexagon for chlorophyll binding protein, the hollow heart for pollen Ole e I allergen and extensin family protein, the solid triangle for expansin, the solid circle for profilin, the solid diamond for chitinase, the solid pentacle for thaumatin-like protein, the solid hexagon for tubulin and the solid heart for other proteins.
Fig. 5qPCR results revealed some putative allergic protein transcripts expressed different in rice at post-tillering phase. ** Indicated extremely significant (P = 0.01), * indicated significant (P = 0.05).