| Literature DB >> 26284934 |
Hemalatha Rajkumar1, Ramesh Kumar Ramagoni1, Vijayendra Chary Anchoju1, Raju Naik Vankudavath2, Arshi Uz Zaman Syed3.
Abstract
Allium cepa (onion) is a diploid plant with one of the largest nuclear genomes among all diploids. Onion is an example of an under-researched crop which has a complex heterozygous genome. There are no allergenic proteins and genomic data available for onions. This study was conducted to establish a transcriptome catalogue of onion bulb that will enable us to study onion related genes involved in medicinal use and allergies. Transcriptome dataset generated from onion bulb using the Illumina HiSeq 2000 technology showed a total of 99,074,309 high quality raw reads (~20 Gb). Based on sequence homology onion genes were categorized into 49 different functional groups. Most of the genes however, were classified under 'unknown' in all three gene ontology categories. Of the categorized genes, 61.2% showed metabolic functions followed by cellular components such as binding, cellular processes; catalytic activity and cell part. With BLASTx top hit analysis, a total of 2,511 homologous allergenic sequences were found, which had 37-100% similarity with 46 different types of allergens existing in the database. From the 46 contigs or allergens, 521 B-cell linear epitopes were identified using BepiPred linear epitope prediction tool. This is the first comprehensive insight into the transcriptome of onion bulb tissue using the NGS technology, which can be used to map IgE epitopes and prediction of structures and functions of various proteins.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26284934 PMCID: PMC4564285 DOI: 10.1371/journal.pone.0135387
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary of raw reads of Allium cepa bulb transcripts.
| Raw reads | 99,074,309 (~20Gb) |
|---|---|
| Clean PE reads | 83,046,820 |
| Clean SE forward reads | 1,384,023 |
| Clean SE reverse reads | 93 |
| G+C% | 43% |
Gb, Giga bases; PE, paired end; SE, single end; G+C, Guanine+Cytocine; %, percentage
Statistics of non-redundant set of Allium cepa bulb transcripts obtained from final stage of assembly.
| Total number of transcripts | 293,475 |
|---|---|
| G+C% | 38% |
| Total transcriptome length | 280,882,036 bp |
| Average transcript size | 957.1 bp |
| Transcript N50 | 1,594 bp |
| Max. transcript size | 12,638 bp |
G+C, Guanine+Cytocine; %, percentage; Max, maximum; bp, base pairs
Fig 1Annotations and BLAST top hits of onion transcripts with other species.
This figure shows species distribution of onion transcripts with other species by BLASTx. Most of the onion sequences were homologous to Vitis vinifera (14%) followed by Oryza sativa (8%) and Theobroma cacao (5.4%).
Fig 2Onion allergen species distribution identified with BLASTx.
The data represents number of transcripts and percentage of species distribution of onion bulb. For each species that was matching with onion, total number of hits is given along with relative percentage of homology. The comma (,) is separating the number of transcritpts from the % of species distribution. Highest number of onion transcripts have shown homology to Cryptomeria japonica allergens (9%), followed by Blattella Germania (7%) and Corylus avellana (6%).
Fig 3Onion allergens identified through transcriptome analysis.
This radar graph depicts onion allergens identified through transcriptome analysis. Highest number of transcripts were homologous to Cla h4 (115) followed by putative luminal binding protein of Corylus avellana (73), CPA 63 pollen allergen Cryptomeria japonica (73), and Asp f12 of fungi (72). (Selected transcript sequence size: ≥80 amino acids).
The E-values and homology (%) of Allium cepa transcripts with matched allergen sequences.
| Sequence Name | Allergen | Biochemical Function | Species | Common name | Type | Homology (%) | Seq. Len | Align. Len | E-Value | Bit-Score |
|---|---|---|---|---|---|---|---|---|---|---|
| Contig116485 | N/A | Enolase 1, 2-phospho-D-glycerate hydro-lyase | N/A | N/A | N/A | 75 | 1889 | 438 | 0 | 525.013 |
| Contig165540 | Asp f 23 | 60S ribosomal protein L3 | Aspergillus fumigatus | A. Fumigatus | Fungi | 76 | 1596 | 417 | 4.93E-169 | 483.411 |
| Contig124037 | Asp f 12 | 65 kDa IgE-binding protein; Heat shock protein hsp1 | Aspergillus fumigatus | Fungi | Fungi | 67 | 3117 | 680 | 1.46E-159 | 487.263 |
| Contig147925 | Alt a 11, Alt a 6, Alt a 5 | Enolase; 2-phosphoglycerate dehydratase | Alternaria alternata | Fungi | Fungi | 69 | 1980 | 431 | 9.71E-158 | 460.685 |
| Contig147926 | N/A | AF284645_1 enolase | Aspergillus fumigatus | Fungi | Fungi | 69 | 1850 | 413 | 3.42E-149 | 437.187 |
| Contig134658 | Cuc m 1 | Cucumisin, serine protease | Cucumis melo | Muskmelon | Plant | 56 | 2397 | 735 | 9.67E-135 | 416.001 |
| Contig121132 | N/A | Xylosidase | Aspergillus niger | A. Niger | Fungi | 53 | 2776 | 762 | 2.29E-133 | 418.313 |
| Contig122480 | N/A | Actinidin | Actinidia deliciosa | Fuzzy Kiwifruit | Plant | 70 | 2423 | 360 | 2.09E-125 | 379.793 |
| Contig192886 | Cup a 1 | Putative allergen Cup a 1 | Hesperocyparis arizonica or Cupressus arizonica | Arizona cypress | Plant | 63 | 1515 | 355 | 1.13E-111 | 334.339 |
| Contig149472 | N/A | Chitinase Ib | Castanea sativa | Sweet chestnut | Plant | 74 | 1181 | 306 | 2.04E-111 | 327.791 |
| Contig181997 | N/A | Amb a 1-like protein | Artemisia vulgaris | Mugwort | Plant | 63 | 1798 | 370 | 3.06E-110 | 334.724 |
| Contig147924 | Cla h 6 | Enolase; 2-phospho-D-glycerate hydro-lyase; 2-phosphoglycerate dehydratase | Cladosporium herbarum | Fungi | Fungi | 69 | 1739 | 303 | 1.22E-108 | 331.643 |
| Contig185872 | Cry j 1 | Sugi basic protein | Cryptomeria japonica | Japanese cedar | Plant | 61 | 1653 | 361 | 9.33E-108 | 326.25 |
| Contig213065 | N/A | Unnamed protein product | Actinidia deliciosa | Fuzzy Kiwifruit | Plant | 67 | 1172 | 319 | 1.39E-107 | 320.087 |
| Contig239367 | N/A | 11S globulin | Bertholletia excelsa | Brazil nut | Plant | 61 | 1679 | 449 | 1.21E-104 | 321.627 |
| Contig105897 | Cha o 1 | Major pollen allergen Cha o 1 | Chamaecyparis obtusa | Japanese cypress | Plant | 65 | 1915 | 355 | 3.22E-102 | 314.309 |
| Contig87206 | Act d 1 | Actinidain | Actinidia deliciosa | Kiwi | Plant | 66 | 1513 | 329 | 6.81E-102 | 309.686 |
| Contig165488 | Ana c 2 | Fruit bromelain | Ananas comosus | Pineapple | Plant | 65 | 1227 | 333 | 7.57E-100 | 300.056 |
| Contig82715 | Bet v 6.0102 | Allergenic isoflavone reductase-like protein Bet v 6.0102 | Betula pendula | Silver birch | Plant | 68 | 1364 | 306 | 4.57E-99 | 298.13 |
| Contig134058 | CJP-6 | Isoflavone reductase-like protein CJP-6 | Cryptomeria japonica | Japanese cedar | Plant | 64 | 1279 | 312 | 7.41E-97 | 291.197 |
| Contig88117 | N/A | Triosephosphate isomerase | Crangon crangon | Common shrimp | Crustacean | 70 | 2141 | 248 | 3.05E-89 | 278.1 |
| Contig97550 | Cha o 2 | Polygalacturonase; Major pollen allergen Cha o 2; Pectinase | Chamaecyparis obtusa | Japanese cypress | Plant | 53 | 1956 | 466 | 1.48E-85 | 275.789 |
| Contig181817 | Cry j 2 | Allergen Cry j 2 | Cryptomeria japonica | Japanese cedar | Plant | 60 | 1769 | 358 | 2.34E-84 | 270.781 |
| Contig70943 | Cla h 4 | Heat shock 70 kDa protein | Cladosporium herbarum | Fungi | Fungi | 48 | 3004 | 645 | 5.05E-84 | 281.952 |
| Contig116658 | Cla h 10 | Aldehyde dehydrogenase; Cla h 3 | Cladosporium herbarum | Fungi | Fungi | 50 | 2328 | 483 | 5.37E-60 | 207.608 |
| Contig141069 | N/A | Pollen allergen | Cryptomeria japonica | Japanese cedar | Plant | 47 | 1672 | 313 | 2.23E-59 | 155.221 |
| Contig128098 | N/A | Class I chitinase isoform 2 | Castanea sativa | Sweet chestnut | Plant | 56 | 1528 | 247 | 7.32E-58 | 191.815 |
| Contig116297 | Act c 1 | Actinidain | Actinidia deliciosa | Kiwi | Plant | 50 | 1678 | 323 | 6.34E-55 | 187.193 |
| Contig16398 | Cup s 3.3 precursor | PR5 allergen Cup s 3.3 precursor | Cupressus sempervirens | Pencil pine or Mediterranean cypress | Plant | 57 | 1445 | 241 | 2.31E-54 | 179.489 |
| Contig122189 | Asp f 6 | Superoxide dismutase [Mn] | Aspergillus fumigatus | Fungi | Fungi | 50 | 1281 | 241 | 3.89E-53 | 174.481 |
| Contig86874 | N/A | Aldehyde dehydrogenase (NAD+) | Alternaria alternata | Fungi | Fungi | 45 | 2257 | 462 | 9.06E-53 | 186.422 |
| Contig70938 | N/A | Putative luminal binding protein | Corylus avellana | Common hazel | Plant | 44 | 4214 | 721 | 4.39E-52 | 191.045 |
| Contig152875 | CPA63 | Pollen allergen CPA63 | Cryptomeria japonica | Japanese cedar | Plant | 48 | 1590 | 437 | 6.92E-52 | 180.644 |
| Contig175155 | Api m 5 | Venom dipeptidyl peptidase 4 | Apis mellifera (Honeybee) | Honeybee | Insect | 43 | 2585 | 622 | 8.42E-42 | 158.303 |
| Contig176760 | Cup a 1 | Major allergen Cup a 1 | Hesperocyparis arizonica or Cupressus arizonica | Arizona cypress | Plant | 49 | 1577 | 253 | 2.10E-39 | 142.124 |
| Contig198455 | N/A | 48-kDa glycoprotein precursor | Corylus avellana | Common hazel | Plant | 44 | 1792 | 400 | 3.56E-36 | 135.191 |
| Contig122991 | Ana o 2 | Legumin-like protein | Anacardium occidentale | Cashew tree | Plant | 42 | 1378 | 401 | 2.51E-31 | 119.398 |
| Contig178413 | Cyn d 1 | Major pollen allergen Cyn d 1 | Cynodon dactylon | Bermuda grass | Plant | 47 | 1150 | 242 | 7.30E-26 | 99.3673 |
| Contig154078 | N/A | Pollen allergen | Chamaecyparis obtusa | Japanese cypress | Plant | 40 | 1895 | 435 | 9.82E-26 | 104.375 |
| Contig181406 | N/A | Subtilisin Carlsberg | Bacillus sp. | Bacillus | Bacteria | 44 | 3643 | 246 | 1.49E-22 | 95.5153 |
| Contig163717 | N/A | Pollen allergen | Cryptomeria japonica | Japanese cedar | Plant | 44 | 1663 | 260 | 4.98E-20 | 87.4261 |
| Contig140011 | N/A | Peanut agglutinin precursor | Arachis hypogaea | Peanut | Plant | 48 | 2216 | 273 | 8.64E-19 | 81.6481 |
| Contig155816 | Bla g 2 allergen | Bla g 2 allergen variant | Blattella germanica | German cockroach | Insect | 46 | 1842 | 240 | 3.19E-16 | 74.3294 |
| Contig189186 | Cla h 8 | Mannitol 2-dehydrogenase [NADP(+)] | Cladosporium herbarum | Fungi | Fungi | 43 | 1355 | 278 | 3.56E-15 | 69.3218 |
| Contig157948 | Cry j 2 | Major pollen allergen; Polygalacturonase; Pectinase | Cryptomeria japonica | Japanese cedar | Plant | 41 | 1014 | 249 | 1.47E-14 | 68.1662 |
| Contig144878 | Apr M | Prepro Apr M | Bacillus sp. | Bacillus | Bacteria | 43 | 4317 | 253 | 1.08E-09 | 55.8398 |
Contig, Contiguous sequence; Seq. Len, Sequence length; Align. Len, Alignment length; %, percentage; E-value, Expect value; NA, not applicable