| Literature DB >> 24354426 |
Elma Mj Salentijn1, Danny G Esselink, Svetlana V Goryunova, Ingrid M van der Meer, Luud J W J Gilissen, Marinus J M Smulders.
Abstract
BACKGROUND: Wheat gluten is important for the industrial quality of bread wheat (Triticum aestivum L.) and durum wheat (T. turgidum L.). Gluten proteins are also the source of immunogenic peptides that can trigger a T cell reaction in celiac disease (CD) patients, leading to inflammatory responses in the small intestine. Various peptides with three major T cell epitopes involved in CD are derived from alpha-gliadin fraction of gluten. Alpha-gliadins are encoded by a large multigene family and amino acid variation in the CD epitopes is known to influence the immunogenicity of individual gene family members. Current commercial methods of gluten detection are unable to distinguish between immunogenic and non-immunogenic CD epitope variants and thus to accurately quantify the overall CD epitope load of a given wheat variety. Such quantification is indispensable for correct selection of wheat varieties with low potential to cause CD.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24354426 PMCID: PMC3890609 DOI: 10.1186/1471-2164-14-905
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Amino acid sequence of a typical alpha-gliadin. Amino acid sequence of a typical alpha-gliadin (gi|289718578|gb|ADD17012.1|) and location of CD epitopes. Consecutively the signal peptide, repetitive domain (blue undelined), polyglutamine repeat 1, unique domain 1, polyglutamine repeat 2 and unique domain 2 are shown (according to Anderson and Greene [20]). In bold, conserved cysteins; underlined, location of PCR primers; in italics, motifs for sequence trimming.
Figure 2Overview of the sequence analysis pipeline.
454 run statistics
| N samples | 23 | 24 | 48 | 95 |
| Raw data | 123,965 | 258,722 | 580,979 | 963,666 |
| Passed reads | 74,038 | 197,945 | 356,862 | 628,845 |
| Contigs > 20 reads | 241 | 269 | 911 | 1421 |
| Length (min, median, max) | 177, 261, 321 | 177, 261, 305 | 173, 262, 300 | |
| Passed reads | 42,162 | 158,054 | 222,085 | 422,301 |
| % of raw reads | 34% | 61% | 38% | |
| % of passed reads | 57% | 80% | 62% |
Figure 3Neighbor-joining topology tree of alpha-gliadins. Neighbor-joining topology tree of alpha-gliadins (unique deduced protein fragments, UPFs). I, II and III in red circles = NJ-topology groups. Green triangles = signature of the D sub-genome; black triangle = signature of the A sub-genome; blue circles = signature of sub-genome B.
Sub-genome specific amino acid motifs
| Motif of two amino acids, ‘YS’ | A | 118 |
| PQLPYL, PPQLPYP, LPQLPYP, QLPYPQPQPFPP | B | 42 |
| PQPQLPYPQ | D | 4 |
| PQPQLPYPQ + B sub-genome motif | D/B | 2 |
| No specific motif detected | Not assigned | 5 |
| Total | 171 |
Amino acid motifs in alpha-gliadin proteins that are specific for the different sub-genomes of wheat. n = number of unique alpha-gliadin protein fragments (UPFs) with the specific motif.
Figure 4Alignment of the ten most abundant unique alpha-gliadin fragments. The amino acid sequences of the ten overall most abundant unique alpha-gliadin protein fragments (repetitive domain; UPFs) with in red the motifs that are indicative for a specific sub-genome of wheat. In grey, amino acid variation compared to UPF-P1. Overall% = normalised transcript abundances overall 94 samples
The natural variation in CD epitopes in
| α1a-variant 1 | PF |
| α1a-variant 2 | PFPQPQL |
| α1a-variant 3 | PF |
| α1a-variant 4 | PFP |
| α1a-variant 5 | PFPQ |
| α2-variant 1 | PQPQLPY |
| α2-variant 2 | |
| α2-variant 3 | PQPQL |
| α2-variant 4 | P |
| α2-variant 5 | |
| α2-variant 6 | F |
| α2-variant 7 | F |
| α3-variant 1 | F |
| α3-variant 2 | F |
| α3-variant 3 | F |
| α3-variant 4 | F |
| α3-variant 5 | F |
| α3-variant 6 | F |
| α3-variant 7 | FRPQQ |
The natural sequence variation in HLA-DQ2.5 restricted T cell epitopes involved in CD (in their natural, non deamidated form); DQ2.5-glia-α1a and DQ2.5-glia-α1b, DQ2.5-glia-α2, DQ2.5-glia-α3, as present in cDNAs of T. turgidum accessions. Canonical CD epitope sequences in bold. * variants that have not previously been found in T. aestivum[11].
The main unique alpha-gliadin protein fragments in ten different alpha-gliadin profiles
| 32 | 25 | 60 | 60 | 62 | 67 | 54 | 31 | 42 | 20 | ||
| | 7 | | | 15 | | | 15 | 36 | 23 | PFPQPQL | |
| | | 18 | | | | | 6 | | | ||
| | 5 | | | | | | | | | PF | |
| | | | | | 15 | | | | | ||
| | | | | | 9 | | | | | ||
| | 5 | | 8 | 9 | | | | 4 | 15 | ||
| | | 6 | | | | | | | | ||
| | | | | | | 19 | | | 3 | ||
| | | | 14 | | | | | | | PFPQ | |
| | | | | 3 | | | | | | PF | |
| | | | | | | | | | 8 | ||
| 25 | | | | | | | | | | ||
| | 7 | | | | | | | | | PF | |
| | | | 10 | | | | | | | ||
| 20 | | | | | | | | | | ||
| | | | | | | | 4 | | 5 | ||
| | | | | | | 7 | | | | ||
| | | | | | | | | 5 | 7 | PF | |
| | | | | | | 9 | | | | PFP | |
| 7 | | | | | | | | | | ||
| | | | | 5 | | | | | | ||
| | | | | | | | 6 | | | PF | |
| | | | 4 | | | | | | | ||
| | | | | | | | 3 | | | PFP | |
| | | | | | | | | | 4 | ||
| | | | | | | | 4 | | | PFPQPQLPY | |
| | | | | | | | 4 | | | PFPQPQLPY | |
| | | | | | | | 3 | | | PFP | |
| | 11 | | | | | | | | | ||
| | | | | | | | 3 | | | ||
| | 9 | | | | | | | | | PFP | |
| | | | | | | | 3 | | | ||
| | 7 | | | | | | | | | ||
| | 4 | | | | | | | | | ||
| | 3 | | | | | | | | | ||
| 84 | 83 | 84 | 96 | 94 | 91 | 89 | 82 | 87 | 85 |
The main UPFs for expression profiles (n = 10) and their average normalised transcript abundances are shown. The main UPF of profiles have an average normalised transcript abundance > 2, in average over all samples with a specific profile, and are expressed in all the plants with that profile. The part of the amino acid sequence of the UPFs that harbours DQ2.5-glia α1 to -α3 is depicted, with P to S substitutions and other substitutions depicted in bold; underlined are canonical DQ2.5-glia a1, -α2 and α3 epitopes. Each expression profile, group 1–10, represents a total of respectively, 33998, 2779, 181184, 37917, 28857, 19863, 33105, 36973, 43252 and 3019 alpha-gliadin transcript 454 sequences. In case of identical amino acid fragments, the different proteins can be distinguished by differences in sequences outside the depicted sequence.
Figure 5Alpha-gliadin expression profiles of durum wheat plants. The deduced unique alpha-gliadin protein fragments (UPFs) were differentially present in transcripts among the samples, at normalised transcript abundances ranging from zero (light grey) to 75.48 (2487/3295, for UPF- P1). The plants were clustered based on their UPF expression profiles using hierarchical clustering (average linkage groups, Pearson correlation). Heat map: normalised transcript abundances zero = light grey; normalised transcript abundance ~1 = black; normalised transcript abundance ~1.5 to > 2 = clear red.
Figure 6Correlations matrix of expression profiles. Comparison of the alpha-gliadin expression profiles found in plants sampled from durum wheat accessions (Pearson’s r). Two cDNA samples were analysed from each plant. A = accession name; B = expression profile number; C = plant number; D = cDNA sample number.
Figure 7Sequence depth and number of unique alpha gliadin gene clusters. The correlation (Pearson’s r) between the sequence read depth of a sample and the number of different unique alpha-gliadin gene clusters detected in plants (n = 49) with expression profile 3.
Figure 8Geographical distribution of alpha-gliadin expression profiles. Numbers 1 to 10 are the different alpha-gliadin expression profiles observed in the T. turgidum genotypes. Some profiles are only found in Northern regions (6, 7) whereas others are limited to Southern regions (10, 5 and 1).
Figure 9CD epitope abundance in different alpha-gliadin expression profiles. Among the 77 plants ten different alpha-gliadin expression profiles are recognized (expression profile 1–10). The number of CD-epitopes per 100 transcripts is shown for each profile (profile 1 to 10) for respectively (a) DQ2.5-glia-α1a and DQ2.5-glia-α1b, (b) DQ2.5-glia-α2, (c) DQ2.5-glia-α3 and (d) the total HLADQ2.5 epitopes. Each data point is, the number per 100 transcripts for CD epitopes = ∑171 UPFs (number per UPF for a CD epitope core x normalised transcript abundances per UPF), for a single plant.