| Literature DB >> 20064259 |
Susan B Altenbach1, William H Vensel, Frances M Dupont.
Abstract
BACKGROUND: The gamma gliadins are a complex group of proteins that together with other gluten proteins determine the functional properties of wheat flour. The proteins have unusually high levels of glutamine and proline and contain large regions of repetitive sequences. While most gamma gliadins are monomeric proteins containing eight conserved cysteine residues, some contain an additional cysteine residue that enables them to be linked with other gluten proteins into large polymers that are critical for flour quality. The ability to differentiate among the gamma gliadins is important for studies of wheat flour quality because proteins with similar sequences can have different effects on functional properties.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20064259 PMCID: PMC2827424 DOI: 10.1186/1471-2229-10-7
Source DB: PubMed Journal: BMC Plant Biol ISSN: 1471-2229 Impact factor: 4.215
Best matches of consensus sequences from Butte 86 contigs to NCBI non-redundant database.
| Contig # | # ESTs | NCBI Accession1 | Identity | Source | Type |
|---|---|---|---|---|---|
| 1 | 192 | [GenBank: | 1195/1227 | mRNA | |
| 2 | 34 | [GenBank: | 1187/1200 | gene | |
| 3 | 183 | [GenBank: | 1088/1094 | gene | |
| 4 | 14 | [GenBank: | 1005/1007 | gene | |
| 5 | 27 | [GenBank: | 1139/1191 | gene | |
| 6 | 17 | [GenBank: | 1048/1054 | gene | |
| 7 | 4 | [GenBank: | 1083/1087 | gene | |
| 8 | 82 | [GenBank: | 1083/1123 | mRNA | |
| 94 | 3 | [GenBank: | 848/908 | gene | |
| [GenBank: | 848/908 | gene | |||
| 10 | 103 | [GenBank: | 1024/1078 | gene | |
| 115 | 3 | [GenBank: | 668/668 | mRNA |
1 with best score by blastn against nr/nt database, no filters, no masks, last searched on 5/11/09.
2 includes seven ESTs assigned to both contigs #1 and #8.
3 includes nine ESTs assigned to both contigs #3 and 10.
4 contig is missing a portion of the 3' end of coding region.
5 contig is missing a portion of the 5' end of the coding region.
Characteristics of full-length proteins encoded by Butte 86 contigs.
| Region II | # Immunogenic Peptides1 | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Contig # | # amino acids2 | # Cys | MW3 | pI3 | # amino acids | # Pro | # Gln | QQPQQPFPQ4 | PQQPFPQQPQQ5 | PQQSFPQQQ6 | IIQPQQPAQ7 | |
| 1 | 328 | 8 | 35481 | 7.5 | 156 | 50 | 78 | 6 | 2 | 0 | 1 | |
| 2 | 327 | 8 | 35148 | 7.9 | 149 | 41 | 73 | 7 | 3 | 1 | 1 | |
| 3 | 302 | 9 | 32294 | 7.5 | 136 | 38 | 63 | 4 | 0 | 1 | 1 | |
| 4 | 302 | 9 | 32569 | 7.8 | 136 | 36 | 65 | 3 | 0 | 1 | 1 | |
| 5 | 358 | 8 | 38899 | 7.6 | 174 | 51 | 84 | 9 | 4 | 1 | 0 | |
| 6 | 285 | 8 | 30581 | 8.1 | 119 | 35 | 59 | 4 | 0 | 1 | 1 | |
| 7 | 291 | 8 | 30975 | 7.9 | 114 | 31 | 54 | 2 | 0 | 1 | 1 | |
| 8 | 332 | 9 | 36035 | 7.8 | 160 | 50 | 77 | 6 | 2 | 0 | 1 | |
| 10 | 286 | 9 | 30361 | 7.5 | 120 | 34 | 56 | 4 | 0 | 1 | 1 | |
1 epitopes likely to play a role in celiac disease.
2 including signal peptide.
3 mature protein.
4 described by Qiao et al. [15].
5 described by Molberg et al. [16].
6 described by Sjostrom et al. [17].
7 described by Arentz-Hansen et al. [18].
Figure 1Alignment of full-length gamma gliadins deduced from consensus sequences of Butte 86 contigs. Alignments were performed with ClustalW2 using a blosum matrix and default settings. Identical residues are indicated by asterisks, conserved substitutions by colons and semi-conserved substitutions by periods. The eight conserved cysteine residues are enclosed in boxes. The position of the extra cysteine residue in #3, 4, 8 and 10 is indicated with an arrow.
Comparison of gamma gliadins encoded by Butte 86 contigs to proteins in NCBI and proteins encoded by contigs from other EST assemblies.
| Butte 86 Contig # | # Amino acids | NCBI Accession #1 | Identity | TaGI 10.0 Contig # | Identity | TaGI 11.0 Contig # | Identity | US Wheat Genome Project Contig # | Identity2 | HarvEST 1.14 Contig # | Identity2 |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 328 | [GenBank: | 326/337 | TC2347103 | 194/197 | TC2918573 | 181/193 | 185244 | nd | 14880 | 328/328 |
| TC250086 | 328/328 | TC280138 | 327/328 | 19014,5 | nd | ||||||
| TC250043 | 326/337 | ||||||||||
| 2 | 327 | [GenBank: | 326/327 | TC250310 | 327/327 | TC327998 | 327/327 | 188714 | nd | 4446 | 327/327 |
| TC3330203 | 160/198 | 149393,4 | nd | ||||||||
| 3 | 302 | [GenBank: | 300/302 | TC249992 | 300/302 | TC308951 | 299/302 | 18764 | 299/302 | 2079 | 302/302 |
| 4 | 302 | [GenBank: | 302/302 | TC250000 | 302/302 | TC308951 | 276/302 | 18752 | 301/302 | 149393,4 | nd |
| TC337354 | 269/302 | 149453 | 212/230 | ||||||||
| 5 | 358 | [GenBank: | 342/362 | TC233870 | 357/358 | TC3573196 | 357/358 | 188737 | nd | 15356 | 358/358 |
| TC250310 | 306/358 | ||||||||||
| 6 | 285 | [GenBank: | 283/285 | TC249991 | 285/285 | TC3299763 | 254/264 | 18814 | 285/285 | 15101 | 285/285 |
| TC332920 | 265/285 | ||||||||||
| 7 | 291 | [GenBank: | 290/291 | TC232926 | 291/291 | TC296897 | 290/291 | 18753 | 289/291 | 4448 | 290/291 |
| 19008 | 291/291 | ||||||||||
| 8 | 332 | [GenBank: | 317/337 | TC250043 | 317/337 | TC280138 | 313/332 | 185244 | nd | 14880 | 314/332 |
| TC2347103 | 194/197 | TC281237 | 318/339 | 2713 | 176/176 | 46049 | nd | ||||
| TC250086 | 314/332 | TC2918573 | 183/185 | 19014,5 | nd | ||||||
| 474323 | 188/188 | ||||||||||
| 93 | 289 | [GenBank: | 266/294 | TC2499963 | 204/227 | TC293526 | 283/289 | 16926 | 288/289 | 14942 | 288/289 |
| TC2500313 | 251/255 | ||||||||||
| 10 | 286 | [GenBank: | 284/302 | TC249992 | 284/302 | TC308951 | 283/302 | 18764 | 283/302 | 2079 | 286/302 |
| TC3377603 | 142/180 | 188424 | nd | 2095 | 283/286 | ||||||
| 113 | 163 | [GenBank: | 163/163 | TC233421 | 163/163 | TC277980 | 163/163 | 165853 | 163/163 | 4328 | 163/163 |
1 accession with highest score in BLASTp search of non-redundant protein sequences using BLOSUM62, no compositional adjustments, no filters, no masks. Database last searched on 5/08/09.
2 protein identities that were not determined because consensus sequences of contigs contained frameshifts or stop codons in the reading frame are indicated by nd.
3 partial coding sequence.
4 consensus sequence contains frameshift or stop codon in reading frame.
5 complement of consensus sequence.
6 contig contains a portion of a LMW-GS sequence at 5' end.
7 encodes alpha gliadin.
8 encodes identical protein to Butte 86 contig, but does not contain any ESTs from Butte 86.
9 encodes LMW-glutenin subunit.
Figure 2Phylogram showing relationships among full-length gamma gliadins encoded by Butte 86 contigs. Proteins containing nine cysteine residues are indicated with asterisks.
Figure 3Phylogram of gamma gliadins containing nine cysteine residues. Proteins containing the sequence PFCQQPQRTIPQ are denoted with a blue circle, those containing PFCQQPQQTIPQ are denoted with a black circle and those containing PFCEQPQRTIPQ are denoted with a red circle. For proteins from diploid or tetraploid species, the source of the sequence is indicated in italics. For proteins from hexaploid wheat, only the cultivar is indicated.
Evidence for expression of sequences encoding gamma gliadins with odd numbers of cysteine residues.
| Protein accession # | Source | # Cysteine residues | Region of missing/additional cysteine | Reference | Sequence used to search NCBI1 | # ESTs in NCBI 2 |
|---|---|---|---|---|---|---|
| [GenBank: | 7 | III | Arentz-Hansen et al. [ | DCQVMRQQ | 1 | |
| [GenBank: | 7 | V | Qi et al. [ | VLQTLPT | 0 | |
| [GenBank: | 7 | IV | Qi et al. [ | LAQIPQQLQ | 0 | |
| [GenBank: | 7 | V | Chen et al. [ | EAIRSLVLQTLPSM | 0 | |
| [GenBank: | 9 | II | Anderson et al. [ | HTFPQPQQT | 0 | |
| [GenBank: | 9 | II | Qi et al. [ | PF | 0 | |
| [GenBank: | 9 | II | Snegaroff, unpublished | PFPQSQQQ | 0 | |
| [GenBank: | 9 | V | Chen et al. [ | RHTDPATTVSA | 3 | |
| [GenBank: | 9 | V | Chen et al. [ | RHTDPATTVSA | 3 | |
| [GenBank: | 9 | II | Chen et al. [ | PFPQPQQPF | 0 |
1 The extra cysteine in proteins containing nine cysteines or residue substituted for cysteine in proteins containing seven cysteines is underlined.
2 determined by tBLASTn search of non-human, non-mouse ESTs, limited to Triticum, word size 2, expect 30000, PAM30, no compositional adjustment, no filters, no masks, database last searched on 7/22/09.
Butte 86 gamma gliadins identified by MS/MS from wheat flour.
| Butte 86 contig # | Band # 1 | Total # peptides 2 | % Coverage3 | Unique peptides4 | Enzyme used to generate peptide5 |
|---|---|---|---|---|---|
| 2 | 126 | 14 | 36.0 | SQQQQVGQGSL | CH |
| QQLPQPQQPQQSFPQQQR | CH | ||||
| SQQQQVGQGSLVQGQGIIQPQQPAQL | CH | ||||
| NIQVDPSGQVQWLQQQLVPQLQQPL | CH | ||||
| LQQPQQPFPQPQQQLPQPQQPQQ | TH | ||||
| 4 | 188 | 22 | 56.5 | SIIMQQEQRQGVQIRRPL | CH |
| LQPQQPQQSFPQQQQPL | CH | ||||
| VSPDCSTINAPF | CH | ||||
| VSPDCSTINAPFASIVVGIGGQ | CH | ||||
| LQPQQPQQSFPQQQQPLIQL | CH | ||||
| LQPQQPQQSFPQQQQPLIQLSL | CH | ||||
| CH | |||||
| IIMQQEQRQG | TH | ||||
| IIMQQEQRQGVQ | TH | ||||
| LQPQQPQQSFPQQQQPLIQ | TH | ||||
| VDPGYQVHWPQQQPFPQPQQP | TH | ||||
| VHWPQQQPFPQPQQP | TH | ||||
| NFLLQQCNPVSLVSSLISMILPR | TR | ||||
| 5 | 145 | 31 | 44.8 | VPPECSIIRAPF6 | CH |
| IQPSLQQR | CH | ||||
| IQPSLQQRL | CH | ||||
| SQQQQLGQGTL6 | CH | ||||
| QSFPQQQRPF | CH | ||||
| ILLPLSQQQQL6 | CH | ||||
| TQQPQQPFPQFQQPHQPF6 | CH | ||||
| VQGQGIIQPQQLAQLEAIRSL6 | CH | ||||
| SQQQQLGQGTLVQGQGIIQPQQL6 | CH | ||||
| SQQQQLGQGTLVQGQGIIQPQQLAQLEAIRSL6 | CH | ||||
| ILLPLSQQQQLGQGTL6 | CH | ||||
| SQQQQLGQGTLVQGQGIIQPQQLAQL6 | CH | ||||
| SQQPQQAFPQPQQTFPHQPQQQVPQPQQPQQPF | CH | ||||
| HQPQQQFPQPQQPQQSFPQQQRPF | CH | ||||
| HQPQQQFPQPQQPQQSFPQ | CH | ||||
| AFPQPQQTFPHQPQQQVPQPQQPQQPF | TH | ||||
| FHQPQQQFPQPQQPQQ | TH | ||||
| FHQPQQQFPQPQQPQQSFPQQQRP | TH | ||||
| IQPSLQQR | TR | ||||
| QLAQLEAIR | TR | ||||
| PFIQPSLQQR | TR | ||||
| QSFPQQQRPFIQPSLQQR | TR | ||||
| 6 | 186 | 16 | 36.5 | RQPQQPF | CH |
| ASIVAGISGQ | CH | ||||
| YQQPQQTFPQPQ | CH | ||||
| LAQIPRQ | TH | ||||
| IQILRPLFQ | TH | ||||
| IIQPQQPAQYEVIRS | TH | ||||
| FRQPQQPFY | TH | ||||
| IIQPQQPAQYE | TH | ||||
| YQQPQQTFPQPQQ | TH | ||||
| FRQPQQPFYQQPQQTFPQPQQ | TH | ||||
| FYQQPQQTFPQPQQ | TH | ||||
| 7 | 170 | 13 | 43.4 | LQPHQPF | CH |
| ASIVASIGGQ | CH | ||||
| QGVQILVPL | CH | ||||
| SQQQQVGQGIL | CH | ||||
| SQQQQVGQGILVQGQGIIQPQQPAQL | CH | ||||
| VYVPPYCST | TH | ||||
| LQPHQPFSQQPQQ | TH | ||||
| APFASIVASIGGQ | TR | ||||
| QPFPQQPQQPYPQQPQQPFPQTQQPQQPFPQSK | TR | ||||
| 11 | 149 | 4 | 27.6 | YQQQQVGQGTLVQGQGIIQPQQPAQL | CH |
| SFPQQQPPF | TH | ||||
| 17 | 180 | 7 | 24.9 | ASIVADIGGQ | CH |
| 87 | 22.0 | KAPFASIVADIGGQ | CH | ||
| APFASIVADIGGQ | TR | ||||
| 38 | 169 | 16 | 37.5 | ANIDAGIGGQ | CH |
| 108 | 39.7 | GIIQPQQPAQLEGIRSLVL | CH | ||
| VPPNCSTINVPY | CH | ||||
| VPPNCSTINVPYANIDAGIGGQ6 | CH | ||||
| VTILRPLFQ6 | TH | ||||
| INVPYANIDAGIGGQ | TH | ||||
| NFLLQQCNHVSLVSSLVSIILPR | TR | ||||
1 detailed in Vensel et al. (in preparation).
2 obtained by interrogation of "subset" database with MS/MS data.
3 excluding 19 amino acid signal peptide.
4 sequences of peptides obtained by MS/MS that are not shared by other Butte 86 gamma gliadins. Peptide containing the extra cysteine is indicated in italics.
5 CH, chymotrypsin; TH, thermolysin; TR, trypsin.
6 peptides that distinguish Butte 86 sequence from closest NCBI match.
7 MS/MS data consistent with either Butte 86 gamma gliadin #1 or #8.
8 MS/MS data consistent with either Butte 86 gamma gliadin #3 or #10.