| Literature DB >> 27190000 |
Wei Zhang1, Jianhong Xu2, Jeffrey L Bennetzen3, Joachim Messing1.
Abstract
Seed storage proteins (SSP) in cereals provide essential nutrition for humans and animals. Genes encoding these proteins have undergone rapid evolution in different grass species. To better understand the degree of divergence, we analyzed this gene family in the subfamily Chloridoideae, where the genome of teff (Eragrostis tef) has been sequenced. We find gene duplications, deletions, and rapid mutations in protein-coding sequences. The main SSPs in teff, like other grasses, are prolamins, here called eragrostins. Teff has γ- and δ-prolamins, but has no β-prolamins. One δ-type prolamin (δ1) in teff has higher methionine (33%) levels than in maize (23-25%). The other δ-type prolamin (δ2) has reduced methionine residues (<10%) and is phylogenetically closer to α prolamins. Prolamin δ2 in teff represents an intermediate between δ and α types that appears to have been lost in maize and other Panicoideae, and was replaced by the expansion of α-prolamins. Teff also has considerably larger numbers of α-prolamin genes, which we further divide into five sub-groups, where α2 and α5 represent the most abundant α-prolamins both in number and in expression. In addition, indolines that determine kernel softness are present in teff and the panicoid cereal called foxtail millet (Setaria italica) but not in sorghum or maize, indicating that these genes were only recently lost in some members of the Panicoideae Moreover, this study provides not only information on the evolution of SSPs in the grass family but also the importance of α-globulins in protein aggregation and germplasm divergence.Entities:
Keywords: gene copy number variation; grass genomes; seed protein genes
Mesh:
Substances:
Year: 2016 PMID: 27190000 PMCID: PMC4943188 DOI: 10.1093/gbe/evw117
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FPhylogenetic relationships among several common cereals. Names of family, subfamily, tribe, and species are shown. This is a generalized tree of relationships adapted from previous studies (Kellogg 2001; Vincentini et al. 2008; Xu and Messing 2009) with no computational support and phylogenetic distances were not drawn to scale.
FAlcohol soluble proteins from mature tefs seed resolved in a 15% SDS-PAGE gel. Lane 1, protein markers. Lane 2–4, extractable prolamin components by 70% ethanol. The types of 15 and 10 kDa eragrostins were predicted with no experimental validations.
FPhylogenetic analysis of teff eragrostins. Genomic sequences of all eragrostins from Dabbi were obtained with PCR and traditional sequencing (Methods). A few eragrostins from Tsedey that are absent from Dabbi and a few representative maize genes were also used for phylogenetic analysis. The sequences of the eragrostins used can be found in supplementary table S2, Supplementary Material online. A phylogenetic tree was drawn using the MEGA5 program with the Neighbor-Joining method.
Protein Composition of Teff Storage Proteins
| Ala (A) | 5.5 | 4.1 | 6.5 | 3.9 | 7.0 | 13.2 | 7.4 | 4.7 | 5.8 | 12.2 | 5.4 |
| Arg (R) | 1.3 | 1.8 | 6.5 | 0.0 | 0.4 | 0.6 | 2.5 | 0.6 | 1.2 | 0.9 | 1.1 |
| Asn (N) | 0.5 | 0.8 | 2.4 | 0.8 | 0.4 | 6.0 | 1.8 | 2.9 | 2.9 | 3.5 | 3.8 |
| Asp (D) | 0.5 | 1.0 | 2.4 | 0.8 | 0.9 | 0.0 | 1.8 | 0.0 | 0.0 | 0.0 | 0.5 |
| Cys (C) | 3.7 | 3.6 | 9.7 | 8.7 | 5.3 | 3.6 | 1.2 | 1.2 | 0.6 | 2.2 | 0.5 |
| Gln (Q) | 25.8 | 24.3 | 8.1 | 11.8 | 8.8 | 14.4 | 39.9 | 41.5 | 40.7 | 24.3 | 39.1 |
| Glu (E) | 2.4 | 2.3 | 8.1 | 0.0 | 0.0 | 0.0 | 1.8 | 0.0 | 1.2 | 0.0 | 0.5 |
| Gly (G) | 13.2 | 12.9 | 3.2 | 3.9 | 16.7 | 1.2 | 1.8 | 1.8 | 0.6 | 1.3 | 1.1 |
| His (H) | 5.8 | 6.5 | 2.4 | 0.8 | 0.4 | 1.2 | 1.8 | 1.2 | 1.2 | 1.3 | 1.1 |
| Ile (I) | 1.1 | 0.8 | 4.8 | 1.6 | 1.8 | 5.4 | 4.3 | 4.1 | 5.8 | 6.1 | 5.4 |
| Leu (L) | 2.6 | 3.1 | 4.8 | 2.4 | 2.2 | 4.2 | 7.4 | 8.8 | 8.7 | 11.7 | 10.9 |
| Lys (K) | 1.1 | 0.8 | 5.6 | 0.8 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
| Met (M) | 3.9 | 4.1 | 7.3 | 29.9 | 18.9 | 9.0 | 0.0 | 2.3 | 0.6 | 1.7 | 0.5 |
| Phe (F) | 3.4 | 3.1 | 0.0 | 2.4 | 5.3 | 7.2 | 7.4 | 4.7 | 6.4 | 5.2 | 4.9 |
| Pro (P) | 13.2 | 15.0 | 5.6 | 10.2 | 9.2 | 9.6 | 4.3 | 3.5 | 5.8 | 7.8 | 4.9 |
| Ser (S) | 7.4 | 7.2 | 5.6 | 12.6 | 13.2 | 10.2 | 2.5 | 6.4 | 4.1 | 5.7 | 4.3 |
| Thr (T) | 2.4 | 2.6 | 8.1 | 6.3 | 4.8 | 5.4 | 4.3 | 4.1 | 2.9 | 4.8 | 3.3 |
| Trp (W) | 1.3 | 1.3 | 2.4 | 0.8 | 0.0 | 1.8 | 1.8 | 0.6 | 0.6 | 0.9 | 1.1 |
| Tyr (Y) | 3.7 | 3.1 | 4.0 | 2.4 | 4.4 | 3.6 | 4.3 | 4.1 | 2.9 | 4.8 | 4.9 |
| Val (V) | 1.3 | 1.6 | 2.4 | 0.0 | 0.4 | 3.6 | 3.7 | 7.6 | 8.1 | 5.7 | 6.5 |
FComparison of the Ha locus and Ha-like genes in wheat (T. aestivum [ta]), brachypodium (B. sylvaticum [bs]), rice (Oryza sativa [os]), teff (E. tef [et]), foxtail millet (S. italica [si]) and sorghum (Sorghum bicolor [bc]). (A) Comparison of loci orthologous to Ha. Ha and Ha-like loci of wheat, B. sylvaticum, rice and sorghum were from Charles et al. (2009). The teff Ha-like locus was from Scaffold1023:14107-23158 (http://www.tef-research.org/genome.html) and the foxtail millet Ha-like locus was downloaded from NCBI as XM_004963284. Gene sizes are not drawn to scale. Solid blue represents expressed genes while unfilled genes indicate pseudo-genes or partial genes. (B) Comparison of tryptophan-rich domains of Ha-like genes. (C) Phylogenetic analysis of the relatedness of Ha-like genes. The phylogenetic tree shown was drawn with MEGA5 using the Maximum Likelihood method.
FComparison of organization of storage proteins in (A) teff and (B) maize. CW, cell wall; PB, protein body; SB,: starch body.
FDevelopment of PBs in teff endosperm. (A) First layer of endosperm. (B) Fourth layer of endosperm. (C) Fifth layer of endosperm. Arrowheads point to electron-dense PBs.
Summary of Kernel Softness, SSP Content, and Storage Structures of SSPs in Several Grass Species
| Wheat | hard or soft | 3 | PBs and PSVs | HMW | 6–12 | HMW-glutenins | |
| γ | 25–38 | LMW-glutenins | |||||
| 38–50 | α-gliadins | ||||||
| γ-gliadins | |||||||
| ω-gliadins | |||||||
| Rice | hard | 1 | PBs and PSVs | γ | 40 | 13 kDa Ory13 | |
| 60 | 13 kDa Ory16 | ||||||
| 16 kDa Ory13 | |||||||
| δ | 10 kDa Ory10 | ||||||
| Teff | hard | 4 | PBs and PSVs | γ | 15 | 50 kDa γ-eragrostin | |
| 27 kDa γ-eragrostin | |||||||
| δ | 25 | 15 kDa δ-eragrostins | |||||
| 10 kDa δ-eragrostins | |||||||
| α | 60 | 22 kDa α-eragrostins | |||||
| 19 kDa α-eragrostins | |||||||
| Maize | hard | 2 | PBs | γ | 20–25 | 50 kDa γ-zein | |
| 27 kDa γ-zein | |||||||
| 16 kDa γ-zein | |||||||
| 15 kDa β-zein | |||||||
| δ | <5 | 18 kDa δ-zein | |||||
| 10 kDa δ-zein | |||||||
| α | 60–70 | 22 kDa α-zein | |||||
| 19 kDa α-zein |
The percentage of different types of prolamins in total wheat prolamins is based on a previous report (Payne et al. 1984), with ∼80% of wheat proteins being glutenins and gliadins.
The percentages of different prolamin species in total prolamins were from a previous study (Ogawa et al. 1987).
The percentage of different types of teff prolamins in total prolamins was calculated by protein band densitometry from Image J, following a previous protocol (Garcia et al. 2015).
Percentages of different zeins in total maize prolamins were from previous reports (Thompson and Larkins 1994; Wu et al. 2009).