| Literature DB >> 26435710 |
Fred Y Peng1, Zhiqiu Hu2, Rong-Cai Yang3.
Abstract
Early flowering is an important trait influencing grain yield and quality in wheat (Triticum aestivum L.) and barley (Hordeum vulgare L.) in short-season cropping regions. However, due to large and complex genomes of these species, direct identification of flowering genes and their molecular characterization remain challenging. Here, we used a bioinformatic approach to predict flowering-related genes in wheat and barley from 190 known Arabidopsis (Arabidopsis thaliana (L.) Heynh.) flowering genes. We identified 900 and 275 putative orthologs in wheat and barley, respectively. The annotated flowering-related genes were clustered into 144 orthologous groups with one-to-one, one-to-many, many-to-one, and many-to-many orthology relationships. Our approach was further validated by domain and phylogenetic analyses of flowering-related proteins and comparative analysis of publicly available microarray data sets for in silico expression profiling of flowering-related genes in 13 different developmental stages of wheat and barley. These further analyses showed that orthologous gene pairs in three critical flowering gene families (PEBP, MADS, and BBX) exhibited similar expression patterns among 13 developmental stages in wheat and barley, suggesting similar functions among the orthologous genes with sequence and expression similarities. The predicted candidate flowering genes can be confirmed and incorporated into molecular breeding for early flowering wheat and barley in short-season cropping regions.Entities:
Year: 2015 PMID: 26435710 PMCID: PMC4576011 DOI: 10.1155/2015/874361
Source DB: PubMed Journal: Int J Plant Genomics ISSN: 1687-5389
Distributions of 204 flowering genes over five chromosomes and seven known functional groups in Arabidopsis compiled through searches in the literature and TAIR.
| Gene type | Functional group | AT1 | AT2 | AT3 | AT4 | AT5 | Total |
|---|---|---|---|---|---|---|---|
| Protein coding | Autonomous | 0 | 2 | 1 | 3 | 1 | 7 |
| Flower development | 19 | 3 | 7 | 3 | 9 | 41 | |
| Gibberellin | 9 | 2 | 3 | 3 | 2 | 19 | |
| Pathway integration | 2 | 1 | 0 | 2 | 2 | 7 | |
| Photoperiod | 9 | 9 | 3 | 1 | 10 | 32 | |
| Regulation | 8 | 15 | 16 | 10 | 20 | 69 | |
| Vernalization | 1 | 1 | 2 | 3 | 8 | 15 | |
| Subtotal |
|
|
|
|
|
| |
|
| |||||||
| MicroRNA | 2 | 4 | 0 | 2 | 6 | 14 | |
|
| |||||||
| Total |
|
|
|
|
|
| |
Flowering-related genes in barley and wheat that are annotated on the basis of top BLAST hits in Arabidopsis thaliana expressed as the percentage of characterized and uncharacterized proteins/enzymes in current ENSEMBL annotation.
| Ensembl annotation status | New annotation | |
|---|---|---|
| Barley | Wheat | |
| Uncharacterized | 93.1% (256) | 96.2% (866) |
| Characterized | 6.9% (19) | 3.8% (34) |
|
| ||
| Total | 275 | 900 |
Structural characteristics of flowering-related genes in Arabidopsis thaliana (AT), Triticum aestivum (TA), and Hordeum vulgare (HV).
|
AT ( |
TA ( |
HV ( | ||||
|---|---|---|---|---|---|---|
| Mean | Range | Mean | Range | Mean | Range | |
| Transcripts per gene | 1.4 | 1–5 | 1.0 | 1-1 | 2.8 | 1–27 |
| Gene length (bp) | 3161 | 182–16871 | 3815 | 240–20952 | 4328 | 404–15512 |
| Exons per gene | 6.5 | 1–48 | 5.7 | 1–42 | 4.5 | 1–20 |
| Exon size (bp) | 466 | 79–4165 | 565 | 42–5550 | 878 | 87–5211 |
| Intron size (bp) | 468 | 78–2316 | 924 | 58–7291 | 856 | 44–5912 |
| Protein length (aa) | 529 | 77–3529 | 444 | 52–3250 | 500 | 50–2056 |
The numbers of flowering genes used for the summary statistics are shown in parentheses. Single-exon genes (no introns) were excluded for intron size calculation. bp, base pair; aa, amino acid.
Figure 1The multiple sequence alignment of OG5_178217 including conserved MADS-box domain and K-box region (a) and PEBP family proteins (b). (a) The MADS-box domain (IPR002100) is marked blue, and the K-box region (IPR002487) marked red. (b) The PEBP domain (IPR008914) in Arabidopsis FT protein encoded by AT1G65480 spans from 27 to 161 amino acids.
Figure 2Phylogeny of PEBP (a) and MADS (b) family proteins in Arabidopsis, wheat, and barley. The PEBP proteins include 11 sequences in OG5_146543 (see Supplemental file 2), and a PEBP protein Pp1s34_16V6 in moss was used as an outgroup to root the phylogenetic tree. The MADS AGL12 proteins include six sequences in OG5_178217, and a MADS protein Phpat.004G002000.1 in moss was used as an outgroup to root the phylogenetic tree. The support value on each node is the Bayesian posterior probability. The scale bar denotes the number of nucleotide replacements per site.
Summary of the three public transcriptome datasets, genome characteristics, and numbers of expressed flowering genes of Arabidopsis, wheat, and barley.
| Arabidopsis | Wheat | Barley | |
|---|---|---|---|
| Accession/cultivar | Columbia | Chinese Spring | Morex |
| Ploidy | Diploid (2 | Hexaploid (2 | Diploid (2 |
| Genome size | 135 Mb | 17 Gb | 5.3 Gb |
| Number of total predicted genes | 27,416 | 9,8897 | 24,287 |
| Number of genes on GeneChip | 22,814 | 61,290 | 22,840 |
| PLEXdb experiment ID | AT40 | TA3 | BB3 |
| Number of tissues sampleda | 63 | 13 | 15 |
| Number of predicted flowering genes | 190 | 900 | 273 |
| Number of flowering genes expressed (% of predicted flowering genes) | 189 (99%) | 676 (75%) | 248 (91%) |
aTissues of wheat include germinating seed (coleoptile, root, and embryo), seedling (root, crown, and leaf), immature inflorescence, floral organs before anthesis (bracts, pistils, and anthers), 3–5 DAP caryopsis, 22 DAP embryo, and 22 DAP endosperm; tissues of barley include germinating seed (coleoptile, root, and embryo), seedling (root, crown, and leaf), immature inflorescence, floral organs before anthesis (bracts, pistils, and anthers), 3–5 DAP caryopsis, 10 DAP caryopsis, 16 DAP caryopsis, 22 DAP embryo, and 22 DAP endosperm. See Supplemental file 5 for the normalized, log2-transformed expression values of the flowering genes in different organs or developmental stages.
Figure 3Overview of expression of flowering genes in different tissues and development stages in Arabidopsis (a), wheat (b), and barley (c). The expression data Arabidopsis, wheat, and barley was from 63, 13, and 15 tissue types, respectively (Table 4, Supplemental file 5). The heat maps were created by hierarchical clustering using complete linkage method with the heatmap.2 function in R. The same color key shown is used for all the three heat maps.
Figure 4Expression profiles of orthologous genes in the two ortholog groups of the PEBP, MADS-box, and BBX families in wheat and barley. (a): PEBP OG5_158796, (b): PEBP OG5_146543, (c): MADS OG5_178217, (d): MADS OG5_144912, (e): BBX OG5_178368, and (f): OG5_170758.
The average percentage of protein sequence similarity of flowering genes in the seven functional groups in Arabidopsis, wheat, and barley.
| Functional group | AT versus TA | AT versus HV | HV versus TA |
|---|---|---|---|
| Autonomous | 62.72 (16.91) | 53.36 (14.48) | 97.32 (2.28) |
| Flower development | 68.77 (11.83) | 63.23 (12.39) | 95.19 (6.21) |
| Gibberellin | 61.27 (15.07) | 54.71 (10.95) | 96.85 (2.01) |
| Pathway integration | 74.75 (13.39) | 58.95 (16.97) | 95.44 (3.81) |
| Photoperiod | 61.44 (17.99) | 57.37 (16.25) | 94.48 (5.99) |
| Regulation | 69.78 (13.04) | 61.97 (13.49) | 96.49 (3.75) |
| Vernalization | 58.88 (8.31) | 44.63 (13.04) | 81.64 (15.04) |
AT versus TA, comparison of flowering protein sequences between Arabidopsis and wheat; AT versus HV, comparison of flowering protein sequences between Arabidopsis and barley; HV versus TA, comparison of flowering protein sequences between barley and wheat. The values in the parentheses are standard deviations.