| Literature DB >> 35873966 |
Guang Yang1, Ying Zhang1, Xinyu Wei1, Licao Cui2, Xiaojun Nie1.
Abstract
Transcription factor (TF) is a class of the sequence-specific DNA-binding proteins that modulate the transcription of target genes, and thus regulate their expressions. Variations in TF are the crucial determinants for phenotypic traits. Although much progress has been made in the functions of TF genes in wheat, one of the most important staple crops globally, the diversity of TF genes in wheat and its progenitors are not well understood, especially the agronomically promising haplotypes have not yet been characterized. Here, we identified a total of 6,023 TF genes from hexaploid wheat through a genome-search method and classified them into 59 gene families based on the conserved domain. The characteristics and dN/dS values of these genes showed evidently selective effects. Based on re-sequencing data, we found a strong genetic bottleneck among these TF genes on A and D subgenomes while no found in B subgenome during wheat domestication. Combined with selective signals and known QTLs on the whole genome, 21 TF genes were preliminarily found to be associated with yield-related traits. The haplotype frequency of these TF genes was further investigated in bread wheat and its progenitors and 13 major haplotypes were the casual loci related to key traits. Finally, the tissue-specific TF genes were also identified using RNA-seq analysis. This study provided insights into the diversity and evolution of TF genes and the identified TF genes and excellent haplotypes associating with traits will contribute to wheat genetic improvement.Entities:
Keywords: genetic diversity; haplotype; transcription factor; wheat; yield traits
Year: 2022 PMID: 35873966 PMCID: PMC9305608 DOI: 10.3389/fpls.2022.899292
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 6.627
FIGURE 1The characteristics of TF genes and the dN (number of substitutions per non-synonymous site) and dS (number of substitutions per synonymous site) values of pairs. (A) Percentage of TF genes on three subgenomes. (B) Density of TF genes on three subgenomes. (C) Numbers of exon of TF and non-TF genes. ***P-value of Student’s test < 0.01. (D) Distribution of cDNA length of TF and non-TF genes. (E) Distribution of protein length of TF and non-TF genes. (F) Distribution of dN/dS ratio of paired TF genes. (G) Distribution of dS value of paired TF genes.
FIGURE 2SNP density and population analysis of TF genes. (A). The number of SNPs within TF gene region in each group. (B) The number of SNPs within TF promotor region in each group. (C–E) The density of SNPs within gene and promotor region on panels (C) A, (D) B, and (E) D subgenomes. (F) The Fst values between different groups. The values on the lines represent Fst.
FIGURE 3Function analysis and expression heatmap of TF genes. (A) Summary of total gene density, TF density, and the distribution of 124 known QTLs of 14 agronomic traits of wheat. (B) Top 20 PO and TO enrichment terms of TF genes. (C) Expression profiles of TF genes in eight different tissues and time points. Clusters 1–7 on the right represent classification according to gene expression specificity. DAA represents days after anthesis; DPA represents days post-anthesis.
FIGURE 4An example of key TF gene TraesCS4A02G130000. (A) π ratio between wild and domesticated emmer on the A subgenome. Black dotted line represents the position of TraesCS4A02G130000. Red box represents the position of one QTL related to plant height (HT). (B) Gene structure and SNP position of TraesCS4A02G130000. (C) Haplotype frequency of TraesCS4A02G130000. (D) Heatmap of TraesCS4A02G130000 expression in different tissues and at different developmental time points. DAA represents days after anthesis; DPA represents days post-anthesis.