Literature DB >> 31215411

Genome-wide identification, phylogenetic and expression analysis of the heat shock transcription factor family in bread wheat (Triticum aestivum L.).

Min Zhou1,2,3, Shigang Zheng1, Rong Liu1,2, Jing Lu1,2, Lu Lu1, Chihong Zhang1, Zehou Liu1, Congpei Luo1,2, Lei Zhang1, Levi Yant3, Yu Wu4.   

Abstract

BACKGROUND: Environmental toxicity from non-essential heavy metals such as cadmium (Cd), which is released from human activities and other environmental causes, is rapidly increasing. Wheat can accumulate high levels of Cd in edible tissues, which poses a major hazard to human health. It has been reported that heat shock transcription factor A 4a (HsfA4a) of wheat and rice conferred Cd tolerance by upregulating metallothionein gene expression. However, genome-wide identification, classification, and comparative analysis of the Hsf family in wheat is lacking. Further, because of the promising role of Hsf genes in Cd tolerance, there is need for an understanding of the expression of this family and their functions on wheat under Cd stress. Therefore, here we identify the wheat TaHsf family and to begin to understand the molecular mechanisms mediated by the Hsf family under Cd stress.
RESULTS: We first identified 78 putative Hsf homologs using the latest available wheat genome information, of which 38 belonged to class A, 16 to class B and 24 to class C subfamily. Then, we determined chromosome localizations, gene structures, conserved protein motifs, and phylogenetic relationships of these TaHsfs. Using RNA sequencing data over the course of development, we surveyed expression profiles of these TaHsfs during development and under different abiotic stresses to characterise the regulatory network of this family. Finally, we selected 13 TaHsf genes for expression level verification under Cd stress using qRT-PCR.
CONCLUSIONS: To our knowledge, this is the first report of the genome organization, evolutionary features and expression profiles of the wheat Hsf gene family. This work therefore lays the foundation for targeted functional analysis of wheat Hsf genes, and contributes to a better understanding of the roles and regulatory mechanism of wheat Hsfs under Cd stress.

Entities:  

Keywords:  Expression profiles; Genome-wide analysis; Hsf; Wheat

Mesh:

Substances:

Year:  2019        PMID: 31215411      PMCID: PMC6580518          DOI: 10.1186/s12864-019-5876-x

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   3.969


Background

Heat shock proteins (HSPs) perform important roles not only in cellular protection against stress-related damage, but also in the regular folding, intracellular distribution, and degradation of proteins. These functions facilitate organismal survival under stressful conditions [1, 2]. Heat shock transcription factors (Hsfs) modulate the expression of HSPs, and participate in various aspects of protein homeostasis, such as refolding, assembly and transporting damaged proteins, which sustain protein stability [3-7]. Hsfs share a core structure consisting of an N-terminal DNA binding domain (DBD) and an adjacent bipartite oligomerization domain (HR-A/B) [6, 8]. Some Hsfs also share a leucine-rich nuclear export signal (NES) for nuclear export, a nuclear localization signal (NLS) essential for nuclear import,, and short peptide motifs (AHA motifs) for activator functions [9-12]. Based on the characteristics of their HR-A/B domain and phylogenetic comparisons, plant Hsf genes may be classified into three broad groups: A, B and C [6, 8]. The HR-A/B regions of class B Hsfs are relatively compact, not including any insertions, while all class A and class C HSFs have an outspread HR-A/B region due to an insertion of 21 (class A) and seven (class C) amino acid residues [6]. This classification is also supported by differences in the flexible linkers between the DBD domain and HR-A/B domain, which consists of 9 to 39, 50 to 78, and 14 to 49 amino acid residues in class A, B and C Hsfs, respectively [6, 9]. Recent studies indicate that Hsfs are engaged in plant development and growth, as well as in response to abiotic stresses such as salt, cold, drought and cadmium challenge [7, 9, 13–19]. For example, HsfA9 is related to seed maturation and embryogenesis in sunflowers and Arabidopsis [14-16]. HsfA4a is involved in cadmium tolerance in wheat [19]. Due to the essential modulatory functions of Hsf genes in plants [16-18], the Hsf gene family has been studied in the models Arabidopsis thaliana and rice (Oryza sativa), and nonmodels such as poplar (Popupus trichocarpa), maize (Zea mays), and apple (Malus domestica) [5, 6, 9, 20–22]. However, the Hsf gene family in the bread wheat (Triticum aestivum) has not been systematically examined. Bread wheat is one of the most widely grown and consumed crops worldwide [23]. Bread wheat is hexaploid (2n = 6x = 42; AABBDD genome), originating from two amphiploidization events: the first hybridization producing the tetraploid wheat species (2n = 4x = 28, genome AABB) was between the Triticum urartu (2n = 2x = 14, genome AA) and presumably Aegilops speltoides, belonging to the section Sitopsis (2n = 2x = 14, genome SS); the second hybridization was between the tetraploid wheat and Aegilops tauschii (2n = 2x = 14, genome DD) [24, 25]. Therefore, bread wheat has a huge and highly complex genome with three subgenomes (A, B and D) and ~17Gb total size [26], leading to great challenges for genomic studies. Recently, however, a quality draft genome of hexaploid ‘Chinese Spring’ wheat has provided the foundation upon which we can investigate wheat gene families and to clearly recognize homologous gene copies in these three sub-genomes [27]. Further, it has allowed the study of interactions of loci during polyploidization and the retention and dispersion of homologous gene [28, 29]. Here we first perform an in silico genome-wide study to comprehensively identify members of the wheat Hsf gene family. Next, to characterize evolutionary and functional features, we determine chromosome locations, gene structures, conserved protein domains, phylogenetic relationships and expression profiles for this family. Our study provides a foundation for downstream targeted functional investigation of wheat Hsf genes, and will be allow for better understanding of the molecular mechanisms by which Hsfs regulate in growth, development and stress resilience in wheat.

Results

Genome-wide identification and classification of Hsf family in wheat

Through the availability of the genome sequence, it is possible for the first time to identify all the Hsf family members in wheat. In this study, we identified a total of 78 genes as Hsf members in the wheat genome, designating the predicted wheat Hsf genes TaHsf1 to TaHsf78. Members of the Hsf gene family have been broadly subdivided into Classes A, B, and C according to differences in the length of the flexible linkers between the A and B parts of the HR-A/B regions. In the TaHsf gene family, 38, 16 and 24 genes were accordingly assigned to Classes A, B and C, respectively. Within the A clade, 8 distinct subclades (A1, to A8) were resolved. The B-type Hsf genes were grouped into a separate clade subdivided into three groups (B1, B2 and B4). And the C-type genes were subdivided into two groups (C1 and C2). We further performed a BLASTN search against the wheat expressed sequence tag (EST) using the 78 identified Hsfs as queries to verify the existence and completeness of this set of wheat Hsfs. Results showed that most of the TaHsfs were supported by EST hits except 2 Hsfs (TaHsf57 and TaHsf75). We speculated these 2 unsupportted TaHsfs might not be expressed under any the assayed conditions or may be expressed at very low level that cannot be easily detected. Among the supported TaHsf genes, TaHsf8 has the largest number of EST hits, with 49, followed by TaHsf21 and TaHsf27 with 48 and 30 hists, respectively. Chromosome localization analysis found that 4 TaHsfs did not have corresponding chromosomal locations, and that the remaining 74 TaHsf genes were distributed on all of the 21 wheat chromosomes. Chromosome 3B contained the most Hsf genes with 8, followed by 4B, 5A and 5D, with each harboring 6, then 3A with 5, and finally 6A, 6B and 6D with one each. The predicted lengths of the putative TaHsf proteins ranged from 209 to 701 amino acids, with the molecular weights (Mw) ranging from 22.72 to 73.92 kDa and theoretical isoelectric points (PI) ranging from 4.67 to 9.50 (Table 1).
Table 1

The list of the putative wheat Hsf genes

NamesEnsemble Gene IDChromosome locationEST countlength (bp)ExonsIntronsAmino acid length (aa)PIMW (kDa)
TaHsf1 Trae_4AL_8577C148Bscaffold_288809_4AL: 49,335-56,655267321325214.9457.34
TaHsf2 Trae_5BL_E15759DADscaffold_404129_5BL: 211,116-217,536266421214715.1852.89
TaHsf3 Trae_5DL_B1D24781B1scaffold_433347_5DL: 108,916-114,305285390214874.9554.60
TaHsf4 Trae_5AL_16AD8DEECscaffold_375092_5AL: 45,746-49,54443799213465.4538.98
TaHsf5 Trae_5DL_6EB179C88scaffold_434875_5DL: 17,703-21,44583743213485.3938.90
TaHsf6 ndscaffold_640974_U: 63,006-67,19064185213535.5939.72
TaHsf7 Trae_2AS_CF07F4EC2scaffold_113503_2AS: 55,860-61,955266096214134.9945.60
TaHsf8 Trae_2BS_ECF9B4EB4scaffold_148328_2BS: 27,356-32,557495202214055.0644.92
TaHsf9 Trae_2DS_B6872CB84scaffold_177319_2DS: 131,495-137,254275760214124.8545.43
TaHsf10 Trae_3AL_E15419B88scaffold_194616_3AL: 22,466-26,62024155213146.1435.42
TaHsf11 TRAES3BF002300100CFDscaffold_221589_3B: 97,736-102,70044965323965.0943.99
TaHsf12 ndscaffold_379543_5AL: 6399-8897102499323725.3741.13
TaHsf13 ndscaffold_433195_5DL: 110,106-113,56093455213775.4241.59
TaHsf14 ndscaffold_116363_2AS: 3667-661572949214676.0651.62
TaHsf15 Trae_2AS_53BFA14C7scaffold_114504_2AS: 30,086-34,71074625435025.9555.44
TaHsf16 Trae_2BS_1484A7516scaffold_146118_2BS: 176,336-179,44073105214755.9452.08
TaHsf17 Trae_2DS_070CE3D50scaffold_177422_2DS: 92,871-96,20583335214995.754.78
TaHsf18 Trae_3AL_463ABD4BFscaffold_196554_3AL: 30,462-32,936152475214325.3648.37
TaHsf19 TRAES3BF029100010CFDscaffold_223991_3B: 37,660-40,120152461214415.1849.46
TaHsf20 Trae_3DL_8FD0F859Bscaffold_249383_3DL: 45,727-48,075162349214335.3548.45
TaHsf21 Trae_1AL_7D6DC73FCscaffold_001183_1AL: 39,381-43,295483915324484.9150.25
TaHsf22 ndscaffold_031159_1BL: 70,097-72,25592159324454.9449.92
TaHsf23 ndscaffold_061383_1DL: 26,931-29,06582135324425.1149.70
TaHsf24 Trae_6AS_1537629B3scaffold_487059_6AS: 7273-11,03273760214585.2149.87
TaHsf25 Trae_6BS_25E162197scaffold_513816_6BS: 34,066-37,53983474214555.3349.92
TaHsf26 Trae_6DS_C59B6322Fscaffold_543918_6DS: 1556-555674001214585.1649.86
TaHsf27 Trae_1AL_A4B5C1474scaffold_003124_1AL: 28,946-32,10130315654368541.70
TaHsf28 Trae_1BL_5D8D6B865scaffold_031443_1BL: 79,599-83,003273405433644.8941.02
TaHsf29 Trae_1DL_B5A84E4C8scaffold_061579_1DL: 62,790-66,102293313433705.0342.03
TaHsf30 Trae_4AS_52EB860E7scaffold_307193_4AS: 64,786-67,745132960213415.0739.63
TaHsf31 Trae_4BL_2E125A702scaffold_321575_4BL: 50,126-53,221123096213415.0739.59
TaHsf32 Trae_4DL_AF19ABC7Dscaffold_342984_4DL: 44,562-50,805136244433415.0239.49
TaHsf33 ndscaffold_559301_7AL: 9972–11,83541864543104.6733.78
TaHsf34 ndscaffold_579527_7BL: 16,166-18,11151946433514.9437.90
TaHsf35 ndscaffold_605087_7DL: 18,736-21,00752272433514.8237.98
TaHsf36 Trae_4AS_02B607421scaffold_306492_4AS: 132,616-136,430103815433835.2242.84
TaHsf37 Trae_4BL_542B1DA85scaffold_322416_4BL: 4327-821593889433845.342.87
TaHsf38 Trae_4DL_EE941086Escaffold_344014_4DL: 13,486-17,31093825433845.342.92
TaHsf39 Trae_5AL_D369204D3scaffold_374310_5AL: 146,720-151,848285129212989.532.14
TaHsf40 Trae_5BL_F80E01D65scaffold_404669_5BL: 141,516-147,139275624212989.3132.28
TaHsf41 Trae_5DL_431CCA490scaffold_433651_5DL: 31,056-36,542285487212989.232.06
TaHsf42 Trae_2AL_D3B2C21A7scaffold_094650_2AL: 33,644-35,17021527212956.1231.99
TaHsf43 ndscaffold_712376_U: 1–7151715212099.522.72
TaHsf44 ndscaffold_019033_1AS: 12,760-16,705253946324044.942.04
TaHsf45 Trae_5BL_FCB1625F3scaffold_404935_5BL: 109,416-113,225273810327019.2273.92
TaHsf46 ndscaffold_433530_5DL: 41,946-43,807261862213974.8941.11
TaHsf47 Trae_7AS_937121AF8scaffold_570040_7AS: 14,527-16,33561809213745.4440.45
TaHsf48 Trae_7BS_03F39ED94scaffold_592325_7BS: 110,144-112,89562752323745.3340.33
TaHsf49 Trae_7DS_10A9C68FAscaffold_621446_7DS: 14,666-16,58061915213675.539.79
TaHsf50 Trae_2DS_01A0E5F7Ascaffold_178567_2DS: 15,585-18,51842934213206.5535.31
TaHsf51 ndscaffold_642758_U: 53,288-55,87552587213206.6635.26
TaHsf52 ndscaffold_374067_5AL: 30,626-32,51071885213887.8541.35
TaHsf53 ndscaffold_404268_5BL: 201,437-203,32581889213887.8941.46
TaHsf54 ndscaffold_433663_5DL: 11,036-12,82081785213888.4241.39
TaHsf55 ndscaffold_201352_3AL: 447–165501209322775.5431.15
TaHsf56 ndscaffold_194514_3AL: 70,656-72,59121936212946.2632.57
TaHsf57 ndscaffold_220888_3B: 91,006-92,35601351213225.4635.55
TaHsf58 TRAES3BF021000010CFDscaffold_220882_3B: 116,126-117,71021585213255.9435.72
TaHsf59 ndscaffold_249994_3DL: 60,736-62,25021515213216.1635.38
TaHsf60 ndscaffold_249450_3DL: 110,687-117,555166869322257.1125.63
TaHsf61 ndscaffold_193607_3AL: 163,384-164,530131147212366.9126.05
TaHsf62 TRAES3BF005500020CFDscaffold_223354_3B: 26,214-27,330141117212278.3524.69
TaHsf63 ndscaffold_250779_3DL: 25,456-26,755111300212418.7626.40
TaHsf64 TRAES_3BF025700020CFD_c1scaffold_231430_3B: 2326-340041075102375.1126.12
TaHsf65 ndscaffold_223198_3B: 70,994-72,09521102212375.9926.61
TaHsf66 TRAES3BF025700030CFDscaffold_224063_3B: 9915-11,221271307102746.9829.98
TaHsf67 Trae_4BL_86572BB6Dscaffold_321958_4BL: 10,751-11,96591215102645.4329.06
TaHsf68 Trae_4BL_F6C3B5069scaffold_320289_4BL: 21,946-23,120251175102758.430.21
TaHsf69 Trae_4BL_5091DE58Escaffold_320289_4BL: 33,386-34,49021105102574.8828.50
TaHsf70 ndscaffold_320675_4BL: 112,161-113,54051380212735.729.45
TaHsf71 ndscaffold_344468_4DL: 19,506-20,710141205102766.4630.27
TaHsf72 Trae_4DL_FA07D8414scaffold_343739_4DL: 22,666-23,88541220102765.3229.85
TaHsf73 ndscaffold_376864_5AL: 4896-601041115102736.1530.17
TaHsf74 ndscaffold_375679_5AL: 69,576-70,900151325102295.0825.56
TaHsf75 ndscaffold_641118_U: 187,271-188,37501105212685.6929.99
TaHsf76 Trae_7AL_6931AA68Bscaffold_558532_7AL: 22,876-24,495131620212666.4428.23
TaHsf77 ndscaffold_577398_7BL: 12,506-14,356141851212445.6126.12
TaHsf78 ndscaffold_609477_7DL: 1–1636161636212636.1128.04
The list of the putative wheat Hsf genes

Conserved domains analysis of TaHsf

We identified five conserved domains by sequence alignment approaches (Table 2). All the TaHsf predicted proteins contained a highly conserved DBD domain, forming with a three helical bundles (H1, H2 and H3) and four-stranded antiparallel β-sheet in their N-terminal regions. However, within the Hsf family, the length of the DBD domain was quite different. We then used the MARCOIL tool to detect the presence of a property of the HR-A/B, the coiled-coil structure characteristic of leucine zipper-type protein interaction domains. We found that most of the TaHsfs proteins consisted of NES and NLS domains, which are vital for shuttling Hsfs between the nucleus and cytoplasm. As was expected in the A-type TaHsfs, additional sequence comparisons identified AHA domain in the middle of the C-terminal activation domains. By contrast, these domains were not detected in the B- and C-type TaHsfs. To further predict and verify domains in the TaHsfs proteins, we used the Multiple EM for Motif Elicitation (MEME) motif search tool. Using this, we found thirty corresponding consensus motifs (Additional file 1: Figure S1, Additional file 2). Compared with class B and C, the members of class A contained the greatest number of conserved motifs (22), with the majority (12) detected in TaHsf1 and TaHsf3. The conserved motifs 1, 2, 4, 5, 8 16 represented the DBD domain. Motif 1 was found in 77 members of TaHsf family (except for TaHsf33). Regarding coiled-coiled structures, motif 3 was detected in class A and class C TaHsfs family, while motif 7 was detected in class B. The conserved motifs 10, 20, 22, 23, 25, 28, 30 were identified as NLS domains. Motifs 10 and 25 represented NLS domains in class A, whereas NLS domains were represented by motifs 20, 23, 28 and 30 in class B, motifs 22 and 23 in class C. Motif 15 represented NES domains, and motif11 was identified as characteristic AHA domains. Thus through the combination of the two methods, predicted DBD domains and HR-A/B domains were observed in each TaHsfs and varied greatly in size and sequence.
Table 2

Functional domains of TaHsfs

NamesProtein type (A-B-C)DBDHR-A/BNLSNESAHA
TaHsf1 A1a38–128163–227(245)RRIVAANKKRR(508)LTEQMGLLAHA2(464)DSFWEQFLCA
TaHsf2 A1a1–73109–173(191)RRIVAANKKRR(458)LTEQMGLLAHA2(414)DSFWEQFLCA
TaHsf3 A1a1–91125–189(207)RRIVAANKKRR(474)LTEQMGLLAHA2(430)DSFWEQFLCA
TaHsf4 A2a38–128143–207(223)RKELEDAISNKRRRRndAHA1(313)DDFWEDLL
TaHsf5 A2a40–130145–209(225)RKELEDAISNKRRRRndAHA1(315)DDFWEDLL
TaHsf6 A2a45–135150–214(230)RKELEDAISNKRRRRndAHA(320)DDFWEDLL
TaHsf7 A2b43–133149–213(229)RKELHDAISKKRRRR(400)KMGYLAHA1(370)DNFWEELL
TaHsf8 A2b44–134150–214(230)SKELHDAISKKRRRR(392)KMGYFAHA1(362)DNFWEGLL
TaHsf9 A2b43–133149–213(229)RKELHDAISKKRRRR(399)KMGYLAHA1(369)DNFWEELL
TaHsf10 A2b42–132148–212(228)RKELHDAMSKKRRRSndnd
TaHsf11 A2b41–131147–211(227)RKELHDAMSKKRRRSndAHA1(353)DDFWEELM
TaHsf12 A2e66–156178–242(260)RKELAEALLSKKRGRndAHA1(314)ESFWKELL
TaHsf13 A2e66–156180–244(262)RKELAEALLSKKRGRndAHA1(320)ESFWKELL
TaHsf14 A349–139175–221(248)RVKRKFLKHVndnd
TaHsf15 A384–174210–256(283)RVKRKFLKHVndnd
TaHsf16 A380–170206–252(279)RVKRKFLKHVndnd
TaHsf17 A381–171207–253(280)RVKRKFLKHVndnd
TaHsf18 A4a13–103126–183(198)KKRR(419)MTEKLGHLAHA1(244)LNSLENFFKE AHA2(370)DGFWQQFLTE
TaHsf19 A4a13–103126–183(198)KKRR(428)MTEKLGHLAHA1(244)LNSLENFFKE AHA2(379)DGFWQQFLTE
TaHsf20 A4a13–103126–183(198)KKRR(420)MTKKLGHLAHA1(244)LNSLENFFKE AHA2(370)DGFWQQFLTE
TaHsf21 A4d25–115138–195(220)KKRR(430)ITQQMGHLAHA1(267)LVSMEKLVQR AHA2(386)DLFWERFLTD
TaHsf22 A4d23–113136–193(219)KKRR(432)ITEQMGHLAHA1(267)LVSMEKLVRR AHA2(388)DLFWERFLTD
TaHsf23 A4d23–113136–193(218)KKRR(429)ITEQMGHLAHA1(270)LVSMEKLVQR AHA2(385)DLFWERFLTD
TaHsf24 A520–111131–188(199)KMAEASSMFADALHKKnd(414)DNFWEQFLTE
TaHsf25 A520–111131–188(199)KMAEASSMFADALHKKnd(414)DNFWEQFLTE
TaHsf26 A520–111131–188(199)KMAEASSMFADALHKKnd(414)DNFWEQFLTE
TaHsf27 A6a52–142159–223(238)KRKELEDAISKKRRR(352)IDELGQQLGYL(322)SDFWAELFSD
TaHsf28 A6a48–138155–219(234)KRKELEDAISKKRRR(348)IDELAQQLGYL(318)NDFWAELFSD
TaHsf29 A6a54–144161–225(240)KRKELEDAISKKRRR(354)IDELAQQLGYL(324)NDFWAELFSD
TaHsf30 A6b46–136153–217(232)KLKDLEDGYPTKRRRnd(311)DDFWEELLSE
TaHsf31 A6b46–136153–217(232)KLKDLEDGYPTKRRRnd(311)DDFWEELLSE
TaHsf32 A6b46–136153–217(232)KLKDLEDAYSNKRRRnd(311)DDFWEELLSE
TaHsf33 A7b47–138150–175ndnd(246)TDMIWYELL
TaHsf34 A7b49–139173–223ndnd(298)TDMIWYELL
TaHsf35 A7b49–139173–223ndnd(298)TDMIWYELL
TaHsf36 A837–127173–230ndndnd
TaHsf37 A837–127173–230ndndnd
TaHsf38 A837–127173–230ndndnd
TaHsf39 B127–117172–209ndndnd
TaHsf40 B130–120174–211ndndnd
TaHsf41 B130–120174–211ndndnd
TaHsf42 B2a13–103157–193(223)KRSREndnd
TaHsf43 B2a26–116170–206ndndnd
TaHsf44 B2c42–132215–251(321)KRARDndnd
TaHsf45 B2c176–266349–385(455)KRARDndnd
TaHsf46 B2c42–132215–251(321)KRARDndnd
TaHsf47 B2d32–122192–228(300)KRMRHndnd
TaHsf48 B2d32–122192–228(300)KRMRHndnd
TaHsf49 B2d32–122192–228(293)KRMRHndnd
TaHsf50 B4b40–130201–237(299)KKKRndnd
TaHsf51 B4b39–129200–236(299)KKKRndnd
TaHsf52 B4c26–117207–243(336)PVGA(362)LALENDDLnd
TaHsf53 B4c26–117207–243(336)PVGA(362)LALESDDLnd
TaHsf54 B4c26–117207–243(336)PVGA(362)LALESDDLnd
TaHsf55 C1a21–111121–164ndndnd
TaHsf56 C1a1–84121–171ndndnd
TaHsf57 C1a21–111154–197ndndnd
TaHsf58 C1a25–115159–202ndndnd
TaHsf59 C1a25–115159–202ndndnd
TaHsf60 C1a21–111149–192ndndnd
TaHsf61 C1b19–109126–169ndndnd
TaHsf62 C1b19–109131–174ndndnd
TaHsf63 C1b19–109131–174ndndnd
TaHsf64 C2a1–7597–140(168)KRPR
TaHsf65 C2a24–114134–177(202)KRPRndnd
TaHsf66 C2a19–109132–175(203)KRPRndnd
TaHsf67 C2a12–102124–167(195)QRPRndnd
TaHsf68 C2a21–111135–178(206)KRPRndnd
TaHsf69 C2a20–110132–175(203)KKPRndnd
TaHsf70 C2a24–114135–178(205)KRPRndnd
TaHsf71 C2a23–113135–178(206)KRRRndnd
TaHsf72 C2a24–114141–184(211)KRPRndnd
TaHsf73 C2a30–120143–186(207)NRPRndnd
TaHsf74 C2a1–84106–149(177)KRPRndnd
TaHsf75 C2a23–113132–175(198)KRLRndnd
TaHsf76 C2b15–105132–175(204)KRARndnd
TaHsf77 C2b1–8497–153ndndnd
TaHsf78 C2b13–103129–172(201)KRARndnd

DBD DND-binding domain, HR-A/B OD (oligomerisation domain), heptad pattern of hydrophobic amino acid residues; NLS: Nuclear localization signal, NES Nuclear export signal. AHA Activator motifs, romatic (W, F, Y), larger ydrophobic (L, I, V) and cidic (E, D) amino acid residues; Numbers in brackets reveals the position of the first amino acid present in the putative NLS, NES, and AHA in the C-terminal; nd: no domains detectable by sequence similarity

Functional domains of TaHsfs DBD DND-binding domain, HR-A/B OD (oligomerisation domain), heptad pattern of hydrophobic amino acid residues; NLS: Nuclear localization signal, NES Nuclear export signal. AHA Activator motifs, romatic (W, F, Y), larger ydrophobic (L, I, V) and cidic (E, D) amino acid residues; Numbers in brackets reveals the position of the first amino acid present in the putative NLS, NES, and AHA in the C-terminal; nd: no domains detectable by sequence similarity

Phylogenetic analysis in wheat Hsf proteins

To further evaluate the phylogenetic relationships amidst Hsf families, the Hsf conserved amino acid sequences (from the beginning of the DNA-binding domain to the end of the HR-A/B region) of 39 proteins from wheat (Triticum aestivum L.), 21 proteins from Arabidopsis (A. thaliana), 25 from rice (O. sativa), 24 from brachypodium (B. distachyon) and 30 from maize (Z. mays) were used to construct a phylogenetic tree (Fig. 1). According to this tree, class HsfA showed the maximum number of subclasses among the three major groups, and contained eight smaller clusters of which five (A6, A2, A8, A1 and A7) were closer to class HsfC than class HsfA3, A4 and A5. Two HsfA6 members from Arabidopsis (At5g43840 and At3g22830) were not clustered with the HsfA6 subclass from other plant species, but were closer to the HsfA7 subclass. Brachypodium Hsfs were closer to wheat Hsf proteins compared with Arabidopsis, maize and rice, which was in line with the botanical classification.
Fig. 1

Phylogenetic tree of Hsf proteins from wheat, Arabidopsis, rice, brachypodium and maize. The N-proximal regions (from the start of the DNA-binding domain to the end of the HR-A/B region) of Hsf proteins were used to construct an unrooted neighbor-joining tree with MEGA6.0 (with pairwise deletion and Poisson correct). For Hsf proteins of Arabidopsis (prefixed by AT), rice (prefixed by Os), Brachyposium (prefixed by Bradi) and maize (prefixed by ZM), both locus ID and subclass numbers are given. TaHsf proteins are marked in red

Phylogenetic tree of Hsf proteins from wheat, Arabidopsis, rice, brachypodium and maize. The N-proximal regions (from the start of the DNA-binding domain to the end of the HR-A/B region) of Hsf proteins were used to construct an unrooted neighbor-joining tree with MEGA6.0 (with pairwise deletion and Poisson correct). For Hsf proteins of Arabidopsis (prefixed by AT), rice (prefixed by Os), Brachyposium (prefixed by Bradi) and maize (prefixed by ZM), both locus ID and subclass numbers are given. TaHsf proteins are marked in red

Genome distribution and gene duplication of TaHsf gene family

We next determined chromosomal locations of TaHsf genes by leveraging the available wheat genome annotation information (Fig. 2). A total of 25, 26 and 23 TaHsf genes are found in the A, B and D sub-genomes, respectively (B > A > D). The distribution of Hsf genes was not even across the chromosomes. There were 7, 9, 17, 13, 16, 3 and 9 genes in the group 1 to group 7 chromosomes, which reveal obvious differences between group 3, 4, 5 and other four groups. Chromosome 3B had the highest number of Hsf genes with 8, while chromosome 6A, 6B and 6D all had only one Hsf gene eachs. These results suggest that Hsf gene duplication events may have happened in wheat 3, 4 and 5 group chromosomes during wheat formation and the evolution of gene families in the different sub-genome is independent, which may relate to gene function.
Fig. 2

Chromosomal localizations and the homologous TaHsf genes in wheat A, B and D sub-genomes

Chromosomal localizations and the homologous TaHsf genes in wheat A, B and D sub-genomes Gene duplication is frequently revealed in plant genomes, resulting from polyploidization or through tandem and segmental duplication related to replication [30]. Here, we found 17 homologous gene groups with a copy on each of A, B and D homologous chromosome, and 7 gene pairs with a copy on only 2 of the 3 homologous chromosomes, while the other 13 genes were not found as homologs (Fig. 2, Additional file 3). Our results indicate that gene loss may happen throughout the wheat Hsf gene family, leading to the loss of some homologous copies. Moreover, these homologous genes are clustered in group 3, 4 and 5 chromosomes, which was in line with the above analysis of chromosome localization, suggesting that group 3, 4 and 5 chromosomes subjected less sequence loss and interaction impact compared to other homologous chromosome groups. In addition, 17 pairs of duplicated genes from different sub-genomes were also found, containing 3 duplication events in the same chromosome and 14 segmental duplication events between different chromosomes, indicating that the duplication events could play important roles in the extension of the Hsf genes in wheat genome (Fig. 3, Additional file 3).
Fig. 3

Duplicated Hsf gene pairs identified in wheat. Seven homologous groups of wheat chromosomes are depicted in different colors. Duplicated gene pairs are depicted in corresponding colors and linked using lines with the corresponding color

Duplicated Hsf gene pairs identified in wheat. Seven homologous groups of wheat chromosomes are depicted in different colors. Duplicated gene pairs are depicted in corresponding colors and linked using lines with the corresponding color

Phylogenetic analysis of Hsfs between the T. urartu, A. tauschii, and wheat orthologs

We also identify the Hsfs gene in the diploid ancestors of wheat, T. urartu and A. tauschii, to investigate the change of Hsf number in transition from diploidy to hexaploidy within a given subgenome. Results showed that 16 and 15 putative Hsfs were identified in T. urartu and A. tauschii through our methods, respectively (Additional file 4). Total 16 T. urartu-Hsfs, 25 T. aestivum-A-Hsfs, 15 A. tauschii-Hsfs, and 23 T. aestivum-D-Hsfs gene sequences were applied to build gene trees. 16 pairs of T. urartu-wheat A genome orthologs were mapped to T. urartu chromosomes with 2 on 1A, 2 on 2A, 4 on 3A, 3 on 4A, 2 on 5A, 1 on 6A and 2 on 7A (Fig. 4). 15 pairs of A. tauschii-wheat D genome orthologs were mapped to A.tauschii chromosomes with 2 on 1D, 3 on 2D, 3 on 3D, 2 on 4D, 3 on 5D, 1 on 6D and 1 on 7D (Fig. 4). The majority of the orthologs (75 and 66.67% for T. urartu and A. tauschii, respectively) belonged to class A, as expected due to the high proportional composition of this type (48.72%) among the identified wheat Hsf genes. Moreover, the chromosome locations of the majority of wheat Hsf genes and their orthologs in T. urartu and A. tauschii corresponded to one another (Additional file 5).
Fig. 4

Collinear analysis for the Hsf gene family among wheat, T.urartu and A.tauschii. The green annulus on the top left represent chromosomes of A. tauschii and the blue annulus on the top right represent chromosomes of T. urartu. Different colors represent seven homologous groups of wheat chromosomes. Homeologous genes of each group are linked by lines with corresponding color

Collinear analysis for the Hsf gene family among wheat, T.urartu and A.tauschii. The green annulus on the top left represent chromosomes of A. tauschii and the blue annulus on the top right represent chromosomes of T. urartu. Different colors represent seven homologous groups of wheat chromosomes. Homeologous genes of each group are linked by lines with corresponding color

Modulatory network between TaHsf genes with other wheat genes

In order to comprehend the interactions between TaHsfs and other wheat genes, the modulatory network of them (Fig. 5) was predicted via the orthology-based method [31]. Results showed that 15 TaHsfs were shown to have homology with Arabidopsis genes and the 420 gene pairs of network interactions were found with the average of 28 gene per TaHsf, suggesting the TaHsfs were broadly engaged in the regulatory network and biological process in wheat. Among these, 292 genes interacted with TaHsfA and 128 genes interacted with TaHsfB. TaHsf16 (A3) was found to interact with 77 wheat genes, including Hsp81.4, ZF2, HBT and HSP90.1, suggesting it was mainly participated in response to stress, metal ion binding, cell differentiation and protein folding. TaHsf18 (A4a) was found to interact with 24 wheat genes, including ZAT6, STZ and S6K2, suggesting it was mainly engaged in metal ion binding, intracellular signal transduction and negative regulation of cell proliferation. TaHsf50 (B4b) was predicted to interact with 88 wheat genes, including MYB15, MYB70, ZFP2, FMA, and HB31, suggesting it is engaged primarily in the regulation of transcription, asmonic acid, metal ion binding and DNA binding. TaHsf44 (B2c) was found to interact with 30 wheat genes including AGC2–1, WRKY39, BAG6 and NF-YC2, suggesting it is mainly engaged in defense response, calmodulin binding, response to heat and flower development (Additional files 6, 7). Moreover, GO and KEGG pathway descriptions of those interacting genes were analyzed to understand the potential function and pathway of the 15 TaHsfs (Fig. 6). The 15 TaHsf interacting genes were significantly enriched for transcription, DNA-templating, response to heat, transcription factor activity, sequence-specific DNA binding and calmodulin binding (Fig. 6a). Significantly enriched pathways included plant hormone signal transduction, PI3K-Akt signaling pathway, and protein processing in endoplasmic reticulum (Fig. 6b).
Fig. 5

An interaction network of TaHsf genes in wheat based on the orthologs in Arabidopsis. Fifteen TaHsfs were found to have homology with Arabidopsis genes and the 420 gene pairs of network interactions

Fig. 6

Functional and KEGG pathway categories of 15 TaHsfs interacting with wheat genes. a Top 10 GO categories that are enriched in 15 TaHsfs interacting with wheat genes according to –log10Pvalues. GOs included biological process, cellular component and molecular function. b Top 20 KEGG pathways that are enriched in 15 TaHsfs interacting with wheat genes according to enrichment scores

An interaction network of TaHsf genes in wheat based on the orthologs in Arabidopsis. Fifteen TaHsfs were found to have homology with Arabidopsis genes and the 420 gene pairs of network interactions Functional and KEGG pathway categories of 15 TaHsfs interacting with wheat genes. a Top 10 GO categories that are enriched in 15 TaHsfs interacting with wheat genes according to –log10Pvalues. GOs included biological process, cellular component and molecular function. b Top 20 KEGG pathways that are enriched in 15 TaHsfs interacting with wheat genes according to enrichment scores

Tissue-specific expression patterns of TaHsf genes

Using available RNA-seq data for five different tissues, the tissue specificity of the TaHsf genes was investigated to focus on the temporal and spatial expression patterns and putative functions of Hsf genes in wheat growth and development. According to FPKM values, we found that the expression levels of the TaHsfs varied significantly in different tissues (Fig. 7). TaHsf10 (A2b), TaHsf15 (A3), TaHsf16 (A3), TaHsf17 (A3), TaHsf30 (A6b), TaHsf32 (A6b), TaHsf50 (B4b), TaHsf58 (C1a), TaHsf66 (C2a) and TaHsf72 (C2a) exhibit low expression abundance in endosperm, inner pericarp and outer pericarp, while TaHsf1 (A1a), TaHsf2 (A1a), TaHsf3 (A1a), TaHsf4 (A2a), TaHsf8 (A2b), TaHsf9 (A2b), TaHsf20 (A4a), TaHsf21 (A4d), TaHsf36 (A8) and TaHsf41 (B1) had high expression abundances. Furthermore, the expression levels of the TaHsfs varied significantly in different grain layers over development (Additional file 1: Figure S2).
Fig. 7

Heat map of the expression profiles of 46 TaHsf genes in five different tissues (grain, leaf, root, spike and stem). Log2 transformed FPKM values were used to create the heat map. The red or green colors stand for the higher or lower relative abundance of each transcript in each sample. Z represent Zadoks scale, a decimal code for the growth stages of cereals. P-value< 0.05 were regarded as statistically significant

Heat map of the expression profiles of 46 TaHsf genes in five different tissues (grain, leaf, root, spike and stem). Log2 transformed FPKM values were used to create the heat map. The red or green colors stand for the higher or lower relative abundance of each transcript in each sample. Z represent Zadoks scale, a decimal code for the growth stages of cereals. P-value< 0.05 were regarded as statistically significant

Expression patterns of TaHsf genes under abiotic stresses

To study the roles of TaHsf genes in response to abiotic stresses, expression of all TaHsf genes in response to drought, heat, and Cd stress was investigated using RNA sequencing data. All 46 wheat Hsf genes revealed different expression patterns under these dynamic conditions. Among them, the expression levels of TaHsf2 (A1a) and TaHsf21 (A4d) were both down-regulated under drought, heat, drought and heat stresses, while the expression of TaHsf4 (A2a), TaHsf15 (A3), TaHsf16 (A3), TaHsf17 (A3), TaHsf28 (A6a) and TaHsf41 (B1) was up-regulated (Additional file 1: Figure S3). According to our RNA sequencing data (Additional file 8) [31], expression levels of TaHsf3 (A1a), TaHsf4 (A2a), TaHsf5 (A2a), TaHsf16 (A3), TaHsf18 (A4a), TaHsf20 (A4a), TaHsf31 (A6b) and TaHsf32 (A6b) were up-regulated under Cd stress, while the expression of TaHsf7 (A2b), TaHsf8 (A2b), TaHsf9 (A2b), TaHsf26 (A5) and TaHsf50 (B4b) was down-regulated (Fig. 8).
Fig. 8

Heat map of the expression profiles of TaHsf genes under Cd treatment. FPKM values were used to create the heat map. The red or green colors indicate the higher or lower relative abundance or each transcript in each sample

Heat map of the expression profiles of TaHsf genes under Cd treatment. FPKM values were used to create the heat map. The red or green colors indicate the higher or lower relative abundance or each transcript in each sample

Verification of the expression of TaHsf in wheat under cd stress by qRT-PCR

According to the expression analysis based on diverse RNA sequencing data above, we obtained an overview of expressed TaHsfs under various agriculturally-relevant stressors. To further verify these results we selected a subset of these TaHsfs to detect their expression levels in root under Cd stress through qRT-PCR. Results showed that compared with H17CK group, levels of TaHsf3 (A1a), TaHsf4 (A2a), TaHsf5 (A2a), TaHsf16 (A3), TaHsf18 (A4a), TaHsf20 (A4a), TaHsf31 (A6b) and TaHsf32 (A6b) were significantly increased, while levels of TaHsf7 (A2b), TaHsf8 (A2b), TaHsf9 (A2b), TaHsf26 (A5) and TaHsf50 (B4b) were significantly decreased (P < 0.05, Fig. 9). The qRT-PCR results were highly consistent with that of RNA sequencing data, confirming that it is reasonable to use RNA sequencing data to evaluate the expression level of transcripts in wheat Cd-response.
Fig. 9

Verification of the expression level of TaHsfs by qRT-PCR analysis. Relative expression levels of 13 TaHsfs under Cd treatment. * represents P < 0.05 vs H17CK

Verification of the expression level of TaHsfs by qRT-PCR analysis. Relative expression levels of 13 TaHsfs under Cd treatment. * represents P < 0.05 vs H17CK

Discussion

A growing body of evidence shows that Hsfs play essential roles in plant developmental and defense processes [16, 32–35]. Due to growing numbers of quality genomes available, putative functions of Hsf family genes have been predicted in many species, from the model plants Arabidopsis [13], rice [5] and maize [36], now to other crops, such as apple [21], Chinese cabbage [37], Chinese white pear [38] and pepper [39]. However, despite the global impact of wheat, as well as the importance of environmental Cd contamination, there has been limited investigation into the molecular basis of Cd accumulation, and the Hsf family in wheat. Here we took advantage of the high quality wheat reference genome, to first identify 78 Hsf wheat genes and to characterize these bioinformatically (Table 1). A first contrast lies on the sheer quantity of these genes in wheat: while we identify 78 in wheat, there are only 21 Hsfs in Arabidopsis, 25 in rice, 30 in maize, 29 in Chinese white pear and 25 in apple [5, 13, 36, 38]. The vast majority of Hsfs can be categorized into three classes: A, B and C. The quantity of class A in Arabidopsis, rice, maize, Chinese white pear and apple are 15, 13, 16, 19 and 16, respectively. Class B Hsfs amount to 5, 8, 9, 8 and 7, in the five plants respectively. Finally, class C is represented by 1, 9, 4, 2 and 2, respectively. In contrast, of 78 putative wheat Hsf genes, 38 belonged to class A, 16 to class B and 24 to class C. Thus class C is relatively expanded in wheat in contrast to these other genomes. We next investigated occurrences of possible gene duplication, which contributes differentially to the extension of specific gene families in plant genomes, and results from polyploidization or tandem and segmental duplication related [30, 40, 41]. In wheat, we found that homologous genes are gathered in group 3, 4 and 5 chromosomes, which was in line with the above analysis of chromosome localization. These results indicated that compared to other homologous chromosome groups, group 3, 4 and 5 chromosomes suffered less sequence loss and interaction impact. Three duplication events with the same chromosome and 14 segmental duplication events between various chromosomes were identified, suggesting that in wheat genome, the duplication events could play important roles in the extension of the Hsf cascade genes. A previous study indicated that more than 90% of the enhancement in regulatory genes in the Arabidopsis lineage were facilitated via genome duplications [42]. Compared with tandem duplications, segmental Hsf gene duplications were more often. This situation appeared in Arabidopsis, maize, poplar [21, 22, 36], and also in wheat. Our phylogenetic analysis indicated that compared with Arabidopsis, maize and rice, brachypodium Hsfs were nearer to wheat Hsf proteins, which was in line with broader classifications. Identification of Hsf genes in wheat and its diploid ancestors, T. urartu and A. tauschii, which suggesting that the number of Hsf in a known subgenome was increased in transition from diploidy to hexaploidy (for A subgenome, 16 to 25 genes, and for D subgenome, 15 to 23 genes). These results further indicate that gene gain happened broadly during the formation of hexaploid [27]. Moreover, protein-protein regulatory interactions were constructed to provide inference of mechanisms of life activities and to explore potential biological functions for unknown proteins. Results showed that TaHsf18 (A4a) interacts with 24 wheat genes, including ZAT6, STZ and S6K2, suggesting it was mainly engaged in metal ion binding, intracellular signal transduction, and the negative regulation of cell proliferation. A previous study indicated that ZAT6 coordinately activates the expression of phytochelatin synthesis-related gene and positively modulate Cd accumulation and tolerance by directly targeting GSH1 in Arabidopsis [43]. HsfA4a was also engaged in cadmium tolerance in wheat [19], suggesting it might be involved in metal ion binding via interacting with ZAT6 to further play a role in cadmium tolerance in wheat. TaHsf50 (B4b) interacts with 88 wheat genes, including MYB15, MYB70, ZFP2, FMA, and HB31, suggesting it is involved in regulation of transcription, regulation of jasmonic acid, metal ion binding and DNA binding. It has been reported that MYB15 is required for the defense-induced synthesis of G-rich lignin and the constitutive synthesis of the coumarin metabolite scopoletin, both of which contribute to disease resistance against a hemibiotrophic bacterial pathogen [44]. TaHsf44 (B2c) was found to interact with 30 wheat genes including AGC2–1, WRKY39, BAG6 and NF-YC2, suggesting it is engaged in defense response, calmodulin binding, response to heat and flower development. AtBAG6 can induce programmed cell death in yeast and plants [45]. Aspartyl protease-mediated cleavage of BAG6 plays an important role in autophagy and fungal resistance in plants [46]. GO analysis showed that 15 TaHsfs interacted genes were significantly enriched for transcription, DNA-templating, response to heat, transcription factor activity, sequence-specific DNA binding and calmodulin binding. It has been reported that Hsf family has a unique role as master modulators of thermotolerance, and were essential for plants survival under serious heat stress [9, 47]. Furthermore, we characterize wheat Hsf genes that expression throughout tissues and development stages. Many of these genes were highly expressed across development. For example, TaHsf2, 3, 20, 17 and 45 were high expressed in roots, stems, leaves, spikes and grains including whole endosperm, starchy endosperm, transfer cells and aleurone layer, as well as seed coats during different developmental stages. It has been reported that Hsfs were involved in plant growth and development [9, 16]. Our results further indicated that Hsf genes play important regulatory roles in wheat growth, development and reproductive processes. In addition, we comprehensively analyzed the expression levels of Hsf genes in response to drought, heat and Cd stresses to predict potential roles. The expression of most Hsf genes were differentially regulated in response to a given stress, which strongly suggests that they may be vital stress response genes. A previous study indicated that Hsfs are involved in responses to the abiotic stress as heat, cold, salt, drought and cadmium [13, 17, 19]. Our results first comprehensively illustrate that Hsf genes likely play important regulatory roles in wheat Cd stress response. Therefore, these genes stand as strong functional candidates for followup research into Cd stress in wheat.

Conclusion

We present the first comprehensive identification and characterization of the wheat Hsf gene family. Through the latest available wheat genome information, total 78 putative wheat Hsf gens were identified through a genome-wide search, and categorized into class A, B and C subfamilies based on conserved motifs. Chromosome localizations, gene structures, conserved protein motifs, and phylogenetic relationship of these TaHsfs were comprehensively analyzed and strongly supported these classifications. Moreover, the gene duplication and homologous genes between wheat A, B and D sub-genome were also surveyed. Expression profiles of these TaHsfs through development and under various abiotic stresses were surveyed and provide strong functional candidates for followup work. Finally, through qRT-PCR analysis, 13 TaHsf genes were selected to verify their expression level in wheat under Cd stress, which provide top candidates for further functional analysis of Hsf genes in response to wheat Cd stress.

Methods

Identification and classification of Hsf gene family in wheat

The Hsf gene family was identified following the method as described by Wang et al. with some modifications [48]. First, to construct a local protein database, all the wheat (T. aestivum L.) protein sequences available were downloaded from the Ensemble database (http://plants.ensembl.org/index.html). Then, the database were searched with 100 known Hsf gene sequences collected from A. thaliana (21), O. sativa (25), B. distachyon (24) and Z. mays (30) using the local BLASTP program with an e-value of le-5 and identity of 50% as the threshold. Moreover, a self-blast of these sequences was performed to remove redundancy, the physical localizations of all candidate Hsf genes were checked and redundant sequences with the same chromosome location were rejected. Furthermore, all obtained Hsf protein sequences were analyzed to detect DBD domains and coiled-coil structures by the SMART and MARCOIL programs (SMART: http://smart.embl-heidelberg.de/, MARCOIL: http://toolkit.tuebingen.mpg.de/marcoil). Those protein sequences lacking the DBD domain or a coiled-coil structure were removed. Finally, to verify the existence of all the obtained sequences, BLASTN similarity searches against the wheat ESTs deposited in the NCBI database were performed. The theorectical pI (isoelectric point) and Mw (molecular weight) of the putative Hsf from T. aestivum L were calculated using compute pI/Mw tool online (http://web.expasy.org/compute_pi/), respectively. Classification of the three different groups A, B and C was based on structural characteristics and phylogenetic comparisons [49, 50].

Gene structure construction, protein domain and motif analysis

Gene structure information were obtained from the Ensemble plants database (http://plants.ensembl.org/index.html). Conserved domains annotation was performed using Pfam (http://pfam.xfam.org/search), SMART (http://smart.embl-heidelberg.de/) and Heatster online tools [39]. All full-length amino acid sequences of the TaHsfs were used to identify conserved domain motifs by the Multiple Em for Motif Elicitation (MEME) tool [51]. The parameters were set as follows: maximum numbers of different motifs, 30; minimum motif width, 4; maximum motif width, 50.

Chromosomal locations and gene duplication

Genes were mapped onto chromosomes by identifying their chromosomal position provided in the wheat genome database. Gene duplication events of Hsf genes in wheat were investigated based on the following three criteria: (a) the alignment covered > 80% of the longer gene; (b) the aligned region had an identity > 80% [52]. In order to visualize the duplicated regions in the T. aestivum genome, lines were drawn between matching genes using Circos-0.67 program (http://circos.ca/).

Phylogenetic analysis

The N-terminal Hsf protein sequences containing the DBD and HR-A/B regions and parts of the linker between these two regions from A. thaliana, O. sativa, B. distachyon, Z. mays and T. aestivum L. were performed for multiple alignments by CLUSTALW and the results of alignment were used to construct phylogenetic tree using the NJ method in MEGA (version 6.0) [53]. Bootstrap test method was adopted and the replicate was set to 1000.

Analysis of the TaHsf family orthologs in T. urartu and A. tauschii

The wheat- T. aestivum, wheat-T. urartu and wheat-A .tauschii Hsf genes were used to construct phylogenetic trees using neighbor-joining method with 1000 bootstrap replicates. According to these orthologous Hsf genes, a collinear map of the T. urartu-wheat A genome and A. tauschii-wheat B genome was created using genome visualization tool CIRCOS according to these orthologous Hsf genes. The locations of Hsf orthologous genes on the chromosomes of T. urartu and A. tauschii were obtained from the database published by Ling et al. [23] and Jia et al. [54], respectively.

Network interaction analysis

The interaction network involving the TaHsf genes was based on the orthologous genes between Wheat and Arabidopsis using the AraNet V2 tool (http://www.inetbio.org/aranet/) [48]. Enrichment analysis was implemented by BiNGO, a cytoscape plugin, for gene ontology analysis and identifying processes and pathways of specific gene sets. Over-represented GO full categories were identified with a significance threshold of 0.01.

The TaHsf gene expression analysis by RNA-seq data

To study the expression of TaHsf genes in different organs and response to stress, the wheat expression database (http://wheat.pw.usda.gov/WheatExp/) was used The FPKM (fragments per kilobase of transcript per million fragments mapped) value was calculated for each Hsf gene, the log2 transformed values of the TaHsf genes were used for heat map generation. P-values < 0.05 were taken as statistically significant thresholds [55].

Plant materials, growth conditions, and treatments

The plant of wheat cultivar Chuanyu17, a high-Cd-accumulating cultivar, was planted in growth chambers at 23 ± 1 °C with a photoperiod of 16 h light/8 h dark. One-week-old seedlings were treated with 0 (H17CK) and 100 μM CdCl2 for 24 h (H17Cd). Roots from the plants with similar size were harvested separately and washed three times with deionized water. All the plant samples from three biological replicates were frozen in liquid nitrogen immediately and stored at − 80 °C for RNA extraction.

RNA extraction and real-time quantitative RT-PCR (qRT-PCR) analysis

Total RNA was extracted from roots of Chuanyu17 in H17CK and H17Cd groups using TRIzol Reagent (Invitrogen, USA) according to the manufacturer’s instructions. RNA was quantified by using NanoDrop-1000 and RNA integrity was checked by electrophoresis. First strand cDNA was synthesized using HiScript IIQ RT SuperMix (Vazyme, R223–1). The primers used in the qRT-PCR analyses are listed in Additional file 9. β-actin was used as an internal control. The qRT-PCR was carried out using QuantiFast® SYBR® Green PCR kit (Qiagen, 204,054) according to the manufacturer’s instructions. Each treatment was repeated three times. The expression levels were calculated from the 2-ΔΔCt value [ΔΔCt = (CT target/Cd - CT actin/Cd - (CT target/control - CT actin/control)] [45]. Figure S1. Motifs identified by MEME tools in Wheat Hsfs. Thirty motifs (1–30) were identified and indicated by different color. Motif location and combined p-value were represented. Motif 9 was found in TaHsf5, 6, 9, 10, 11, 13, 17, 18, 20, 23, 27, 28, 30, 31, 32, 45, 46, 52, 56, 59, 60, 64, 65, 66, 68, 73 and 75 which was covered by other motifs. Figure S2. Heat map of the expression profiles of TaHsf genes in different grain layers and a developmental timecourse. Log2 transformed FPKM values were used to establish the heat map. The red or green colors stand for the higher or lower relative abundance of each transcript in each sample. P-value< 0.05 were regarded as statistically significant. DPA means days post-anthesis. Figure S3. Heat map of the expression profiles of TaHsf genes under drought and heat stress treatments. Log2 transformed FPKM values were used to create the heat map. The red or green colors indicate the higher or lower relative abundance or each transcript in each sample. P-value< 0.05 were regarded as statistically significant. (PDF 580 kb) Motif sequences identified by MEME tools. Motif numbers corresponded to the motifs in Additional file 1: Figure S1. (XLSX 10 kb) The homologous TaHsf genes in wheat A, B and D sub-genomes and the Duplicated genes pairs identified in wheat (XLSX 11 kb) The list of the putative Hsf genes for A.tauschii and T.urartu (XLSX 11 kb) Details of TaHsfs and corresponding orthologs Hsfs in T.urartu and A.tauschii (XLSX 11 kb) The detail of 15 TaHsf orthologous genes in Arabidopis thaliana (XLSX 10 kb) Detail information of Network of TaHsf with other genes (XLSX 40 kb) Expression profiles of TaHsf in wheat under Cd stress (XLSX 13 kb) The Primers for TaHsfs. (XLSX 10 kb)
  54 in total

Review 1.  Molecular chaperones in the cytosol: from nascent chain to folded protein.

Authors:  F Ulrich Hartl; Manajit Hayer-Hartl
Journal:  Science       Date:  2002-03-08       Impact factor: 47.728

Review 2.  Arabidopsis and the heat stress transcription factor world: how many heat stress transcription factors do we need?

Authors:  L Nover; K Bharti; P Döring; S K Mishra; A Ganguli; K D Scharf
Journal:  Cell Stress Chaperones       Date:  2001-07       Impact factor: 3.667

3.  In the complex family of heat stress transcription factors, HsfA1 has a unique role as master regulator of thermotolerance in tomato.

Authors:  Shravan Kumar Mishra; Joanna Tripp; Sybille Winkelhaus; Bettina Tschiersch; Klaus Theres; Lutz Nover; Klaus-Dieter Scharf
Journal:  Genes Dev       Date:  2002-06-15       Impact factor: 11.361

Review 4.  More than folding: localized functions of cytosolic chaperones.

Authors:  Jason C Young; José M Barral; F Ulrich Hartl
Journal:  Trends Biochem Sci       Date:  2003-10       Impact factor: 13.807

5.  Modeling gene and genome duplications in eukaryotes.

Authors:  Steven Maere; Stefanie De Bodt; Jeroen Raes; Tineke Casneuf; Marc Van Montagu; Martin Kuiper; Yves Van de Peer
Journal:  Proc Natl Acad Sci U S A       Date:  2005-03-30       Impact factor: 11.205

6.  AtBAG6, a novel calmodulin-binding protein, induces programmed cell death in yeast and plants.

Authors:  C H Kang; W Y Jung; Y H Kang; J Y Kim; D G Kim; J C Jeong; D W Baek; J B Jin; J Y Lee; M O Kim; W S Chung; T Mengiste; H Koiwa; S S Kwak; J D Bahk; S Y Lee; J S Nam; D J Yun; M J Cho
Journal:  Cell Death Differ       Date:  2006-01       Impact factor: 15.828

7.  The balance of nuclear import and export determines the intracellular distribution and function of tomato heat stress transcription factor HsfA2.

Authors:  D Heerklotz; P Döring; F Bonzelius; S Winkelhaus; L Nover
Journal:  Mol Cell Biol       Date:  2001-03       Impact factor: 4.272

Review 8.  Heat stress response in plants: a complex game with chaperones and more than twenty heat stress transcription factors.

Authors:  Sanjeev Kumar Baniwal; Kapil Bharti; Kwan Yu Chan; Markus Fauth; Arnab Ganguli; Sachin Kotak; Shravan Kumar Mishra; Lutz Nover; Markus Port; Klaus-Dieter Scharf; Joanna Tripp; Christian Weber; Dirk Zielinski; Pascal von Koskull-Döring
Journal:  J Biosci       Date:  2004-12       Impact factor: 1.826

9.  A seed-specific heat-shock transcription factor involved in developmental regulation during embryogenesis in sunflower.

Authors:  Concepción Almoguera; Anabel Rojas; Juan Díaz-Martín; Pilar Prieto-Dapena; Raul Carranco; Juan Jordano
Journal:  J Biol Chem       Date:  2002-09-12       Impact factor: 5.157

10.  Characterization of C-terminal domains of Arabidopsis heat stress transcription factors (Hsfs) and identification of a new signature combination of plant class A Hsfs with AHA and NES motifs essential for activator function and intracellular localization.

Authors:  Sachin Kotak; Markus Port; Arnab Ganguli; Frank Bicker; Pascal von Koskull-Döring
Journal:  Plant J       Date:  2004-07       Impact factor: 6.417

View more
  10 in total

1.  Genome-wide identification and expression analysis of the GSK gene family in wheat (Triticum aestivum L.).

Authors:  Peipei Zhang; Linghui Zhang; Tao Chen; Fanli Jing; Yuan Liu; Jingfu Ma; Tian Tian; Delong Yang
Journal:  Mol Biol Rep       Date:  2022-01-27       Impact factor: 2.316

2.  The heat stress transcription factor family in Aegilops tauschii: genome-wide identification and expression analysis under various abiotic stresses and light conditions.

Authors:  Harsha Samtani; Aishwarye Sharma; Jitendra P Khurana; Paramjit Khurana
Journal:  Mol Genet Genomics       Date:  2022-09-16       Impact factor: 2.980

3.  Evolution and co-evolution: insights into the divergence of plant heat shock factor genes.

Authors:  Ramya Parakkunnel; K Bhojaraja Naik; C Susmita; Vanishree Girimalla; K Udaya Bhaskar; K V Sripathy; C S Shantharaja; S Aravindan; Sanjay Kumar; Suman Lakhanpaul; K V Bhat
Journal:  Physiol Mol Biol Plants       Date:  2022-05-19

4.  Genome-wide characterization of tea plant (Camellia sinensis) Hsf transcription factor family and role of CsHsfA2 in heat tolerance.

Authors:  Xuyang Zhang; Wenluan Xu; Dejiang Ni; Mingle Wang; Guiyi Guo
Journal:  BMC Plant Biol       Date:  2020-05-29       Impact factor: 4.215

5.  Genome-Wide Identification and Low Temperature Responsive Pattern of Actin Depolymerizing Factor (ADF) Gene Family in Wheat (Triticum aestivum L.).

Authors:  Ke Xu; Yong Zhao; Sihang Zhao; Haodong Liu; Weiwei Wang; Shuhua Zhang; Xueju Yang
Journal:  Front Plant Sci       Date:  2021-02-24       Impact factor: 5.753

6.  Heat shock transcription factor (Hsf) gene family in common bean (Phaseolus vulgaris): genome-wide identification, phylogeny, evolutionary expansion and expression analyses at the sprout stage under abiotic stress.

Authors:  Qi Zhang; Jing Geng; Yanli Du; Qiang Zhao; Wenjing Zhang; Qingxi Fang; Zhengong Yin; Jianghui Li; Xiankai Yuan; Yaru Fan; Xin Cheng; Jidao Du
Journal:  BMC Plant Biol       Date:  2022-01-14       Impact factor: 4.215

Review 7.  Recent Advances in Minimizing Cadmium Accumulation in Wheat.

Authors:  Min Zhou; Zhengguo Li
Journal:  Toxics       Date:  2022-04-12

8.  Characterization of the Wheat Heat Shock Factor TaHsfA2e-5D Conferring Heat and Drought Tolerance in Arabidopsis.

Authors:  Huihui Bi; Jingnan Miao; Jinqiu He; Qifan Chen; Jiajun Qian; Huanhuan Li; Yan Xu; Dan Ma; Yue Zhao; Xuejun Tian; Wenxuan Liu
Journal:  Int J Mol Sci       Date:  2022-03-03       Impact factor: 5.923

9.  Large-scale analyses of heat shock transcription factors and database construction based on whole-genome genes in horticultural and representative plants.

Authors:  Tong Yu; Yun Bai; Zhuo Liu; Zhiyuan Wang; Qihang Yang; Tong Wu; Shuyan Feng; Yu Zhang; Shaoqin Shen; Qiang Li; Liqiang Gu; Xiaoming Song
Journal:  Hortic Res       Date:  2022-02-19       Impact factor: 7.291

10.  Genome-wide identification, transcriptome analysis and alternative splicing events of Hsf family genes in maize.

Authors:  Huaning Zhang; Guoliang Li; Cai Fu; Shuonan Duan; Dong Hu; Xiulin Guo
Journal:  Sci Rep       Date:  2020-05-15       Impact factor: 4.379

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.