| Literature DB >> 31083365 |
Bobin Liu1, Jianjun Hu2, Jin Zhang3,4.
Abstract
Heat shock transcription factors (Hsfs), which function as the activator of heat shock proteins (Hsps), play multiple roles in response to environmental stress and the development of plants. The Hsf family had experienced gene expansion via whole-genome duplication from a single cell algae to higher plants. However, how the Hsf gene family went through evolutionary divergence after genome duplication is unknown. As a model wood species, Populus trichocarpa is widely distributed in North America with various ecological and climatic environments. In this study, we used P. trichocarpa as materials and identified the expression divergence of the PtHsf gene family in developmental processes, such as dormant bud formation and opening, catkins development, and in response to environments. Through the co-expression network, we further discovered the divergent co-expressed genes that related to the functional divergence of PtHsfs. Then, we studied the alternative splicing events, single nucleotide polymorphism distribution and tertiary structures of members of the PtHsf gene family. In addition to expression divergence, we uncovered the evolutionary divergence in the protein level which may be important to new function formations and for survival in changing environments. This study comprehensively analyzed the evolutionary divergence of a member of the PtHsf gene family after genome duplication, paving the way for further gene function analysis and genetic engineering.Entities:
Keywords: Populus; alternative splicing; co-expression network; gene duplication; gene expression; heat shock transcription factors; protein structure; single nucleotide polymorphism
Mesh:
Substances:
Year: 2019 PMID: 31083365 PMCID: PMC6563006 DOI: 10.3390/cells8050438
Source DB: PubMed Journal: Cells ISSN: 2073-4409 Impact factor: 6.600
Figure 1The Hsf gene family underwent gene expansion in evolution history. (A) The gene number of Hsf genes in different plants. Note that the Hsf gene number was multiplied from algae to land plants. Ostreococcus lucimarinus (Osluc), Volvox carteri (Vocar), Marchantia polymorpha (Mapol), Physcomitrella patens (Phpat), Selaginella moellendorffii (Semoe), Amborella trichopoda (Amtri), Arabidopsis thaliana (Artha), Populus trichocarpa (Potri), Oryza sativa (Orsat), Zea mays (Zemay), Gnetum montanum (Gnmon), Taxus baccata (Tabac), Ginkgo biloba (Gibil) and Picea abies (Piabi). (B) Evolutionary relationship of the Hsfs from Osluc, Vocar, Mapol, Phpat, Semoe, Amtri, Artha and Potri. Note that every subfamily Hsf genes in Arabidopsis and poplar were expanded from Amborella trichopoda, the oldest angiosperm. (C) Syntenic relationships of duplicated genes in the PtHsf family.
Figure 2The expression patterns of PtHsf and the pairwise correlation co-efficiency. (A) Expression patterns of PtHsfs across various tissues. (1) Pre-dormant bud I, (2) Pre-dormant bud II, (3) Early dormant bud, (4) Late dormant bud, (5) Fully open bud (6) GW9592.ZK 10 male early, (7) GW9840.ZE 30 male early, (8) GW9911.ZK 51 male mid (9) BESC423.ZL 7 female early, (10) BESC842.ZI 22 female late, (11) BESC443.ZG 43 female receptive (12) Leaf immature standard, (13) Leaf young standard, (14) Leaf first fully expanded standard, (15) Root standard, (16) Root tip standard, (17) Root ammonia, (18) Root nitrate, (19) Root urea, (20) Stem node standard, (21) Stem inode standard, (22) Stem ammonia, (23) Stem nitrate, (24) Stem urea. (B) Pairwise comparison of Pearson correlation co-efficiency (PCC, r).
Figure 3The promoter similarity between 10 gene pairs in the PtHsf family. (A) Conserved blocks located in the promoter region of PtHsf gene pairs. (B) Conserved block number and total conserved length between the PtHsf pairs’ promoter. (C) The correlation coefficiency between the total conserved length and the PCC of the expression (Figure 2) of PtHsf pairs. (D) The correlation coefficiency between the paralogous pair duplication date and the PCC of expression.
Figure 4The co-expression network of PtHsfs. (A) Co-expression network of PtHsfs. Blue nodes indicate PtHsfs and red nodes indicate other transcription factors (TFs). (B) Gene ontology (GO) enrichment analysis co-expression sub-networks of ten PtHsfs genes. Yellow to red represents -log10 transformed false discovery rate and node size indicates the percentage of GO enriched genes. GO enrichment was shown in Table S1.
The co-expressed genes and enrichment analysis.
| Gene Name | Co-Expressed Gene # | Co-Expressed TF # | Enriched GO Term | Enriched Protein Domain | Enriched Pathway |
|---|---|---|---|---|---|
|
| 268 | 8 | 9 | 9 | 1 |
|
| 281 | 2 | 17 | 20 | 1 |
|
| 458 | 21 | 57 | 2 | 1 |
|
| 439 | 6 | 16 | 41 | 2 |
|
| 84 | 3 | n.a. | 2 | n.a. |
|
| 275 | 24 | n.a. | 5 | n.a. |
|
| 17 | 4 | n.a. | n.a. | n.a. |
|
| 0 | 0 | n.a. | n.a. | n.a. |
|
| 137 | 4 | n.a. | n.a. | n.a. |
|
| 626 | 21 | 56 | 28 | 1 |
|
| 6 | 0 | n.a. | n.a. | n.a. |
|
| 0 | 0 | n.a. | n.a. | n.a. |
|
| 91 | 4 | n.a. | 13 | 1 |
|
| 54 | 3 | n.a. | n.a. | n.a. |
|
| 8 | 1 | n.a. | n.a. | n.a. |
|
| 17 | 2 | n.a. | n.a. | n.a. |
|
| 15 | 2 | n.a. | n.a. | n.a. |
|
| 269 | 11 | 19 | 1 | 1 |
|
| 18 | 0 | n.a. | 6 | 1 |
|
| 0 | 0 | n.a. | n.a. | n.a. |
|
| 39 | 3 | n.a. | n.a. | n.a. |
|
| 27 | 1 | n.a. | n.a. | n.a. |
|
| 103 | 5 | n.a. | 2 | n.a. |
|
| 315 | 10 | 53 | n.a. | 1 |
|
| 176 | 6 | 7 | 11 | 3 |
|
| 391 | 29 | 59 | 5 | n.a. |
|
| 200 | 8 | 6 | 11 | 3 |
|
| 0 | 0 | n.a. | n.a. | n.a. |
|
| 154 | 12 | n.a. | 2 | n.a. |
|
| 2 | 1 | n.a. | n.a. | n.a. |
n.a.: no significant term available.
The gene list and corresponding transcripts of PtHsfs.
| Class | Gene Name | Gene ID | Transcript ID | AS type a | Resultant |
|---|---|---|---|---|---|
| Class A |
| Potri.003G095000 | Potri.003G095000.1 | (primary transcript) | |
| Potri.003G095000.2 | A5SS | 3′ UTR change | |||
|
| Potri.013G079800 | Potri.013G079800.1 | (primary transcript) | ||
|
| Potri.001G138900 | Potri.001G138900.3 | (primary transcript) | ||
| Potri.001G138900.2 | A3SS | loss partial of DBD | |||
|
| Potri.006G226800 | Potri.006G226800.4 | (primary transcript) | ||
| Potri.006G226800.2 | A5SS, SE | loss partial of C-terminal and 3′ UTR change | |||
| Potri.006G226800.3 | A5SS | loss partial of C-terminal and 3′ UTR change | |||
|
| Potri.006G115700 | Potri.006G115700.2 | (primary transcript) | ||
|
| Potri.011G071700 | Potri.011G071700.1 | (primary transcript) | ||
|
| Potri.014G141400 | Potri.014G141400.1 | (primary transcript) | ||
| Potri.014G141400.2 | RI | loss DBD | |||
|
| Potri.004G062300 | Potri.004G062300.1 | (primary transcript) | ||
| Potri.004G062300.2 | A3SS | loss partial DBD | |||
|
| Potri.017G059600 | Potri.017G059600.1 | (primary transcript) | ||
|
| Potri.001G320900 | Potri.001G320900.1 | (primary transcript) | ||
|
| Potri.010G082000 | Potri.010G082000.2 | (primary transcript) | ||
| Potri.010G082000.1 | SE | 3′ UTR change | |||
|
| Potri.008G157600 | Potri.008G157600.1 | (primary transcript) | ||
|
| Potri.005G214800 | Potri.005G214800.2 | (primary transcript) | ||
| Potri.005G214800.4 | SE, SE | 3′ UTR change | |||
| Potri.005G214800.1 | SE | 3′ UTR change | |||
| Potri.005G214800.3 | SE, SE, SE | 3′ UTR change | |||
| Potri.005G214800.5 | SE, SE | 3′ UTR change | |||
| Potri.005G214800.6 | RI | N-terminal | |||
|
| Potri.002G048200 | Potri.002G048200.1 | (primary transcript) | ||
| Potri.002G048200.2 | RI | 3′ UTR change | |||
| Potri.002G048200.3 | SE | 3′ UTR change | |||
| Potri.002G048200.4 | RI, SE | 3′ UTR change | |||
|
| Potri.008G136800 | Potri.008G136800.2 | (primary transcript) | ||
| Potri.008G136800.6 | A5SS | 3′ UTR change | |||
|
| Potri.010G104300 | Potri.010G104300.2 | (primary transcript) | ||
| Potri.010G104300.1 | SE | 3′ UTR change | |||
|
| Potri.006G148200 | Potri.006G148200.2 | (primary transcript) | ||
| Class B |
| Potri.007G043800 | Potri.007G043800.1 | (primary transcript) | |
|
| Potri.012G138900 | Potri.012G138900.1 | (primary transcript) | ||
| Potri.012G138900.2 | SE, RI | 3′ UTR change | |||
| Potri.012G138900.3 | RI | 3′ UTR change | |||
|
| Potri.001G108100 | Potri.001G108100.1 | (primary transcript) | ||
| Potri.001G108100.3 | A3SS | 5′ UTR change | |||
| Potri.001G108100.2 | A5SS | 5′ UTR change | |||
|
| Potri.015G141100 | Potri.015G141100.1 | (primary transcript) | ||
| Potri.015G141100.2 | SE | loss partial of internal sequence | |||
|
| Potri.006G049200 | Potri.006G049200.1 | (primary transcript) | ||
|
| Potri.016G056500 | Potri.016G056500.1 | (primary transcript) | ||
|
| Potri.002G124800 | Potri.002G124800.1 | (primary transcript) | ||
|
| Potri.009G068000 | Potri.009G068000.1 | (primary transcript) | ||
|
| Potri.014G027100 | Potri.014G027100.3 | (primary transcript) | ||
| Potri.014G027100.1 | RI | loss partial of C-terminal | |||
|
| Potri.001G273700 | Potri.001G273700.1 | (primary transcript) | ||
|
| Potri.004G042600 | Potri.004G042600.1 | (primary transcript) | ||
|
| Potri.011G051600 | Potri.011G051600.1 | (primary transcript) | ||
| Class C |
| Potri.T137400 | Potri.T137400.1 | (primary transcript) |
a: Alternative splicing types include skipped exon (SE), alternative 5′ splice sites (A5SS), alternative 3′ splice sites (A3SS), mutually exclusive exons (MXE), and retained intron (RI).
Figure 5The identified single nucleotide polymorphisms (SNPs) in PtHsf genes from 549 P. trichocarpa individuals based on whole genome re-sequencing. (A) The SNPs were classified based on their locations: (a) intron, (b) UTR 5 prime, (c) UTR 3 deleted, (d) UTR 3 prime, (e) start gained, (f) start lost, (g) stop gained, (h) stop lost, (i) frame shift, (j) codon insertion, (k) codon deletion, (l) codon change plus codon deletion, (m) codon change plus codon insertion, (n) splice site donor, (o) splice site acceptor, (p) synonymous stop, (q) synonymous coding, (r) non-synonymous coding. (B) Examples of the SNP frequency and location of paralogous pairs PtHsf-A1a/A1c and PtHsf-B2a/B2c. Details of SNPs were listed in Table S3.
The non-synonymous coding single nucleotide polymorphisms (SNPs) located in the domain of PtHsfs.
| Gene Name | DBD | HR-A/B | AHA | Others (NLS, NES, or RD) |
|---|---|---|---|---|
|
| Chr03:12188111 G->T (W105C) | Chr03:12188394 A->G (M200V) | ||
| Chr03:12188143 A->T (Q116L) | ||||
|
| Chr13:6946101 T->C (K107E) | Chr13:6946028 G->A (S131F) | Chr13:6945777 T->G (K215Q) <NLS> | |
| Chr13:6945966 C->A (V152F) | ||||
| Chr13:6945909 G->C (R171G) | ||||
|
| Chr01:11154369 C->T (R211Q) | Chr01:11154258 T->C (E248G) | Chr01:11153373 G->A (A543V) | |
| Chr01:11154213 C->T (R263K) | ||||
| Chr01:11154182 G->T (S273R) | ||||
|
| Chr06:23831182 C->A (S94I) | Chr06:23830387 C->G (M171I) | ||
| Chr06:23830564 C->A (W112C) | Chr06:23830281 T->C (T207A) | |||
|
| Chr06:9090513 G->T (P15H) | Chr06:9088646 C->T (V367I) | ||
| Chr06:9090444 A->G (I38T) | Chr06:9088592 G->A (P385S) | |||
| Chr06:9089476 G->A (S90F) | ||||
| Chr06:9089434 C->A (R104M) | ||||
|
| Chr11:6830049 T->A (N37Y) | Chr11:6829640 C->T (D132N) | Chr11:6828857 G->A (L393F) <NES> | |
| Chr11:6830025 G->T (P45T) | Chr11:6829620 T->A (K138N) | |||
| Chr11:6829607 C->A (A143S) | ||||
| Chr11:6829480 C->G (R185P) | ||||
| Chr11:6829475 A->T (L187M) | ||||
|
| Chr14:10767449 A->C (S6R) | Chr14:10766450 T->C (K131R) | Chr14:10765533 T->C (I437V) <NES> | |
| Chr14:10766596 C->A (Q82H) | Chr14:10766449 C->A (K131N) | |||
| Chr14:10766553 T->G (I97L) | Chr14:10766399 T->C (Q148R) | |||
| Chr14:10766333 C->G (S170T) | ||||
| Chr14:10766295 C->A (G183C) | ||||
| Chr14:10766294 C->G (G183A) | ||||
|
| Chr04:5146215 G->A (P25L) | Chr04:5145571 T->A (M187L) | Chr04:5145100 C->G (V344L) | |
| Chr04:5146204 G->A (P29S) | ||||
| Chr04:5146203 G->T (P29Q) | ||||
| Chr04:5146185 T->G (Q35P) | ||||
| Chr04:5146182 C->G (S36T) | ||||
|
| Chr17:5478669 C->T (S43N) | Chr17:5477318 G->C (H138D) | Chr17:5476292 T->C (M480V) <NES> | |
| Chr17:5477446 T->C (K95R) | Chr17:5477236 T->A (Q165L) | |||
| Chr17:5477207 C->T (E175K) | ||||
|
| Chr01:32571330 G->T (F23L) | Chr01:32569699 T->C (K144R) | ||
| Chr01:32571193 T->C (N69S) | Chr01:32569688 T->C (K148E) | |||
| Chr01:32569818 G->T (H104Q) | Chr01:32569582 A->C (L183R) | |||
|
| Chr10:10826222 C->A (S64I) | Chr10:10825317 G->C (L156V) | Chr10:10825077 T->A (I236F) <NLS> | |
| Chr10:10826111 G->T (T101K) | Chr10:10825283 T->C (K167R) | Chr10:10824747 A->C (L346V) <NES> | ||
| Chr10:10825416 T->C (K123E) | Chr10:10825281 G->T (Q168K) | Chr10:10824724 C->G (L353F) <NES> | ||
| Chr10:10825251 T->C (R178G) | ||||
| Chr10:10825200 C->T (V195I) | ||||
| Chr10:10825148 G->A (A212V) | ||||
|
| Chr08:10686052 C->T (P18S) | Chr08:10686988 T->A (L129Q) | Chr08:10687618 A->C (Y339S) <NES> | |
| Chr08:10687036 T->C (V145A) | ||||
| Chr08:10687083 A->C (I161L) | ||||
| Chr08:10687099 G->A (R166Q) | ||||
| Chr08:10687140 A->G (S180G) | ||||
|
| Chr05:22774061 G->T (A74S) | Chr05:22774833 G->T (R156L) | Chr05:22775403 C->T (T346I) <NES> | |
| Chr05:22774724 G->A (G120R) | Chr05:22774866 G->A (R167K) | |||
| Chr05:22774951 C->G (D195E) | ||||
| Chr05:22774954 A->C (Q196H) | ||||
|
| Chr02:3141391 T->C (D56G) | Chr02:3140451 G->C (H171Q) | Chr02:3140246 T->A (T240S) <NLS> | |
| Chr02:3141341 C->T (V73I) | Chr02:3140405 C->T (A187T) | |||
| Chr02:3141326 A->G (Y78H) | Chr02:3140318 T->A (M216L) | |||
| Chr02:3141309 A->T (N83K) | ||||
| Chr02:3140561 T->C (R135G) | ||||
|
| Chr08:9112715 G->A (E251K) <NLS> | |||
|
| Chr10:12489151 T->G (M19L) | |||
| Chr10:12489048 T->C (K53R) | ||||
| Chr10:12487083 T->A (D78V) | ||||
| Chr10:12487082 A->T (D78E) | ||||
| Chr10:12487060 C->T (G86R) | ||||
| Chr10:12486987 C->T (R110Q) | ||||
|
| Chr06:12750969 T->C (K103E) | Chr06:12750442 C->G (V206L) | Chr06:12750381 C->T (S226N) <NLS> | |
| Chr06:12750927 A->G (S117P) | Chr06:12750409 T->G (K217Q) | |||
| Chr06:12750921 T->C (N119D) | ||||
| Chr06:12750920 T->A (N119I) | ||||
| Chr06:12750618 G->A (P147L) | ||||
|
| Chr07:3779417 T->A (E174V) | |||
| Chr07:3779389 C->A (L183F) | ||||
|
| Chr12:15349619 G->A (D44N) | Chr12:15350210 G->T (E166D) | Chr12:15350412 G->A (G234R) <NLS> | |
| Chr12:15349721 T->C (F78L) | Chr12:15350293 C->G (S194W) | Chr12:15350397 G->T (V229F) <RD> | ||
| Chr12:15350046 T->A (L112M) | ||||
| Chr12:15350051 A->T (L113F) | ||||
| Chr12:15350067 A->G (R119G) | ||||
|
| Chr01:8582316 C->G (D55E) | Chr01:8582880 C->T (S209L) | Chr01:8583138 T->C (V295A) <NLS> | |
| Chr01:8582335 G->A (D62N) | ||||
| Chr01:8582651 A->C (T133P) | ||||
|
| Chr15:14827877 A->T (E56D) | Chr15:14828338 A->G (E175G) | Chr15:14828531 G->T (E239D) <RD> | |
| Chr15:14827921 G->T (R71I) | ||||
| Chr15:14827935 A->C (K76Q) | ||||
|
| Chr06:3535947 A->G (K152R) | |||
| Chr06:3535970 G->T (V160F) | ||||
| Chr06:3535988 A->T (T166S) | ||||
|
| Chr16:3748780 T->A (I36L) | Chr16:3747556 G->T (T162N) | ||
| Chr16:3748759 T->C (T43A) | Chr16:3747537 G->C (N168K) | |||
| Chr16:3748734 G->T (A51E) | Chr16:3747526 T->C (K172R) | |||
| Chr16:3747757 C->A (R95L) | ||||
|
| Chr02:9388712 C->G (T49S) | Chr02:9389317 T->C (S209P) | Chr02:9389755 G->C (G355R) <NES> | |
| Chr02:9388795 G->T (V77F) | Chr02:9389350 G->A (D220N) | Chr02:9389656 T->C (S322P) <RD> | ||
|
| Chr09:6774240 C->A (L80I) | Chr09:6774986 A->C (K167Q) | Chr09:6775257 G->A (G257E) <RD> | |
| Chr09:6775259 G->T (V258F) <RD> | ||||
|
| Chr14:2300725 A->C (M211L) | Chr14:2301145 A->T (M351L) <NES> | ||
| Chr14:2301156 C->A (D354E) <NES> | ||||
|
| Chr01:28036766 T->G (N81K) | Chr01:28037553 A->C (N169H) | ||
| Chr01:28037311 T->C (V88A) | ||||
| Chr01:28037358 G->A (A104T) | ||||
|
| Chr04:3233937 G->T (P53H) | Chr04:3232326 G->C (T195S) | ||
| Chr04:3232591 A->T (S107T) | ||||
| Chr04:3232540 A->T (L124M) | ||||
|
| Chr11:4423922 C->A (A45S) | Chr11:4422794 G->A (T162M) | ||
| Chr11:4423915 T->C (D47G) | Chr11:4422692 G->T (T196N) | |||
| Chr11:4423904 C->A (D51Y) | Chr11:4422679 T->C (I200M) | |||
| Chr11:4423865 C->T (E64K) | ||||
| Chr11:4423834 G->T (A74D) | ||||
| Chr11:4423825 G->A (S77L) | ||||
| Chr11:4422971 T->C (K103R) | ||||
| Chr11:4422954 G->T (Q109K) | ||||
| Chr11:4422936 C->T (E115K) | ||||
| Chr11:4422931 C->A (K116N) | ||||
|
| scaffold_294:25641 C->G (G149A) | |||
Figure 6The conserved protein motifs and 3D structures of PtHsf proteins. (A) The diagram represents the motif domain distribution in PtHsfs. (B) Structural analysis of PtHsf proteins. Protein 3D structure of PtHsfs from each subfamily and each subclade were predicted.