| Literature DB >> 28523015 |
Abstract
Transcription factors (TFs) are essential regulators of gene expression in a cell; the entire repertoire of TFs (TFome) of a species reflects its regulatory potential and the evolutionary history of the regulatory mechanisms. In this work, I give an overview of fungal TFs, analyze TFome dynamics, and discuss TF families and types of particular interest. Whole-genome annotation of TFs in more than 200 fungal species revealed ~80 families of TFs that are typically found in fungi. Almost half of the considered genomes belonged to basidiomycetes and zygomycetes, which have been underrepresented in earlier annotations due to dearth of sequenced genomes. The TFomes were analyzed in terms of expansion strategies genome- and lineage-wise. Generally, TFomes are known to correlate with genome size; but what happens to particular families when a TFome is expanding? By dissecting TFomes into single families and estimating the impact of each of them, I show that in fungi the TFome increment is largely limited to three families (C6 Zn clusters, C2H2-like Zn fingers, and homeodomain-like). To see whether this is a fungal peculiarity or a ubiquitous eukaryotic feature, I also analyzed metazoan TFomes, where I observed a similar trend (limited number of TFome-shaping families) but also some important differences connected mostly with the increased complexity in animals. The expansion strategies of TF families are lineage-specific; I demonstrate how the patterns of the TF families' distributions, designated as "TF signatures," can be used as a taxonomic feature, e.g., for allocation of uncertain phyla. In addition, both fungal and metazoan genomes contain an intriguing type of TFs. While usually TFs have a single DNA-binding domain, these TFs possess two (or more) different DNA-binding specificities. I demonstrate that dual-specific TFs comprising various combinations of all major TF families are a typical feature of fungal and animal genomes and have an interesting evolutionary history involving gene duplications and domain losses.Entities:
Keywords: DNA-binding domains; TFome; Zn2 Cys6 Zn cluster TF; fungal genomes; gene family expansions; transcription factors
Year: 2017 PMID: 28523015 PMCID: PMC5415576 DOI: 10.3389/fgene.2017.00053
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
TF-type DNA-binding domains typically found in fungal species.
| IPR000005 | Helix-turn-helix, AraC type |
| IPR000007 | Tubby, C-terminal |
| IPR000197 | Zinc finger, TAZ-type |
| IPR000232 | Heat shock factor (HSF)-type, DNA-binding |
| IPR000327 | POU-specific |
| IPR000418 | Ets |
| IPR000551 | Bacterial regulatory protein, MerR |
| IPR000571 | Zinc finger CCCH-type |
| IPR000679 | Zinc finger, GATA-type |
| IPR000792 | Bacterial regulatory protein, LuxR |
| IPR000814 | TATA-box binding |
| IPR000818 | TEA/ATTS |
| IPR000835 | Bacterial regulatory protein, MarR |
| IPR000843 | Bacterial regulatory protein, LacI |
| IPR000944 | Transcriptional regulator, Rrf2 |
| IPR000967 | Zinc finger, NF-X1-type |
| IPR001034 | Bacterial regulatory protein, DeoR N-terminal |
| IPR001083 | Copper fist DNA-binding |
| IPR001138 | Zn2 Cys6 Zn_cluster |
| IPR001275 | DM DNA-binding |
| IPR001289 | CCAAT-binding TF, subunit B |
| IPR001356 | Homeobox |
| IPR001387 | Helix-turn-helix type 3 |
| IPR001471 | Pathogenesis-related TF and ERF, DBD |
| IPR001523 | Paired box protein, N-terminal |
| IPR001699 | Transcription factor, T-box |
| IPR001766 | Fork head transcription factor |
| IPR001808 | Bacterial regulatory protein, Crp |
| IPR001845 | Bacterial regulatory protein, ArsR |
| IPR001878 | Zinc finger, CCHC-type |
| IPR002059 | Cold-shock protein, DNA-binding |
| IPR002100 | Transcription factor, MADS-box |
| IPR002197 | Helix-turn-helix, Fis-type |
| IPR002653 | Zinc finger, A20-type |
| IPR003150 | DNA-binding RFX |
| IPR003163 | APSES-type DNA-binding domain |
| IPR003316 | E2F/dimerisation partner (TDP) |
| IPR003656 | Zinc finger, BED-type predicted |
| IPR003657 | DNA-binding WRKY |
| IPR003902 | Transcriptional regulator, GCM-like |
| IPR003958 | TF CBF/NF-Y/archaeal histone |
| IPR004022 | DDT |
| IPR004181 | Zinc finger, MIZ-type |
| IPR004198 | Zinc finger, C5HC2-type |
| IPR004333 | Transcription factor, SBP-box |
| IPR004645 | DNA-binding protein Tfx |
| IPR004823 | TATA box binding protein associated factor (TAF) |
| IPR004826 | Maf transcription factor |
| IPR004827 | Basic-leucine zipper (bZIP) TF |
| IPR005011 | SART-1 protein |
| IPR006780 | YABBY protein |
| IPR006856 | Mating-type protein MAT alpha 1 |
| IPR007087 | Zinc finger, C2H2-type |
| IPR007196 | CCR4-Not complex component, Not1 |
| IPR007396 | Negative transcriptional regulator |
| IPR007604 | CP2 transcription factor |
| IPR007889 | Helix-turn-helix, Psq |
| IPR008895 | YL1 nuclear |
| IPR008917 | Eukaryotic transcription factor, Skn-1-like |
| IPR008967 | p53-like transcription factor, DNA-binding |
| IPR009044 | ssDNA-binding transcriptional regulator |
| IPR009057 | Homeodomain-like |
| IPR009061 | Putative DNA binding |
| IPR009395 | GCN5-like 1 |
| IPR010666 | Zinc finger, GRF-type |
| IPR010770 | SGT1 |
| IPR010919 | SAND-like |
| IPR010921 | Trp repressor/replication initiator |
| IPR010982 | Lambda repressor-like, DNA-binding |
| IPR010985 | Ribbon-helix-helix |
| IPR011598 | Helix-loop-helix DNA-binding |
| IPR012294 | Transcription factor TFIID, C-terminal |
| IPR013921 | TATA-binding related factor |
| IPR013932 | TATA-binding protein interacting (TIP20) |
| IPR015988 | STAT transcription factor, coiled coil |
| IPR016032 | Signal transduction response regulator, C-term. effector |
| IPR016177 | DNA-binding, integrase-type |
| IPR024061 | NDT80 DNA-binding domain |
| IPR025659 | Tubby C-terminal-like domain |
Fungal-specific families.
Fungal TF gene families.
| APSES |
| Bacterial regulatory protein, ArsR |
| Bacterial regulatory protein, Crp |
| Bacterial regulatory protein, DeoR N-terminal |
| Bacterial regulatory protein, LacI |
| Bacterial regulatory protein, LuxR |
| Bacterial regulatory protein, MarR |
| Bacterial regulatory protein, MerR |
| BESS |
| bZIP |
| bZIP + Helix-loop-helix DNA-binding |
| bZIP + Homeodomain-like |
| bZIP + C2H2 |
| C2H2/CCHC/CCCH ZF + Homeodomain |
| C2H2/CCHC/CCCH/C5HC2 |
| CBF/NF-Y/archaeal histone |
| CCAAT-BindingTF |
| CCR4-Not complex component, Not1 |
| Cold-shock DBD |
| Copper fist |
| Copper fist+Zn finger |
| DM DNA-binding |
| DNA-binding protein Tfx |
| DNA-binding WRKY |
| DNA-binding, integrase-type |
| E2F_TDP |
| Ets |
| Fork head TF |
| GATA |
| GATA + Homeo |
| GATA + Zn cluster |
| GCM-like |
| GCN5L1 |
| Heat shock factor (HSF)-type |
| HLH, helix-loop-helix |
| HTH/Homeodomain-like |
| lambda repressor(-like)/POU |
| Homeodomain+lambda repressor-like |
| LexA |
| MADS-box/SRF |
| Maf TF |
| Mating-type protein MATα1 |
| No apical meristem (NAM) protein |
| p53 |
| Putative DNA binding |
| Rel homology |
| RFX_DNA_binding |
| Ribbon-helix-helix |
| Rrf2 |
| SAND-like |
| SART-1 |
| SBP-box |
| SGT1 |
| Signal transduction response regulator, C-term. |
| Skn-1 |
| ssDNA-binding transcriptional regulator |
| STAT |
| TATA box binding protein associated factor (TAF) |
| TATA-binding protein interacting (TIP20) |
| TATA-binding related factor |
| T-box |
| TEA/ATTS |
| TFIID |
| Trp repressor/replication initiator |
| Tubby TF |
| Viral DNA-binding protein |
| YL1 |
| zf-A20 |
| zf-BED |
| zf-GRF |
| zf-MIZ |
| zf-TAZ |
| Zn cluster |
| Zn cluster + bZIP |
| Zn_cluster+C2H2+Homeodomain |
| Zn_cluster+Homeodomain |
| Zn cluster + C2H2/CCHC/CCCH-type Zn fingers |
| Zn-finger, NF-X1 type |
Fungal-specific TFgFs (the specificity is assumed based on the involved fungal-specific domains).
Found exclusively in ascomycetes.
Only in Zygomycetes.
Figure 1(A) Distribution of TF gene families in fungal genomes. (B) Two “tails” belong to asco- and basidiomycetes. Each dot represents a species.
Figure 2Contribution of responsive families to TFome expansion in fungal genomes. (A) The sum of five responsive families (dark blue circles) follows the profile of whole TFomes. The effect can be explained by the input of just three families (Zn cluster, C2H2-like, and HD-like; orange circles). (B) The contribution of Zn clusters can be illustrated by subtraction of this family from the whole TFome. Without the Zn cluster family, TFomes are confined to the lower “tail.”
Figure 3TFome dynamics in Metazoa: different TFome growth speed in chordates and ecdysozoans. Chordates: logTF = 1,17logP; Ecdysozoans: logTF = 0.5logP, where P is the number of protein-coding genes and TF is the number of TFs. Each dot represents a species.
Figure 4Example of a single-species expansion. Glucocorticoid receptor-like TF gene family in animals: only in one species (C. elegans) the family expands to 270 TFs, whereas in other ecdysozoan genomes it does not exceed 35. Each dot represents the number of GR-like TFs in one genome.
Figure 5The role of abundant, small, responsive, and non-responsive families in the TFome dynamics in Metazoa. (A) Ecdysozoans (Arthropoda and Nematoda). Gray arrows point at the deviating non-responsive families (green triangles), which show unexpected increase. (B) Ecdysozoans after the correction for single-species expansions. (C) Chrodates.
Figure 6Predictiveness of TF signatures. (A) TF signatures based on 6 TFgFs are built for the main fungal phyla and Ascomycota subphyla (Pezizomycotina, of filamentous Ascomycota, and Saccharomycotina). The signatures built for the training sets (Ascomycota without respective class) are shown in the colored field. The test sets (Separate classes) are placed next to the corresponding training sets. (B) Statistical significance of the TFgF ratios used for construction of the signatures. Four classes belonging to Pezizomycotina are confronted with three main fungal phyla (Zoopagomycota is not shown because of the small size of the set). Asterisks mark statistically significant differences between the phyla and ascomycete classes.
Figure 7TF signatures in the phylogenetic background. 9 TFgFs used for the signature are listed in the color legend.
Dual-specific TFs in fungal and metazoan genomes.
| Zn cluster+C2H2-like ZF | 140 (73) | 23 | – | Ubiquitous in fungi | |
| C2H2-like ZF+HD | 115 (60) | 27 | 53 (96) | 7 | Ubiquitous |
| Zn_cluster+C2H2-like+HD | 64 (34) | 3 | – | Ascomycete-specific | |
| GATA+HD | 60 (31) | 3 | 6 (11) | 2 | F: Rare in ascomycetes. M: Not in chordates |
| Zn_cluster+HD | 35 (18) | 3 | – | ||
| Copper fist+Zn cluster | 29 (15) | 2 | – | – | Not found in zygomycetes |
| bZIP+HLH | 22 (12) | 2 | – | – | |
| Zn cluster+bZIP | 10 (5) | 1 | – | ||
| bZIP+C2H2 | 9 (5) | 1 | 42 (76) | 5 | F: Not found in basidiomycetes. M: Higher in fish, not in worms. |
| HD+lambda repressor-like | 7 (4) | 7 | 39 (71) | 9 | F: Mostly found in zygomycetes. M: Not in insects; higher in fish; expansion in lancelet |
| bZIP+HD-like | 5 (3) | 1 | – | – | Not found in zygomycetes |
| GATA+Zn cluster | 5 (3) | 1 | – | – | |
| GR-like+GATA | – | – | 55 (100) | 13 | Ubiquitous in Metazoa |
| HD+GR-like | – | – | 55 (100) | 19 | Higher in fish |
| CUT, HD+CUT | – | – | 53 (96) | 12 | Higher in fish |
| zf-C2H2+GR-like | – | – | 12 (22) | 137 | Insect-specific |
| C2H2+GATA | – | – | 9 (16) | 2 | Arthropoda-specific |
The analysis was run in 191 fungal and 55 animal TFomes. No dual-specific TFs were found in Microsporidia and protists with one exception of an expansion of GR-like+GATA in amoeba. In fungi, the dsTFs are almost totally absent in yeast (Saccharomycotina as well as Taphrinomycotina) The number in parentheses shows the percentage of genomes in the respective sets (fungal or metazoan), which possess the TF. “Max” is the maximal number of TFs per species. F, fungi; M, metazoa.
Figure 8Phylogeny and domain structures of . The dual specificity TFs are marked with asterisks.
Figure 9Alignment of the domain regions of . Gray bars underline the domains; red asterisks mark the key residues essential for binding. The dual TFs are in the red frame.