| Literature DB >> 19284633 |
Debra L Fulton1, Saravanan Sundararajan, Gwenael Badis, Timothy R Hughes, Wyeth W Wasserman, Jared C Roach, Rob Sladek.
Abstract
Unravelling regulatory programs governed by transcription factors (TFs) is fundamental to understanding biological systems. TFCat is a catalog of mouse and human TFs based on a reliable core collection of annotations obtained by expert review of the scientific literature. The collection, including proven and homology-based candidate TFs, is annotated within a function-based taxonomy and DNA-binding proteins are organized within a classification system. All data and user-feedback mechanisms are available at the TFCat portal (http://www.tfcat.ca).Entities:
Mesh:
Substances:
Year: 2009 PMID: 19284633 PMCID: PMC2691000 DOI: 10.1186/gb-2009-10-3-r29
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Transcription factor data resources
| Resource | Organism | Reference/URL |
| Human KZNF Gene Catalog | Human | Huntley |
| Database of bZIP Transcription Factors | Human | Ryu |
| The | Fly | Adryan |
| wTF2.0: a collection of predicted | Worm | Reece-Hoyes |
TFCat catalog statistics
| Total number of genes annotated | 1,058 | 100% |
| Proportion of genes with positive TF judgments | 882 | 83% |
| Proportion of positive TFs with DNA-binding activity | 571 | 65% |
| Proportion of DNA-binding TFs that are (double-stranded) sequence-specific | 535 | 94% |
TFCat judgment classifications
| Judgment classification | Number of annotations | % of annotations |
| TF gene | 733 | 61.9 |
| TF gene candidate | 256 | 21.7 |
| Probably not a TF - no evidence that it is a TF | 41 | 3.5 |
| Not a TF - evidence that it is not a TF | 30 | 2.5 |
| Indeterminate - there is no evidence for or against this gene's role as a TF | 114 | 9.6 |
| TF evidence conflict - there is evidence for and against this gene's role as a TF | 10 | 0.8 |
TFCat taxonomy classifications
| Taxonomy classification | Number of annotations | % of annotations |
| Basal transcription factor | 39 | 3.7 |
| DNA-binding: non-sequence-specific | 30 | 2.9 |
| DNA-binding: sequence-specific | 591 | 56.5 |
| DNA-binding: single-stranded RNA/DNA binding | 20 | 1.9 |
| Transcription factor binding: TF co-factor binding | 315 | 30.1 |
| Transcription regulatory activity: heterochromatin interaction/binding | 51 | 4.9 |
DNA-binding TF gene classification counts
| Protein group | Protein group description | Protein family | Protein family description | Gene count | Predicted occurrences |
| 1.1 | Helix-turn-helix group | 2 | Homeodomain family | 131 | 160 |
| 1.1 | Helix-turn-helix group | 100 | Myb domain family | 7 | 16 |
| 1.1 | Helix-turn-helix group | 109 | Arid domain family | 5 | 5 |
| 1.1 | Helix-turn-helix group | 999 | No family level classification | 2 | 2 |
| 1.2 | Winged helix-turn-helix | 13 | Interferon regulatory factor | 7 | 7 |
| 1.2 | Winged helix-turn-helix | 15 | Transcription factor family | 10 | 11 |
| 1.2 | Winged helix-turn-helix | 16 | Ets domain family | 23 | 23 |
| 1.2 | Winged helix-turn-helix | 101 | GTF2I domain family | 2 | 12 |
| 1.2 | Winged helix-turn-helix | 102 | Forkhead domain family | 26 | 26 |
| 1.2 | Winged helix-turn-helix | 103 | RFX domain family | 4 | 4 |
| 1.2 | Winged helix-turn-helix | 111 | Slide domain family | 1 | 1 |
| 2.1 | Zinc-coordinating group | 17 | Beta-beta-alpha-zinc finger family | 79 | 450 |
| 2.1 | Zinc-coordinating group | 18 | Hormone-nuclear receptor family | 43 | 43 |
| 2.1 | Zinc-coordinating group | 19 | Loop-sheet-helix family | 1 | 1 |
| 2.1 | Zinc-coordinating group | 104 | GATA domain family | 7 | 12 |
| 2.1 | Zinc-coordinating group | 105 | Glial cells missing (GCM) domain family | 2 | 2 |
| 2.1 | Zinc-coordinating group | 106 | MH1 domain family | 3 | 3 |
| 2.1 | Zinc-coordinating group | 114 | Non methyl-CpG-binding CXXC domain | 2 | 4 |
| 2.1 | Zinc-coordinating group | 999 | No family level classification | 2 | 2 |
| 3 | Zipper-type group | 21 | Leucine zipper family | 41 | 64 |
| 3 | Zipper-type group | 22 | Helix-loop-helix family | 71 | 71 |
| 4 | Other alpha-helix group | 28 | High mobility group (Box) family | 24 | 28 |
| 4 | Other alpha-helix group | 29 | MADS box family | 4 | 4 |
| 4 | Other alpha-helix group | 107 | Sand domain family | 3 | 3 |
| 4 | Other alpha-helix group | 115 | NF-Y CCAAT-binding protein family | 2 | 2 |
| 5 | Beta-sheet group | 30 | TATA box-binding family | 1 | 2 |
| 6 | Beta-hairpin-ribbon group | 34 | Transcription factor T-domain | 11 | 11 |
| 6 | Beta-hairpin-ribbon group | 108 | Methyl-CpG-binding domain, MBD family | 2 | 2 |
| 7 | Other | 37 | Rel homology region family | 10 | 10 |
| 7 | Other | 38 | Stat protein family | 6 | 6 |
| 7 | Other | 110 | Runt domain family | 3 | 3 |
| 7 | Other | 112 | Beta_Trefoil-like domain family | 2 | 2 |
| 7 | Other | 113 | DNA-binding LAG-1-like domain family | 2 | 2 |
| 8 | Enzyme group | 47 | DNA polymerase-beta family | 1 | 7 |
| 999 | Unclassified structure | 901 | CP2 transcription factor domain family | 3 | 3 |
| 999 | Unclassified structure | 902 | AF-4 protein family | 1 | 1 |
| 999 | Unclassified structure | 903 | DNA binding homeobox and different transcription factors (DDT) domain family | 1 | 1 |
| 999 | Unclassified structure | 904 | AT-hook domain family | 3 | 6 |
| 999 | Unclassified structure | 905 | Nuclear factor I - CCAAT-binding transcription factor (NFI-CTF) family | 3 | 3 |
Large cluster ranking criteria
| Rank | Implication for unannotated genes in cluster | Fraction of observed approximate mean TFs (OAMTF) | |
| 1 | The majority of genes are likely TFs | 95% | |
| 0.10 ≤ | 2 | A higher proportion of genes are likely TFs | 75% |
| 0.03 ≤ | 3 | A higher proportion of genes are likely not TFs | 35% |
| 0.00 ≤ | 4 | The majority of genes are likely not TFs | 15% |