| Literature DB >> 25262351 |
Hong-Mei Zhang1, Teng Liu1, Chun-Jie Liu1, Shuangyang Song1, Xiantong Zhang1, Wei Liu1, Haibo Jia1, Yu Xue1, An-Yuan Guo2.
Abstract
Transcription factors (TFs) are key regulators for gene expression. Here we updated the animal TF database AnimalTFDB to version 2.0 (http://bioinfo.life.hust.edu.cn/AnimalTFDB/). Using the improved prediction pipeline, we identified 72 336 TF genes, 21 053 transcription co-factor genes and 6502 chromatin remodeling factor genes from 65 species covering main animal lineages. Besides the abundant annotations (basic information, gene model, protein functional domain, gene ontology, pathway, protein interaction, ortholog and paralog, etc.) in the previous version, we made several new features and functions in the updated version. These new features are: (i) gene expression from RNA-Seq for nine model species, (ii) gene phenotype information, (iii) multiple sequence alignment of TF DNA-binding domains, and the weblogo and phylogenetic tree based on the alignment, (iv) a TF prediction server to identify new TFs from input sequences and (v) a BLAST server to search against TFs in AnimalTFDB. A new nice web interface was designed for AnimalTFDB 2.0 allowing users to browse and search all data in the database. We aim to maintain the AnimalTFDB as a solid resource for TF identification and studies of transcription regulation and comparative genomics.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25262351 PMCID: PMC4384004 DOI: 10.1093/nar/gku887
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Comparison of data contents between two versions of AnimalTFDB
| AnimalTFDB | Version 1.0 | Version 2.0 |
|---|---|---|
| Species | 50 | 65 |
| TF families | 72 | 70 |
| TF genes | 52 722 | 72 336 |
| Co-factor genes | 9066 | 21 053 |
| CRFs genes | 3476 | 6502 |
| Annotation | ||
| -gene function description | No | Yes |
| -expression | No | Yes |
| -phenotype | No | Yes |
| Multi-alignment of DBDs and their WebLogo | No | Yes |
| Phylogenetic tree | No | Yes |
| TF prediction server | No | Yes |
| BLAST search | No | Yes |
Summary of the expression data and TF numbers of model species in AnimalTFDB 2.0
| Species | Lineage | Expressiona | TF (%)b | Expressed TF (%)c | Co-factor (%)b | Expressed co-factor (%)c | CRF (%)b | Expressed CRF (%)c |
|---|---|---|---|---|---|---|---|---|
| Primate | CA ( | 1691 (7.4%) | 1589 (94.0%) | 462 (2.0%) | 430 (93.1%) | 155 (0.7%) | 140 (90.3%) | |
| Primate | TI ( | 1418 (6.5%) | 964 (68.0%) | 378 (1.7%) | 291 (77.0%) | 118 (0.5%) | 95 (80.5%) | |
| Rodent | TI ( | 1485 (6.5%) | 1227 (82.6%) | 397 (1.7%) | 390 (98.2%) | 122 (0.5%) | 118 (96.7%) | |
| Rodent | TI ( | 1375 (6.0%) | 1137 (82.7%) | 382 (1.7%) | 374 (97.9%) | 118 (0.5%) | 116 (98.3%) | |
| Laurasiatheria | TI ( | 1280 (6.4%) | 1141 (89.1%) | 378 (1.9%) | 376 (99.5%) | 121 (0.6%) | 121 (100.0%) | |
| Bird | TI ( | 858 (5.5%) | 769 (89.6%) | 329 (2.1%) | 325 (98.8%) | 98 (0.6%) | 98 (100.0%) | |
| Fish | DS ( | 2345 (8.9%) | 1756 (74.9%) | 315 (1.2%) | 306 (97.1%) | 100 (0.4%) | 97 (97.0%) | |
| Insect | TI ( | 604 (4.3%) | 594 (98.3%) | 160 (1.1%) | 158 (98.8%) | 53 (0.4%) | 51 (96.2%) | |
| Nematoda | TI ( | 706 (3.4%) | 684 (96.9%) | 130 (0.6%) | 130 (100.0%) | 40 (0.2%) | 39 (97.5%) |
aCA, cancer; TI, tissue; DS, development stage; CL, cell line; CT, cell type. Number in the bracket is the number of data sets of that type. The TI (16,24) of human indicates there are 16 mRNA data sets and 24 protein data sets for human tissue expression data. All other gene expression data are from RNA-seq mRNA expression.
bThe percentages in brackets are the percentages of TF (co-factor or CRF) genes in the protein-coding genes of genomes.
cThe percentages in brackets are the percentages of expressed TF (co-factor or CRF) genes.
Figure 1.The new annotations and tools in AnimalTFDB 2.0. (A) The multiple sequence alignment of TF DBDs, the weblogo and phylogenetic tree based on the alignment in each TF family. (B) The TF prediction server and examples of prediction result. (C) The BLAST search server. (D) One example of gene expression information. (E) The gene phenotype information.