| Literature DB >> 20011068 |
Abstract
The ETS proteins are a family of transcription factors (TFs) that regulate a variety of biological processes. We made genome-wide analyses to explore the classification of the ETS gene family. We identified 207 ETS genes which encode 321 ETS TFs from ten animal species. Of the 321 ETS TFs, 155 contain only an ETS domain, about 50% contain a ETS_PEA3_N or a SAM_PNT domain in addition to an ETS domain, the rest (only four) contain a second ETS domain or a second ETS_PEA3_N domain or an another domain (AT_hook or DNA_pol_B). A Neighbor-Joining phylogenetic tree was constructed using the amino acid sequences of the ETS domain of the ETS TFs. The results revealed that the ETS genes of the ten species can be divided into two distinct groups. Group I contains one nematode ETS gene and 18 vertebrate animal ETS genes. Group II contains the majority of the ETS TFs and can be further divided into eleven subgroups. The sequence motifs outside the DNA-binding domain and the conservation of the exon-intron structural patterns of the ETS TFs in human, cattle, and chicken further support the phylogenetic classification among these ETS TFs. Extensive duplication of the ETS genes was found in the genome of each species. The duplicated ETS genes account for ~69% of the total of ETS genes. Furthermore, we also found there are ETS gene clusters in all of the ten animal species. Statistical analysis of the Gene Ontology annotations of the ETS genes showed that the ETS proteins tend to be related to RNA biosynthetic process, biopolymer metabolic process and macromolecule metabolic process expected from the common GO categories of transcriptional factors. We also discussed the functional conservation and diversification of ETS TFs.Entities:
Keywords: ETS transcription factors; genome-wide; metazoan animal; phylogenetic analysis
Year: 2009 PMID: 20011068 PMCID: PMC2789578 DOI: 10.4137/ebo.s2948
Source DB: PubMed Journal: Evol Bioinform Online ISSN: 1176-9343 Impact factor: 1.625
ETS TF subfamilies and distribution of ETS TFs in different subfamilies and in the ten species of animal kingdom. First row: numbers of proteins, second row: numbers of genes encoding these proteins.
| Subfamily | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Species | ETS | ETS& SAM_PNT | ETS& ETS_PEA3_N | ETS + 2 | ETS& DNA_pol_B | ETS& ETS_PEA3_N + 2 | ETS& AT_hook& SAM_PNT | Total No ETS TFs | Total No TFs | ETS TFs/TFs (%) |
| Human | 30 | 34 | 7 | 0 | 0 | 0 | 0 | 71 | 2740 | 2.6 |
| 15 | 11 | 3 | 29 | 1453 | 2.0 | |||||
| Mouse | 24 | 20 | 7 | 0 | 0 | 0 | 0 | 51 | 2416 | 2.1 |
| 14 | 10 | 4 | 28 | 1377 | 2.1 | |||||
| Rat | 19 | 13 | 4 | 0 | 0 | 0 | 0 | 36 | 1669 | 2.1 |
| 15 | 10 | 3 | 28 | 1167 | 2.4 | |||||
| Cattle | 15 | 10 | 3 | 0 | 1 | 0 | 0 | 29 | 1381 | 2.1 |
| 14 | 10 | 3 | 1 | 28 | 1136 | 2.5 | ||||
| Chicken | 9 | 15 | 5 | 0 | 0 | 1 | 1 | 31 | 1084 | 2.9 |
| 7 | 10 | 3 | 1 | 1 | 22 | 720 | 3.1 | |||
| Zebrafish | 19 | 11 | 4 | 0 | 0 | 0 | 0 | 34 | 1853 | 1.8 |
| 11 | 7 | 3 | 21 | 1234 | 1.9 | |||||
| Frog | 11 | 8 | 1 | 0 | 0 | 0 | 0 | 20 | 981 | 2.0 |
| 11 | 8 | 1 | 20 | 981 | 2.0 | |||||
| Sea squirt | 8 | 12 | 0 | 0 | 0 | 0 | 0 | 20 | 518 | 3.9 |
| 4 | 7 | 11 | 374 | 3.7 | ||||||
| Fruit fly | 11 | 7 | 0 | 0 | 0 | 0 | 0 | 18 | 963 | 1.9 |
| 6 | 5 | 11 | 579 | 1.9 | ||||||
| Nematode | 9 | 1 | 0 | 1 | 0 | 0 | 0 | 11 | 1159 | 0.9 |
| 8 | 1 | 1 | 10 | 793 | 1.3 | |||||
| Total | 155 | 131 | 31 | 1 | 1 | 1 | 1 | 321 | ||
| 104 | 79 | 20 | 1 | 1 | 1 | 1 | 207 | |||
ETS TFs are classified into different subfamilies according to the DNA-binding domains they contain. For example, ETS TFs in subfamily ETS contain only an ETS domain, those in subfamily ETS&SAM_PNT contain an ETS domain and a SAM_PNT domain, those in subfamily ETS + 2 contain two ETS domains, et al.
Figure 1Unrooted ML trees of the ETS genes based on the amino acid sequences of the ETS-domain of all ETS TFs. The scale bar corresponds to 0.5 amino acid substitutions per residue. Different colors denote different species, red: human, magenta: mouse, orange: rat lime: cattle, green: chicken, blue: frog, darkblue: sea squirt, purple: zebrafish, yellow: fruit fly, black: nematode.
Figure 2The whole sequence logo of the ETS domain and its core conserved sequence MAY(DE) KLSR(GA)LRYYY. The over all height of each stack indicated the sequence conservation at that position, whereas the height of symbols within each stack reflects the relative frequency of the corresponding amino acid.
Core conserved sequence of the ETS domain in different (sub)groups in the ML tree.
| Group | Sequence logo | Group | Sequence logo | Group | Sequence logo |
|---|---|---|---|---|---|
| SPI | MTYQKMARALRNYG | ESE | MTYEKLSRALRYYY | TEL | MTYEKMSRALRHYY |
| ELF | MNYETMGRALRYYY | DETS4 | MNYDKLSRSLRQYY | PEA3 | MNYDKLSRSLRYYY |
| ELK | MNYDKLSRALRYYY | ETS | MNYEKLSRGLRYYY | GABP | MNYEKLSRALRYYY |
| ERF | MNYDKLSRALRYYY | ERG | MNYDKLSRALRYYY | CEETS | MNYDKMSRGLRYFY |
Figure 3Distribution of ETS genes on the chromosomes of human. Chromosome numbers are indicated at the top of each bar. The small blue box on the chromosome indicated the position of the ETS gene with its name beside it. The duplicated ETS genes (either on different chromosomes or residing nearby) are connected with a single line. The red lines indicate the ETS gene clusters (genes reside tandem next to one another within a 200 kb genomic region).
Figure 4Clustering of 29 human, 27cattle and 20 chicken ETS genes by three methods. A) ML phylogenetic analysis reconstructed by phyML 3.0. B) Patters of exon-intron structure. Filled grey boxes: ETS domains; filled green boxes: SAM_PNT domains; filled blue boxes: ETS_PEA3_N domains; white boxes: other exon regions; lines; introns. C) MEME motif search results aligned based on the DNA-binding domains represented as white boxes. Conserved motifs are indicated in numbered color boxes.
Abbreviations: SAM, SAM_PNT domain; PEA, ETS_PEA3_N domain; ETS, ETS domain.
The significant gene ontology (GO) terms of the ETS gene (FDR < 0.05).
| Category | GO term | GO definition | Species |
|---|---|---|---|
| Molecular Function | 0043565 | sequence-specific DNA binding | human, mouse, rat, chicken, sea squirt, fruit fly |
| 0003676 | nucleic acid binding | human, mouse, rat, chicken, fruit fly | |
| 0003677 | DNA binding | human, mouse, rat, chicken, fruit fly | |
| 0003700 | transcription factor activity | human, mouse, rat, chicken, fruit fly | |
| 0030528 | transcription regulator activity | human, mouse, rat, chicken, fruit fly | |
| 0016563 | transcription activator activity | human, mouse | |
| Cellular Component | 0005634 | nucleus | human, mouse, rat, chicken |
| 0043231 | intracellular membrane-bound organelle | human, mouse, chicken | |
| 0043227 | membrane-bound organelle | human, mouse, chicken | |
| 0043229 | intracellular organelle | human, mouse | |
| 0043226 | organelle | human, mouse | |
| 0044424 | intracellular part | human, mouse | |
| 0005622 | Intracellular | mouse | |
| Biological Process | 0006350 | Transcription | human, mouse, rat, chicken, fruit fly |
| 0010467 | gene expression | human, mouse, rat, chicken, fruit fly | |
| 0010468 | regulation of gene expression | human, mouse, rat, chicken, fruit fly | |
| 0019219 | regulation of nucleobase, nucleoside, nucleotide and nucleic acid metabolic process | human, mouse, rat, chicken, fruit fly | |
| 0019222 | regulation of metabolic process | human, mouse, rat, chicken, fruit fly | |
| 0050789 | regulation of biological process | human, mouse, rat, chicken, fruit fly | |
| 0050794 | regulation of cellular process | human, mouse, rat, chicken, fruit fly | |
| 0031323 | regulation of cellular metabolic process | human, mouse, rat, chicken, fruit fly | |
| 0006139 | nucleobase, nucleoside, nucleotide and nucleic acid metabolic process | human, mouse, rat, chicken | |
| 0006351 | Transcription, DNA-dependent | human, mouse, rat, chicken | |
| 0006355 | regulation of transcription, DNA-dependent | human, mouse, rat, chicken | |
| 0016070 | RNA metabolic process | human, mouse, rat, chicken | |
| 0032774 | RNA biosynthetic process | human, mouse, rat, chicken | |
| 0045449 | regulation of transcription | human, mouse, rat, fruit fly | |
| 0065007 | biological regulation | human, mouse, chicken | |
| 0043283 | biopolymer metabolic process | human, mouse, chicken | |
| 0044237 | cellular metabolic process | human, mouse | |
| 0044238 | primary metabolic process | human, mouse | |
| 0043170 | macromolecule metabolic process | human, mouse | |
| 0006357 | regulation of transcription from RNA polymerase II promoter | human, mouse | |
| 0008152 | metabolic process | human, mouse | |
| 0006366 | transcription from RNA polymerase II promoter | human | |
| 0045935 | positive regulation of nucleobase, nucleoside, nucleotide and nucleic acid metabolic process | mouse | |
| 0045941 | positive regulation of transcription | mouse | |
| 0031325 | positive regulation of cellular metabolic process | mouse | |
| 0009893 | positive regulation of metabolic process | mouse |