| Literature DB >> 25679776 |
Libing Shen1, Gangbiao Liu1, Yangyun Zou1, Zhan Zhou1, Zhixi Su1, Xun Gu2.
Abstract
RNA sequencing (RNA-Seq) technology provides the detailed transcriptomic information for a biological sample. Using the RNA-Seq data of six organs from nine vertebrate species, we identified a number of organ-specifically expressed or repressed orthologous genes whose expression patterns are mostly conserved across nine species. Our analyses show the following results: (i) About 80% of these genes have a chordate or more ancient origin and more than half of them are the legacy of one or multiple rounds of large-scale gene duplication events. (ii) Their evolutionary rates are shaped by the organ in which they are expressed or repressed, e.g. the genes specially expressed in testis and liver generally evolve more than twice as fast as the ones specially expressed in brain and cerebellum. The organ-specific transcription factors were discriminated from these genes. The ChIP-seq data from the ENCODE project also revealed the transcription-related factors that might be involved in regulating human organ-specifically expressed or repressed genes. Some of them are shared by all six human organs. The comparison of ENCODE data with mouse/chicken ChIP-seq data proposes that organ-specifically expressed or repressed orthologous genes are regulated in various combinatorial fashions in different species, although their expression features are conserved among these species. We found that the duplication events in some gene families might help explain the quick organ/tissue divergence in vertebrate lineage. The phylogenetic analysis of testis-specifically expressed genes suggests that some of them are prone to develop new functions for other organs/tissues.Entities:
Mesh:
Year: 2015 PMID: 25679776 PMCID: PMC4332667 DOI: 10.1371/journal.pone.0116872
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
The number of specifically expressed or repressed (OSER) orthologous clusters in each organ/tissue.
| Organ/Tissue | No. of specifically expressed clusters | No. of specifically repressed clusters |
|---|---|---|
| Brain | 98 | 9 |
| Cerebellum | 49 | 15 |
| Heart | 139 | 44 |
| Kidney | 130 | 2 |
| Liver | 140 | 154 |
| Testis | 453 | 46 |
| Nervous tissue | 269 | 52 |
*The OSER clusters in nervous tissue show no significant expression difference between brain and cerebellum while have a distinct expression pattern between nervous tissues and the other organs.
The evolutionary origins of OSER clusters in each organ/tissue.
| Origin | Brain | Cerebellum | Nervous tissue | Heart | Kidney | Liver | Testis |
|---|---|---|---|---|---|---|---|
| Chordata (sea squirt) | 86 | 54 | 250 | 134 | 112 | 232 | 404 |
| Craniata (lamprey) | 13 | 7 | 43 | 29 | 12 | 30 | 46 |
| Gnathostomata (zebrafish) | 4 | 3 | 20 | 19 | 6 | 25 | 37 |
| Tetrapoda (clawed frog) | 3 | 0 | 4 | 1 | 2 | 5 | 3 |
| Amniota (lizard or chicken) | 1 | 0 | 4 | 0 | 0 | 2 | 9 |
| Total | 107 | 64 | 321 | 183 | 132 | 294 | 499 |
Figure 1Phylogenetic tree of MACC1 gene family.
A Ciona intestinalis gene was selected as the outgroup to root the tree and only the cladogram is shown. The tree node where a possible large-scale duplication event happened is marked with a filled black square ■. 7 means the gene’s expression level is higher than 95% of all genes expressed in the organ. 6 is between 95% and 85%. 5 is between 85% and 65%. 4 is between 65% and 35%. 3 is between 35% and 15%. 2 is between 15% and 5%. 1 is lower than 5%. 0 means no expression at all. N is not available.
Figure 2Distribution of the number orthologous clusters in the gene families containing OSER genes.
Figure 3Classification of OSER clusters according to their evolutionary origins.
Figure 4Comparison of the evolutionary rate of specifically expressed and repressed clusters in seven organs/tissues.
The total phylogenetic tree branch length of each one-to-one OSER cluster was used to represent its evolutionary rate.
Organ-specifically expressed or repressed transcriptional factors.
| Organ/Tissue | Organ-specifically expressed TF | Organ-specifically repressed TF |
|---|---|---|
| Brain | BCL11A, TSHZ3, MKX, ZEB2, SATB1 | HBP1, KLF11 |
| Cerebellum | BAZ1B, FOXJ2, CTCF, FUBP1, ZFPM2, CHD7, ZNF385C | KLF10 |
| Nervous tissue | CRTC1, ZNF365, TEF, CAMTA1, SOX2, MYT1L | ELF1, GATA5, GATA6, ZNF217, TGIF1 |
| Heart | TBX18, GATA6, ZNF366 | SALL1 |
| Kidney | TFEC, GATA3, SIM1, TFCP2L1, EHF, ELF5, ELF3, FOXI1 | ZFPM2 |
| Liver | CREB3L3, TBX3, NRBF2 | MYEF2, GLI3, SOX9, ZNF462, ZNF827, CAMTA1, ZNF536, TRPS1 |
| Testis | TCFL5, TBX4, BAZ2B, E2F8, TMF1, LZTFL1, E2F7, ZNF217, HSF5, CNOT10, ZNF438, MTF1, BAZ1A, TRAF3IP1, NR6A1, TBPL1 | TFEB, NFIB, CTBP1, KLF15, IRF2, PPARGC1A, JDP2 |
Figure 5Phylogenetic tree of GATA transcription factor family.
A Ciona intestinalis gene was selected as the outgroup to root the tree and only the cladogram is shown. The tree node where a possible large-scale duplication event happened is marked with a filled black square ■. 7 means the gene’s expression level is higher than 95% of all genes expressed in the organ. 6 is between 95% and 85%. 5 is between 85% and 65%. 4 is between 65% and 35%. 3 is between 35% and 15%. 2 is between 15% and 5%. 1 is lower than 5%. 0 means no expression at all. N is not available.
Five most common transcription-related factors might be involved in regulating OSER genes.
| OSER gene group | Proximal binding TFs (the number of OSER genes it might control) | Distal binding TFs (the number of OSER genes it might control) |
|---|---|---|
| Brain-specifically expressed genes | REST(9), RAD21(7), CTCF(5), FOXA1(3), ZZZ3(3) | EP300(10), CTCF(3), SPI1(3), RAD21(2), TCF12 (2) |
| Cerebellum-specifically expressed genes | REST(5), CTCF(2), TAL1(2), E2F1(2), GATA2(2) | EP300(5), RAD21(2), CEBPB(2), SP1(2), STAT3(2) |
| Nervous-tissue-specifically expressed genes | REST(19), CTCF(15), RAD21(12), ZNF263(7), SUZ12(6) | EP300(30), SPI1(11), RAD21(10), SP1(10), CTCF(9) |
| Heart-specifically expressed genes | CTCF(8), GATA2(7), ESR1(7), TCF4(5), E2F1(4) | EP300(22), CTCF(6), CEBPB(5), RAD21(4), YY1(4) |
| Kidney-specifically expressed genes | RAD21(7), GATA2(5), CTCF(5), ESR1(5), TFAP2C(4) | EP300(24), SPI1(12), EBF1(11), RAD21(10), BATF(10) |
| Liver-specifically expressed genes | FOXA1(20), FOXA2(19), EP300(14), HNF4A(13), TCF4(13) | EP300 (33), TAL1(22), JUND(19), GATA2(18), POL2(17) |
| Testis-specifically expressed genes | E2F4(42), E2F1(22), CTCF(21), MYC(15), NFYA (14) | EP300(29), FOXA2(12), RAD21(12), FOXA1(12), JUND(11) |
Figure 6Venn diagram of transcription-related factors in five organs/tissues.
A, the proximal-binding transcription-related factors in five organs/tissues. B, the distal-binding transcription-related factors in five organs/tissues. Nervous tissue includes the transcription-related factors in brain and cerebellum.
Comparison of human ChIPseq data with mouse/chicken ChIPseq data.
| Transcription factor | Number of proximal binding OSER gene | Proximal binding overlap with human | Number of distal binding OSER gene | Distal binding overlap with human |
|---|---|---|---|---|
| Human CTCF | 72 | 51 | ||
| Mouse CTCF | 704 | 49 (68%) | 819 | 25 (49%) |
| Chicken CTCF | 95 | 4 (5.6%) | 291 | 15 (29%) |
| Human EP300 | 28 | 172 | ||
| Mouse EP300 | 291 | 9 (32%) | 563 | 78 (45%) |