| Literature DB >> 21689478 |
Chia-Sheng Chuang1, Tun-Wen Pai, Chin-Hua Hu, Wen-Shyong Tzou, Margaret Dah-Tsyr Chang, Hao-Teng Chang, Chih-Chia Chen.
Abstract
BACKGROUND: Hypoxia-inducible factors (HIFs) are transcription factors that play a crucial role in response to hypoxic stress in living organisms. The HIF pathway is activated by changes in cellular oxygen levels and has significant impacts on the regulation of gene expression patterns in cancer cells. Identifying functional conservation across species and discovering conserved regulatory motifs can facilitate the selection of reference species for empirical tests. This paper describes a cross-species functional pathway mapping strategy based on evidence of homologous relationships that employs matrix-based searching techniques for identifying transcription factor-binding sites on all retrieved HIF target genes.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21689478 PMCID: PMC3121119 DOI: 10.1186/1752-0509-5-S1-S3
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Figure 1A mapped HIF functional pathway with zebrafish as the query species and humans as the reference species is shown. The initial HIF pathway was acquired from the KEGG database, and the mapped functional pathway between zebrafish and humans was constructed on the basis of the homologous relationship. The novel parameters OrthRate and ParaRate are proposed for indicating the degree of functional conservation across species regarding a specific biological function. Higher ParaRates are shown within rectangular boxes with darker backgrounds. The rectangular boxes with white background represent no orthologous relationship at the compared node between 2 species.
Statistics of the OrthRate and ParaRate parameters within mapped functional pathways between 2 model species with humans as the reference
| Phylogeny | OrthNum | OrthRate | ParaNum | ParaRate | |
|---|---|---|---|---|---|
| HSA | 1 | 14 | N/A | 24 | N/A |
| MMU | 2 | 14 | 14/14 (100%) | 26 | 26/24 (108.3%) |
| GGA | 3 | 13 | 13/14 (92.9%) | 20 | 20/24 (83%) |
| XTR | 4 | 9 | 9/14 (64.3%) | 8 | 8/24 (33%) |
| DRE | 5 | 12 | 12/14 (85.7%) | 23 | 23/24 (95.8%) |
| CIN | 6 | 9 | 9/14 (64.3%) | 9 | 9/24 (37.5%) |
The first column, Phylogeny, shows the evolutionary distance from HSA to the other 5 model species. OrthNum represents the total number of genes involved within the subpathway of HIF responses, and OrthRate represents the functional conservation of the HIF pathway between 2 species (with HSA as the reference species). ParaNum shows the total number of paralogous genes involved within the subpathway of HIF responses, and ParaRate represents the relative possibility of gene replacement within the HIF functional pathway between 2 species (with HSA as the reference species). HSA, H. sapiens; MMU, M. musculus; GGA, G. gallus; DRE, D. rerio; XTR, X. tropicalis; CIN, C. intestinalis.
Identification of HRE motifs for HIF orthologous target genes
| Model Species | HIF orthologous target genes (KEGG: Orthology K05448) | ||
|---|---|---|---|
| HSA | ENSG00000119630 | ENSG00000112715 | ENSG00000173511 |
| MMU | ENSMUSG00000004791 | ENSMUSG00000023951 | ENSMUSG00000024962 |
| GGA | ENSGALG00000010290 | N/A | N/A |
| DRE | ENSDARG00000034700 | N/A | N/A |
| XTR | ENSXETG00000016375 | N/A | N/A |
| CIN | ENSCING00000014020 | N/A | N/A |
The locations of HRE motifs were detected within the HIF orthologous genes for 6 model species (the PSSM pattern of V$HIF_STKR_2005 and the core segment “CGTG” pattern were required to exactly match and shown in bold and underline). The regions were identified within 2,000 base pairs upstream of the orthologous gene set of KEGG: Orthology K05448 (VEGFA_B); only the first 3 detected HREs are shown here. The Ensembl gene IDs are shown on the top of each identified HRE motif. Only 2 HRE motifs were found in XTR within the assigned promoter region. HSA, H. sapiens; MMU, M. musculus; GGA, G. gallus; DRE, D. rerio; XTR, X. tropicalis; CIN, C. intestinalis
The number of identified HIF-related TFBSs within an orthologous gene set and a randomly selected gene set
| Comparison for HIF orthologous gene set | |||||||
|---|---|---|---|---|---|---|---|
| TF ID | HSA | MMU | GGA | CIN | DRE | XTR | # of species possessing the identical TFBS (0.85/0.80/0.75) |
| V$HIF_STKE_2005 | 0/0/2 | 1/1/3 | 1/1/3 | 0/0/1 | 0/0/2 | 0/0/0 | 2/2/5 |
| V$HIF1_Q5 | 1/3/7 | 2/2/12 | 1/3/10 | 2/2/6 | 1/1/4 | 0/1/4 | 5/6/6 |
| V$HIF1_Q3 | 1/3/10 | 2/2/12 | 1/4/9 | 2/3/7 | 1/1/4 | 0/0/11 | 5/5/6 |
| V$AHRHIF_Q6 | 5/5/29 | 3/3/24 | 5/5/22 | 6/6/23 | 3/3/21 | 2/2/18 | 6/6/6 |
| Total identified motifs | 7/11/48 | 8/8/51 | 8/13/44 | 8/11/37 | 5/5/31 | 2/3/33 | - |
| Comparison for a randomly selected gene set | |||||||
| V$HIF_STKE_2005 | 0/0/1 | 0/0/0 | 1/1/4 | 0/1/2 | 1/1/2 | 0/0/0 | 2/3/4 |
| V$HIF1_Q5 | 0/0/3 | 0/0/3 | 4/5/6 | 2/2/8 | 2/3/13 | 0/1/4 | 3/4/6 |
| V$HIF1_Q3 | 0/0/4 | 0/0/1 | 4/4/6 | 2/3/14 | 2/2/7 | 0/0/11 | 3/3/6 |
| V$AHRHIF_Q6 | 1/1/21 | 0/0/21 | 5/5/20 | 5/5/16 | 7/7/27 | 2/2/18 | 5/5/6 |
| Total identified motifs | 1/1/29 | 0/0/25 | 14/15/36 | 9/11/40 | 12/13/49 | 2/3/33 | - |
Comparison of TFBS conservation of the HIF orthologous gene set and a randomly selected gene set with 3 different cut-off values. (Upper half) The orthologous genes were obtained from 6 model species at the “VEGF” node within the HIF pathway (KEGG: Orthology K05448). These genes include ENSG00000119630 (HSA), ENSMUSG00000004791 (MMU), ENSGALG00000010290 (GGA), ENSCING00000014020 (CIN), ENSDARG00000034700 (DRE), and ENSXETG00000016375 (XTR). Three HIF-related TFBSs selected from the TRANSFAC library (V$HIF1_Q3, V$HIF1_Q5, and V$AHRHIF1_Q6) and 1 customized HRE motif (V$HIF_STKE_2005) were employed for scanning through all of the selected genes. Each cell in the table represents the numbers of retrieved TFBSs using 3 different cut-off values of 0.85, 0.80, and 0.75. For example, the V$AHRHIF_Q6 for HSA is 5/5/29, which represents the total number of V$AHRHIF_Q6 motifs that can be identified within the promoter region of the Homo sapiens VEGF gene by setting cut-off values at 0.85, 0.80, and 0.75. (Lower half) A set of genes without an orthologous relationship, randomly selected from 6 species. These genes include ENSG00000001460 (HSA), ENSMUSG00000073489 (MMU), ENSGALG00000000002 (GGA), ENSCING00000002344 (CIN), ENSDARG00000000086 (DRE), and ENSXETG00000016375 (XTR). Regarding the HIF function, most of the orthologous genes from various species indeed contain HIF TFBSs; however, they are sparsely distributed within the randomly selected genes. The rightmost columns also reflect the fact that the total number of species possessing identical HIF TFBSs is relatively higher within the orthologous gene set, since they are expected to retain HIF functional characteristics through the evolutionary process of speciation. HSA, H. sapiens; MMU, M. musculus; GGA, G. gallus; DRE, D. rerio; XTR, X. tropicalis; CIN, C. intestinalis
The number of identified HIF-related TFBSs within a paralogous gene set and a randomly selected gene set of zebrafish species
| Comparison for the paralogous genes | ||||
|---|---|---|---|---|
| TF ID | KEGG dre30682 | KEGG dre403049 | KEGG dre678512 | # of gene possessing the identical TFBS (0.85/0.80/0.75) |
| V$HIF_STKE_2005 | 0/0/2 | 0/0/0 | 0/0/2 | 0/0/2 |
| V$HIF1_Q5 | 1/1/4 | 0/0/3 | 2/3/3 | 2/2/3 |
| V$HIF1_Q3 | 1/1/4 | 0/0/2 | 2/3/5 | 2/2/3 |
| V$AHRHIF_Q6 | 3/3/21 | 0/0/15 | 4/4/30 | 2/2/3 |
| Total identified motifs | 5/5/31 | 0/0/20 | 8/10/40 | - |
| Comparison for randomly selected genes | ||||
| G1 | G2 | G3 | ||
| V$HIF_STKE_2005 | 0/0/1 | 0/0/0 | 0/0/0 | 0/0/1 |
| V$HIF1_Q5 | 1/1/5 | 0/0/0 | 0/0/4 | 1/1/2 |
| V$HIF1_Q3 | 1/1/9 | 0/0/1 | 0/0/3 | 1/1/3 |
| V$AHRHIF_Q6 | 3/3/25 | 0/0/13 | 1/1/21 | 2/2/3 |
| Total identified motifs | 5/5/40 | 0/0/14 | 1/1/28 | - |
Comparison of TFBS conservation of the HIF paralogous gene set and a randomly selected gene set from the zebrafish (DTR) with 3 different cut-off values. (Upper half) The paralogous genes were obtained from the “VEGF” node within the HIF pathway. These genes include ENSDARG00000034700 (KEGG: dre30682), ENSDARG00000069640 (KEGG: dre403049), and ENSDARG00000056624 (KEGG: dre678512). The selected HIF TFBS patterns are the same as those in Table 3. Each cell in the table represents the number of retrieved TFBSs using 3 different cut-off values of 0.85, 0.80, and 0.75. For example, the V$AHRHIF_Q6 for the first DTR gene (ENSDARG00000034700) is 3/3/21 which represents the total number of V$AHRHIF_Q6 motifs can be identified within the promoter region (2,000 base pairs) of the Danio rerio VEGF gene by setting cut-off values of 0.85, 0.80, and 0.75. (Lower half) A set of genes without a paralogous relationship, randomly selected from zebrafish. These genes include ENSDARG00000040535 (G1), ENSDARG00000068465 (G2), and ENSDARG00000010610 (G3). Regarding the HIF function, most of the paralogous genes from a given node in the HIF functional pathway contain the corresponding HIF TFBSs but appeared less frequently in the randomly selected genes. The rightmost columns also reflect the tendency that the total number of gene possessing identical HIF TFBSs is relatively higher within the paralogous gene set since they might retain HIF functional characteristics through the evolutionary process of gene duplication.