| Literature DB >> 23432897 |
Nazia Parveen1, Ayesha Masood, Nouman Iftikhar, Bushra F Minhas, Rashid Minhas, Uzma Nawaz, Amir Ali Abbasi.
Abstract
BACKGROUND: Human genome is enriched with thousands of conserved non-coding elements (CNEs). Recently, a medium throughput strategy was employed to analyze the ability of human CNEs to drive tissue specific expression during mouse embryogenesis. These data led to the establishment of publicly available genome wide catalog of functionally defined human enhancers. Scattering of enhancers over larger regions in vertebrate genomes seriously impede attempts to pinpoint their precise target genes. Such associations are prerequisite to explore the significance of this in vivo characterized catalog of human enhancers in development, disease and evolution.Entities:
Mesh:
Year: 2013 PMID: 23432897 PMCID: PMC3599049 DOI: 10.1186/1471-2164-14-122
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
The association of the human CNE-enhancers with their target gene bodies by comparative syntenic analyses and through comparison of the reporter expression induced by CNE-enhancers with the reported endogenous expression patterns of the neighboring genes
| 39 | 1 | √ | √ | √ | ¯ |
| 2 | 2 | √ | √ | ¯ | √ |
| 1 | 1 | √ | √ | ¯ | √ |
| 1 | 1 | √ | √ | √ | * |
| 41 | 2 | ¯ | √ | ¯ | √ |
| 7 | 3 | ¯ | √ | ¯ | √ |
| 44 | 1 | ¯ | √ | √ | ¯ |
| 56 | 1 | ¯ | √ | ¯ | √ |
| 1 | 1 | ¯ | √ | √ | * |
This table provides the following information about human CNE-enhancers.
(i) The number of target genes associated with each CNE-enhancer analyzed.
(ii) Whether they are duplicated or not.
(iii) Their association with target gene is supported by either syntenic analysis or by endogenous expression analysis or by both.
* Expression pattern is not known.
MGI: Mouse Genome Informatics.
CNE: Conserved non-coding element.
Figure 1Human CNE-enhancers associated with target genes using orthology mapping. (A) Comparative syntenic analysis of human, mouse, chicken and frog orthologous loci depicts the conserved presence of both the paralogous SP4 and SP8 in the nearest vicinity of the CNE-enhancer (hs110: light green vertical line). Analogy in the expression pattern of the CNE-enhancer and both of these paralogs suggest the association of this CNE-enhancer with both SP4 and SP8. (B) Increasing the depth of our synteny comparison by including orthologous loci from teleost fish lineage, it became evident that only SP8 maintains its physical linkage with the CNE-enhancer in fish. Based on uninterrupted physical proximity over longer period of evolutionary time (450 Myr) SP8 was considered as probable target for this CNE-enhancer. Genes are color-coded. Direction of arrow depicts the direction of gene transcription. Light green vertical line depicts the position of CNE-enhancer. Horizontal black line depicts scale.
Figure 2Human CNE-enhancers duplication history and their genomic range of action. (A) This pie chart shows the extent of duplication events in our selected subset of CNE-enhancers (192 enhancers in total). The majority of our selected enhancers are unduplicated. About 10% of the enhancers have duplicated copies only in teleost fishes, 9% are duplicated before teleost-tetrapod split and 3% enhancers have duplicated copies only in human/tetrapod. (B) This pie chart represents the distribution of distances between CNE-enhancers and target gene bodies to estimate the optimal distance at which an enhancer can access its concerned promoter. About 19% of enhancers are within the introns of their predicted target gene bodies, 24% are within a range of 0-200 kb, 17% are at a distance of 201-400 bp, 10% lie in the range of 401-600 kb, 14% in the range of 601-999 kb and 16% of the enhancers are located at distance of >1 Mb from their target genes.
Figure 3Predicting the target genes of human enhancers by paralogy and orthology mapping of locus architecture. (A) CNE-enhancers when duplicated are retained with their target genes. Comparing the genic content of CNE-EBF containing paralogous loci in human genome and their orthologous loci in multiple fish lineages unmistakably suggests that duplicated copies of human CNE-EBF enhancer (hs230) are associated with the regulation of paralogous copies of EBF family members, i.e. EBF1, EBF3 and EBF4. (B) Unduplicated CNE-enhancer (hs529) is associated with its target gene SH3GL2 by tracing the differential loss of unrelated bystander genes among orthologous loci. Genes are color-coded. Direction of arrow depicts the direction of gene transcription. Light green vertical line depicts the position of CNE-enhancer. Horizontal black line depicts scale.
Figure 4CNE-enhancer induced reporter expression was compared with the endogenous expression of neighboring genes to find their probable target. In those cases where comparative synteny analysis alone was not sufficient to establish unambiguous associations among CNE-enhancers and their target gene bodies, we carefully compared the CNE-enhancer induced reporter expression pattern with the reported endogenous expression pattern of neighboring genes. (A) Both TSHZ3 and ZNF536 genes depict conserved syntenic associations with a human CNE-enhancer (hs858), suggesting either or both of these genes are under the regulatory control of this enhancer. However comparing the reporter expression induced by this CNE-enhancer with the reported endogenous expression pattern of TSHZ3 and ZNF536 depicts a precise analogy in the expression domain of ZNF536 and this enhancer interval. (B) CNE-enhancer (hs1305) within an intron of human MRPS28 gene is not associated with the promoter of the same gene, rather applying the same strategy discussed above in panel A we infer that this CNE-enhancer is acting at a distance of ~150.8 kb on HEY1 and STMN2 genes. Genes are color-coded. Direction of arrow depicts the direction of gene transcription. Light green vertical line depicts the position of CNE-enhancer. Horizontal black line depicts scale.
Figure 5Human enhancer regions can access their target promoters from a distance of ~2 MB. Evolutionary conserved syntenic association between a CNE-enhancer (hs191) and human NR2F1 and harmony in their expression pattern domains in mice infers the functional association among them. The distance between the transcription start site of human NR2F1 and concerned cis-acting site is ~2 MB. Genes are color-coded. Direction of arrow depicts the direction of gene transcription. Light green vertical line depicts the position of CNE-enhancer. Horizontal black line depicts scale.
Probability table of transcription factor binding sites in training and control data sets
| 1 | Tal1b-E47S | 54/159=0.34 | 6/100=0.06 |
| 2 | NF-kappaβ | 29/159=0.18 | 10/100=0.10 |
| 3 | n-MYC | 74/159=0.46 | 7/100=0.07 |
| 4 | ARNT | 61/159=0.38 | 17/100=0.17 |
| 5 | USF | 61/159=0.38 | 7/100=0.07 |
| 6 | c-REL | 58/159=0.36 | 20/100=0.20 |
| 7 | MEF | 55/159=0.35 | 10/100=0.10 |
| 8 | FREAC | 94/159=0.59* | 14/100=0.14 |
| 9 | AML-1 | 94/159=0.59* | 25/100=0.25 |
| 10 | c-FOS | 119/159=0.74* | 23/100=0.23 |
| 11 | HNF-3beta | 119/159=0.74* | 31/100=0.31 |
| 12 | SOX17 | 136/159=0.85* | 44/100=0.44 |
| 13 | HFH | 150/159=0.94* | 51/100=0.51* |
| 14 | SOX-5 | 151/159=0.95* | 44/100=0.44 |
* TFs with P(TF) greater than 0.50 defined as group-2. The table present Sr. No 8 to 14 in Training data set as the only group of TFs with high probability of TFs so the most likely co-occurring TFs in CNS brain specific enhancers compared to TFs with P(TF) less than 0.5 in training and control data sets.
Figure 63D loading plots. A: Central nervous system specific human enhancers exhibit three distinctive and internally compact clusters exposing the interactive pattern of TFBSs. B: The control data set of human non-conserved and non-coding elements do not present any vivid cluster structure for the fourteen TFBSs that formed internally closely packed clusters in CNS specific enhancers (A), thus elucidating the significance of clusters in panel A. TFBSs are color coded.
List of transcription factors having over-representative occurrence in brain specific -acting sites
| AML1 | RUNT-type transcription factor | spinal cord, hindbrain | [ |
| ARNT | Helix-loop-helix DNA-binding | spinal cord, forebrain, midbrain, hindbrain | [ |
| cFOS | Basic-leucine zipper (bZIP) transcription factor | hindbrain | [ |
| c-REL | Immunoglobulin-like fold | CNS (components not defined yet) | [ |
| FREAC | Winged helix-turn-helix transcription repressor | CNS (components not defined yet) | [ |
| HFH | Winged helix-turn-helix transcription repressor DNA | CNS (components not defined yet) | [ |
| HNF3B | Winged helix-turn-helix transcription repressor DNA | Spinal-cord, forebrain midbrain, hindbrain | [ |
| MEF2 | myocyte-specific enhancer factor 2A | forebrain, hindbrain | [ |
| NF-KappaB | Immunoglobulin-like fold | spinal cord, forebrain, midbrain, hindbrain | [ |
| nMYC | Helix-loop-helix DNA-binding | spinal cord, forebrain, midbrain, hindbrain | [ |
| SOX17 | High mobility group, HMG1/HMG2 | spinal cord, forebrain, midbrain, hindbrain | [ |
| SOX5 | High mobility group, HMG1/HMG2 | spinal cord, hindbrain, midbrain | [ |
| Talbeta-E47S | Helix-loop-helix DNA-binding | spinal cord, forebrain, midbrain, hindbrain | [ |
| USF | Helix-loop-helix DNA-binding | spinal cord, forebrain, hindbrain | [ |
This table also provides references of literature (last column) that addresses the type/function and endogenous expression pattern of transcription factors.