| Literature DB >> 19008958 |
Hong Sun1, Geir Skogerbø, Zhen Wang, Wei Liu, Yixue Li.
Abstract
Large numbers of sequence elements have been identified to be highly conserved among vertebrate genomes. These highly conserved elements (HCEs) are often located in or around genes that are involved in transcription regulation and early development. They have been shown to be involved in cis-regulatory activities through both in vivo and additional computational studies. We have investigated the structural relationships between such elements and genes in six vertebrate genomes human, mouse, rat, chicken, zebrafish and tetraodon and detected several thousand cases of conserved HCE-gene associations, and also cases of HCEs with no common target genes. A few examples underscore the potential significance of our findings about several individual genes. We found that the conserved association between HCE/HCEs and gene/genes are not restricted to elements by their absolute distance on the genome. Notably, long-range associations were identified and the molecular functions of the associated genes do not show any particular overrepresentation of the functional categories previously reported. HCEs in close proximity are found to be linked with different set of gene/genes. The results reflect the highly complex correlation between HCEs and their putative target genes.Entities:
Mesh:
Substances:
Year: 2008 PMID: 19008958 PMCID: PMC2579482 DOI: 10.1371/journal.pone.0003727
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1The number of HCEs and genes corresponding to the same HGLB.
In a minority of HGLBs a single gene is associated with one or more HCEs. More often, several HCEs were associated with a number of common genes.
Twenty-two single-gene HGLBs.
| Number of HCEs associated | Gene name | GeneOntology annotation |
| 15 |
| phosphorylase activity; pyridoxal phosphate binding; transferase activity, transferring glycosyl groups |
| 13 |
| metal ion binding; sequence-specific DNA binding; transcription factor activity; zinc ion binding |
| 8 |
| Not available |
| 7 |
| ATP binding; contributes_to activin receptor activity; growth factor binding; inhibin beta-A binding; magnesium ion binding; manganese ion binding; nucleotide binding; protein self-association; receptor activity; transferase activity |
| 7 |
| ATP binding; nucleoside-triphosphatase activity; nucleotide binding |
| 7 |
| purinergic nucleotide receptor activity, G-protein coupled; receptor activity; rhodopsin-like receptor activity |
| 6 |
| DNA-directed RNA polymerase activity; hydrolase activity; phosphoprotein phosphatase activity |
| 5 |
| hedgehog receptor activity |
| 5 |
| potassium ion binding; protein binding; voltage-gated ion channel activity; voltage-gated potassium channel activity |
| 5 |
| metal ion binding; protein binding; zinc ion binding |
| 4 |
| hydrolase activity; receptor activity; transmembrane receptor protein tyrosine phosphatase activity |
| 4 |
| metal ion binding; protein binding; zinc ion binding |
| 4 |
| metal ion binding; nucleic acid binding; zinc ion binding |
| 4 |
| cysteine-type endopeptidase activity; ubiquitin thiolesterase activity |
| 2 |
| 2-polyprenyl-6-methoxy-1,4-benzoquinone methyltransferase activity; O-methyltransferase activity; hexaprenyldihydroxybenzoate methyltransferase activity; transferase activity |
| 2 |
| protein binding |
| 2 |
| cysteine-type endopeptidase activity; protein binding; ubiquitin thiolesterase activity |
| 2 |
| Not available |
| 2 |
| RNA binding; nucleotide binding |
| 1 |
| chloride ion binding; extracellular ligand-gated ion channel activity |
| 1 |
| Not available |
| 1 |
| protein binding |
The functional description of associated genes is based on the Gene Ontology annotation.
Figure 2Human genomic environment of the three cases in which multi-HCEs are associated with only one gene.
The three genes are all “trans-dev” associated, and labeled with surrounding oval. A: Six HCEs clustered in a 0.5 Mb region are linked with the disease-associated gene CTDP1. The CTDP1 gene and the six HCEs are located more than 5 Mb apart. B: Thirteen HCEs are associated with the single gene TSHZ3 (ZNF537). C: Five HCEs are linked with the gene VPS41.
Percentage of overlapping length of HGLBs and GRBs (%).
| Min | Median | Mean | Max | |
| Over the length of GRB | 1.5 | 12.9 | 24.6 | 1 |
| Over the length of HGLB | 0.1 | 2.7 | 7.4 | 1 |
Figure 3A sketch map of interlaced HGLBs.
Two HGLBs interlaced each other on human chromosome 4. The two HGLBs reside in two different chromosomes in the mouse, rat, zebrafish and tetraodon genomes respectively. The conserved associations between HCEs and genes are not restricted to elements in relative proximity on the genome. HCEs belonging to the same cluster are divided into two HCE blocks, which are linked with two different sets of genes. (To compact the figure, several genes and HCEs are ignored and the size of the associated genomic region is not adjusted to the real scale. Rectangles represent genes while bars stand for HCEs. Lines link ortholgous sequence elements, and are labeled with the same color for the same HGLB block.)
The distances of HCE-gene pairs in the human genome (Kb).
| Min | Median | Mean | Max | |
| Distance of HCE-gene pairs | 1.7 | 6,285 | 15,107 | 82,725 |
The distances were measured from midpoint to midpoint.
The length of HCE/gene blocks and the associated number of HCE(s)/gene(s).
| Min | Median | Mean | Max | ||
| HCE block | Length (Kb) | 0.074 | 173 | 3,700 | 76,299 |
| Number of HCEs | 1 | 4 | 7 | 58 | |
| Gene block | Length (Kb) | 4 | 1,999 | 16,108 | 76,782 |
| Number of genes | 1 | 2 | 3 | 17 | |
Lengths (Kb) are measured based on the human genomic annotation.
The number of HCEs with consistent genomic location in the six genomes.
| exonic | intronic | intergenic | genic | |
| Number of HCEs | 35 | 34 | 243 | 86 |
| Percentage (%) | 3.7 | 3.6 | 25.7 | 9.1 |