| Literature DB >> 19323843 |
Hong Sun1, Geir Skogerbø, Xiaohui Zheng, Wei Liu, Yixue Li.
Abstract
BACKGROUND: A number of vertebrate highly conserved elements (HCEs) have been detected and their genomic interval distances have been reported to be more conserved than protein coding genes among mammalian genomes. A characteristic of the human - non-mammalian comparisons is a bimodal distribution of relative distance difference of conserved consecutive HCE pairs; and it is difficult to attribute such profile to a random assortment. We therefore undertook an analysis of the human genomic regions confined by consecutive HCE pairs common to eight genomes (human, mouse, rat, chicken, frog, zebrafish, tetradon and fugu).Entities:
Mesh:
Substances:
Year: 2009 PMID: 19323843 PMCID: PMC2667192 DOI: 10.1186/1471-2164-10-133
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1RDD distribution for HCE pairs common to the five genomes.
Figure 2Median |RDD| for HCE pairs of IHRs. Median | RDD| of IHR2s were much higher than that of IHR1s for the comparison of any two pair-wise genomes.
Number of HCE pairs with different genomic locations
| Group 1 | Group 2 | total | |
| Exonic-Exonic | 29 | 16 | 45 |
| Intronic-Intronic | 62 | 57 | 119 |
| Intergenic-Intergenic | 77 | 111 | 188 |
| Exonic-Intronic | 16 | 12 | 28 |
| Exonic-Intergenic | 4 | 4 | 8 |
| Intronic-Intergenic | 0 | 15 | 15 |
| Total | 188 | 215 | 403 |
Distance between intergenic IHRs and their nearest genes.
| Distance (Kb) | |||||
| Number | Min | Median | Mean | Max | |
| Intergenic IHR1 | 75 | 0.6 | 85.3 | 113.2 | 482.0 |
| Intergenic IHR1 with CpG islands | 11 | 2.5 | 22.8 | 60.5 | 398.1 |
| Intergenic IHR2 | 106 | 0.4 | 113.0 | 150.8 | 652.1 |
| Intergenic IHR2 with CpG islands | 15 | 2.9 | 55.8 | 92.6 | 421.2 |
Percentage of repeated base pairs within IHRs.
| IHR1 | IHR2 | ||||
| Observed | Expected | Observed | Expected | ||
| Percentage of IHRs containing repeat (%) | 43 (0.001) | 74 | 97 (0.948) | 94 | |
| Average percentage of repeated base pairs (%) | SINE | 4.26 (0.001) | 16.76 | 11.00 (0.001) | 13.51 |
| LINE | 2.37 (0.001) | 26.63 | 13.42 (0.001) | 20.96 | |
| LTR | 0.60 (0.001) | 10.81 | 4.67 (0.001) | 8.53 | |
| Low_complexity | 4.86 (1) | 0.73 | 0.72 (0.984) | 0.58 | |
| scRNA | 0 (0.001) | 0.005 | 0.006 (0.783) | 0.004 | |
| DNA | 0.75 (0.001) | 3.88 | 2.78 (0.146) | 3.06 | |
| RNA | 0.03 (0.967) | 0.005 | 0.001 (0.404) | 0.005 | |
| srpRNA | 0 (0.001) | 0.008 | 0.0008 (0.110) | 0.008 | |
| snRNA | 0 (0.001) | 0.02 | 0.01 (0.716) | 0.01 | |
| tRNA | 0 (0.001) | 0.003 | 0.0009(0.219) | 0.003 | |
| rRNA | 0 (0.001) | 0.007 | 0.005(0.635) | 0.006 | |
| Simple_repeat | 1.04 (0.476) | 1.17 | 1.02 (0.828) | 0.92 | |
| Satellite | 0 (0.001) | 0.49 | 0 (0.001) | 0.36 | |
Randomly selected human genomic regions were used to test the significance, the fraction of times in which the random sample set scored lower average scores than those of the IHRs provided the basis for the statistical significance. p value was given in the bracket.
Enrichment of human indels within IHRs.
| IHR1 | IHR2 | |||
| Observed | Expected | Observed | Expected | |
| Number | 30 (0.241) | 27 | 174 (0.001) | 156 |
| Average percentage of deleted base pairs (%) | 4.53 (0.107) | 1.66 | 0.49 (0.226) | 0.39 |
| Average percentage of inserted base pairs (%) | 0.03 (0.679) | 0.05 | 0.01 (0.801) | 0.01 |
Randomly selected human genomic regions provided the background to test the significance, p value was given in the bracket.
Figure 3Correlation between the percentage of conserved sequence and the length of the two groups of IHRs. Circles represent the data for IHR1 and stars for IHR2.
Length of conserved fractions and distance between two consecutive conserved fractions within IHRs.
| Length of conserved fractions (bp) | Distance in between two consecutive conserved fractions (bp) | |||||
| Total number of conserved fractions | Median | Mean | Median | Mean | ||
| Human | IHR1 | 1460 | 37 | 73 | 54 | 327 |
| IHR2 | 10041 | 39 | 76 | 100 | 894 | |
| p value (IHR1, IHR2) | 0.1798 | 2.2e-16 | ||||
| Chicken | IHR1 | 727 | 37 | 65 | 50 | 569 |
| IHR2 | 3504 | 42 | 72 | 74 | 1302 | |
| p value (IHR1, IHR2) | 0.0020 | 8.3e-11 | ||||
| Frog | IHR1 | 576 | 158 | 233 | 140 | 867 |
| IHR2 | 2531 | 156 | 226 | 249 | 1463 | |
| p value (IHR1, IHR2) | 0.6452 | 6.2e-06 | ||||
| Zebrafish | IHR1 | 560 | 65 | 96 | 144 | 903 |
| IHR2 | 1863 | 73 | 104 | 241 | 1686 | |
| p value (IHR1, IHR2) | 0.0018 | 2.3e-05 | ||||
Two samples Wilcoxon test was used to test the significance.