| Literature DB >> 21092170 |
Barbara Hutter1, Matthias Bieg, Volkhard Helms, Martina Paulsen.
Abstract
BACKGROUND: Genomic imprinting is an evolutionary conserved mechanism of epigenetic gene regulation in placental mammals that results in silencing of one of the parental alleles. In order to decipher interactions between allele-specific DNA methylation of imprinted genes and evolutionary conservation, we performed a genome-wide comparative investigation of genomic sequences and highly conserved elements of imprinted genes in human and mouse.Entities:
Mesh:
Substances:
Year: 2010 PMID: 21092170 PMCID: PMC3091771 DOI: 10.1186/1471-2164-11-649
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
General sequence properties of human genes
| imprinted | maternally expressed | paternally expressed | autosomal | |
|---|---|---|---|---|
| G+C content of genes | 46.56% | 46.91% | 45.98% | 45.47% |
| CpGobs/CpGexp of genes | 0.305 | 0.305 | 0.315 | 0.280 |
| gene length (bp) | 29641 | 46976 | 26829 | 22422 |
| intron length (bp) | 1637* | 1652 | 1633 | 1549 |
| length of intergenic regions (bp) | 50879* | 49483 | 55322 | 26410 |
| coverage of introns with purely intronic PCSs | 0.74% | 0.69% | 1.09% | 0.45% |
| purely intronic PCSs per 10 kb of intron per gene | 1.72 | 1.35 | 2.04 | 1.06 |
| coverage of intergenic regions with PCSs | 0.66% | 0.74% | 0.64% | 1.11% |
| intergenic PCSs per 10 kb per gene | 1.29 | 1.41 | 1.29 | 2.17 |
* p < 0.01 (Wilcoxon test for comparison with autosomal genes)
Gene length: length of transcribed DNA sequence from most upstream transcriptional start site to most downstream transcriptional termination site; intergenic region: For this, the intergenic segments between any two genes were cut into two halves. Each of them was assigned to the closest gene. As intergenic region of a gene we then added the lengths of the halved upstream intergenic segment and of the halved downstream intergenic segment. Since the features show a highly skewed distribution, medians are given instead of averages and standard deviations.
Features of different PCS classes in human
| group | number of PCSs | conser-vation score | length (bp) | G+C content | PCSs with ≥ 1 CpG | CpGobs/CpGexp of PCSs with ≥ 1 CpG | (TpG+CpA)/(2·CpG) ratio of PCSs with ≥ 1 CpG |
|---|---|---|---|---|---|---|---|
| imprinted | 321 | 392 | 69 | 65.93% | 94.38% | 0.79 | 0.73 |
| maternal | 170 | 417 | 75 | 66.48% | 95.29% | 0.85a | 0.59 |
| paternal | 151 | 379 | 62 | 65.44% | 92.72% | 0.74* | 0.83* |
| autosomal | 55931 | 385 | 58 | 67.48% | 95.30% | 0.83 | 0.68 |
| imprinted | 1120 | 320* | 40 | 38.89%**** | 32.05%*** | 0.51**** | 2.50***** |
| maternal | 719 | 320 | 41 | 39.39%**** | 33.38%*** | 0.52** | 2.50***** |
| paternal | 401 | 320 | 40 | 38.10%** | 29.68% | 0.50** | 2.50**** |
| autosomal | 365258 | 329 | 40 | 36.54% | 25.42% | 0.42 | 3.83 |
| imprinted | 1787 | 325**** | 40**** | 39.04%***** | 33.97%*** | 0.47**** | 2.83***** |
| maternal | 832 | 320**** | 39*** | 35.78%c | 25.00%b | 0.53****a | 3.00**** |
| paternal | 955 | 329*** | 41 | 42.31%*** | 41.78%*** | 0.45* | 2.50***** |
| autosomal | 588309 | 338 | 43 | 36.36% | 27.32% | 0.42 | 4.00 |
| imprinted | 2357 | 325**** | 39**** | 38.46%***** | 29.87%*** | 0.45**** | 3.00***** |
| maternal | 1304 | 320***** | 39**** | 36.72%c | 25.61%b | 0.46**** | 3.00***** |
| paternal | 1053 | 329*** | 40 | 40.74%***** | 35.14%*** | 0.44 | 3.00***** |
| autosomal | 844703 | 338 | 42 | 36.36% | 25.49% | 0.39 | 4.33 |
* p < 0.01, ** p < 0.005, *** p < 0.001, **** p < 1e-4, ***** p < 1e-10 for comparison with autosomal PCSs (χ2 test for fractions of PCSs, Wilcoxon test for all other features). If maternally and paternally expressed genes differ significantly from each other, this is indicated as following: a p < 0.05, b p < 0.005, c p < 1e-4
Unique PCSs do not overlap with protein encoding exons, repetitive elements, or CpG islands. The shown absolute numbers of PCSs are not normalized by the lengths of the analyzed genomic segments. Hence they give information about the amount of available data but not about frequencies or coverage of PCSs in the listed genomic segments.
Figure 1Distribution of repetitive elements in phastCons sequences. PCSs that contain at least 1 bp of a repetitive element were summed up in categories according to the first overlapping repeat. The most prominent enrichment is that of PCSs overlapping with LINE-1 repeats (L1, red) in the intergenic regions of imprinted genes relative to the autosomal genome. MER: medium reiteration frequency repeats; MIR: mammalian-wide interspersed repeats; LTR: long terminal repeats; low compl./simple: low complexity regions and simple repeats; L2: LINE-2 repeats.
Figure 2Conservation score and length of exonic phastCons sequences. Using the human genome as a reference, conservation scores (A) and lengths (B) of PCSs that overlap with coding exons were determined. Compared to autosomal data (black bars), the PCSs of maternally expressed genes (red bars) are shorter and have lower conservation scores whereas PCSs of paternally expressed ones (blue bars) are similar to PCSs of autosomal genes.
Figure 3Distribution of phastCons sequences at exon boundaries. Shown are the frequencies of PCSs in introns and exons as a function of their distance from the closest exon boundary in a semi-logarithmic representation. The y-axis shows the normalized PCS density calculated as the number of PCSs per 100 bp bin divided by the number of introns in the respective group (imprinted, autosomal). On the x-axis, the position of the exon boundary is marked as zero. PCSs with negative distances are either partially or completely located in exons (thick black bar). Most of these are concentrated within a distance of 2000 bp from the exon boundary. Likewise, the number of PCSs that are completely located in introns (thin black line; positive distances) decreases rapidly within increasing distance. Beyond a distance of 10 kb (not shown here), only a few PCSs are found. Imprinted genes (red stars) possess more PCSs within about ± 2000 bp of the exon boundary than autosomal genes (blue diamonds).
6-mers enriched in intronic PCSs of imprinted genes
| human imprinted | mouse imprinted | ||||
|---|---|---|---|---|---|
| 6.21 | 4.08 | 5.91 | 4.04 | ||
| 3.51 | 4.31 | 3.64 | 3.82 | ||
| gc | 5.79 | 4.47 | 5.46 | 4.39 | |
| gc | 3.62 | 4.32 | 5.37 | 4.70 | ATF, v-Myb |
| g | 6.02 | 3.65 | 9.36 | 5.92 | |
| g | 4.31 | 6.65 | 5.96 | 6.52 | ATF |
| gggc | 3.82 | 4.01 | 3.70 | 3.99 | |
| gggg | 5.30 | 4.28 | 4.75 | 3.61 | R |
| gt | 10.53 | 7.32 | 10.92 | 5.60 | |
| tc | 4.51 | 4.18 | 4.68 | 3.96 | |