| Literature DB >> 24923327 |
Katrin Rademacher1, Christopher Schröder2, Deniz Kanber1, Ludger Klein-Hitpass3, Stefan Wallner4, Michael Zeschnigk1, Bernhard Horsthemke5.
Abstract
Imprinting of the human RB1 gene is due to the presence of a differentially methylated CpG island (CGI) in intron 2, which is part of a retrocopy derived from the PPP1R26 gene on chromosome 9. The murine Rb1 gene does not have this retrocopy and is not imprinted. We have investigated whether the RB1/Rb1 locus is unique with respect to these differences. For this, we have compared the CGIs from human and mouse by in silico analyses. We have found that the human genome does not only contain more CGIs than the mouse, but the proportion of intronic CGIs is also higher (7.7% vs. 3.5%). At least 2,033 human intronic CGIs are not present in the mouse. Among these CGIs, 104 show sequence similarities elsewhere in the human genome, which suggests that they arose from retrotransposition. We could narrow down the time points when most of these CGIs appeared during evolution. Their methylation status was analyzed in two monocyte methylome data sets from whole-genome bisulfite sequencing and in 18 published methylomes. Four CGIs, which are located in the RB1, ASRGL1, PARP11, and PDXDC1 genes, occur as methylated and unmethylated copies. In contrast to imprinted methylation at the RB1 locus, differential methylation of the ASRGL1 and PDXDC1 CGIs appears to be sequence dependent. Our study supports the notion that the epigenetic fate of the retrotransposed DNA depends on its sequence and selective forces at the integration site.Entities:
Keywords: CpG islands; epigenetics; evolution; methylation; retrocopy
Mesh:
Year: 2014 PMID: 24923327 PMCID: PMC4122923 DOI: 10.1093/gbe/evu125
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Location of CGIs in the Human and Mouse Genomes with Regard to the CCDS
| CGIs | Human (hg19) | Mouse (mm10) | ||
|---|---|---|---|---|
| Number | Percentage | Number | Percentage | |
| CGIs (UCSC) | 27,537 | — | 15,997 | — |
| CGIs (analyzed) | 28,396 | 100 | 16,643 | 100 |
| CGIs in gene | 17,807 | 62.71 | 12,587 | 75.63 |
| CGIs not in gene | 10589 | 37.29 | 4,056 | 24.37 |
| TSS | 503 | 1.77 | 474 | 2.85 |
| TSS + 5′-UTR | 3,412 | 12.02 | 3,029 | 18.20 |
| TSS + 5′-UTR + CDS (Exon) | 1,078 | 3.80 | 1,189 | 7.14 |
| TSS + 5′-UTR + CDS (Exon) + CDS (Intron) | 5,223 | 18.39 | 4,692 | 28.19 |
| TSS + 5′-UTR + CDS (Exon) + CDS (Intron) + 3′-UTR | 275 | 0.97 | 107 | 0.64 |
| 5′-UTR | 491 | 1.73 | 303 | 1.82 |
| 5′-UTR + CDS (Exon) | 93 | 0.33 | 112 | 0.67 |
| 5′-UTR + CDS (Exon) + CDS (Intron) | 273 | 0.96 | 225 | 1.35 |
| 5′-UTR + CDS (Exon) + CDS (Intron) + 3′-UTR | 27 | 0.10 | 13 | 0.08 |
| CDS (Exon) | 845 | 2.98 | 781 | 4.69 |
| CDS (Exon) + CDS (Intron) | 2,453 | 8.64 | 758 | 4.55 |
| CDS (Exon) + CDS (Intron) + 3′-UTR | 620 | 2.18 | 222 | 1.33 |
| CDS (Exon) + 3′-UTR | 70 | 0.25 | 39 | 0.23 |
| 3′-UTR | 270 | 0.95 | 64 | 0.38 |
Note.—The table shows the distribution of human and mouse CGIs dependent on their CCDS location. In addition to the total number, the percentages are given. Five classes for CGI characterization are defined, where a CGI can overlap no, one, or more classes. The classes are: TSS (200-bp region upstream of the TSS), 5′-UTR, CDS (Exon), CDS (Intron), and 3′-UTR (see supplementary fig. S1, Supplementary Material online).
aThe analyzed number of CGIs is higher than the downloaded number from the UCSC browser, because one CGI can belong to more than one CCDS.
Methylation Analysis of 104 Human/Nonmurine Intronic CGIs
| CGIs | All Analyzed CGIs | Methylation ( | Methylation (20% ≥ | Methylation ( |
|---|---|---|---|---|
| Methylome 1 | ||||
| Numbers | 104 | 15 | 12 | 77 |
| Methylation | 77 | 2 | 54 | 95 |
| Coverage | 14 | 8 | 12 | 12 |
| Methylome 2 | ||||
| Numbers | 104 | 18 | 11 | 75 |
| Methylation | 77 | 4 | 68 | 95 |
| Coverage | 8 | 5 | 8 | 9 |
Note.—The table summarizes degree of methylation (%) and number of CGIs analyzed in two monocyte methylome data set (methylome 1: 1,929,952,791 reads, duplication rate 0.22, mapping efficiency 0.99, and conversion 0.994; methylome 2: 1,407,767,072 reads, duplication rate 0.15, mapping efficiency 0.98, and conversion 0.996). CGIs are divided into three classes, corresponding to their methylation level. Methylation less than 20% (unmethylated), methylation between 20% and 80% (candidates for differential methylation), and methylation more than 80% (methylated).
Read Analysis and CpG Methylation of CGIs with Intermediate Methylation Levels (Methylome 1)
| CGI_ID | Gene | Chr. | Length (bp) | Mean Methylation (%) | Mean Coverage | Number of Reads | Reads Unmethylated (<20% methylation) | Reads Methylated (>80% methylation) | Reads Partially Methylated (≥20% and <80% methylation) | CpG Methylation | ||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| (Number) | (%) | (Number) | (%) | (Number) | (%) | VAR | SD | |||||||
| 1911_1_hg19 | 1 | 214 | 81 | 18 | 70 | 2 | 3 | 48 | 69 | 20 | 29 | 0.01 | 0.11 | |
| 4675_1_hg19 | 3 | 326 | 73 | 19 | 108 | 8 | 7 | 70 | 65 | 30 | 28 | 0.02 | 0.12 | |
| 4754_1_hg19 | 3 | 463 | 32 | 9 | 66 | 31 | 47 | 15 | 23 | 20 | 30 | 0.09 | 0.29 | |
| 9009_1_hg19 | 7 | 210 | 78 | 8 | 24 | 2 | 8 | 14 | 58 | 8 | 33 | 0.08 | 0.29 | |
| 15205_1_hg19 | 12 | 1219 | 84 | 12 | 188 | 7 | 4 | 143 | 76 | 38 | 20 | 0.03 | 0.18 | |
| 15290_1_hg19 | 12 | 624 | 23 | 14 | 153 | 57 | 37 | 41 | 27 | 55 | 36 | 0.08 | 0.27 | |
| 19870_1_hg19 | 16 | 207 | 58 | 15 | 45 | 17 | 38 | 17 | 38 | 11 | 24 | 0.19 | 0.44 | |
| 20636_1_hg19 | 17 | 506 | 58 | 6 | 44 | 4 | 9 | 20 | 45 | 20 | 45 | 0.08 | 0.29 | |
| 25767_1_hg19 | 21 | 403 | 70 | 17 | 93 | 22 | 24 | 50 | 54 | 21 | 23 | 0.10 | 0.32 | |
Note.—This table shows the results of the read methylation and CpG methylation analysis of 14 candidate CGIs of methylome 1 (for methylome 2, see supplementary table S2, Supplementary Material online). In addition to CGI_ID, gene, chromosome, length, mean methylation, mean coverage, and number of reads, the reads are divided into three classes: Unmethylated, methylated, and partially methylated. The last column shows the results of the CpG methylation analyses, variance (VAR), and standard derivation (SD) over all single CpGs were calculated. Bold: CGIs that might be differentially methylated (partially methylated reads ≤20% and VAR ≤0.05).
FMethylation patterns of three intronic CGIs. For each CGI, a histogram showing the distribution of reads with different levels of methylation as well as a bar plot showing the methylation levels of each CpG across all reads within the CGI is shown. Red indicates the percentage of methylated CpGs and blue the percentage of unmethylated CpGs. (A) CGI 14414_1_hg19 (ASRGL1) is mainly covered by unmethylated and highly methylated reads. Each CpG has approximately 50% of methylation. These results indicate that this CGI might be differentially methylated. (B) The CGI, 19870_1_hg19 (SLC7A5), also, is mainly covered by unmethylated and highly methylated reads, however, the CpGs do not have approximately 50% methylation; whereas the 5′-end of the CGI is nearly unmethylated, the 3′-end is highly methylated. This result indicates that this CGI is not differentially methylated. (C) CGI 9009_1_hg19 (MAD1L1) does not show a bimodal distribution of methylation, and the methylation level of individuals CpGs is highly variable. This result indicates that this CGI is not differentially methylated.
CGI Methylation Levels in Other Tissues
| Sample_Name | Cell/Tissue | 14414_1_hg19 | 15224_1_hg19 | 16634_1_hg19 | 19100_1_hg19 | 20632_1_hg19 | |||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Meth. (%) | Cov. | Meth. (%) | Cov. | Meth. (%) | Cov. | Meth. (%) | Cov. | Meth. (%) | Cov. | ||
| Monocyte methylome 1 | Monocyte | 55 | 10 | 42 | 11 | 63 | 13 | 75 | 11 | 22 | 7 |
| Monocyte methylome 2 | Monocyte | 49 | 9 | 46 | 8 | 72 | 8 | 79 | 6 | 2 | 2 |
| Frontal_cortex_normal_1 | Cortex | 84 | 27 | 53 | 38 | 76 | 48 | 28 | 57 | 4 | 33 |
| Frontal_cortex_normal_2 | Cortex | 83 | 20 | 63 | 33 | 76 | 32 | 42 | 34 | 13 | 24 |
| Frontal_cortex_AD_1 | Cortex | 89 | 22 | 52 | 30 | 76 | 38 | 39 | 45 | 14 | 33 |
| Frontal_cortex_AD_2 | Cortex | 88 | 36 | 69 | 43 | 76 | 56 | 41 | 40 | 16 | 44 |
| IMR90 | Lung, fetal, fibroblast | 56 | 18 | 53 | 13 | 82 | 19 | 53 | 32 | 25 | 11 |
| Colon_Primary_Normal | Colon | 52 | 38 | 67 | 42 | 79 | 59 | 44 | 47 | 12 | 23 |
| Human sperm | Sperm | 9 | 4 | 1 | 5 | 5 | 7 | 7 | 11 | 6 | 16 |
| Adult liver replicate 1 | Liver | 82 | 61 | 81 | 74 | 71 | 77 | 54 | 50 | 10 | 28 |
| Adult liver replicate 2 | Liver | 56 | 53 | 52 | 45 | 61 | 57 | 47 | 49 | 11 | 38 |
| Hippocampus middle replicate 1 | Hippocampus middle | 88 | 62 | 67 | 50 | 77 | 65 | 36 | 55 | 19 | 26 |
| Hippocampus middle replicate 2 | Hippocampus middle | 87 | 35 | 69 | 39 | 77 | 48 | 30 | 46 | 16 | 26 |
| Fetal heart (119) | Fetal heart | 64 | 39 | 38 | 37 | 58 | 51 | 34 | 53 | 7 | 27 |
| Fetal thymus (1,238) | Fetal thymus | 65 | 29 | 29 | 43 | 49 | 42 | 34 | 63 | 22 | 36 |
| Fetal adrenal (1,244) | Fetal adrenal | 52 | 33 | 37 | 26 | 67 | 29 | 52 | 44 | 10 | 26 |
| Fetal muscle leg (1,243) | Fetal muscle leg | 55 | 35 | 28 | 44 | 54 | 38 | 32 | 52 | 5 | 29 |
| Fetal brain (515) | Fetal brain | 78 | 22 | 46 | 21 | 51 | 30 | 19 | 34 | 10 | 20 |
Note.—For each CGI, mean methylation (meth.) and mean coverage (cov.) are specified.
aData published in this article.
bData published by Ziller et al. (2013).
FSingle molecule methylation analysis of the intronic ASRGL1, PARP11, and PDXDC1 CGIs in heterozygous individuals. Three examples are shown. The amplicons cover only parts of the CGIs, and some include flanking CpGs. The left part of the figure shows all amplicon reads, whereas the right part shows the sequence reads sorted by SNP allele. (A) Methylation pattern of the ASRGL1 CGI. The first two CpGs do not belong to the CGI. (B) Methylation pattern of the PARP11 CGI. (C) Methylation pattern of the PDXDC1 CGI. The last four CpGs not belong to the CGI.
Evolution Analysis of 86 Human/Nonmurine Intronic CGIs
| Suborder | Haplorrhini | Strepsirrhini | ||||||
|---|---|---|---|---|---|---|---|---|
| Parvorder | Catarrhini | Platyrrhini | ||||||
| Superfamily | Hominoidea | Cercopithecoidea | ||||||
| Organism | Human | Chimpanzee | Gorilla | Orang-utan | Gibbon | Rhesus | Marmoset | Bushbaby |
| Number of CGIs | 86 | 73 | 67 | 59 | 59 | 44 | 29 | 7 |
Note.—The table gives an overview about the 86 human/nonmurine intronic CGIs and their evolution. The number stand for the human/nonmurine intronic CGIs which are present in the analyzed organism. In addition, the superfamilies, parvorders, and suborders are specified.
FEvolutionary origin of the five CGIs with intermediate methylation levels. The figure illustrates a simplified genealogy (distances are not scaled) of all analyzed primate genomes. The red arrows indicate the time point when the CGIs entered the germ line.
FStructure of the introns containing CGIs with intermediate methylation levels. The figure shows the location of the intronic CGIs and their putative ancestral origin. (A) Intron 3 of the ASRGL1 gene containing the CGI 14414_1_hg19 and the retro-RCC2. (B) Intron 1 of the PARP11 gene containing CGI 15224_1_hg19 and the retro-OTUD4. (C) Intron 1 of the PDXDC1 gene containing CGI 19100_1_hg19 and retro-KIAA2013. (D) Intron 1 of the MYOD1 gene with CGI 20632_1_hg19 and the putative ancestral origin, a CGI on the X chromosome. (E) Intron 2 of the RB1 gene containing the CGI 16634_1_hg19 and retro-PPP1R26P1.