| Literature DB >> 19906696 |
David Serre1, Byron H Lee, Angela H Ting.
Abstract
DNA methylation is an epigenetic modification involved in both normal developmental processes and disease states through the modulation of gene expression and the maintenance of genomic organization. Conventional methods of DNA methylation analysis, such as bisulfite sequencing, methylation sensitive restriction enzyme digestion and array-based detection techniques, have major limitations that impede high-throughput genome-wide analysis. We describe a novel technique, MBD-isolated Genome Sequencing (MiGS), which combines precipitation of methylated DNA by recombinant methyl-CpG binding domain of MBD2 protein and sequencing of the isolated DNA by a massively parallel sequencer. We utilized MiGS to study three isogenic cancer cell lines with varying degrees of DNA methylation. We successfully detected previously known methylated regions in these cells and identified hundreds of novel methylated regions. This technique is highly specific and sensitive and can be applied to any biological settings to identify differentially methylated regions at the genomic scale.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19906696 PMCID: PMC2811030 DOI: 10.1093/nar/gkp992
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 2.Sampling curves for HCT116 and DICERex5. The curves show the number of 100-bp windows identified as being methylated (y-axis) for a given number of sequences randomly drawn from the entire dataset (x-axis). Blue lines represent sampling curves for HCT116, and red lines correspond to data for DICERex5. The solid lines show the results obtained when considering all significantly methylated windows (≥4 reads), while the dash lines represent the results for highly methylated loci (≥10 reads) only. The vertical black line shows the number of reads used in this study.
Experimental validation by bisulfite sequencing
| Locus analyzed | MiGS (No. of methylated windows) | Bisulfite Seq. (% MetC) | ||||||
|---|---|---|---|---|---|---|---|---|
| Coordinates | Name | Total | HCT116 | DICERex5 | DKO | HCT116 | DICERex5 | DKO |
| Significantly methylated in HCT116 and DICERex5 in MiGS assay | ||||||||
| chr19:1507384-1507703 | 5 | 5 | 5 | 5 | 99 | 99 | 41 | |
| chr5:92949531-92949838 | 4 | 4 | 4 | 4 | 100 | 99 | 37 | |
| chr1:154452833-154453297 | 5 | 5 | 5 | 3 | 98 | 98 | 31 | |
| chr1:211190670-211190970 | 4 | 3 | 2 | 0 | 76 | 28 | 16 | |
| chrX:21302569-21302868 | 4 | 4 | 4 | 1 | 95 | 96 | 13 | |
| chr1:247108292-247108567 | 4 | 4 | 4 | 4 | 98 | 98 | 11 | |
| chrX:23262922-23263235 | 4 | 4 | 4 | 2 | 94 | 96 | 6 | |
| chr1:164401866-164402277 | 5 | 5 | 5 | 1 | 93 | 94 | 1 | |
| chr16:33869047-33869405 | Peri-16 | 5 | 5 | 5 | 0 | 96 | 96 | 1 |
| chr10:83624002-83624444 | 5 | 5 | 5 | 0 | 94 | 92 | 1 | |
| chr20:26136535-26136825 | 4 | 3 | 3 | 0 | 99 | 98 | 1 | |
| chr1:47654991-47655443 | 6 | 6 | 6 | 0 | 97 | 98 | 1 | |
| chr10:79066821-79067106 | 4 | 4 | 4 | 0 | 98 | 99 | 1 | |
| chr9:23810766-23810968 | 3 | 3 | 3 | 0 | 98 | 98 | 1 | |
| chr9:25667441-25667703 | 4 | 4 | 4 | 0 | 95 | 92 | 1 | |
| chr10:31649125-31649519 | 5 | 5 | 1 | 0 | 72 | 25 | 0 | |
| chr3:128830800-128831068 | 3 | 3 | 3 | 0 | 87 | 34 | NA | |
| No evidence of methylation in HCT116 and DICERex5 in MiGS assay | ||||||||
| chr19:44113033-44113323 | S/M | 4 | 0 | 0 | 0 | 1 | NA | NA |
| chr8:145209542-145209851 | 4 | 0 | 0 | 0 | 1 | NA | NA | |
| chr7:105539547-105539910 | 5 | 0 | 0 | 0 | 0 | NA | NA | |
| chr18:10515854-10516103 | 4 | 0 | 0 | 0 | 0 | NA | NA | |
| chr1:143920369-143920682 | 4 | 0 | 0 | 0 | 0 | NA | NA | |
aName of the closest RefSeq gene.
bBisulfite sequencing not performed.
cSARS2/MRPS12 bidirectional promoter.
Mapping distribution of DNA sequence reads
| HCT116 | DICERex5 | DKO | |
|---|---|---|---|
| No. of reads | 19 041 613 | 18 315 610 | 4 393 056 |
| Reads mapped uniquely to Human Genome | 7 102 330 (37%) | 6 920 203 (38%) | 1 207 400 (27%) |
| Non-repeat | 5 767 100 | 5 620 190 | 977 692 |
| Repeat elements | 1 335 230 | 1 300 013 | 229 708 |
| Reads mapped to multiple locations on the Human Genome | 1 978 515 (10%) | 2 138 088 (12%) | 439 995 (10%) |
| Non-repeat | 811 412 | 793 801 | 103 734 |
| Repeat elements | 1 167 103 | 1 344 287 | 336 261 |
| Reads mapped to the unassembled Human Genome | 4 210 471 (22%) | 3 811 890 (21%) | 167 253 (4%) |
| Non-repeat | 3 368 209 | 3 083 404 | 159 548 |
| Repeat elements | 842 262 | 728 486 | 7705 |
| Others not mapped | 5 694 794 (30%) | 5 445 429 (30%) | 2 578 408 (59%) |
Figure 1.Bisulfite sequencing of newly identified methylated regions. Bisulfite sequencing was performed on (A) peri-centromeric CpG island on chromosome 16, (B) promoter CpG island of PTCHD1 and (C) ZEB1 intronic CpG island in HCT116, DICERex5 and DKO cells. Genomic coordinates (NCBI Build 36.1) for the bisulfite sequencing amplicon of chromosome 16 peri-centromeric CpG island are shown. Positions relative to the transcription start site are indicated for the amplicons of PTCHD1 and ZEB1. Each circle represents a CpG dinucleotide with black circles representing methylated cytosines and white ones representing unmethylated cytosines. Each row represents one individual allele sequenced. Rectangles above each cell line represent the non-overlapping 100-bp windows covered by each amplicon. Methylated windows identified by MiGS are shaded in gray while unmethylated windows are in white.
Distribution of DNA methylation
| Hg18 | Distribution of methylation | Methylation at CpG islands | |||||
|---|---|---|---|---|---|---|---|
| HCT116 | DICERex5 | DKO | HCT116 | DICERex5 | DKO | ||
| 5′-end | 0.82% | 11.29% | 10.98% | 4.83% | 83.66% (7×) | 83.54% (7×) | 76.02% (4×) |
| 3′-end | 0.71% | 2.59% | 2.56% | 2.14% | 51.14% (40×) | 50.82% (40×) | 43.06% (29×) |
| Genic | 41.20% | 50.20% | 50.26% | 60.92% | 40.47% (98×) | 40.23% (97×) | 31.81% (67×) |
| Intergenic | 57.26% | 35.92% | 36.20% | 32.11% | 36.51% (151×) | 35.88% (147×) | 23.49% (82×) |
| Total | 100% | 100% | 100% | 100% | 44.20% (90×) | 43.67% (88×) | 31.52% (52×) |
aProportion of the human genome (NCBI Build 36.1) located within 500 bp of a RefSeq gene transcription start site (5′-end), within 500 bp of a RefSeq gene transcription stop site (3′-end), in the body of a RefSeq gene (Genic), or between RefSeq genes (Intergenic).
bDistribution of methylated 100-bp windows according to their genomic location.
cProportion of methylated windows that overlap with annotated CpG islands. The preferential enrichment of methylation at CpG islands is shown in brackets and is calculated by dividing the proportion of methylated windows in CpG islands by the proportion of methylated windows not in CpG islands in each genomic context.
Gene expression changes of methylated genes upon demethylation
| Chemical demethylation | Genetic demethylation | |||||
|---|---|---|---|---|---|---|
| Mean change | Mean change | |||||
| 5′-end methylation | 3031 | 1.57 | <2.2 × 10−16 | 1.81 | 2.2 × 10−16 | |
| No methylation | 16 695 | 1.12 | 1.03 | |||
| 5′-end CpG island methylation | 2465 | 1.63 | 1.1 × 10−3 | 1.91 | 3.6 × 10−4 | |
| 5′-end non CpG island methylation | 566 | 1.33 | 1.45 | |||
| 5′-end non CpG island methylation | 566 | 1.33 | 2.4 × 10−3 | 1.45 | 4.4 × 10−7 | |
| No methylation | 16 695 | 1.12 | 1.03 | |||
| 5′end methylation | 3031 | 1.57 | <2.2 × 10−16 | 1.81 | <2.2 × 10−16 | |
| Genic or 3′-end methylation | 5294 | 0.99 | 0.95 | |||
aAverage of ratios of gene expression in DAC-treated HCT116 cells over those in HCT116 cells.
bAverage of ratios of gene expression in DKO cells over those in HCT116 cells.