| Literature DB >> 16224102 |
Alexander Meissner1, Andreas Gnirke, George W Bell, Bernard Ramsahoye, Eric S Lander, Rudolf Jaenisch.
Abstract
We describe a large-scale random approach termed reduced representation bisulfite sequencing (RRBS) for analyzing and comparing genomic methylation patterns. BglII restriction fragments were size-selected to 500-600 bp, equipped with adapters, treated with bisulfite, PCR amplified, cloned and sequenced. We constructed RRBS libraries from murine ES cells and from ES cells lacking DNA methyltransferases Dnmt3a and 3b and with knocked-down (kd) levels of Dnmt1 (Dnmt[1(kd),3a-/-,3b-/-]). Sequencing of 960 RRBS clones from Dnmt[1(kd),3a-/-,3b-/-] cells generated 343 kb of non-redundant bisulfite sequence covering 66212 cytosines in the genome. All but 38 cytosines had been converted to uracil indicating a conversion rate of >99.9%. Of the remaining cytosines 35 were found in CpG and 3 in CpT dinucleotides. Non-CpG methylation was >250-fold reduced compared with wild-type ES cells, consistent with a role for Dnmt3a and/or Dnmt3b in CpA and CpT methylation. Closer inspection revealed neither a consensus sequence around the methylated sites nor evidence for clustering of residual methylation in the genome. Our findings indicate random loss rather than specific maintenance of methylation in Dnmt[1(kd),3a-/-,3b-/-] cells. Near-complete bisulfite conversion and largely unbiased representation of RRBS libraries suggest that random shotgun bisulfite sequencing can be scaled to a genome-wide approach.Entities:
Mesh:
Substances:
Year: 2005 PMID: 16224102 PMCID: PMC1258174 DOI: 10.1093/nar/gki901
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1Reduced representation bisulfite sequencing. Genomic DNA is digested to completion using a restriction enzyme (here BglII). After size-selection an adapter is added. The DNA is denatured, and unmethylated cytosines are bisulfite-converted to uracil. The two resulting C-poor strands are no longer complementary to each other. Primers specific for the converted adapter sequence are used to fill-in the second (G-poor) strand and for PCR amplification. PCR products are cloned and sequenced. Sequences generated from RRBS libraries are projected onto the genome by searching against a reduced representation database of BglII fragments that had been size-selected and bisulfite-converted in silico.
Figure 2Generation of Dnmt1, Dnmt3a and Dnmt3b deficient ES cells. (A) Dnmt3a/3b homozygous double knockout ES cells have been described earlier (43). The knockdown virus is expressing a Dnmt1 shRNA, whereas the control is not. The infection was termed Passage 0. After the infection ES cells were passaged four times on feeders followed by two additional passages under feeder-free conditions (Passage 6). Number of viral integrations were determined by Southern blotting and clones with single integration were selected (data not shown). (B) Western blot analysis. The status of the different Dnmts is indicated above. The knockdown ES cells showed a significant reduction in Dnmt1 levels compared with their sister clone. c/c is a previously reported Dnmt1 null ES line (16).
Figure 3Methylation status of the Dnmt-deficient ES cells. All knockdown and control ES cells were analyzed at Passage 6 after infection. (A) Minor satellite repeat methylation. HpaII digests of genomic DNA were hybridized to minor satellite probe pMR150. MspI is an isoschizomere of HpaII and cuts irrespective of the methylation status (i.e. appearance of a ladder in HpaII lane indicates loss of methylation). The status of the different Dnmts is shown above the Southern blot. All knockdown and control ES cell lines were generated as described in Figure 2. Each knockdown line contains a single lentiviral integration (data not shown). (B) IAP methylation. HpaII-digested genomic DNA was hybridized to an IAP probe. (C) COBRA analysis for imprinted genes. Genomic DNA was bisulfite treated and after PCR amplification of H19, Snrpn, Peg1 and Peg3 a restriction digest was performed to analyze the methylation status of the differentially methylated regions (U = unmethylated, M = methylated). The second (smaller) fragment of the methylated and digest product is not shown. (D) Total mCpG quantification by NNA. The spots corresponding to CpG and mCpG are indicated in the upper left panel. The per cent mCpG/(CpG+mCpG) are displayed in each panel (estimated error 5%).
Sequencing and methylation statistics
| ES cell line | Dnmt[1kd, 3a−/−,3b−/−] | wild-type |
|---|---|---|
| Colonies picked | 960 | 192 |
| Bisulfite sequencing reads | 876 | 186 |
| Insert in plus orientation | 153 | 50 |
| Plus read complete | 38 | 23 |
| Insert in minus orientation | 723 | 136 |
| Minus read complete | 719 | 134 |
| Complete bisulfite sequencing reads | 757 | 157 |
| Genome hits | 676 | 148 |
| Non-redundant genome hits | 609 | 147 |
| Total bp of non-redundant genome hits | 342556 | 80692 |
| Cytosines in aligned genome sequence | 66212 | 15296 |
| 5-Methylcytosine (mC) | 38 (0.06%) | 707 (4.6%) |
| CpG in aligned genome sequence | 3458 | 594 |
| mCpG | 35 (1.0%) | 533 (90%) |
| CpA in aligned genome sequence | 23 046 | 5601 |
| mCpA | 0 (0%) | 135 (2.4%) |
| CpT in aligned genome sequence | 25 505 | 5924 |
| mCpT | 3 (0.01%) | 39 (0.7%) |
| CpC in aligned genome sequence | 14203 | 3177 |
| mCpC | 0 (0%) | 0 (0%) |
aExcludes growth failures, sequencing failures, mixed clones, vector-only clones and a total of nine reads that showed no bisulfite conversion at all.
bSequenced strand is the bisulfite-converted C-poor strand.
cHigh-quality sequence across entire length of BglII fragment.
dSequenced strand is the G-poor complementary strand of the bisulfite-converted strand.
eIncludes sequences that are duplicated in the genome. BglII fragments that were hit more than once were counted only once.
Figure 4Size distributions of the sequenced clones from each library. RRBS reads from wild-type ES cells (black) had a mean of 553 bp and an SD of 17 bp. Dnmt[1kd,3a−/−,3b−/−] reads were (570 ± 20) bp in size (grey bars). The size distributions of the two libraries were overlapping but not identical.
Fraction (in per cent) of various types of sequences in the mouse reference genome, the 500–600 bp BglII reduced representation thereof (RR genome) and RRBS sequences from Dnmt-deficient and wild-type ES cells
| Genome | RR genome | RRBS | RRBS | |
|---|---|---|---|---|
| GC content | 42.0 | 41.5 | 43.7 | 43.1 |
| CpG islands | 0.4 | 0.1 | 0.1 | 0.0 |
| ENSEMBL genes | 34.3 | 35.0 | 41.9 | 35.3 |
| Promoter | 5.0 | 5.0 | 7.0 | 4.7 |
| SINEs | 8.2 | 2.7 | 2.6 | 2.3 |
| LINEs | 19.2 | 10.2 | 10.7 | 11.3 |
| LTR elements | 9.9 | 2.9 | 2.9 | 4.3 |
| MER DNA elements | 0.9 | 0.2 | 0.1 | 0.2 |
aRepeat and GC content were taken from Ref. (41).
bCpG islands were taken from the mm6 mouse genome assembly on the UCSC genome browser.
cFraction of genome sequence that falls within gene bounds of non-overlapping ENSEMBL gene models.
dFraction of RRBS sequences with significant hits to the ENSEMBL gene fraction of the genome.
eFraction of genome sequence that falls within 5 kb upstream of the transcription start site of ENSEMBL gene models.
fFraction of RRBS sequences with significant hits to regions 5 kb upstream of transcription start sites.
Figure 5Targeted bisulfite sequencing of specific loci. Ten loci for which RRBS sequencing indicated mCpGs in Dnmt-deficient cells and 10 loci that were devoid of methylation were bisulfite re-sequenced using specific primers in wild-type (top), 3a/b double knockout (middle) and Dnmt[1kd,3a−/−,3b−/−] cells (bottom). Shown are two examples of each set. Each row represents a single sequenced molecule. Filled squares are methylated CpGs and empty ones indicate unmethylated sites. The asterisk indicates the original clone sequenced from the library.