| Literature DB >> 26512698 |
Hilal Varinli1,2,3, Aaron L Statham4, Susan J Clark5,6, Peter L Molloy7, Jason P Ross8.
Abstract
Combined Bisulfite Restriction Analysis (COBRA) quantifies DNA methylation at a specific locus. It does so via digestion of PCR amplicons produced from bisulfite-treated DNA, using a restriction enzyme that contains a cytosine within its recognition sequence, such as TaqI. Here, we introduce COBRA-seq, a genome wide reduced methylome method that requires minimal DNA input (0.1-1.0 mg) and can either use PCR or linear amplification to amplify the sequencing library. Variants of COBRA-seq can be used to explore CpG-depleted as well as CpG-rich regions in vertebrate DNA. The choice of enzyme influences enrichment for specific genomic features, such as CpG-rich promoters and CpG islands, or enrichment for less CpG dense regions such as enhancers. COBRA-seq coupled with linear amplification has the additional advantage of reduced PCR bias by producing full length fragments at high abundance. Unlike other reduced representative methylome methods, COBRA-seq has great flexibility in the choice of enzyme and can be multiplexed and tuned, to reduce sequencing costs and to interrogate different numbers of sites. Moreover, COBRA-seq is applicable to non-model organisms without the reference genome and compatible with the investigation of non-CpG methylation by using restriction enzymes containing CpA, CpT, and CpC in their recognition site.Entities:
Keywords: CHH; COBRA; DNA methylation; enhancer; next generation sequencing; non CpG; non-model organism; reduced representation; restriction enzymes
Year: 2015 PMID: 26512698 PMCID: PMC4690032 DOI: 10.3390/genes6041140
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Details of oligos used in constructing GW-COBRA and LA-COBRA methylome libraries. The less common sequence abbreviations are: 5 (5mC), b (5' two biotin groups) and p (5' phosphorylation).
| Primer Name | Primer Sequence (5'–3')* | Purification Method |
|---|---|---|
| PAGE and HPLC | ||
| PAGE and HPLC | ||
| CTA CAC TCT TTC CCT ACA CGA CGC TCT TCC GAT CT | PAGE and HPLC | |
| CGA GAT CGG AAG AGC GTC GTG TAG GGA AAG AGT GTA G | PAGE and HPLC | |
| PAGE and HPLC | ||
| PAGE and HPLC | ||
| CAA GCA GAA GAC GGC ATA CGA GCT CTT CCG ATC T | PAGE | |
| HPLC | ||
| PAGE | ||
| AAT GAT ACG GCG ACC ACC GAG ATC TAC ACT CTT TCC CTA CAC GAC GCT CTT CCG ATC T | HPLC | |
| CAA GCA GAA GAC GGC ATA CGA GCT CTT CCG ATC T | HPLC | |
| AAT GAT ACG GCG ACC ACC GA | HPLC | |
| CAA GCA GAA GAC GGC ATA CGA | HPLC |
Figure 1Flowchart of key steps in constructing COBRA-seq libraries with minor steps omitted for clarity. The common steps in both methods are shown on light yellow background. The GW-COBRA and LA-COBRA specific steps are on light blue and pink backgrounds respectively. The less common sequence abbreviations are: mC (5mC), b (two biotin groups sequentially placed on the 5'-end) and p (5' phosphorylation).
Figure 2GW-COBRA and LA-COBRA library construction results. (A) Sonicated genomic DNA isolated from HCT116 cell line (lane 1), 50 ng of Adapter-2 ligated library material (lane 2). Column purified Adapter-2 ligated library material (lane 3); (B) PCR amplification test of bisulfite treated library materials. Lane 1, 2, 3 and 4 are produced by 6, 7, 8 and 9 cycles of PCR using GW-A2-FwdP and COBRA-A2-RevP primers from Table 1 with GW-COBRA library material as the template respectively. Lane 5, 6, 7 and 8 are produced by 6, 7, 8 and 9 cycles of PCR using LA-A2+T7-FwdP and COBRA-A2-RevP primers from Table 1 with LA-COBRA library material as the template, respectively; (C) Final library products: GW-COBRA and LA-COBRA methylome libraries of HCT116 DNA amplified using flowcell or LADS primer pairs respectively (Lane 2 and 4). Lane 1 and 3 are negative controls for GW-COBRA and LA-COBRA libraries, respectively; (D) Bioanalyzer results of GW-COBRA and LA-COBRA methylome sequencing libraries, respectively, prepared from HCT116 cell line DNA. The final library fragments ranged between 150–500 bp with an average size of 257 and 360 bp for GW-COBRA and LA-COBRA, respectively.
Figure 3Empirical comparison of HCT116 COBRA-seq methylome with RRBS and 450K array data. (A) Comparison of the GW-COBRA and LA-COBRA log transformed read counts aligning to the same genome position (5,756,193 TaqI sites, R2 = 0.905); (B) Venn diagram comparing coverage of CpG sites by the 3 methods; (C) Combined methylome sequencing data of GW-COBRA and LA-COBRA (Joint-COBRA; median coverage of 17 reads per strand) compared with HCT116 cell line WGBS data (902,990 shared CpG sites, R2 = 0.384); (D) with HCT116 cell line 450K array data (31,988 shared CpG sites, R2 = 0.439) and (E) with HCT116 cell line RRBS data (95,766 shared CpG sites; median coverage of 82.14 reads per strand, R2 = 0.443); (F) Population mean (μ) coverage of 24.6 in M fraction only using simulated data; (G) Comparison of methylation scores of WGBS with 450K (463,300 shared CpG sites); (H) WGBS wish RRBS (978,735 shared CpG sites); (I) RRBS with 450K (46,374 shared CpG sites).
Figure 4IGV screenshots on the selected colon cancer associated genes: (A) BCAT1 and (B) EHD3. i. Methylation levels determined by WGBS, 450K arrays, RRBS GW-COBRA and LA-COBRA respectively; ii. Genome coverage and stacking of cleaned GW-COBRA reads; iii. Genome coverage and stacking of cleaned LA-COBRA reads; iv. Genomic locations of accessible COBRA TaqI sites, 450K array probes and accessible RRBS sites.
Figure 5Comparison of CpG coverage of COBRA-seq with WGBS and commonly used methylome sampling methods by genomic regions, as pie charts or bar diagram for proportion of enhancer sites (WGBS covers a total of 205,740 CpGs in enhancers). (A) WGBS covers 28,217,448 CpG sites; (B) COBRA-seq with TaqI, HpyCH4IV, BstuI and CviQI covers 15.9%, 16.9%, 4.3%, and 5.2% of the total CpG sites respectively; C. 450K arrays, RRBS, SuBLiME and DREAM cover 1.7%, 16.7%, 75.8%, and 1.8% of the total CpG sites respectively.
Qualitative comparison of selected methylome methods (*are the preferred genomic input, low input will increase variation due to sampling).
| Methylome Methods | Complexity Reduction Type | M or M + U Fraction | Methylome Sampling (Yes/No) | Favour of Enrichment Towards | Comments |
|---|---|---|---|---|---|
| WGBS [ | N/A | M + U | N/A | N/A | High cost, can detect non-CpG methylation, genomic input* (0.05–0.1 μg). |
| MBDCap-Seq [ | Affinity capture | M | No | CpG-rich | Dependent on CpG density, effected by salt concentration, covers about 18% of the CpGs [ |
| MeDIP [ | Affinity capture | M | No | CpG-rich | Bias towards 5mC-rich regions, Captures single-stranded DNA, prone to technical variability, coverage is read-depth dependent, input = 0.15–5 μg. |
| SuBLiME [ | Methylated cytosine capture | M | Yes | CpG-rich and poor | Substitutes biotin-14-dCTP or biotin-14-dGTP at the position of the 5mC in bisulfite treated DNA, input = 2 μg. |
| COBRA-seq | Restriction digest | M | Yes | CpG-rich and poor | Provides relative methylation levels like MeDIP/MBD-Cap, 1%–17% of the CpGs for single digest, can detect non-CpG methylation, input = 0.1–1 μg. |
| Nimblegen SeqCap [ | Hybridization capture | M + U | Yes | CpG-rich and poor | “Off-the-shelf” version for human genome only/similar regions covered as 450K array, can be customized [ |
| Agilent SureSelect [ | Hybridization capture | M + U | Yes | CpG-rich and poor | Available for human genome only, covers 3.7 million CpG sites, input = 0.5 μg. |
| 450K array [ | Microarray | M + U | Yes | CpG-rich | Arrays comes in 12 per slide, available only for humans, not readily customized, input = 0.5–1 μg. |
| RRBS [ | Restriction digest | M + U | Yes | CpG-rich/medium | Can detect non-CpG methylation, input = 0.1–0.3 μg. |
| Methyl-Seq [ | Restriction digest | M + U | Yes | CpG-rich | Assesses 0.25 to 1.3 million CCGG sites in human genome by difference in read fractions in |
| CHARM [ | Restriction digest | M + U | Yes | CpG-rich and poor | Array-based and available for human, mice and rat, assesses 3.5 to 7 million CpG sites, input = 5 μg. |
| DREAM [ | Restriction digest | M + U | Yes | CpG-rich | Assesses methylation at ~0.15 million sites in human genome by sequential |