Literature DB >> 25789165

Paediatric leukaemia DNA methylation profiling using MBD enrichment and SOLiD sequencing on archival bone marrow smears.

Nicholas Cl Wong1, Gavin D Meredith2, George Marnellos3, Miroslav Dudas2, Mandy Parkinson-Bates4, Minhee Suh Halemba4, Zac Chatterton4, Jovana Maksimovic5, David M Ashley6, Francoise Mechinaud7, Jeffrey M Craig8, Richard Saffery9.   

Abstract

BACKGROUND: Acute Lymphoblastic Leukaemia (ALL) is the most common cancer in children. Over the past four decades, research has advanced the treatment of this cancer from a less than 60% chance of survival to over 85% today. The causal molecular mechanisms remain unclear. Here, we performed sequencing-based genomic DNA methylation profiling of eight paediatric ALL patients using archived bone marrow smear microscope slides.
FINDINGS: SOLiD™ sequencing data was collected from Methyl-Binding Domain (MBD) enriched fractions of genomic DNA. The primary tumour and remission bone marrow sample was analysed from eight patients. Four patients relapsed and the relapsed tumour was analysed. Input and MBD-enriched DNA from each sample was sequenced, aligned to the hg19 reference genome and analysed for enrichment peaks using MACS (Model-based Analysis for ChIP-Seq) and HOMER (Hypergeometric Optimization of Motif EnRichment). In total, 3.67 gigabases (Gb) were sequenced, 2.74 Gb were aligned to the reference genome (average 74.66% alignment efficiency). This dataset enables the interrogation of differential DNA methylation associated with paediatric ALL. Preliminary results reveal concordant regions of enrichment indicative of a DNA methylation signature.
CONCLUSION: Our dataset represents one of the first SOLiD™MBD-Seq studies performed on paediatric ALL and is the first to utilise archival bone marrow smears. Differential DNA methylation between cancer and equivalent disease-free tissue can be identified and correlated with existing and published genomic studies. Given the rarity of paediatric haematopoietic malignancies, relative to adult counterparts, our demonstration of the utility of archived bone marrow smear samples to high-throughput methylation sequencing approaches offers tremendous potential to explore the role of DNA methylation in the aetiology of cancer.

Entities:  

Keywords:  Childhood leukaemia; DNA methylation; Epigenetics; NGS; SOLiD MBD-Seq

Mesh:

Year:  2015        PMID: 25789165      PMCID: PMC4364676          DOI: 10.1186/s13742-015-0050-0

Source DB:  PubMed          Journal:  Gigascience        ISSN: 2047-217X            Impact factor:   6.524


Data description

This project was approved by the Royal Children’s Hospital Human Research Ethics Committee (RCH HREC# 29140C). We have performed Methyl-Binding Domain protein 2 (MBD2) enrichment and isolated fractions of DNA from 40 individuals for sequencing on the Sequencing by Oligonucleotide Ligation and Detection (SOLiD™) sequencing platform (SOLiD™MBD-Seq, Life Technologies, Carlsbad, USA). MBD2 has been shown to bind to double-stranded methylated DNA molecules and used to interrogate the human methylome [1]. By comparing the enriched fraction to the "input" total genomic DNA fraction, genomic regions of DNA methylation can be inferred after sequencing both fractions. The samples analysed are comprised of the following: three model cell lines, JWL (an in-house non-leukaemic cell line [2]), CEM-CCRF (childhood T-cell acute lymphoblastic leukaemia [ALL] cell line) and K562 (adult chronic myelogenous leukaemia cell line). From two non-leukaemic individuals (pbsc1 and pbsc2), peripheral blood mononuclear cells were sampled and four haematopoietic cell populations (CD34-positive, CD19-positive, CD33-positive and CD45-positive) were isolated for SOLiD™MBD-Seq analysis. From another two non-leukaemic individuals (bm9 and bm10), the same haematopoietic cell populations were isolated from bone marrow. Eight cases of childhood ALL were analysed with the identifiers 135, 197, 292, 316, 362, 367, 378 and 386 at diagnosis (leuk) and 28 days post induction chemotherapy (rem). A third set of samples was taken at relapse (lap) for cases 197, 316, 362 and 367 (Table 1).
Table 1

Samples analysed in this study and sequencing metrics

Sequencing chemistrySampleTotalTagsUniqueTagsAlignment efficiency
SOLiD v3JWL(1ug)-E82,825,33225,878,33031.24%
JWL(5ug)-E41,496,63616,004,58338.57%
CEM-CCRF-E70,843,05423,576,68933.28%
K562-E67,818,65619,407,27328.62%
Leuk316-E86,478,57029,043,37833.58%
Lap316-E74,223,31119,719,25826.57%
Rem316-E77,702,36623,783,04330.61%
NB-Leuk84,461,47144,789,62853.03%
SOLiD v4bm9_cd19-E38,269,89222,810,02059.60%
bm9_cd19-I37,784,06734,877,52692.31%
bm9_cd33-E10,416,7436,266,45760.16%
bm9_cd33-I42,759,15839,428,52192.21%
bm9_cd34-E51,318,75829,465,95657.42%
bm9_cd34-I46,036,93842,428,71592.16%
bm9_cd45-E12,914,6099,361,35972.49%
bm9_cd45-I33,483,40931,189,19093.15%
bm10_cd19-E45,846,78823,820,76851.96%
bm10_cd19-I42,916,29239,433,44291.88%
bm10_cd33-E18,881,63512,678,55967.15%
bm10_cd33-I36,392,68833,622,17392.39%
bm10_cd34-E1,455,904976,83567.09%
bm10_cd34-I43,740,86938,954,34489.06%
bm10_cd45-E47,832,60531,267,13365.37%
bm10_cd45-I56,272,35951,839,72592.12%
Lap197-E23,613,06917,771,92275.26%
Lap197-I40,839,84237,160,82390.99%
Lap316-E22,312,02918,406,67182.50%
Lap316-I65,985,86959,587,55990.30%
Lap362-E26,103,26918,291,09870.07%
Lap362-I43,663,52939,464,17490.38%
Lap367-E30,436,84822,390,18873.56%
Lap367-I61,426,57155,393,21590.18%
Leu135-E28,518,31922,184,99877.79%
Leu135-I66,384,95359,496,16789.62%
Leu197-E40,781,90521,042,70051.60%
Leu197-I100,952,57688,986,49388.15%
Leu292-E37,383,29027,245,48872.88%
Leu292-I81,946,81373,469,30889.65%
Leu316-E19,691,03515,476,01178.59%
Leu316-I49,443,95744,399,79389.80%
Leu362-E26,155,13719,508,96274.59%
Leu362-I52,718,69947,588,02390.27%
Leu367-E30,436,84822,390,18873.56%
Leu367-I61,426,57155,393,21590.18%
Leu378-E37,963,48025,159,30466.27%
Leu378-I63,946,62157,452,71889.84%
Leu386-E34,541,20726,730,83877.39%
Leu386-I85,783,79576,829,08889.56%
pbsc1_cd19-E28,300,79817,236,82560.91%
pbsc1_cd19-I42,994,20339,873,84592.74%
pbsc1_cd33-E28,441,23717,222,14960.55%
pbsc1_cd33-I41,190,71938,084,03692.46%
pbsc1_cd34-E30,595,32618,228,85459.58%
pbsc1_cd34-I40,582,29637,607,61892.67%
pbsc1_cd45-E21,807,67311,901,50854.57%
pbsc1_cd45-I44,739,46141,071,01391.80%
pbsc2_cd19-E35,937,65620,893,97658.14%
pbsc2_cd19-I39,678,92636,769,93992.67%
pbsc2_cd33-E35,344,00922,387,89163.34%
pbsc2_cd33-I32,507,10030,204,90092.92%
pbsc2_cd34-E25,845,40113,736,29653.15%
pbsc2_cd34-I48,706,31544,827,41392.04%
pbsc2_cd45-E32,212,43221,627,45267.14%
pbsc2_cd45-I47,235,29043,581,36692.26%
Rem135-E36,998,27824,794,22567.01%
Rem135-I123,775,359108,611,09087.75%
Rem197-E32,669,97920,692,19063.34%
Rem197-I72,248,56964,234,95988.91%
Rem292-E40,308,56129,524,87873.25%
Rem292-I56,187,55350,646,66890.14%
Rem316-E29,052,09823,634,22181.35%
Rem316-I60,566,39654,693,71190.30%
Rem362-E30,583,56824,759,91180.96%
Rem362-I59,138,76853,532,11190.52%
Rem367-E27,854,95020,285,93872.83%
Rem367-I53,616,04248,017,38889.56%
Rem378-E19,428,37214,820,51376.28%
Rem378-I58,333,37051,624,94288.50%
Rem386-E26,909,68119,355,78671.93%
Rem386-I83,504,22774,547,83689.27%
Total3,671,922,9552,741,473,29774.66%
Samples analysed in this study and sequencing metrics Genomic DNA from archived bone marrow smear microscope slides from ALL patients, cells and cell lines were extracted as previously described [3] and used for the enrichment of CpG methylation with the MethylMiner™ Methylated DNA enrichment kit (Life Technologies) according to the manufacturer’s protocols. The fragmented input genomic DNA (I) and enriched E5 fraction (E) were isolated from each sample for library preparation and sequencing using SOLiD™ v3 and v4 chemistry according to the manufacturer’s protocols (Life Technologies). Single and paired-end SOLiD™ sequencing reads were aligned using LifeScope™ Genomic Analysis Suite (Life Technologies) with default parameters against the hg19 reference genome. Alignment efficiency (the ratio of uniquely aligned reads to total sequenced reads for each sample) ranged from 26.57% to 93.15% across all samples in this study (Table 1). Alignments were then processed using MACS (Model-based Analysis for ChIP-Seq) [4] and HOMER (Hypergeometric Optimization of Motif EnRichment) [5,6] to identify enrichment peaks. This study is unique in a number of ways. This is the first sequencing-based DNA methylation profiling study in childhood ALL using archived bone marrow samples of similar quality to formalin-fixed paraffin-embedded (FFPE) tissue samples [7]. We have selected samples that have been interrogated using an orthogonal platform, the Illumina Infinium Human Methylation 450K BeadArray [3,8], and included replicate samples to assess the reproducibility of SOLiD™MBD-Seq and to identify regions of differential DNA methylation of interest to childhood ALL. We performed replicate DNA methylation enrichment analysis using the JWL cell line with 1 μg and 5 μg of starting genomic DNA to determine if 1 μg of starting material was sufficient for DNA methylation enrichment. This was less than the recommended quantity but a typical amount obtainable from our primary patient samples. We isolated four haematopoietic cell populations (CD34, CD19, CD33, CD45) at major stages of development corresponding to the arrested stages of development in paediatric leukaemia. This was achieved by positive selection using fluorescent-labelled antibodies and Fluorescent Activated Cell Sorting (FACS) from four individuals. This would enable us to track changes in DNA methylation between cell lineages and contrast them with leukaemic cells. After MACS enrichment peak analysis, a large proportion of peaks were common between the CD19 cells from three individuals, confirming the premise of tissue-specific DNA methylation profiles in haematopoietic cells (Figure 1A).
Figure 1

Venn diagrams summarising peak region overlaps between samples analysed in this study. Overlapping peak regions are shown after MACS peak analysis. (A) Peaks on chromosome 21 from three non-leukaemic individuals where CD19 cells were positively selected using FACS. A high degree of overlapping peaks were observed. (B) Peaks from matching leukaemic and remission samples from individual 135. Although there are some overlapping peaks (183), there are a substantial number of distinct peaks in each sample. (C) The extent of overlapping peaks between 3 leukaemic samples. (D) The extent of overlapping peaks between 3 remission samples.

Venn diagrams summarising peak region overlaps between samples analysed in this study. Overlapping peak regions are shown after MACS peak analysis. (A) Peaks on chromosome 21 from three non-leukaemic individuals where CD19 cells were positively selected using FACS. A high degree of overlapping peaks were observed. (B) Peaks from matching leukaemic and remission samples from individual 135. Although there are some overlapping peaks (183), there are a substantial number of distinct peaks in each sample. (C) The extent of overlapping peaks between 3 leukaemic samples. (D) The extent of overlapping peaks between 3 remission samples. When comparing DNA methylation enrichment peaks between leukaemic and remission samples (tumour versus normal) from the same individual, distinct enrichment peaks are seen; these are likely to correlate to disease state (Figure 1B). The number of overlapping peaks between leukaemic and remission samples were fewer compared to the haematopoietic cell analyses (Figure 1C and 1D) and could be indicative of the difference in sample qualities. For each of the samples analysed in this study, we have generated track hubs that can be uploaded and visualised on the UCSC Genome Browser. This permits the immediate visualisation of regions of differential DNA methylation with potential biological significance. Moreover, we have performed Infinium analysis on these samples, and visualisation using the Genome Browser permits direct comparison to other publicly available data such as The Cancer Genome Atlas (TCGA) [9] and TARGET (Therapeutically Applicable Research to Generate Effective Treatments) [10]. This also permits further analysis and comparison to publicly available data using the Galaxy [11,12] and Cistrome [13] web servers. In summary, our data represent one of the first DNA methylation enrichment analyses using SOLiD™MBD-Seq on archival bone marrow smears from children diagnosed with ALL. Such specimens are readily available in most pathology laboratories across the world and are amenable to genomic-scale analysis, as we have demonstrated here. These data should prove valuable for other DNA methylation studies in childhood ALL in haematopoeitic cell development.

Availability of supporting data

Supporting data is available from the GigaScience Database, GigaDB [14] and at NCBI under BioProject PRJNA272864.

Data file details

SRA Files included BioProject PRJNA272864 MACS and HOMER output files of peaks and peak locations Track Hubs for UCSC Genome Browser
  10 in total

1.  Galaxy: a platform for interactive large-scale genome analysis.

Authors:  Belinda Giardine; Cathy Riemer; Ross C Hardison; Richard Burhans; Laura Elnitski; Prachi Shah; Yi Zhang; Daniel Blankenberg; Istvan Albert; James Taylor; Webb Miller; W James Kent; Anton Nekrutenko
Journal:  Genome Res       Date:  2005-09-16       Impact factor: 9.043

2.  A distinct DNA methylation signature defines pediatric pre-B cell acute lymphoblastic leukemia.

Authors:  Nicholas C Wong; David Ashley; Zac Chatterton; Mandy Parkinson-Bates; Hong Kiat Ng; Minhee S Halemba; Adam Kowalczyk; Justin Bedo; Qiao Wang; Katrina Bell; Elizabeth Algar; Jeffrey M Craig; Richard Saffery
Journal:  Epigenetics       Date:  2012-06-01       Impact factor: 4.528

3.  Trisomy 20p resulting from inverted duplication and neocentromere formation.

Authors:  L Voullaire; R Saffery; J Davies; E Earle; P Kalitsis; H Slater; D V Irvine; K H Choo
Journal:  Am J Med Genet       Date:  1999-08-06

4.  Epigenetic deregulation in pediatric acute lymphoblastic leukemia.

Authors:  Zac Chatterton; Leah Morenos; Francoise Mechinaud; David M Ashley; Jeffrey M Craig; Alexandra Sexton-Oates; Minhee S Halemba; Mandy Parkinson-Bates; Jane Ng; Debra Morrison; William L Carroll; Richard Saffery; Nicholas C Wong
Journal:  Epigenetics       Date:  2014-01-06       Impact factor: 4.528

5.  The Cancer Genome Atlas Pan-Cancer analysis project.

Authors:  John N Weinstein; Eric A Collisson; Gordon B Mills; Kenna R Mills Shaw; Brad A Ozenberger; Kyle Ellrott; Ilya Shmulevich; Chris Sander; Joshua M Stuart
Journal:  Nat Genet       Date:  2013-10       Impact factor: 38.330

6.  Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities.

Authors:  Sven Heinz; Christopher Benner; Nathanael Spann; Eric Bertolino; Yin C Lin; Peter Laslo; Jason X Cheng; Cornelis Murre; Harinder Singh; Christopher K Glass
Journal:  Mol Cell       Date:  2010-05-28       Impact factor: 17.970

7.  Cistrome: an integrative platform for transcriptional regulation studies.

Authors:  Tao Liu; Jorge A Ortiz; Len Taing; Clifford A Meyer; Bernett Lee; Yong Zhang; Hyunjin Shin; Swee S Wong; Jian Ma; Ying Lei; Utz J Pape; Michael Poidinger; Yiwen Chen; Kevin Yeung; Myles Brown; Yaron Turpaz; X Shirley Liu
Journal:  Genome Biol       Date:  2011-08-22       Impact factor: 13.583

8.  Paediatric leukaemia DNA methylation profiling using MBD enrichment and SOLiD sequencing on archival bone marrow smears.

Authors:  Nicholas Cl Wong; Gavin D Meredith; George Marnellos; Miroslav Dudas; Mandy Parkinson-Bates; Minhee Suh Halemba; Zac Chatterton; Jovana Maksimovic; David M Ashley; Francoise Mechinaud; Jeffrey M Craig; Richard Saffery
Journal:  Gigascience       Date:  2015-03-18       Impact factor: 6.524

9.  MBD-isolated Genome Sequencing provides a high-throughput and comprehensive survey of DNA methylation in the human genome.

Authors:  David Serre; Byron H Lee; Angela H Ting
Journal:  Nucleic Acids Res       Date:  2009-11-11       Impact factor: 16.971

10.  Model-based analysis of ChIP-Seq (MACS).

Authors:  Yong Zhang; Tao Liu; Clifford A Meyer; Jérôme Eeckhoute; David S Johnson; Bradley E Bernstein; Chad Nusbaum; Richard M Myers; Myles Brown; Wei Li; X Shirley Liu
Journal:  Genome Biol       Date:  2008-09-17       Impact factor: 13.583

  10 in total
  2 in total

Review 1.  Next-Generation Sequencing Approaches in Cancer: Where Have They Brought Us and Where Will They Take Us?

Authors:  Veronique G LeBlanc; Marco A Marra
Journal:  Cancers (Basel)       Date:  2015-09-23       Impact factor: 6.639

2.  Paediatric leukaemia DNA methylation profiling using MBD enrichment and SOLiD sequencing on archival bone marrow smears.

Authors:  Nicholas Cl Wong; Gavin D Meredith; George Marnellos; Miroslav Dudas; Mandy Parkinson-Bates; Minhee Suh Halemba; Zac Chatterton; Jovana Maksimovic; David M Ashley; Francoise Mechinaud; Jeffrey M Craig; Richard Saffery
Journal:  Gigascience       Date:  2015-03-18       Impact factor: 6.524

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.