Literature DB >> 27322744

Distinct evolution and dynamics of epigenetic and genetic heterogeneity in acute myeloid leukemia.

Sheng Li1, Francine E Garrett-Bakelman2, Stephen S Chung3, Mathijs A Sanders4, Todd Hricik3, Franck Rapaport3, Jay Patel3, Richard Dillon5, Priyanka Vijay6, Anna L Brown7,8,9, Alexander E Perl10, Joy Cannon10, Lars Bullinger11, Selina Luger10, Michael Becker12, Ian D Lewis7,9,13, Luen Bik To9,13, Ruud Delwel4, Bob Löwenberg4, Hartmut Döhner11, Konstanze Döhner11, Monica L Guzman2, Duane C Hassane2, Gail J Roboz2, David Grimwade5, Peter J M Valk4, Richard J D'Andrea7,8,9, Martin Carroll10, Christopher Y Park14,15, Donna Neuberg16, Ross Levine3, Ari M Melnick2, Christopher E Mason1,17.   

Abstract

Genetic heterogeneity contributes to clinical outcome and progression of most tumors, but little is known about allelic diversity for epigenetic compartments, and almost no data exist for acute myeloid leukemia (AML). We examined epigenetic heterogeneity as assessed by cytosine methylation within defined genomic loci with four CpGs (epialleles), somatic mutations, and transcriptomes of AML patient samples at serial time points. We observed that epigenetic allele burden is linked to inferior outcome and varies considerably during disease progression. Epigenetic and genetic allelic burden and patterning followed different patterns and kinetics during disease progression. We observed a subset of AMLs with high epiallele and low somatic mutation burden at diagnosis, a subset with high somatic mutation and lower epiallele burdens at diagnosis, and a subset with a mixed profile, suggesting distinct modes of tumor heterogeneity. Genes linked to promoter-associated epiallele shifts during tumor progression showed increased single-cell transcriptional variance and differential expression, suggesting functional impact on gene regulation. Thus, genetic and epigenetic heterogeneity can occur with distinct kinetics likely to affect the biological and clinical features of tumors.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 27322744      PMCID: PMC4938719          DOI: 10.1038/nm.4125

Source DB:  PubMed          Journal:  Nat Med        ISSN: 1078-8956            Impact factor:   53.440


INTRODUCTION

Acute Myeloid Leukemia (AML) is a predominantly fatal hematopoietic malignancy[1-3]. Even when leukemia cells appear to have been eradicated from the bone marrow after chemotherapy treatment, most patients eventually relapse. Therefore, understanding how subpopulations of AML cells are resilient to chemotherapy and give rise to progressively more refractory disease is of utmost importance. Several mechanisms are hypothesized to endow sub-populations of cells within AML tumors with the capacity to survive exposure to therapeutic agents. These mechanisms include the presence of subsets of quiescent cancer stem cells with inherently greater self-renewal properties and reduced sensitivity to chemotherapy drugs[4]. Somatic mutations that can facilitate the expansion of putative leukemia stem cell populations can also emerge during disease establishment and progression[5-8]. Genetic heterogeneity within tumors can also increase evolutionary fitness[9,10]. Genetic diversity among individual cells within a tumor presumably provides the greatest chance for a subset of cells with particular combinations of mutations (single nucleotide variants (SNVs), insertions and deletions (INDELs) as well as copy number aberrations (CNAs) and translocations) to survive when challenged by cytotoxic drugs. Genetic heterogeneity has been appreciated in AML since early karyotyping studies[11,12], and tumor heterogeneity is also relevant in solid tumors and lymphoid malignancies[13-19], where several studies have linked the degree of genetic heterogeneity to clinical outcome[16,20]. However, cells from patients with AML feature a paucity of genetic lesions compared to most solid tumors[21,22], suggesting that other factors, such as epigenetic changes, could contribute to the aggressive behavior of AML. Indeed, aberrant epigenetic patterning is a hallmark of AML[23,24]. Gain or loss of cytosine methylation at specific loci disrupts promoter activity and gene expression regulation[23,25,26]. Epigenetic marks have great plasticity and may vary over time or through exposure to environmental stimuli. Thus, there is potential for epigenetic diversification to emerge in tumor cell populations and to change during disease progression. Sequence-based profiling can be used to determine quantitative cytosine methylation at base-pair resolution. DNA methylation at sequential CpGs (cytosine-phosphate-guanine dinucleotides) can create a phased epigenetic pattern (epiallele) that represents a native “cellular barcode”[27,28]. Such data can be used to measure epigenetic heterogeneity, which has been associated with worse clinical outcomes in lymphoid malignancies[28-31]. Yet in contrast to AML, lymphoid malignancies display a high burden of somatic mutations and can arise from cell types such as B- and T-cells that are naturally prone to epigenetic heterogeneity[32]. It is has been suggested that genetic mutations and/or cytogenetic abnormalities and epigenetic modifications would diversify along similar lines during disease progression[28,33]. This may be a selection process of genetic and epigenetic changes, which enhance tumor fitness through expression of oncogenes and/or repression of tumor suppressors. Alternatively, this may be a process of genetic diversification and clonal selection[34] in parallel with epigenetic modifications[35], leading to the same outcome. However, the interplay and/or independence of genetic versus epigenetic heterogeneity in AML have not been thoroughly explored. Here we performed the first large-scale comparative genomics and epigenomics study in serial specimens obtained from patients with AML. We assembled a cohort of 138 clinically annotated, paired AML patient samples (diagnosis and relapse; Supplementary Tables 1 and 2). Patient samples were characterized with genome-wide methylome sequencing, using a modified version of reduced representation bisulfite sequencing (RRBS)[36,37], Enhanced Reduced Representation of Bisulfite Sequencing (ERRBS)[38,39]. In a subset of this cohort, whole exome-sequencing (WES) was performed in paired patient samples with matched germline control DNA, RNA sequencing (RNA-seq) was performed in paired patient samples, and single-cell RNA-sequencing (scRNA-seq) in the patient with the most timepoints. This allowed for epigenetic heterogeneity assessment and comparison to genetic and transcriptional variance during disease progression. The epigenetic features explored in this study are conceptually quite different than the epigenetic signatures that define uniform patterns of differentially methylated genes[23,24]. Notably, we find that genetic and epigenetic allelic variations follow distinct, and often independent, kinetics and patterns. Moreover, shifts in epiallele composition are linked to transcriptional deregulation at the global and single-cell levels, and higher epiallele burden is linked to clinically more aggressive disease. Collectively, our data show that epigenetic allelic diversification occurs during AML establishment and disease progression, and may be independent of the genetic landscape.

RESULTS

Epiallele burden is linked to poor clinical outcome

We defined loci with epigenetic allelic variance in our patient cohort using the Methclone compositional entropy equation[40]. By definition, the epigenetic state of each locus is comprised of cytosine methylation at four consecutive CpG dinucleotides. Each of the possible 16 CpG methylation patterns at these loci is an “epiallele” (Supplementary Fig. 1). Epigenetically shifted loci (“eloci”) occur when the epiallele proportions at these sites undergo a statistically significant entropy shift (calculated by delta Boltzmann entropy ΔS < Δ90) in their composition when comparing two specimens[40]. The global metric EPM (eloci per million loci), which normalizes for the variable depth of coverage per specimen and the number of loci measured[40], was used to determine the overall magnitude of epiallele shifting across the genome and “epiallele burden.” Shifting can include both gain and/or loss of epialleles between two specimens. The epiallele and eloci measurements were determined using methylome data from ERRBS (Supplementary Table 3) and validated in a subset of specimens using orthogonal methylome sequencing methods on two different platforms (Agilent and Roche; Supplementary Fig. 2). We evenly divided our patients into two cohorts based on their EPM values (highest and lowest 50%). There was no difference in the burden of somatic mutations between the two groups (Supplementary Fig. 3). We then plotted clinical outcome as “time to relapse,” and we observed that patients with high EPM at diagnosis compared to NBM had a shorter time to relapse as compared to the low EPM cohort (P = 0.0396, Mantel-Cox log rank test, Fig. 1a). This association was most significant for EPM assessed based on promoter-associated eloci (P = 0.0077, Mantel-Cox log rank test, Fig 1b).
Figure 1

EPM levels at diagnosis compared to normal bone marrow segregate patients into two groups with distinct clinical outcomes. (a) Time to relapse analysis for patients (n = 137) with high (red) or low (black) EPM values at diagnosis compared to normal bone marrow. (b) Time to relapse analysis for patients (n = 137) with high (red) or low (black) EPM values assessed from promoter-annotated eloci (loci in promoters that were shared by at least 75% of patients were included). (c) Time to relapse analysis for patients with high (blue) or low (green) somatic mutation burden in diagnosis samples (n = 48). Mantel-Cox log rank test was used for the survival analysis (ac).

We also performed a multivariate analysis using linear regression to determine if EPM retained its association with clinical outcome independent of other clinical variables. We found that EPM maintained a significant association with clinical outcome (P = 0.024, Cox proportional hazards regression model), taking into account age, white blood cell count at the time of diagnosis, and gender. In contrast, none of these other criteria were independently linked to time to relapse in this cohort (Table 1). In contrast to EPM, the burden of somatic mutations based on WES (n = 48; Supplementary Table 4) was not significantly associated with time to relapse (P = 0.272, Mantel-Cox log rank test, Fig. 1c).
Table 1

Multivariate analysis of EPM association with time to relapse. Multivariate Cox proportional hazards regression model used relapse time as response variable, and tested EPM and clinical parameters as variables in the entire cohort (n = 127).

127 Patients
VariableP valueHazard Ratio
EPM0.0241.559
Age0.9300.994
Gender0.3031.223
WBC count0.3390.999

Relapsed AMLs display variable changes in epiallele burden

To further understand the nature of epiallele shifting in AML, we compared diagnostic and relapsed AML specimen methylomes to NBMs (Fig. 2a). We observed that all AML patients displayed substantial epiallele shifting both at diagnosis and at relapse compared to NBMs (Fig. 2b,c). Although the degree of EPM increase was variable, it was increased across all patients regardless of disease stage (Fig. 2d). We next compared EPM in individual patients at relapse versus diagnosis (Fig 2a). In this case we observed significant intra-patient variation (Fig. 2d,e). Although most patients exhibited a considerable degree of epiallele shifting during disease progression (median log10(EPM) = 2.29), the magnitude of this difference was highly variable. Indeed, a few patients (8%) exhibited no significant epigenetic changes (EPM = 0; Fig. 2e) between diagnosis and relapse. Overall, these data suggest that global increases in epigenetic allele shifting is a universal feature of AML relative to NBM controls, but, in contrast, epiallele shifting is highly variable within individual patients during disease progression. Notably, the changes in epiallele burden were independent of patients’ age, percent blast purity of specimens, disease FAB subtype and abundance of somatic mutations (Supplementary Fig. 4ah).
Figure 2

AML is characterized by high epiallele shift and variance. (a) Schematic diagram representing the DNA methylation patterns compared between CD34+ normal bone marrow controls (NBM), diagnostic AML and relapsed AML patient samples. (b,c) log10(EPM) values of diagnostic (b) and relapsed (c) patient samples versus NBMs. (d) Violin plot of the EPM values between the AML patient samples and NBMs and intra-patient relapse versus diagnosis (Wilcoxon rank sum tests: ***P < 0.001). (e) log10(EPM) values between AML diagnosis and relapse samples.

We also noted that relative to NBM controls, eloci were significantly overrepresented at CpG islands and promoters (P = 2.9×10−3 and P = 0.027 respectively; Wilcoxon signed rank tests) in AML patients at diagnosis (Supplementary Fig. 5af). Eloci were also significantly enriched at enhancers at diagnosis (based on ChIP-seq data in human CD34+ cells; active enhancers: P = 1.4×10−4; poised enhancers: P = 7.5×10−3; Wilcoxon signed rank tests; Supplementary Fig. 5jk). At relapse there was more significant enrichment for eloci at intronic and intergenic regions (P = 5.3×10−3 and P = 7.6×10−4 respectively; Wilcoxon signed rank tests; Supplementary Fig. 5hi). These differences point towards a significant disruption of enhancer states during AML progression.

Independent patterns of epigenetic and genetic diversity

The EPM measurement provides assessment of epigenetic allele shifting at the global level, but does not provide information on how specific eloci might vary during leukemia progression. This latter information would be similar to how genetic mutations are evaluated individually in tumors. From the genetic perspective, AML progression has been examined in limited numbers of patients using next generation sequencing and other techniques[5,41-44]. These studies showed that while most genetic lesions did not change during disease progression, some were gained or lost at relapse. In a subset of patients there were no new somatic mutations at relapse. Our current dataset enabled a simultaneous comparison of epigenetic and genetic allele burdens, discerning if and how their composition might vary during leukemia progression, and whether these two phenomenon correlate with each other. To address each of these points, we compared eloci occurring at diagnosis and separately at relapse to NBMs. We then defined three categories of eloci: 1) eloci unique to diagnosis (diagnosis-specific), 2) eloci unique to relapse (relapse-specific), and 3) eloci that were present both at diagnosis and relapse (shared). We examined the eloci in several ways. First, we performed K-means clustering based on the proportion of diagnosis-specific, relapse-specific or shared eloci present in each patient (Supplementary Fig. 6a and Fig. 3a). This analysis segregated AML patients into cohorts as follows: 1) a significant predominance of diagnosis-specific eloci (cluster 1, n = 57, P = 5.3 × 10−11, Wilcoxon signed rank test), 2) mostly shared eloci and no predominance of diagnosis- or relapse-specific eloci (cluster 2, n = 39), and 3) a significant predominance of relapse-specific eloci (cluster 3, n = 42, P = 4.5×10−13, Wilcoxon signed rank test; Fig. 3a). There was no significant link between patients belonging to any of these three clusters with their age, white blood cell count, or FAB classification (Supplementary Fig. 6bd).
Figure 3

Disease stage-specific epiallele patterns define unique subsets of AML patients. (a) Proportions of eloci that are diagnosis-specific (light green), shared (green), or relapse-specific (dark green) are shown for each cluster defined using K-means clustering. (b) Proportions of somatic mutations that were diagnosis-specific (light blue), shared (blue), or relapse-specific (dark blue) are shown for the subset of patients with exome-sequencing data within each cluster defined by the abundance of eloci in (a). (c,d) Number of somatic mutations (log10) for each eloci cluster at diagnosis (c; P = 0.048) or relapse (d; P = 0.008). (e,f) Proportion of somatic mutations whose variant allele frequency are increased (e; P = 0.367) or decreased (f; P = 0.0012) by 10% or more at relapse compared to diagnosis. (cf) Wilcoxon rank sum tests: *P < 0.05; **P < 0.01; NS = not significant. ID = patient counts.

We next determined the abundance of somatic genetic mutated alleles (SNVs and INDELs) for patients for whom WES was available (n = 48). We examined the abundance of somatic mutations at diagnosis and relapse within each of the three epigenetically defined patient clusters. We annotated somatic mutations into diagnosis-specific, relapse-specific or shared between the two time points, similar to the criteria used in Fig. 3a. The proportions of the three somatic mutation categories did not follow the same distribution as the eloci in cluster 1 & 3 (cluster 1, P = 0.89, cluster 3, P = 0.40; Wilcoxon signed rank tests; Fig. 3b). For example, samples in epigenetic cluster 1 exhibited significantly lower frequencies of somatic mutations at both diagnosis and relapse, compared with samples in cluster 3 (Fig. 3c, P = 0.048, Fig. 3d, P = 0.008, Wilcoxon rank sum tests; Supplementary Table 5). Concordantly, a reciprocal analysis using the median mutation burden per subject to divide patients revealed enrichment for higher mutations rates in cluster 3 patients (Supplementary Fig. 6e). These data indicate distinct combinations of genetic and epigenetic sub-types between the clusters of patients that are defined by the eloci. We next examined whether specific individual somatic mutations might be linked to the tendency to develop eloci. Such considerations are of interest, given that many recurrent somatic mutations in AML occur in genes encoding epigenetic modifiers[11,45]. However, there was no significant association between specific mutations in genes previously reported to be recurrent in AML[11,45] and the abundance of eloci (all genes P > 0.05, Chi-square test with Monte Carlo simulation; Supplementary Fig. 6f; Supplementary Table 6). As in many tumors, somatic mutations in AML occur in a heterogeneous pattern, representing the presence of genetically distinct subclones. Thus, it is possible that the genetic clonal complexity of AML might mirror the abundance of eloci. Using sciClone[46] we determined that 32.5% of patients exhibited increasing clonal complexity, 20% exhibited decreasing clonal complexity, and 47.5% exhibited no change in clonal complexity between diagnosis and relapse (Supplementary Fig. 7). Yet none of these clonal evolution patterns were enriched in epigenetically defined clusters (P = 0.915 chi-square test; Supplementary Fig. 8), further indicating independent kinetics of genetic and epigenetic changes during AML evolution. We also evaluated whether changes in variant allele frequency (VAF) of somatic mutations between diagnosis and relapse were linked to the pattern of eloci occurrence. Each patient exhibited variable numbers of alleles that decreased, increased or remained stable in frequency in diagnosis and relapse samples. We focused our assessment on the patients in epigenetic clusters 1 and 3, which exhibit the most differences in eloci distributions between diagnosis and relapse samples. Somatic mutations determined in diagnosis and relapsed cases compared to matched germline controls were assessed for changes in variant allele frequencies during disease progression. Increased VAF was defined by a minimum gain of 10 % (Fig. 3e), and decreased VAF was defined by a minimum loss of 10% (Fig. 3f). Among patients in epigenetic clusters 1 and 3, there was no significant difference in the tendency to gain somatic mutation VAF at relapse (Fig. 3e and Supplementary Table 5). However, among epigenetic cluster 3 patients there was significantly lower proportion of somatic mutations with reduced VAF upon disease relapse (P = 0.0012, Wilcoxon rank sum test; Fig. 3f and Supplementary Table 5). These data suggest that disease progression in AML can be classified based on the DNA methylation heterogeneity patterns. Furthermore, epigenetic heterogeneity and genetic diversification do not necessarily follow the same kinetics between diagnosis and relapse. We next determined if the differences in epiallele shift patterns in cluster 1 and 3 were associated with perturbations in the transcriptional landscape. We were able to perform RNA-sequencing in paired diagnostic and relapsed patient specimens representing patients from each cluster (1 and 3; n = 19; Supplementary Table 7). A supervised analysis (see methods) revealed 114 genes differentially expressed between diagnostic samples in cluster 1 and cluster 3 (Supplementary Table 8). These included the cell cycle inhibitor CDKN1A and transcription factor SPI-B, which are expressed higher in cluster 3 and FOXC1 which is expressed higher in cluster 1. Overall, we detected upregulation of protein kinase and other signaling genes in cluster 1 and significant enrichment of genes linked to inflammation and immune response in cluster 3 specimens (Supplementary Table 9). Differential expression of transcription factors, inflammatory and signal transduction gene sets between the two groups suggests a potential for alternative mechanisms of oncogenesis between the patients in epigenetic clusters 1 and 3.

Epiallele and mutation shifts throughout AML progression

To create a higher resolution view of the progression kinetics of genetic and epigenetic heterogeneity in AML, we performed whole genome sequencing (WGS) and ERRBS on leukemia cells from a patient (AML_130) who had five serial specimens available (T1 = diagnosis; T2 – T5 = serial relapse time points; Supplementary Tables 2, 3 and 4). We first evaluated the global epigenetic allele burden (EPM) for each time point compared to NBMs (n = 14 NBMs; Fig. 4a). We observed a marked increase (150%) in epiallele burden occurring at T2. In contrast there was little increase (2.7%) in the number of somatic mutations at this time-point compared to germline DNA from the same patient (Fig. 4b). At the following three timepoints, there were relatively smaller increases in EPM. However at T4 there was a larger increase (29%) in the abundance of somatic mutations compared to other time points (5.4% or less), indicating that the initial epigenetic shift in T2 was antecedent to the later genetic evolution in AML cells from this patient at T4. Furthermore, genomic analysis did not yield evidence of acquisition or loss of any somatic mutation linked to epigenetic modifier genes at T2 that might explain the jump in epiallele diversity at T2 (Supplementary Table 10).
Figure 4

Assessment of epiallele shift and genetic changes in serial samples from a single patient. ERRBS and WGS were performed in serial samples from a single patient (AML_130: diagnosis (T1) and four relapse collections: T2T5). (a) Epiallele shift (EPM) compared to NBMs (n = 14) at each time point (error bars are the standard error of the mean). (b) Somatic mutation burden at each time point. (c) The number of eloci that are shared and unique between all time points. (e) Density plot of the dominant epiallele frequency detected at eloci across all time points. (f) Density plot of the tumor variant allele frequencies detected at each time point.

We next examined the unique occurrences of individual eloci and somatic mutations at each timepoint. Analysis of eloci revealed a tendency of the epigenome to evolve continuously over time (Fig. 4c). Whereas a majority of eloci (65.79%) were unique to a single time point, many fewer eloci were shared among two, three or four timepoints (18.78%, 10.04%, 3.64% respectively); only 1.75% of eloci were shared across all five timepoints (Supplementary Fig. 9a). In sharp contrast, genetic alleles were far more stable; 24.6% of somatic alleles remained detectable over all five timepoints (Fig. 4d, Supplementary Fig. 9b, and Supplementary Table 11). Hence, genetically defined, mutated alleles exhibited significantly greater stability over time than eloci (P < 2.2 × 10−16, chi-square test). To better discern the relationship between epigenetic and genetic heterogeneity, we examined which of the 16 possible epialleles was dominant within each elocus at each time point. The proportion of these specific epialleles was used as a measure of epigenetic clonality. Reciprocally, to evaluate genetic clonal complexity, we examined the VAF of somatic mutations at each time point. Notably, we observed a shift in dominant epiallele frequency over time. At early timepoints, the dominant epialleles tended to occur at a frequency of > 50% (Fig. 4e; median = 53%). A marked shift in epiallele frequency occurred at T3 at which time the dominant pattern shifted to a lower frequency (25%). Later on, by T5, the leukemia cells were again enriched for eloci with higher dominant epiallele frequency. However, the pattern was different when considering genetic VAF. At early timepoints (T1–T3), the median VAF was 44%. As the disease progressed, a shift to low frequency genetic mutations (VAF) did occur (Fig. 4f), but only at T5. Genetic alleles were also distinct from dominant epialleles in that they exhibited a tighter allele frequency distribution (P = 0.008, Wilcoxon rank sum test; Fig. 4e,f), perhaps linked to the greater plasticity of the epigenome versus the genome. Collectively, these higher resolution data support the notion that epigenetic and genetic heterogeneity in AML are not necessarily linked and may follow distinct patterns and distributions during disease progression.

Promoter epiallele shifts linked to transcription variance

Given that transcription is regulated through epigenetic marks, we next determined whether the presence of eloci was linked to alterations in gene expression. We focused on genes containing eloci within promoters, derived from the comparison of diagnosis to relapsed specimens (n = 19 pairs; Supplementary table 7). Genes containing shifts in epiallele composition at promoters (promoter eloci) exhibited significantly greater variance in their transcript abundance between relapse and diagnosis, as compared to genes without such epiallele shifts in their promoters (P < 0.001; Wilcoxon rank sum test; Fig. 5a,b). We next focused our analysis specifically on genes that were significantly differentially expressed when comparing relapse versus diagnosis AML specimens. Here again, a higher proportion of differentially expressed transcripts were significantly associated with promoters harboring eloci than with promoter not harboring eloci (P < 0.001; Wilcoxon signed rank test; Fig. 5c).
Figure 5

Transcriptional variance is associated with high epiallele shift at promoters. (a) Density plot of log2 fold change of transcript levels of genes with eloci within their promoters (red), and genes without eloci in their promoters (blue) as measured from bulk cell populations (n = 19 paired patient samples). (b) Violin plot of the log2 fold change variance in transcript expression from genes with or without eloci in their promoters in bulk cell populations (Wilcoxon signed rank test; P = 3.82×106). (c) Violin plot of the percentage of genes that are differentially expressed (DEGs: absolute log fold change > 1; Wilcoxon signed rank test: P = 3.82×106) with or without eloci in their promoters in bulk cell populations (Wilcoxon signed rank test). (d) Violin plots of transcript expression level variance as measured by single cell RNA-sequencing (AML_130 relapse sample) and association (ANOVA test, P < 2.2×1016) with low (< 0.05), intermediate (0.050.2) and high (0.21) epiallele shift within respective gene promoters. Wilcoxon signed rank tests and ANOVA test: ***P < 0.001.

Finally, we used an additional approach to examine potential links between promoter epigenetic heterogeneity and transcription. We performed single-cell RNA-seq (scRNA-seq) in 96 cells from the first relapse sample from AML patient 130 (T2) We found that the genes with higher epiallele heterogeneity within their promoters also displayed significantly higher levels of transcriptional heterogeneity, as measured by cell-to-cell coefficients of variation (P < 2.2×10−16, ANOVA test, Fig. 5d). Hence the presence of eloci at gene promoters resulted in greater tendency of associated genes to exhibit deregulated expression.

DISCUSSION

AML is a tumor type well suited for examining questions about epigenetic heterogeneity, since it generally manifests a relative paucity of genetic lesions[21,22] but still exhibits genetic clonal complexity and clonal evolution during progression. Mazor, et al.[33] recently examined DNA methylation at individual CpG sites and somatic mutations during tumor progression in gliomas, where a phylogenetic analysis showed a co-dependency of change for the two features examined. However, an examination of overall DNA methylation at individual CpG sites with array-based measurements cannot fully address epigenetic heterogeneity as measured by epiallele diversification. Hence the question has remained unresolved as to whether epigenetic alleles and genetic alleles follow similar, or independent, courses during disease evolution. Although others have documented epigenetic heterogeneity in B-cell neoplasms[28,29,32], these tumors mostly arise from cells that display intrinsic epigenetic heterogeneity, and the studies did not explore the link between genetic and epigenetic allelic diversity during tumor progression. This is the first report of dynamic epiallele shifting in a human tumor. Previous reports have used DNA methylation heterogeneity metrics such as epipolymorphism approaches that calculate diversity within individual specimens, as opposed to the methclone compositional entropy, which measures shifting between samples[28,29]. Hence these two approaches measure different properties of the epigenome. Herein, we establish that tumor genetic and epigenetic heterogeneity in AML may arise as independent, biologically distinct phenomena, each presumably with a unique functional significance. The degree of leukemic epigenetic allelic burden was independent of age and other clinical parameters. There was no apparent association between the degree of epigenetic heterogeneity and the presence of somatic mutations affecting epigenetic modifier genes such as DNMT3A, TET2, and IDH1/2. We also explored whether dominant epigenetic alleles behaved in a similar or distinct manner from genetic alleles during clonal evolution. When compared with epiallele shifting, or changes in dominant epiallele composition of eloci at serial timepoints, there was no association between the kinetics and pattern of genetic and epigenetic alleles during leukemic progression. It is especially notable that AML patients can be classified based on epiallele pattern kinetics and somatic mutation burdens during disease progression. Our results suggest that in at least some cases, AML patients at diagnosis may be divided into disease with predominant epiallele diversity and low somatic mutations (cluster 1: epigenetically-driven) and others with lower epiallele diversity and higher mutation burden (cluster 3; genetically-driven; Fig. 3a, c). The latter develop increasing epigenetic diversity upon progression (Fig. 3a). In both cases, genetic clonal composition remains predominantly stable (Supplementary Table 5). Most importantly, epigenetic instability is not necessarily linked to genetic instability or specific somatic mutations. This observation may point to alternative modes of dominant heterogeneity in newly diagnosed patients: one genetic and one epigenetic. Tumor heterogeneity in AML can thus not strictly be determined by either genetic or epigenetic analysis alone, and relying only on genetic mapping may underestimate the true tumor diversity. We speculate that de novo acquisition of eloci might represent the response of hematopoietic cells to environmental stresses at various points during the development and/or progression of disease. Conversely, exposure to chemotherapy may induce epigenetic plasticity, contributing to the increased eloci seen at relapse in some patients. Indeed, longitudinal profiling in a patient treated with four different regimens resulted each time in distinct patterns of epiallele shifting. It is also possible that unrecognized mutations or alterations of coding or non-coding regions could destabilize the epigenome at specific loci and contribute to epigenetic heterogeneity. One example is the protein CTCF, which can regulate cytosine methylation patterning and boundaries[47]. A CTCF haploinsufficiency murine model results in the acquisition of epigenetic hyper-variability hotspots and development of tumors[48]. In the case of B-cells, epigenetic heterogeneity may be linked to the actions of activation-induced cytosine deaminase[48]. In the case of AML, biological differences among patients with different modes of eloci progression patterns may be mediated through differential expression of hematopoietic transcription factors and functional gene sets. Importantly, the acquisition of eloci may indeed have functional consequences, since we find that genes containing eloci at their promoters display more pronounced transcriptional variability and differential expression during disease progression. A recent single-cell RNA-seq study on newly diagnosed CLL patients also showed that sites of epigenetic variability had greater cell-to-cell transcriptional plasticity[28]. A second indication of biological relevance is the inferior outcome of patients with higher global epiallele (EPM) burdens in both univariate and multivariate analyses. The fact that the link to inferior outcome is even stronger for promoter epialleles suggests that eloci that perturb transcriptional regulation are particularly important in contributing to disease phenotype. Using an epipolymorphism-based approach, we observed decreasing DNA methylation heterogeneity upon AML progression, which matches a previous report in DLBCL[29] (Supplementary Fig. 10a). However, contrary to DLBCL, the abundance of epipolymorphisms in AML did not segregate patients into distinct clinical groups based on progression free survival (Supplementary Fig. 10b) – only the epiallele measure (EPM) separated the cohorts. Epiallele shifting as detected by methclone is thus a candidate biomarker in AML, and may be indicative of more aggressive disease, perhaps reflecting greater epigenetic plasticity and adaptability in certain tumors. Given the heterogeneous nature of AML, our study was underpowered to detect specific eloci that could be used as outcome biomarkers, although there was a trend for a set of 21 promoter eloci to associate with shorter time to relapse (Supplemental Table 12). Since this study is the first to examine the longitudinal interplay of genetic and epigenetic heterogeneity in AML, it is certainly possible that genetic and epigenetic heterogeneity are linked in other means in AML or in different manners in other types of tumors. Studies in larger cohorts using improvements in sequencing technologies will likely yield additional insights. Collectively, our data point towards epigenetic heterogeneity as an important feature of AML with functional and clinical impacts, one which does not fully follow the kinetics and patterns in the genetic compartment, and which creates additional means by which a tumor can evolve.

Online MATERIALS AND METHODS

Patient characteristics

138 clinically annotated, paired AML patient samples (diagnosis and relapse; 86 males and 52 females) seen at medical centers in Australia, Germany, Netherlands, and the United States were collected. Patients with acute promyelocytic leukemia were excluded. All patients were treated according to the protocols of corresponding institutes and hospitals. The clinical and molecular characteristics of these patients are summarized in Supplementary Table 1 and detailed descriptions are provided in Supplementary Table 2. All patients were treated with combination chemotherapy (cytarabine arabinoside and an anthracycline) during induction phase followed by consolidation chemotherapy treatment with or without a stem cell transplantation in first remission per clinical center standards. Samples from serial time points were available for patient AML_130. Briefly this was a male with previous high radiation dose exposure who developed AML at age 57. Samples were available from the diagnostic time point (sample T1: leukapheresis sample) and after the following treatments: He underwent standard induction treatment (cytarabine+idarubicin) and achieved complete remission, but developed relapse prior to consolidation treatment initiation. Patient was then treated with high dose cytarabine without response (sample T2; peripheral blood sample), combination chemotherapy (sirolimus, mitoxantrone, etoposide, and cytarabine) without response (sample T3; peripheral blood sample), combination chemotherapy without response (cyclophosphamide followed by clofarabine), hydroxyurea and external beam radiation to tonsillar site of extramedullary hematopoiesis (sample T4; bone marrow aspirate), and investigational FLT3 tyrosine kinase inhibitor (KW-2449) followed by allogeneic stem cell transplant without remission (sample T5; leukapharesis sample). No treatments beyond the chemotherapy administered immediately after diagnosis in AML_130 induced a complete remission.

Sample collection and processing

Donors (AML patients and individuals without known hematological malignancies) signed informed consent according to the declaration of Helsinki for collection and use of sample materials in research protocols at the following clinical centers: Erasmus Medical Center (protocol number MEC-2015-155), Royal Adelaide Hospital and SA Pathology (Adelaide, South Australia; 1998-onwards), University of Pennsylvania (protocol number 703185), University of Rochester Medical Center (protocol number URCC ULEU07047), and the University Hospital of Ulm. Study protocols were approved by the Institutional Review Boards of corresponding institutes and hospitals (protocols above-noted), and at Weill Cornell Medicine (WCM; protocol number 0805009783). For AML patient cryopreserved specimens from the Royal Adelaide Hospital and SA Pathology (Adelaide, South Australia) collected and stored prior to 1998, the requirement for informed consent was waived by the Royal Adelaide Hospital Human Research Ethics Committee (RAH Protocol #110304b). The use of the samples obtained from RAH and SA Pathology in this specific research study was approved by the RAH HREC on September 10, 2010. Of 140 paired patient samples that were collected, 138 were successfully processed. Patient samples were collected at the time of diagnosis and within three months of clinically determined relapse. Samples were subjected to Ficoll separation on the day of collection and viably frozen or immediately subjected to nucleic acid extractions using standard techniques. De-identified samples were then provided to WCM. Viably frozen cells were available for further processing from patients AML_102 through AML_140, including serial samples from AML_130 (T1 – T5). These samples were thawed and depleted of CD3+ and CD19+ cells using magnetic beads (Miltenyi Biotec), yielding a blast percentage enrichment of 89.±9.5% as confirmed by post separation flow cytometry analysis. Lymphocytes were then isolated for germline controls using flow sorting for CD3+ and CD19+ cells on a FACSAria II cell sorter (BD Biosciences, San Jose, CA) yielding > 85% purity as confirmed by post sort flow cytometry analysis. Germline DNA was isolated from ex vivo-expanded lymphocytes for patient samples AML_074 through AML_101. Briefly, the procedure for ex vivo expansion was as follows: 1 million bone marrow cells were cultured in RPMI 1640 with L-glutamine and 10% fetal calf serum (FCS). T cell expansion was induced with T cell expander Dynabeads CD3/CD28 (Thermo Fisher). After 1 to 3 days rIL2/ml was added. Cells were subsequently purified to > 98% with CD3 microBeads (MACS Miltenyi Biotec). Flow cytometry analysis and sorting was performed using CD19 PerCP-Cy5.5 (HIB19) and CD3 PerCP-Cy5.5 (HIT3a) from Biolegend (San Diego, CA, USA) and human CD45 APC-A780 (2D1) from Ebiosciences (San Diego, CA, USA). Validation for each antibody is provided on the manufacturer’s website (http://www.biolegend.com/) and at Antibodypedia (http://www.antibodypedia.com/gene/4237/CD19/antibody/539284/302214; http://www.antibodypedia.com/gene/4581/CD3D/antibody/539101/300437) and DegreeBio (1DB_ID:1DB-001-0001093758). Lymphocytes were defined as CD45 high SSC (low) CD19+ or CD3+ cells, and leukemic blasts were defined as CD19− CD3− CD45 (low) SSC (low) cells[49]. DNA and RNA were extracted from the isolated cells using standard techniques. For patient samples AML_001 through AML_101, DNA and RNA were extracted from mononuclear cell layers using standard techniques. Normal bone marrow controls (NBM: CD34+ mononuclear cells; 7 males and 7 females) were purchased from AllCells (Alameda, CA, USA; n = 5) or isolated using magnetic bead positive selection for CD34+ (Miltenyi Biotec) from freshly collected bone marrow samples from individuals without known hematological malignancies (n = 9). Purity was verified using flow cytometry post separations to greater than 90%. All flow cytometry data were analyzed using Flowjo (TreeStar, Ashland, Oregon).

Enhanced reduced representation bisulfite sequencing (ERRBS)

ERRBS is a slightly modified version of RRBS[36,37]. Libraries were prepared by MspI restriction enzyme digestion of high molecular weight genomic DNA, followed by end repair, size selection, bisulfite conversion and library amplification as previously described[38,39]. Libraries were sequenced on a HiSeq 2000 Illumina machine using 75 bp single-end reads. Data alignment was performed to human genome hg19 as previously described (see data analysis description)[38,39]. Average reads per sample were 138,596,488 with a mean alignment rate of unique reads of 63.7%, covering on average 4,399,235 CpGs per sample at a minimum threshold coverage of 10X. Average coverage depth per CpG was 72.3 and average bisulfite conversion was 99.86% determined as previously described[38,39]. See Supplementary Table 3 for detailed sequencing statistics. A subset of patient samples were processed using SeqCap Epi 4M CpGiant Enrichment kit (NimbleGen-Roche) and SureSelectXT Human Methyl-Seq (Agilent Technologies) per manufacturer’s recommendations. These were sequenced using a paired-end 100bp approach on a HiSeq 2000 (Illumina, Inc.).

Data analysis

R[50] version 3.2.1 was utilized for data analyses. Specific R packages and other tools/software used for analyses are noted in the respective sections below.

Code availability

Analysis scripts utilized in this manuscript have been deposited at https://github.com/ShengLi/relapsed_AML

ERRBS data analysis

We performed bisulfite-treated read alignment to hg19 genome and methylation calls as previously described[38,39]. Briefly, the adaptor sequences were removed by FAR software[51]. Preprocessed reads were then aligned to human genome reference hg19 using the bismark alignment software. Epiallele shift analysis was performed using methclone[40]. Briefly, the epiallele patterns compositional changes between different samples were evaluated using methclone to calculate the combinatorial entropy (ΔS) change of epialleles at each locus. This analysis outputs the loci with a ranked list of epiallele changes defined by the entropy change. We first calculated the foreground combinatorial entropy using the epiallele composition detected, which is the sum of the entropy of each locus for two samples considered. We then calculated the background combinatorial entropy, using the epiallele composition after uniformly mixing all patterns of epialleles between two samples. The sum entropy was determined and then the entropy changes were defined by the difference between foreground and background combinatorial entropy values. Epiallele shifts per million loci (EPM) is a normalized measure of the global epiallele changes with ΔS < − 90 as the cutoff (eloci). Comparisons of AML specimens to NBMs report the average EPM determined from assessing each AML case against each NBM (n = 14). The diagnosis-specific eloci are eloci that are only detectable at diagnosis stage, when compared to NBMs. The relapse-specific eloci are eloci that are only detectable at relapse stage, when compared to NBMs. The shared eloci are eloci that are detectable at both stages. The eloci clusters were determined using K-means clustering (K = 3, Gap Statistic for Estimating the Number of Clusters) based on the proportion of eloci that are diagnosis-specific, shared, or relapse-specific. Within cluster 1 and 3, respectively, the proportion of eloci for diagnosis and relapse were compared using Wilcoxon signed rank tests within each cluster to measure the significance of dominance. Finally, we assessed the distribution of patients from each clinical center (n = 5) in the epigenetically-defined clusters. Using multinomial logistic regression analysis to model the clusters with clinical and demographic features we determined that the center identity was not equally distributed among the three clusters. However, we confirmed no technical or clinical etiology was related to this distribution for each center (age, WBC, gender, FAB classification, mutations, cytogenetics), and the sample sizes for the sub-cohorts were not powered to detect sub-clusters within each cohort. Intra-tumor global methylation heterogeneity (MH) was assessed as previously described[29]. Briefly, epipolymorphisms were determined using proportions of DNA methylation patterns at four adjacent CpGs ( , where p is the fraction of each DNA methylation pattern i among the cell population)[27]. Epipolymorphism loci shared by at least 75% of the patients (n = 104) were considered for further analysis. Because epipolymorphism is dependent on DNA methylation levels, each locus was binned by the average DNA methylation levels from 0 – 100% (21 bins in total). Therefore, for each sample, all the loci were assigned to one of the 21 different groups based on their mean DNA methylation levels. For each bin, the width was 5% (DNA methylation), except for the first and last bin (2.5% DNA methylation). The median epipolymorphism across all the loci within each bin were calculated. Then the overall epipolymorphism landscape was defined using the median epipolymorphism across the 21 bins spanning the DNA methylation levels from 0 – 100%. The intra-tumor overall MH was defined as the area under the median epipolymorphism across the spectrum of methylation percentages in 21 bins. The range of MH is 0 – 100. Higher MH represents higher overall intra-tumor heterogeneity. Each locus was covered by at least 60 sequencing reads.

Genomic Annotation

Genomic annotation reference files (CpG islands and RefSeq genes) for eloci distribution analyses were obtained from UCSC (https://genome.ucsc.edu/) using Feb. 2009 (GRCh37/hg19) assembly[52,53]. Promoters were defined as transcription start sites +/− 1kb. CpG shores were defined as 2kb flanking CpG islands, subtracted by any regions overlapping with nearby CpG islands. CpG shelves were defined by 2kb flanking CpG shores, subtracted by any regions overlapping with nearby CpG islands and shores. Enhancers were defined based on the NIH Roadmap Epigenomics Project[54] CD34 mobilized primary cell data. Active enhancers were defined as H3K4me1 and H3K27ac peaks without H3K4me3 marks, and poised enhancers were defined as H3K4me1 without H3K27ac or H3K4me3 marks. Wilcoxon signed rank tests were used to compare the proportions of eloci falling into each of the genomic annotation regions.

RNA-sequencing

RNA sequencing (RNA-seq) libraries were prepared in two batches (Supplementary Table 7) using TruSeq RNA-Seq by polyA enrichment (Illumina, Inc.) and sequenced on HiSeq2000 (Illumina, Inc.) using a 50 bp paired-end approach per manufacturer’s recommendations. All paired samples were prepared within the same batch. Alignment was performed using the STAR aligner (version 2.3.0e)[55] and human genome hg19 as reference. Aligned results were annotated using the Refseq gene model and HTSeq union mode. Gene expression data were normalized using RPKM. Blast enriched patient samples from AML_130 (lymphocyte depleted as described in Sample collection and processing) were also used for single cell isolations. Single cells were captured and mRNA isolated using Clontech’s SMARTer chemistry (v2) on the Fluidigm C1 Single Cell Auto Prep system. Illumina’s Nextera XT kit was used for library preparation prior to 100 bp paired-end sequencing on the HiSeq 2500 (Illumina, Inc.) platform. See Supplementary Table 8 for sequencing statistics. Data analysis was performed using r-make (http://physiology.med.cornell.edu/faculty/mason/lab/r-make/) for quality control, alignment and gene expression quantification. DESeq2[56] was used for differential gene expression analysis performed. The significance of Differentially Expressed Genes (DEGs) was determined using Wald significance test to compare cluster 1 compared to cluster 3. The design matrix includes batch information to control for any possible batch differences. For multiple hypothesis testing, the significance cutoff used for optimizing the independent filtering was 0.05 (Benjamini–Hochberg). A log2 fold change greater than 1.2 or lesser than −1.2 was used for significance. Transcriptional heterogeneity was assessed using single cell RNA-seq data. Specifically, the coefficient of variations and transcriptional abundance per gene across cells was determined using log2 RPKM. Genes were included if the average log2 RPKM in cells were higher than 1. Gene Ontology term enrichment analysis was performed using GEne SeT AnaLysis Toolkit[57]. A threshold of two overlapping genes was considered. Hypergeometric tests were used for significance determination and Benjamini–Hochberg[58] was used for multiple testing correction.

Whole Exome Sequencing

Whole exome capture was performed on DNA isolated from 48-paired diagnosis and relapse patient samples and patient-matched germline samples. Germline DNA (lymphocytes) was subjected to whole genome amplification (repli-g kit; Qiagen) in 25 patients due to limited materials (see Supplementary Table 2). To obtain sufficient quantities of DNA from the remaining samples, T cells were expanded ex vivo (detailed under Sample collection and processing). DNA was extracted using standard techniques. Libraries were prepared per manufacturer’s recommendation using NimbleGen SeqCap EZ Human Exome Library v3.0, Agilent Human Exon V3 (Exon 50Mb), or Agilent SureSelect Human All Exon V4 (51 MB; see Supplementary Table 2 for specification of kit use per patient tumor and germline samples) and sequenced at a minimum of 50bp single-read sequencing on a HiSeq 2000 (Illumina, Inc.) to a mean coverage per base of 73X. See Supplementary Table 4 for sequencing statistics.

Whole Genome Sequencing

Illumina TruSeq Nano DNA Library Prep Kit (Illumina, Inc.) was performed on DNA from T1–T5 time points and germline (CD19/3 positive cells) isolated from the first relapse sample of patient AML_130. Libraries were prepared per manufacturer’s recommendation and sequenced using a 101 bp paired-end sequencing approach on the HiSeq 2000 platform per manufacture’s recommendations to a mean coverage per base of 43X. See Supplementary Table 4 for sequencing statistics.

Next generation sequencing data analysis

DNA-sequencing data was analyzed to determine somatic mutations, copy number aberrations and clonal evolution patterning using publically available tools (BWA[59], GATK[60-62], Mutect[63], Varscan[64], somaticSniper[65], SnpEFF[66], XHMM[67], DNAcopy library[68], and sciClone[46]). Tool versions, inclusion, exclusion, and significance parameters, statistical tests as well as specific commands used and implemented are included in the supplementary information file and deposited into github (https://github.com/ShengLi/relapsed_AML).

Integrative analysis

The association between DEGs (derived from relapse versus diagnosis AML patient samples) and promoter-associated eloci was determined as follows. Eloci (maximum ΔS = −90) annotated to gene promoters (transcription start sites +/− 1 kb) were considered for the integrative analyses. Patients with a minimum of 30 genes with promoter-associated eloci were included (n = 19). Epiallele loci within gene promoters which exhibited a minimum ΔS greater than −2 were designated “non-eloci”. RPKM log fold change was used to measure the transcriptome level dynamics. The variance of gene expression (RPKM log fold change) was calculated for genes with or without eloci within their proximal promoters and compared across all patients using a Wilcoxon signed rank test. Genes with log fold change greater than 1 were defined as DEGs. The number of DEGs with or without eloci were compared using a Wilcoxon signed rank test. Assessment for association between epigenetic clusters and mutations in genes recurrently affected in AML was performed. The variant calling pipeline (in-house developed algorithm[69]) used BAM files generated from the sequencing data of all paired diagnostic, relapse and germline samples to identify SNVs and small insertion and deletions (indels) in driver genes associated with AML[11,45]. For the identification of these somatic genetic aberrations we used a multi-variant calling approach integrating the output of six different somatic mutation detection algorithms: MuTect[63] (version 1.17), Indelocator, GATK UnifiedGenotyper[61] (version v3.2-2-gec30cee), SAMtools[70] (version 0.1.19-44428cd), VarScan 2[64] (version 2.3.6) and Pindel[71] (version 0.2.5a7). Briefly, MuTect allowed a VAF up to 10% or 6 reads containing the variant allele in the germline control to prevent the filtering of driver mutations detectable in lymphocyte control material[72]. Indelocator was allowed to detect indels with a VAF of 2% or greater. SAMtools, GATK UnifiedGenotyper and VarScan2 were run with default parameters. Pindel was used to detect the FLT3-internal tandem duplication (FLT3-ITD) aberration specifically within exons 13, 14 and 15 of the FLT3 gene by focusing on short insertions or tandem duplications. The detected variants were aggregated in a unique variant call format (VCF) file and subsequently annotated with ANNOVAR[73]. Annotations included dbSNP[74], COSMIC[75] and population based sequencing efforts, such as the 1000 genome project. The detected variants were further characterized by multiple fragment and regional characteristics by an in-house developed algorithm previously described[69]. In brief the algorithm determines the number of high quality (alignment score > = 40) and total number of fragments (irrespective of alignment score) for each detected variant. Based on the detected alternative allele this algorithm determines the VAF for high quality and all fragments. In addition, the algorithm determines the strand bias of the reads. For each potential somatic variant the same set of fragment and regional statistics was determined for the germline sample. Somatic mutations were detected by comparing the characteristics from the diagnostic or relapse sample to the germline sample. The FLT3-ITD aberration was detected by Pindel (script deposited into https://github.com/ShengLi/relapsed_AML), however, following this approach the VAF was not accurately estimated and therefore the VAF was considered not determined in the Supplementary Table. In a subset of patients, NPM1 mutations were not detectable due to a lack of coverage in the data generated as indicated in Supplemental Table 7. Mutations were considered diagnosis specific if alternative allele frequency at diagnosis was greater than 3%, and was not detected at relapse. Mutations were considered relapse-specific if alternative allele frequency at relapse was greater than 3%, and was not detected at diagnosis. Mutations were considered shared between diagnosis and relapse if alternative allele frequency at diagnosis and at relapse were both greater than 3%. Association between the frequency of each mutation and the epigenetic clusters was assessed using a Chi-square test with Monte Carlo simulation (number of replicates = 106).

Clinical correlation analysis

The correlation of overall epiallele shift (log10 EPM) with clinical parameters was evaluated. The clinical correlation between EPM (log10) and FAB classes was performed using ANOVA test. FAB classes with a minimum of 7 patients were assessed. We used the Pearson correlation (r) between overall epiallele shift and age or blast purity post separation and Hoeffding’s D statistics for dependency test[76]. To determine which gene-associated epiallele shift loci (eloci) associated with clinical outcome we performed the following analysis: 1. We determined eloci between diagnosis samples and NBMs (we required that the epiallele region considered be covered by a minimum of 5 NBMs). 2. We annotated eloci to gene promoters (transcription start site +/− 1 kb) and excluded epiallele loci which were covered in less than ten patients. 3. The patients were divided into groups with longer (n = 69) versus shorter (n = 68) relapse free survival based on the median value of time to relapse. 4. The frequency of each elocus in the patient groups was assessed. 5. odds ratio was used to determine the association between early and late relapse group for each locus. 6. Significance was determined using a Fisher’s exact test for each locus. 7. Benjamini-Hochberg correction was applied to P values.

Survival analysis

Log rank (Mantel-Cox) test was used for survival analysis. For relapse-free survival analysis, survival endpoints in this study were time from diagnosis until AML relapse. The patients were divided by the median EPM (low EPM versus high EPM), median number of somatic mutations (low MUT versus high MUT), or median MH (low MH versus high MH) for respective comparisons. Among 138 AML patients in the cohort, time to relapse was available for 137 patients and white blood cell count was available for 127 patients. Multivariate Cox proportional hazards regression model used relapse time as response variable, and included log10 EPM, age, gender, white blood cell as variables to be tested (n = 127). P-value of each predictor was used to assess if this clinical parameter is significantly associated with relapse time.
  73 in total

1.  Mutational cooperativity linked to combinatorial epigenetic gain of function in acute myeloid leukemia.

Authors:  Alan H Shih; Yanwen Jiang; Cem Meydan; Kaitlyn Shank; Suveg Pandey; Laura Barreyro; Ileana Antony-Debre; Agnes Viale; Nicholas Socci; Yongming Sun; Alexander Robertson; Magali Cavatore; Elisa de Stanchina; Todd Hricik; Franck Rapaport; Brittany Woods; Chen Wei; Megan Hatlen; Muhamed Baljevic; Stephen D Nimer; Martin Tallman; Elisabeth Paietta; Luisa Cimmino; Iannis Aifantis; Ulrich Steidl; Chris Mason; Ari Melnick; Ross L Levine
Journal:  Cancer Cell       Date:  2015-04-13       Impact factor: 31.743

2.  Variability in DNA methylation defines novel epigenetic subgroups of DLBCL associated with different clinical outcomes.

Authors:  Nyasha Chambwe; Matthias Kormaksson; Huimin Geng; Subhajyoti De; Franziska Michor; Nathalie A Johnson; Ryan D Morin; David W Scott; Lucy A Godley; Randy D Gascoyne; Ari Melnick; Fabien Campagne; Rita Shaknovich
Journal:  Blood       Date:  2014-01-02       Impact factor: 22.113

3.  Use of CD45 fluorescence and side-scatter characteristics for gating lymphocytes when using the whole blood lysis procedure and flow cytometry.

Authors:  J K Nicholson; M Hubbard; B M Jones
Journal:  Cytometry       Date:  1996-03-15

Review 4.  Current treatment of acute myeloid leukemia.

Authors:  Gail J Roboz
Journal:  Curr Opin Oncol       Date:  2012-11       Impact factor: 3.645

Review 5.  Epigenetic modulators, modifiers and mediators in cancer aetiology and progression.

Authors:  Andrew P Feinberg; Michael A Koldobskiy; Anita Göndör
Journal:  Nat Rev Genet       Date:  2016-03-14       Impact factor: 53.242

6.  Clonal evolution in relapsed NPM1-mutated acute myeloid leukemia.

Authors:  Jan Krönke; Lars Bullinger; Veronica Teleanu; Florian Tschürtz; Verena I Gaidzik; Michael W M Kühn; Frank G Rücker; Karlheinz Holzmann; Peter Paschka; Silke Kapp-Schwörer; Daniela Späth; Thomas Kindler; Marcus Schittenhelm; Jürgen Krauter; Arnold Ganser; Gudrun Göhring; Brigitte Schlegelberger; Richard F Schlenk; Hartmut Döhner; Konstanze Döhner
Journal:  Blood       Date:  2013-05-23       Impact factor: 22.113

7.  Mutational spectrum of myeloid malignancies with inv(3)/t(3;3) reveals a predominant involvement of RAS/RTK signaling pathways.

Authors:  Stefan Gröschel; Mathijs A Sanders; Remco Hoogenboezem; Annelieke Zeilemaker; Marije Havermans; Claudia Erpelinck; Eric M J Bindels; H Berna Beverloo; Hartmut Döhner; Bob Löwenberg; Konstanze Döhner; Ruud Delwel; Peter J M Valk
Journal:  Blood       Date:  2014-11-07       Impact factor: 22.113

8.  COSMIC: exploring the world's knowledge of somatic mutations in human cancer.

Authors:  Simon A Forbes; David Beare; Prasad Gunasekaran; Kenric Leung; Nidhi Bindal; Harry Boutselakis; Minjie Ding; Sally Bamford; Charlotte Cole; Sari Ward; Chai Yin Kok; Mingming Jia; Tisham De; Jon W Teague; Michael R Stratton; Ultan McDermott; Peter J Campbell
Journal:  Nucleic Acids Res       Date:  2014-10-29       Impact factor: 16.971

9.  WEB-based GEne SeT AnaLysis Toolkit (WebGestalt): update 2013.

Authors:  Jing Wang; Dexter Duncan; Zhiao Shi; Bing Zhang
Journal:  Nucleic Acids Res       Date:  2013-05-23       Impact factor: 16.971

10.  Integrative analysis of 111 reference human epigenomes.

Authors:  Anshul Kundaje; Wouter Meuleman; Jason Ernst; Misha Bilenky; Angela Yen; Alireza Heravi-Moussavi; Pouya Kheradpour; Zhizhuo Zhang; Jianrong Wang; Michael J Ziller; Viren Amin; John W Whitaker; Matthew D Schultz; Lucas D Ward; Abhishek Sarkar; Gerald Quon; Richard S Sandstrom; Matthew L Eaton; Yi-Chieh Wu; Andreas R Pfenning; Xinchen Wang; Melina Claussnitzer; Yaping Liu; Cristian Coarfa; R Alan Harris; Noam Shoresh; Charles B Epstein; Elizabeta Gjoneska; Danny Leung; Wei Xie; R David Hawkins; Ryan Lister; Chibo Hong; Philippe Gascard; Andrew J Mungall; Richard Moore; Eric Chuah; Angela Tam; Theresa K Canfield; R Scott Hansen; Rajinder Kaul; Peter J Sabo; Mukul S Bansal; Annaick Carles; Jesse R Dixon; Kai-How Farh; Soheil Feizi; Rosa Karlic; Ah-Ram Kim; Ashwinikumar Kulkarni; Daofeng Li; Rebecca Lowdon; GiNell Elliott; Tim R Mercer; Shane J Neph; Vitor Onuchic; Paz Polak; Nisha Rajagopal; Pradipta Ray; Richard C Sallari; Kyle T Siebenthall; Nicholas A Sinnott-Armstrong; Michael Stevens; Robert E Thurman; Jie Wu; Bo Zhang; Xin Zhou; Arthur E Beaudet; Laurie A Boyer; Philip L De Jager; Peggy J Farnham; Susan J Fisher; David Haussler; Steven J M Jones; Wei Li; Marco A Marra; Michael T McManus; Shamil Sunyaev; James A Thomson; Thea D Tlsty; Li-Huei Tsai; Wei Wang; Robert A Waterland; Michael Q Zhang; Lisa H Chadwick; Bradley E Bernstein; Joseph F Costello; Joseph R Ecker; Martin Hirst; Alexander Meissner; Aleksandar Milosavljevic; Bing Ren; John A Stamatoyannopoulos; Ting Wang; Manolis Kellis
Journal:  Nature       Date:  2015-02-19       Impact factor: 69.504

View more
  138 in total

Review 1.  When to obtain genomic data in acute myeloid leukemia (AML) and which mutations matter.

Authors:  Gregory W Roloff; Elizabeth A Griffiths
Journal:  Hematology Am Soc Hematol Educ Program       Date:  2018-11-30

2.  Multilayer microfluidic array for highly efficient sample loading and digital melt analysis of DNA methylation.

Authors:  Christine M O'Keefe; Daniel Giammanco; Sixuan Li; Thomas R Pisanic; Tza-Huei Jeff Wang
Journal:  Lab Chip       Date:  2019-01-29       Impact factor: 6.799

Review 3.  Bone marrow evaluation for diagnosis and monitoring of acute myeloid leukemia.

Authors:  Mary-Elizabeth Percival; Catherine Lai; Elihu Estey; Christopher S Hourigan
Journal:  Blood Rev       Date:  2017-02-02       Impact factor: 8.250

4.  Molecular characterization of acute myeloid leukemia patients who relapse more than 3 years after diagnosis: an exome sequencing study of 31 patients.

Authors:  Luise Hartmann; Claudia Haferlach; Manja Meggendorfer; Niroshan Nadarajah; Wolfgang Kern; Torsten Haferlach; Anna Stengel
Journal:  Haematologica       Date:  2019-08-01       Impact factor: 9.941

5.  Tracking myeloid malignancies by targeted analysis of successive DNA methylation at neighboring CG dinucleotides.

Authors:  Monika Eipel; Tanja Božić; Anna Mies; Fabian Beier; Edgar Jost; Tim H Brümmendorf; Uwe Platzbecker; Wolfgang Wagner
Journal:  Haematologica       Date:  2019-01-31       Impact factor: 9.941

6.  Genomic characterization of relapsed acute myeloid leukemia reveals novel putative therapeutic targets.

Authors:  Svea Stratmann; Sara A Yones; Markus Mayrhofer; Nina Norgren; Aron Skaftason; Jitong Sun; Karolina Smolinska; Jan Komorowski; Morten Krogh Herlin; Christer Sundström; Anna Eriksson; Martin Höglund; Josefine Palle; Jonas Abrahamsson; Kirsi Jahnukainen; Monica Cheng Munthe-Kaas; Bernward Zeller; Katja Pokrovskaja Tamm; Lucia Cavelier; Linda Holmfeldt
Journal:  Blood Adv       Date:  2021-02-09

7.  Targeting the MTF2-MDM2 Axis Sensitizes Refractory Acute Myeloid Leukemia to Chemotherapy.

Authors:  Harinad B Maganti; Hani Jrade; Christopher Cafariello; Janet L Manias Rothberg; Christopher J Porter; Julien Yockell-Lelièvre; Hannah L Battaion; Safwat T Khan; Joel P Howard; Yuefeng Li; Adrian T Grzybowski; Elham Sabri; Alexander J Ruthenburg; F Jeffrey Dilworth; Theodore J Perkins; Mitchell Sabloff; Caryn Y Ito; William L Stanford
Journal:  Cancer Discov       Date:  2018-08-16       Impact factor: 39.397

8.  Integrated Cellular and Plasma Proteomics of Contrasting B-cell Cancers Reveals Common, Unique and Systemic Signatures.

Authors:  Harvey E Johnston; Matthew J Carter; Kerry L Cox; Melanie Dunscombe; Antigoni Manousopoulou; Paul A Townsend; Spiros D Garbis; Mark S Cragg
Journal:  Mol Cell Proteomics       Date:  2017-01-04       Impact factor: 5.911

9.  Targeting DNA hypermethylation: Computational modeling of DNA demethylation treatment of acute myeloid leukemia.

Authors:  Jens Przybilla; Lydia Hopp; Michael Lübbert; Markus Loeffler; Joerg Galle
Journal:  Epigenetics       Date:  2017-10-06       Impact factor: 4.528

10.  The CD200/CD200R expression level and its mechanism of action in hematological malignancy patients.

Authors:  Yulei Zhao; Guohong Su; Jie Shen; Chunyan Liu; Na Miao
Journal:  Am J Transl Res       Date:  2021-06-15       Impact factor: 4.060

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.