Liran I Shlush1, Sasan Zandi1, Amanda Mitchell2, Weihsu Claire Chen2, Joseph M Brandwein3, Vikas Gupta3, James A Kennedy2, Aaron D Schimmer4, Andre C Schuh3, Karen W Yee3, Jessica L McLeod2, Monica Doedens2, Jessie J F Medeiros2, Rene Marke5, Hyeoung Joon Kim6, Kwon Lee6, John D McPherson7, Thomas J Hudson8, Andrew M K Brown9, Fouad Yousif, Quang M Trinh9, Lincoln D Stein10, Mark D Minden4, Jean C Y Wang3, John E Dick11. 1. 1] Princess Margaret Cancer Centre, University Health Network (UHN), Toronto, Ontario M5G 2M9, Canada [2]. 2. Princess Margaret Cancer Centre, University Health Network (UHN), Toronto, Ontario M5G 2M9, Canada. 3. 1] Princess Margaret Cancer Centre, University Health Network (UHN), Toronto, Ontario M5G 2M9, Canada [2] Department of Medicine, University of Toronto, Toronto, Ontario M5S 2J7, Canada [3] Division of Medical Oncology and Hematology, UHN, Toronto, Ontario M5G 2M9, Canada. 4. 1] Princess Margaret Cancer Centre, University Health Network (UHN), Toronto, Ontario M5G 2M9, Canada [2] Department of Medicine, University of Toronto, Toronto, Ontario M5S 2J7, Canada [3] Division of Medical Oncology and Hematology, UHN, Toronto, Ontario M5G 2M9, Canada [4] Department of Medical Biophysics, University of Toronto, Toronto, Ontario M5G 2M9, Canada. 5. 1] Princess Margaret Cancer Centre, University Health Network (UHN), Toronto, Ontario M5G 2M9, Canada [2] Radboud University, Nijmegen Medical Centre, Nijmegen 6500 HB, The Netherlands. 6. Chonnam National University Hwasun Hospital, Genome Research Center for Hematopoietic Diseases, Gwangju 519-809, South Korea. 7. 1] Department of Medical Biophysics, University of Toronto, Toronto, Ontario M5G 2M9, Canada [2] Ontario Institute for Cancer Research, Toronto, Ontario M5G 0A3, Canada. 8. 1] Department of Medical Biophysics, University of Toronto, Toronto, Ontario M5G 2M9, Canada [2] Ontario Institute for Cancer Research, Toronto, Ontario M5G 0A3, Canada [3] Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5S 1A8, Canada. 9. Ontario Institute for Cancer Research, Toronto, Ontario M5G 0A3, Canada. 10. 1] Ontario Institute for Cancer Research, Toronto, Ontario M5G 0A3, Canada [2] Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5S 1A8, Canada. 11. 1] Princess Margaret Cancer Centre, University Health Network (UHN), Toronto, Ontario M5G 2M9, Canada [2] Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5S 1A8, Canada.
Abstract
In acute myeloid leukaemia (AML), the cell of origin, nature and biological consequences of initiating lesions, and order of subsequent mutations remain poorly understood, as AML is typically diagnosed without observation of a pre-leukaemic phase. Here, highly purified haematopoietic stem cells (HSCs), progenitor and mature cell fractions from the blood of AML patients were found to contain recurrent DNMT3A mutations (DNMT3A(mut)) at high allele frequency, but without coincident NPM1 mutations (NPM1c) present in AML blasts. DNMT3A(mut)-bearing HSCs showed a multilineage repopulation advantage over non-mutated HSCs in xenografts, establishing their identity as pre-leukaemic HSCs. Pre-leukaemic HSCs were found in remission samples, indicating that they survive chemotherapy. Therefore DNMT3A(mut) arises early in AML evolution, probably in HSCs, leading to a clonally expanded pool of pre-leukaemic HSCs from which AML evolves. Our findings provide a paradigm for the detection and treatment of pre-leukaemic clones before the acquisition of additional genetic lesions engenders greater therapeutic resistance.
In acute myeloid leukaemia (AML), the cell of origin, nature and biological consequences of initiating lesions, and order of subsequent mutations remain poorly understood, as AML is typically diagnosed without observation of a pre-leukaemic phase. Here, highly purified haematopoietic stem cells (HSCs), progenitor and mature cell fractions from the blood of AML patients were found to contain recurrent DNMT3A mutations (DNMT3A(mut)) at high allele frequency, but without coincident NPM1 mutations (NPM1c) present in AML blasts. DNMT3A(mut)-bearing HSCs showed a multilineage repopulation advantage over non-mutated HSCs in xenografts, establishing their identity as pre-leukaemic HSCs. Pre-leukaemic HSCs were found in remission samples, indicating that they survive chemotherapy. Therefore DNMT3A(mut) arises early in AML evolution, probably in HSCs, leading to a clonally expanded pool of pre-leukaemic HSCs from which AML evolves. Our findings provide a paradigm for the detection and treatment of pre-leukaemic clones before the acquisition of additional genetic lesions engenders greater therapeutic resistance.
There is overwhelming evidence that virtually all cancers are clonal and represent the progeny of a single cell[1-3]. However, the evolutionary trajectory that leads from the first somatic mutation to the eventual development of cancer is not well mapped. The simplest models predict that each newly acquired somatic mutation confers selective advantage to drive successive waves of clonal expansion, with the fittest clone becoming dominant. However the modern era of cancer genomics has exposed a more complex clonal architecture in many tumor types[4], where multiple genetically distinct subclones co-exist with the dominant clone[5,6]. Comparison of diagnostic and recurrent/metastatic samples obtained from the same patient has established that the latter frequently do not evolve from the dominant clone, but instead can be traced either to a minor subclone present at diagnosis, or to a putative, undetected ancestral clone[7-15]. Thus, a clear understanding of the genomic landscape of tumors is required in order to devise targeting strategies that eliminate not only the dominant clone but also the subclonal reservoirs from which recurrence can arise.Although the clonal composition of cancer lineages within individual tumors is coming into focus, the very first steps in cancer development remain poorly defined. Early and possibly initiating mutations have been identified from analysis of pre-neoplastic lesions in breast[16], lung[17], skin[18], and colon cancer[19], as well as from studies of AML cases that evolved from a prior myelodysplastic syndrome (MDS)[20]. However, key questions remain unanswered. In particular, can clinically relevant clones be traced back to a non-tumorigenic cell? Do pre-cancerous ancestral clones persist after tumor development? If so, are they present in the diagnostic sample, and do they survive treatment and persist in remission samples?Human leukemia is a disease model particularly suited to addressing these fundamental questions, due to the depth of our understanding of normal hematopoiesis and the availability of functional assays and analytic tools that allow examination of phenotypically defined populations at the single cell level[21]. In AML, a subset of cases evolve from a preceding clinically overt phase such as MDS or chronic myeloid leukemia (CML), characterized by clonal expansion of one or more blood lineages[22,23]. The founder mutations present in pre-leukemic cells are retained in the AML blasts, implicating them as putative initiating events and establishing clonal expansion as the first step in leukemogenesis. Interestingly, somatic mutations in some leukemia-associated genes such as TET2 have also been linked to multilineage clonal hematopoiesis in aging healthy individuals[24]. Insight into the phenotype of the normal cell from which clonal expansion can initiate was first provided by the pioneering studies of Fialkow in CML, which demonstrated that BCR-ABL1 arises in a multipotential HSC[25]. However, for the majority of AML cases that arise de novo without any prior clinical perturbations, insight into the cellular context and functional consequences of the earliest genetic lesions requires identification and examination of ancestral cells within the diagnostic sample. Recent studies have found that only a subset of mutations contained in AML blasts were present in HSC-enriched cell fractions isolated from AML patient samples, and that these cells were capable of non-leukemic differentiation[26,27]. Here we establish that these ancestral preL-HSC present at diagnosis are able to regenerate the entire hematopoietic hierarchy while possessing competitive repopulation advantage over non-leukemic HSC leading to clonal expansion. These preL-HSC are found in a high proportion of AML patients that carry mutations in DNMT3a and IDH2, and unlike AML blasts, they survive induction chemotherapy and persist in the bone marrow at remission, providing a potential reservoir for leukemic progression.
Results
During studies to examine intra-tumoral genetic heterogeneity in AML, deep targeted sequencing (Tar-seq, read depth ~250×) of 103 commonly mutated leukemia genes (Fig. 1a, Supplementary Table 1) was carried out on peripheral blood (PB) samples from 12 patients at diagnosis (blasts >80%) (Supplementary Table 2, 3). Normal T-cells were expanded in vitro to provide non-leukemic tissue for genetic comparison. Consistent with mutant allele frequencies reported in recent studies[28]
DNMT3a was found in 4 of 12 samples (mutant allele frequency ~50%) (Fig. 1a). Unexpectedly, in 3 of these 4 patients, DNMT3a was detected in T-cells at a low allele frequency (1–20%). Other genetic alterations including NPM1c were found only in PB but not T-cell samples, ruling out AML cell contamination of cultured T-cells. To estimate the proportion of AML cases with DNMT3a-bearing T-cells, 71 additional samples, taken at diagnosis from patients with normal cytogenetics, were screened by Sanger sequencing for DNMT3a along with other common AML mutations (Supplementary Table 4). Consistent with published data[29-31], 17 of 71 AML samples (24%) carried mutations in DNMT3a, and 15 of these 17 (88%) also carried NPM1c. For these 17 patients, the allele frequency of DNMT3a and NPM1c in CD33+ blasts, as well as corresponding freshly isolated T-cell controls, was measured by droplet digital PCR (ddPCR) at a sensitivity of 1 mutated allele in 1000 reference alleles. Whereas both DNMT3a and NPM1c were always present in blasts at similar allele frequency, DNMT3a with no evidence of NPM1c was detected in T-cells from 12 of these 17 patients (70.5%) (Fig. 1b). In addition, FLT3-ITD was detected in blasts but not the T-cells of 2 patients bearing this mutation (Extended Data Fig. 1). These data reveal the sequential order of mutation acquisition in these patients, with DNMT3a arising earlier in leukemogenesis than NPM1c and FLT3-ITD, a conclusion predicted from recent studies on bulk AML blasts showing that NPM1c and FLT3-ITD occur late and are the only genes recurrently mutated in DNMT3a AML[15,28,32]. Moreover, our findings establish that DNMT3a occurs in an ancestral cell that gives rise to both T-cells and the dominant AML clone present at diagnosis.
Figure 1
Recurrent somatic DNMT3a mutations are common in T-cells from AML patients
a, Summary of the allele frequency (%) of missense and frameshift somatic single nucleotide variants (sSNV) in AML-related genes assessed by deep targeted sequencing (read depth 250×) in AML blasts and T cells from the peripheral blood of 12 AML patients. The sSNV numbers indicated at the top of the table correspond to the numbers in Supplementary Table 3. Somatic mutations in DNMT3a (*, R882H; †, R137C) were found in both T-cells and AML blasts in Patients #9, 11 and 12. Patient #12 also had a low frequency IDH2 mutation (‡, R140L) in T cells. b, Frequency (%) of DNMT3a and NPM1c in freshly isolated CD33+ blasts (AML) and matched T-cell controls from 17 patients with normal karyotype AML, as determined by droplet digital PCR. For a and b, the length of the bars is proportional to the mutant allele frequency (the scale bar under the first column applies to all columns).
Extended Data Figure 1
FLT3-ITD is a late event in patients carrying DNMT3A mutation
PCR analysis of FLT3-ITD[50] in stem/progenitor, mature lymphoid and blast (CD45dim CD33+) cell populations from patient no. 13 (a) and no. 14 (b). FLT3-ITD was present in the blasts from both patients, and also in MLPs from patient no. 14. In contrast, DNMT3Amut without FLT3-ITD was detected in multiple non-blast cell populations (see Extended Data Fig. 2). HSC, haematopoietic stem cell; MPP, multipotent progenitor; CMP, common myeloid progenitor; MLP, multilymphoid progenitor; GMP, granulocyte monocyte progenitor; NK, natural killer cells.
To gain insight into the properties of the ancestral cell within which DNMT3a first arises, we examined additional non-leukemic hematopoietic cell populations from 11 DNMT3a AML patients. A high resolution 12-parameter sorting strategy[33-35] was employed to isolate non-leukemic hematopoietic stem and progenitor populations, including hematopoietic stem cells/multipotent progenitors (HSC/MPP), multilymphoid progenitors (MLP), common myeloid progenitors (CMP), granulocyte monocyte progenitors (GMP), and megakaryocyte erythroid progenitors (MEP), as well as mature B, T and natural killer (NK) cells within the CD33− cell fraction. Together with CD45dimCD33+ AML blasts, these highly purified, phenotypically defined normal cell populations were assessed by ddPCR for DNMT3a and NPM1c (Fig. 2, Extended Data Fig. 2, 3). DNMT3a was found together with NPM1c in CD33+ blasts from all patients. By contrast, we found DNMT3a at variable allele frequency without NPM1c across the spectrum of mature and progenitor cell populations. Results for a representative patient (#11) are shown in Fig. 2a–b. In this patient, DNMT3a was present in HSC/MPP at an allele frequency of 12–30% without detectable NPM1c. Although the clonal contribution that an individual normal HSC makes in humans is unknown, studies in higher primates estimated that single HSC provide approximately 0.5% clonal contribution during steady-state hematopoiesis[36]. Thus, the high DNMT3a allele frequency in HSC/MPP points to their clonal expansion as compared to non-mutated HSC. In Patient #11, DNMT3a was also present in all downstream progenitors at variable frequencies. The mean allele frequency among HSC/MPP, MLP and CMP was 24.6% across all patients analyzed (Extended Data Fig. 2). Importantly, even for patients in whom DNMT3a was not detected in mature cells, DNMT3a without NPM1c was found in stem/progenitor populations (Fig. 2c), providing further strong evidence that DNMT3a precedes NPM1c during leukemogenesis. Interestingly, in 3 patients (#10, 14, 16), DNMT3a was detected in CMP but not HSC/MPP, an outcome consistent with the existence of DNMT3a-bearing HSC below our detection limit that generated a clonally expanded CMP population, or possibly the existence of a preceding lesion in HSC/MPP with later acquisition of DNMT3a in CMP. Resolution of this question will require whole genome sequencing of sorted blast and phenotypically normal populations from patient samples. Our analysis also showed that in 6 of 11 patients, both DNMT3a and NPM1c were found together in MLP and/or GMP populations, pointing to the likely progenitor cell types where overt AML driven by NPM1c arises. Collectively, our findings provide key insights into the leukemogenic process in human AML and confirm historical predictions from early clonality studies of the existence of a pre-leukemic state[3,37].
Figure 2
DNMT3a mutation precedes NPM1 mutation in human AML and is present in stem/progenitor cells at diagnosis and remission
a, Flow cytometric analysis showing the gating strategy used to isolate phenotypically normal stem and progenitor cell populations from AML patient samples. Plots show analysis of samples from Patient #11: diagnosis (day 0, peripheral blood mononuclear cells), remission (day 62, CD34+ enriched bone marrow) and relapse (day 379, peripheral blood mononuclear cells). b, Allele frequency of DNMT3a and NPM1 mutations in stem/progenitor, mature lymphoid and blast (CD45dim CD33+) cell populations, as indicated, isolated from diagnosis (blue), remission (white) and relapse (red) samples of Patient #11 as determined by droplet digital PCR (ddPCR). At remission, CD33+ myeloid cells were also analyzed. c, Summary of the occurrence of DNMT3a and NPM1c in isolated stem/progenitor, mature and blast cell populations from 11 AML patient peripheral blood samples as determined by ddPCR. White, DNMT3a or NPM1c not detected; gray, DNMT3a alone; black, DNMT3a + NPM1c. NA, no population detected; HSC, hematopoietic stem cell; MPP, multipotent progenitor; MLP, multilymphoid progenitor; CMP, common myeloid progenitor; GMP, granulocyte monocyte progenitor; MEP, megakaryocyte erythroid progenitor; NK, natural killer cells. d, Graphic representation of DNMT3a allele frequency in sorted cell populations isolated from diagnosis (0 months), early (3) and late (36) remission samples of Patient #57.
Extended Data Figure 2
Frequent occurrence of DNMT3A mutation without NPM1 mutation in stem/progenitor and mature lymphoid cells in AML patients at diagnosis
a, Summary of the allele frequency (%) of DNMT3A and NPM1 mutations in stem/progenitor, mature lymphoid, and blast (CD45dim CD33+) cell populations from 11 AML patient peripheral blood samples obtained at diagnosis, as determined by droplet digital PCR (ddPCR). Phenotypically normal cell populations were isolated by fluorescence activated cell sorting according to the strategy depicted in Fig. 2a. Mutant allele frequency ~50% is consistent with a heterozygous cell population. Departures from 50% mutant allele frequency may be stochastic[51], related to clonal heterogeneity, or due to the presence of copy number variations, for example loss of the wild type allele (loss of heterozygosity) or amplification of the mutant allele. NA, no cell population detected; HSC, haematopoietic stem cell; MPP, multipotent progenitor; CMP, common myeloid progenitor; MEP, megakaryocyte erythroid progenitor; MLP, multilymphoid progenitor; GMP, granulocyte monocyte progenitor; NK, natural killer cells. Blank boxes indicate no DNMT3A or NPM1 mutation detected. b, Representative plots showing ddPCR analysis of DNMT3Amut and NPM1c allele frequency in sorted cell populations from patient no. 11. The mutant allele frequency (%) is indicated on each plot.
Extended Data Figure 3
Phenotypically normal stem/progenitor and mature cell populations are present in AML patient samples at diagnosis, remission and relapse
Flow cytometric analysis showing the gating strategy used to isolate phenotypically normal stem/progenitor and mature lymphoid cell populations from AML patient samples. Diagnosis and relapse samples are from peripheral blood; remission samples are from bone marrow.
To examine how DNMT3a affects population dynamics during leukemic progression, we undertook temporal analysis of mature and progenitor cells from 5 patients (#11, 28, 35, 55, 57) sampled at diagnosis, remission (3 months) or relapse (Fig. 2b, d, Extended Data Fig. 3, 4). Compared to diagnosis, the allele frequency of DNMT3a alone was similar or higher at remission (Patients #11, 28, 35, 57) and relapse (Patient #11, 55). Although CD33+ leukemic blasts at diagnosis always carried both mutations, CD33+ myeloid cells at remission bore only DNMT3a, suggesting that they are not AML blasts but the progeny of DNMT3a-bearing progenitors with preserved myeloid differentiation capacity. In the relapse sample of Patient #11, both mutations were present in the majority of cells, with the exception of HSC/MPP in which a proportion carried only DNMT3a. Patient #57 was a long-surviving patient that allowed a comparison of early and late (36 months) remission samples and showed a striking increase in DNMT3a allele frequency in most cell populations over time (Fig. 2d). In addition, a small proportion of CD33+ myeloid cells in the late remission PB sample contained both DNMT3a and the NPM1c mutation found at diagnosis, suggesting either regrowth of the diagnostic leukemic clone or emergence of a new clone following an independent NPM1c mutation event within the preleukemic pool. Collectively, our data indicate that the ancestral cell that bears DNMT3a without NPM1c is an HSC/MPP capable of multilineage differentiation. Moreover, these ancestral HSC/MPP survive chemotherapy, expand during remission, and might serve as a reservoir for clonal evolution leading to recurrent disease.
Extended Data Figure 4
Cells bearing mutations in DNMT3A but not NPM1 are present at diagnosis in AML patients and persist at remission and relapse
Allele frequency of DNMT3A and NPM1 mutations of patients no. 28, 35, 55, and 57 in stem/progenitor, mature and blast (CD45dim CD33+) cell populations, as determined by droplet digital PCR (ddPCR). Cells were isolated from diagnosis (blue), early remission (white), relapse (red) or late remission (yellow) samples. At remission, CD33+ myeloid cells were also analysed. HSC, haematopoietic stem cell; MPP, multipotent progenitor; MLP, multilymphoid progenitor; CMP, common myeloid progenitor; GMP, granulocyte monocyte progenitor; MEP, megakaryocyte erythroid progenitor; NK, natural killer cells.
To establish conclusively whether phenotypically defined DNMT3a-bearing HSC/MPPs are functional HSC and whether competitive repopulation advantage underlies their in vivo clonal expansion, we undertook xenograft repopulation assays. Mononuclear cells from the PB of 2 patients at diagnosis (#11, 55) with DNMT3a allele frequency in HSC/MPP of 30% and 20% respectively were transplanted into cohorts of immune-deficient mice using a limiting dilution approach and analyzed after 8 and 16 weeks (Fig. 3, Extended Data Fig. 5). In this xenograft model, leukemic engraftment is characteristically seen as a dominant myeloid (CD45dimCD33+) graft, whereas non-leukemic grafts are multilineage and contain both lymphoid (predominantly CD19+ B cells) and myeloid (CD33+) cells (Extended Data Fig. 6). For Patient #11, multilineage engraftment was seen in 24 of 35 mice, giving a calculated frequency of 1 repopulating HSC in 7.3×105 cells (Extended Data Fig. 5a). Only a single graft contained more than 50% CD33+ myeloid cells, suggesting co-engraftment by a leukemia stem cell (LSC) that was present at low frequency[38]. We analyzed by ddPCR 12 of the multilineage xenografts following 16 weeks of repopulation. Ten of these contained a high proportion of cells bearing DNMT3a without NPM1c (mean allele frequency 57%), whereas both DNMT3a and NPM1c were present in the single mouse with significant myeloid engraftment (Fig. 3b). Kinetic analysis demonstrated increasing DNMT3a allele frequency in multilineage grafts over time (Fig 3c). Similar results were found for Patient #55 (Extended Data Fig. 5 and data not shown). In contrast, cells from the relapse sample of both patients generated leukemic grafts and no multilineage grafts (Fig. 3a and data not shown), consistent with a higher LSC frequency at relapse compared to diagnosis[38]. Together, these data provide evidence that DNMT3a occurs in HSC/MPP capable of generating a long-term multilineage lympho-myeloid graft, confirming their designation as pre-leukemic HSC[26] (preL-HSC). DNMT3a also endows preL-HSC with a competitive repopulation advantage over non-mutated HSC explaining the clonal expansion of preL-HSC in patients at the time of diagnosis and during remission.
Figure 3
Pre-leukemic HSC bearing DNMT3a generate multilineage engraftment and have a competitive advantage in xenograft repopulation assays
a, Representative flow cytometric analysis of engrafted human cells harvested from NSG mouse bone marrow (BM) 16 weeks after intrafemoral (i.f.) transplantation of peripheral blood mononuclear cells (PB MNC) from diagnosis and relapse samples of Patient #11. b, Analysis of human graft composition in NSG mouse BM 16 weeks after i.f. transplantation of PB MNC from the diagnosis sample of Patient #11 across a range of cells doses. The percentage of human (CD45+) B (CD19+) and myeloid (CD33+) cells was determined by flow cytometry. Mutant allele frequency (%) in the human graft was determined by droplet digital PCR (ddPCR) analysis of sorted human cells. The length of the bars is proportional to the mutant allele frequency (the scale bar under the first column applies to all columns). c, Summary of DNMT3a allele frequency in the human graft from mice analyzed by ddPCR 8 and 16 weeks after transplantation of PB MNC from Patient #11, compared to isolated hematopoietic stem cell/multipotent progenitors (HSC/MPP) from the patient’s PB at diagnosis. *, P<0.05. Bars indicate mean and standard deviation.
Extended Data Figure 5
PreL-HSCs in the peripheral blood of AML patients generate multilineage human grafts in immunodeficient mice
Summary of results of limiting dilution experiments to assess frequency of pre-leukaemic HSCs generating multilineage grafts after xenotransplantation. Cohorts of NSG mice were transplanted intrafemorally with varying numbers of peripheral blood mononuclear cells from diagnostic samples of AML patient no. 11 (a) and no. 55 (b) and analysed after 8 or 16 weeks by flow cytometry. Engraftment was defined as >0.1% human CD45+ cells in the injected right femur. Shown is the number of mice with multilineage human grafts containing both CD33+ myeloid cells and CD33−CD19+ cells. The frequency of pre-leukaemic HSCs was calculated using the ELDA platform[49].
Extended Data Figure 6
Frequent generation of non-leukaemic multilineage human grafts following xenotransplantation of peripheral blood cells from AML patients
Summary of xenograft characteristics in 123 sublethally irradiated NSG mice transplanted intrafemorally with mononuclear peripheral blood cells from 20 AML patients at diagnosis and analysed after 8 weeks by flow cytometry. The proportion of myeloid (CD33+) and B-lymphoid (CD33−CD19+) cells in the human (CD45+) graft is shown. Leukaemic (AML) engraftment is characterized by a dominant myeloid (CD45dimCD33+) graft, whereas non-leukaemic multilineage grafts contain both lymphoid (predominantly CD33−CD19+ B cells) and myeloid (CD33+) cells. No leukaemic or multilineage graft could be detected in 65/123 mice (53%) in this cohort. Red box indicates AML grafts (27 mice, 22%); blue box indicates multilineage grafts (31 mice, 25%).
Our xenograft results indicate that when preL-HSC exists at higher frequency than LSC, non-leukemic multilineage grafts, rather than leukemic grafts, are frequently generated. Examination of our historical xenograft data from 264 diagnostic AML samples revealed that 37% did not generate any graft, 40% generated leukemia, and 23% gave rise to non-leukemic multilineage grafts (Extended Data Fig. 6). Sanger sequencing data was available for 25 samples that generated multilineage grafts (Supplementary Table 4) revealing that 10 of 25 (40%) came from patients bearing DNMT3a; IDH1/2 mutations were present in 12 patients, including 3 who had both DNMT3a and IDH1/2 mutations (Fig. 4a, 4b). It was not possible to determine conclusively whether these xenografts were generated by preL-HSC, as they were not available for mutation testing. To examine whether pre-leukemic cells also exist in patients with IDH1/2 mutations, we analyzed samples from 3 patients with IDH1 and 3 patients with IDH2 mutation by high resolution cell sorting and ddPCR. In 4 patients, no pattern of preceding mutation was detected in non-leukemic cell populations. However, in 2 patients we found IDH2 mutation without NPM1c in a number of progenitor and mature populations (Fig. 4c), suggesting that IDH2 mutation might also occur as a pre-leukemic event.
Figure 4
Identification of preL-HSC with IDH2 mutation
a, Summary of the occurrence of mutations in NPM1, DNMT3a, and IDH1/2 determined by Sanger sequencing, in AML patient peripheral blood samples (n=25) that generated a non-leukemic multilineage graft after transplantation into immune-deficient mice. b, Representation of the proportion (%) of AML patient samples with DNMT3a and/or IHD1/2 mutations among samples that generated a non-leukemic multilineage graft in xenotransplanted mice. c,
IDH2 and NPM1 mutant allele frequency (%) in stem/progenitor, mature lymphoid and blast (CD45dimCD33+) cell populations isolated from the peripheral blood of Patients #52, 64 and 77 at diagnosis, as determined by droplet digital PCR (ddPCR). Blank boxes indicate no mutation detected.
Our data predict that DNMT3a mutation may occur in healthy adults and pre-date AML diagnosis by months or even years. Through searches of exome sequence databases derived from PB (https://esp.gs.washington.edu/drupal/) we found that the frequency of the DNMT3a R882H variant (rs147001633) was 0.066% (3 in 4545). Although this was considered to be a germline variant in this healthy adult cohort, our findings raise the possibility that the mutations detected in these studies may have originated from an HSC/MPP containing an acquired somatic DNMT3a mutation that underwent clonal expansion.
Discussion
Our study provides a number of key insights into the leukemogenic process in human AML. Our findings establish the sequential order of mutation acquisition for the patients reported here: DNMT3a occurs before NPM1c and FLT3-ITD. Additionally, we provide strong evidence for the presence, at diagnosis, of preL-HSCs that are ancestral to the dominant AML clone. Based on our data, preL-HSC are prevalent among patients with DNMT3a, which account for 25% of adult AML cases; additionally, our multilineage engraftment data suggest that preL-HSC may also exist in a proportion of AML patients with IDH2 mutations. Pre-leukemic progenitors of varying phenotypes have been reported in other types of hematologic malignancies[25,39-41], although functional studies were limited. Our work supports prior studies identifying phenotypic primitive cells that bear only a subset of mutations found in AML blasts[26,27,42]. The more precise analysis of highly resolved HSC and progenitor populations that we have undertaken provides novel insight into the identity and proportional contribution of the stem/progenitor populations that acquire pre-leukemic lesions. Furthermore, our work demonstrates that DNMT3a confers a functional repopulation advantage to preL-HSC over wild type HSC in xenograft assays, which likely underlies the clonal expansion of preL-HSC observed in patients at the time of diagnosis. Our study is consistent with mouse studies showing that HSC lacking DNMT3a have a competitive growth advantage[43,44], and with a recent report predicting that the human DNMT3a results in loss of function[45].Collectively, our results support a model wherein the cell of origin for DNMT3a AML is an HSC and the initiating DNMT3a mutation results in the generation of an expanded pool of HSC and downstream progenitors, within which additional mutations such as NPM1c are acquired, driving progression to AML. In the samples studied here, our findings point to GMP and/or MLP as the likely populations in which NPM1c was acquired. Our experimental design provides a framework for the future identification of other early events in leukemogenesis, using generation of multilineage xenografts as a surrogate assay to identify AML samples that may contain preL-HSC, and for examining how these changes disrupt normal HSC function, cause clonal expansion and initiate leukemic development.Our results have broad clinical implications. Prior studies in T-ALL[46] and B-ALL[7,11,14] revealed the existence of genetically diverse subclones at diagnosis. As found originally in these diseases[47] and now in AML, in approximately 50% of patients the relapse clone is not related to the predominant clone at diagnosis but rather to a minor leukemic subclone [10,15] or to a predicted ancestral clone[47], Our direct demonstration that ancestral clones persist at remission suggests that preL-HSCs are resistant to induction chemotherapy and for some patients they might represent a reservoir from which relapse arises. If future phylogenetic single cell lineage analysis establishes this possibility, then preL-HSC should be directly targeted to prevent relapse. As new drugs are developed that effectively target mutations in DNMT3a or other genes (e.g. AG-221, an IDH2 inhibitor) that give rise to preL-HSC, there may be an opportunity to eradicate these preL-HSC clones before the acquisition of additional mutations renders them more resistant to therapy. Our findings also support broadening the definition of minimal residual disease to include not only the post-therapy survival of AML blasts and LSCs but also preL-HSC. Practically, this suggests that for patients with both DNMT3a and NPM1c, the residual level of both mutations and not NPM1c alone should be monitored. Finally, our database analysis showing that the DNMT3a R882H variant is present in blood samples from normal adults should stimulate new studies investigating whether preL-HSC are present in healthy individuals and determining the risk of progression to AML; these may in turn enable earlier diagnosis for those patients who present without prior overt hematologic disturbances. Of note, a recent report describes the development of donor-derived leukemia in a patient 27 months after receiving allogeneic HSC from an HLA-matched sibling, whose donated PB cells were later found to carry mutations in IDH2 and DNMT3a at low frequency[48]. Interestingly, the donor remains free of leukemia 10 years after his stem cell donation, suggesting that the mechanisms underlying the progression from subclinical pre-leukemic hematopoiesis to overt leukemia are complex and context dependent.
Methods
Targeted sequencing of leukemia-associated genes
Genomic DNA (gDNA) was subjected to limited whole genome amplification (RepliG, Qiagen) to obtain the required amount of input DNA for the SureSelect protocol. Amplified gDNA was mechanically sheared using the Covaris M220 Focused-ultrasonicator, and Illumina sequencing adaptors were ligated to fragments to make a sequencing library, which was then hybridized with 120mer biotinylated RNA library baits to capture the regions of interest. Baits were designed to capture the coding sequence of the 103 leukemia-associated genes listed in Supplementary Table 1 (total target size ~370 Kb). The targeted regions were pulled out using magnetic streptavidin beads and amplified. The resulting amplified library was quantified and sequenced on the Illumina HiSeq 2000 platform to an average on target coverage of 250×. Reads were aligned to the reference human genome build hg19 using Novoalign (Novocraft Inc.) and on-target single nucleotide variants (SNVs) and indels were called using the genome analysis tool kit (GATK). Somatic SNVs were called in AML blasts with a read depth of at least 30×. The list of contributors to the gene list is provided in Supplementary Note 1.
T cell isolation and expansion from primary AML samples
CD3+ cells were isolated from peripheral blood (PB) AML patient samples using EasySep (Stem Cell Technologies) and re-suspended at a concentration of 1×107cells/2 mL in RPM1 + 10% FBS-HI + rhIL-2 (250 IU/mL, Proleukin, Chiron) + anti-CD28 antibody (5 ug/mL, clone CD28.2, eBioscience). Cells were then added to one well of a 24-well plate that had been pre-coated for 2 hours with anti-CD3 antibody (Clone OKT3, eBioscience) and cultured for 4 days at 37°C with 5% CO2. Cells were harvested on day 4, resuspended in fresh RPMI + 10% FBS-HI + rhIL2 (250 IU/ml) and replated into one well of a 6-well plate. Cells were further cultured and expanded for 14–20 days, feeding with fresh full medium containing rhIL-2 (250 IU/ml) every 3–4 days. At the end of T cell expansion, the purity of CD3+ T cells was checked by flow cytometry. DNA from the cultured T cells was extracted by PureGene Cell kit (Qiagen).
Droplet digital PCR (ddPCR)
Genomic DNA (25ng) or amplified DNA (2 ul from a 1:20 dilution of a 16 hour RepliG whole genome amplification) was subjected to ddPCR in a 96-well plate according to the manufacturer’s protocol. Each sample was tested in duplicate. The plate was then loaded onto a droplet reader with a two color FAM/VIC fluorescence detector. The mutant allele frequency was calculated as the fraction of positive droplets divided by total droplets containing a target. The TaqMan probes and primers used for ddPCR are listed in Supplementary Table 5. To evaluate the detection limits of the ddPCR assay, a standard curve was generated using serial dilutions of DNA with a known mutation frequency mixed with non-mutated DNA. For probes listed in Supplementary Table 5, the minimum detection level was 1:1000 (0.1%).
Fluorescence activated cell sorting of human stem/progenitor and mature cell populations
Mononuclear cells (1×106/100 μl) from peripheral blood or bone marrow of AML patients were stained with the following antibodies (all from BD unless stated otherwise, dilution used and catalogue number in parentheses): anti-CD45RA-FITC (1:25, 555488), anti-CD90-APC (1:50, 561971), anti-CD135-Biotin (1:10, 624008), anti-CD38-PE-Cy7 (1:200, 335790), anti-CD10-Alexa-700 (1:10, 624040), anti-CD7-Pacific Blue (1:50, 642916), anti-CD45-V500 (1:200, 560777), anti-CD34-APC-Cy7 (1:100, custom made by BD, CD34 clone 581), anti-CD34-PerCP-Efluor 710 (1:100, e-Bioscience 46-0344-42), anti-CD33-PE-Cy5 (1:100, Beckman Coulter PNIM2647U), anti-CD19-PE (1:200, 349204), anti-CD3-FITC (1:100, 349201), anti-CD56-Alexafluor 647 (1:100, 557711), and Streptavidin-QD605 (1:200, Invitrogen Q10101MP). Samples from Patients #1, 10, 11 (remission sample only), 32, 35, and 55 were enriched for CD34+ cells using a Miltenyi CD34 MicroBead kit according to the manufacturer’s protocol prior to antibody staining. Cells were sorted on a FACS AriaIII to a post-sort purity of >95%.
Xenotransplantation assays
Animal experiments were performed in accordance with institutional guidelines approved by the UHN Animal Care Committee. 8 to 12-week-old female NOD/SCID/IL-2Rgc-null (NSG) mice were sublethally irradiated (225 cGy) 6–24 hours before transplantation. Mononuclear cells from AML patients were depleted of CD3+ cells by EasySep (Stem Cell Technologies) prior to intrafemoral transplantation. Mice were sacrificed 8 or 16 weeks after transplantation and human engraftment in the injected femur and non-injected bone marrow was evaluated by flow cytometry using the following human-specific antibodies (all used at 1:200, all from BD unless stated otherwise, catalogue number in parentheses): anti-CD45-APC (340943), anti-CD19-PE, anti-CD33-PE-Cy5, anti-CD3-FITC, anti-CD14-PE Texas Red (Beckman Coulter PNIM2707U), anti-CD15-Pacific Blue (642917), anti-CD38-PE-Cy7, and anti-CD34-APC-Cy7. The threshold for detection of human engraftment was 0.1% CD45+ cells. All flow cytometric analysis was performed on the LSRII (BD Biosciences). For limiting dilution assays, the frequency of repopulating cells was calculated using ELDA software[49].
Statistical analysis
For the initial targeted sequencing analysis, 12 independent patient samples were studied to capture the biologic diversity of AML. For validation of the DNMT3a findings, 71 samples were screened in order to identify at least 15 with DNMT3a mutations, as predicted by the known prevalence of DNMT3a mutation in AML. For limiting dilution analyses, at least 25 xenografts were analyzed for each patient sample to ensure a large enough sample for statistical comparison. No animals or samples were excluded from any analysis. No formal randomization method was applied when assigning animals to different experimental groups. Group allocation and outcome assessment was not done in a blinded manner, including for animal studies. Frequency estimations were generated using the ELDA software, which takes into account whether the assumptions for LDA are met (http://bioinf.wehi.edu.au/software/elda/index.html, provided by the Walter and Eliza Hall Institute)[49]. P-values were derived using two-tailed Student’s t-tests. In each group of data, estimate variation was taken into account and is indicated as standard deviation. For all graphs, * p=0.01–0.05, ** p=0.001–0.01, and *** p<0.001.
FLT3-ITD is a late event in patients carrying DNMT3A mutation
PCR analysis of FLT3-ITD[50] in stem/progenitor, mature lymphoid and blast (CD45dim CD33+) cell populations from patient no. 13 (a) and no. 14 (b). FLT3-ITD was present in the blasts from both patients, and also in MLPs from patient no. 14. In contrast, DNMT3Amut without FLT3-ITD was detected in multiple non-blast cell populations (see Extended Data Fig. 2). HSC, haematopoietic stem cell; MPP, multipotent progenitor; CMP, common myeloid progenitor; MLP, multilymphoid progenitor; GMP, granulocyte monocyte progenitor; NK, natural killer cells.
Frequent occurrence of DNMT3A mutation without NPM1 mutation in stem/progenitor and mature lymphoid cells in AML patients at diagnosis
a, Summary of the allele frequency (%) of DNMT3A and NPM1 mutations in stem/progenitor, mature lymphoid, and blast (CD45dim CD33+) cell populations from 11 AML patient peripheral blood samples obtained at diagnosis, as determined by droplet digital PCR (ddPCR). Phenotypically normal cell populations were isolated by fluorescence activated cell sorting according to the strategy depicted in Fig. 2a. Mutant allele frequency ~50% is consistent with a heterozygous cell population. Departures from 50% mutant allele frequency may be stochastic[51], related to clonal heterogeneity, or due to the presence of copy number variations, for example loss of the wild type allele (loss of heterozygosity) or amplification of the mutant allele. NA, no cell population detected; HSC, haematopoietic stem cell; MPP, multipotent progenitor; CMP, common myeloid progenitor; MEP, megakaryocyte erythroid progenitor; MLP, multilymphoid progenitor; GMP, granulocyte monocyte progenitor; NK, natural killer cells. Blank boxes indicate no DNMT3A or NPM1 mutation detected. b, Representative plots showing ddPCR analysis of DNMT3Amut and NPM1c allele frequency in sorted cell populations from patient no. 11. The mutant allele frequency (%) is indicated on each plot.
Phenotypically normal stem/progenitor and mature cell populations are present in AML patient samples at diagnosis, remission and relapse
Flow cytometric analysis showing the gating strategy used to isolate phenotypically normal stem/progenitor and mature lymphoid cell populations from AML patient samples. Diagnosis and relapse samples are from peripheral blood; remission samples are from bone marrow.
Cells bearing mutations in DNMT3A but not NPM1 are present at diagnosis in AML patients and persist at remission and relapse
Allele frequency of DNMT3A and NPM1 mutations of patients no. 28, 35, 55, and 57 in stem/progenitor, mature and blast (CD45dim CD33+) cell populations, as determined by droplet digital PCR (ddPCR). Cells were isolated from diagnosis (blue), early remission (white), relapse (red) or late remission (yellow) samples. At remission, CD33+ myeloid cells were also analysed. HSC, haematopoietic stem cell; MPP, multipotent progenitor; MLP, multilymphoid progenitor; CMP, common myeloid progenitor; GMP, granulocyte monocyte progenitor; MEP, megakaryocyte erythroid progenitor; NK, natural killer cells.
PreL-HSCs in the peripheral blood of AML patients generate multilineage human grafts in immunodeficient mice
Summary of results of limiting dilution experiments to assess frequency of pre-leukaemic HSCs generating multilineage grafts after xenotransplantation. Cohorts of NSG mice were transplanted intrafemorally with varying numbers of peripheral blood mononuclear cells from diagnostic samples of AML patient no. 11 (a) and no. 55 (b) and analysed after 8 or 16 weeks by flow cytometry. Engraftment was defined as >0.1% human CD45+ cells in the injected right femur. Shown is the number of mice with multilineage human grafts containing both CD33+ myeloid cells and CD33−CD19+ cells. The frequency of pre-leukaemic HSCs was calculated using the ELDA platform[49].
Frequent generation of non-leukaemic multilineage human grafts following xenotransplantation of peripheral blood cells from AML patients
Summary of xenograft characteristics in 123 sublethally irradiated NSG mice transplanted intrafemorally with mononuclear peripheral blood cells from 20 AML patients at diagnosis and analysed after 8 weeks by flow cytometry. The proportion of myeloid (CD33+) and B-lymphoid (CD33−CD19+) cells in the human (CD45+) graft is shown. Leukaemic (AML) engraftment is characterized by a dominant myeloid (CD45dimCD33+) graft, whereas non-leukaemic multilineage grafts contain both lymphoid (predominantly CD33−CD19+ B cells) and myeloid (CD33+) cells. No leukaemic or multilineage graft could be detected in 65/123 mice (53%) in this cohort. Red box indicates AML grafts (27 mice, 22%); blue box indicates multilineage grafts (31 mice, 25%).
Authors: Faiyaz Notta; Sergei Doulatov; Elisa Laurenti; Armando Poeppl; Igor Jurisica; John E Dick Journal: Science Date: 2011-07-08 Impact factor: 47.728
Authors: Peter J Campbell; Shinichi Yachida; Laura J Mudie; Philip J Stephens; Erin D Pleasance; Lucy A Stebbings; Laura A Morsberger; Calli Latimer; Stuart McLaren; Meng-Lay Lin; David J McBride; Ignacio Varela; Serena A Nik-Zainal; Catherine Leroy; Mingming Jia; Andrew Menzies; Adam P Butler; Jon W Teague; Constance A Griffin; John Burton; Harold Swerdlow; Michael A Quail; Michael R Stratton; Christine Iacobuzio-Donahue; P Andrew Futreal Journal: Nature Date: 2010-10-28 Impact factor: 49.962
Authors: Jay P Patel; Mithat Gönen; Maria E Figueroa; Hugo Fernandez; Zhuoxin Sun; Janis Racevskis; Pieter Van Vlierberghe; Igor Dolgalev; Sabrena Thomas; Olga Aminova; Kety Huberman; Janice Cheng; Agnes Viale; Nicholas D Socci; Adriana Heguy; Athena Cherry; Gail Vance; Rodney R Higgins; Rhett P Ketterling; Robert E Gallagher; Mark Litzow; Marcel R M van den Brink; Hillard M Lazarus; Jacob M Rowe; Selina Luger; Adolfo Ferrando; Elisabeth Paietta; Martin S Tallman; Ari Melnick; Omar Abdel-Wahab; Ross L Levine Journal: N Engl J Med Date: 2012-03-14 Impact factor: 91.245
Authors: Timothy J Ley; Li Ding; Matthew J Walter; Michael D McLellan; Tamara Lamprecht; David E Larson; Cyriac Kandoth; Jacqueline E Payton; Jack Baty; John Welch; Christopher C Harris; Cheryl F Lichti; R Reid Townsend; Robert S Fulton; David J Dooling; Daniel C Koboldt; Heather Schmidt; Qunyuan Zhang; John R Osborne; Ling Lin; Michelle O'Laughlin; Joshua F McMichael; Kim D Delehaunty; Sean D McGrath; Lucinda A Fulton; Vincent J Magrini; Tammi L Vickery; Jasreet Hundal; Lisa L Cook; Joshua J Conyers; Gary W Swift; Jerry P Reed; Patricia A Alldredge; Todd Wylie; Jason Walker; Joelle Kalicki; Mark A Watson; Sharon Heath; William D Shannon; Nobish Varghese; Rakesh Nagarajan; Peter Westervelt; Michael H Tomasson; Daniel C Link; Timothy A Graubert; John F DiPersio; Elaine R Mardis; Richard K Wilson Journal: N Engl J Med Date: 2010-11-10 Impact factor: 91.245
Authors: T Yasuda; T Ueno; K Fukumura; A Yamato; M Ando; H Yamaguchi; M Soda; M Kawazu; E Sai; Y Yamashita; M Murata; H Kiyoi; T Naoe; H Mano Journal: Leukemia Date: 2013-09-26 Impact factor: 12.883
Authors: Subarna Sinha; Daniel Thomas; Linda Yu; Andrew J Gentles; Namyoung Jung; M Ryan Corces-Zimmerman; Steven M Chan; Andreas Reinisch; Andrew P Feinberg; David L Dill; Ravindra Majeti Journal: Blood Date: 2014-11-14 Impact factor: 22.113
Authors: Michael R Burgess; Eugene Hwang; Ari J Firestone; Tannie Huang; Jin Xu; Johannes Zuber; Natacha Bohin; Tiffany Wen; Scott C Kogan; Kevin M Haigis; Deepak Sampath; Scott Lowe; Kevin Shannon; Qing Li Journal: Blood Date: 2014-10-31 Impact factor: 22.113
Authors: Sanjay S Patel; Frank C Kuo; Christopher J Gibson; David P Steensma; Robert J Soiffer; Edwin P Alyea; Yi-Bin A Chen; Amir T Fathi; Timothy A Graubert; Andrew M Brunner; Martha Wadleigh; Richard M Stone; Daniel J DeAngelo; Valentina Nardi; Robert P Hasserjian; Olga K Weinberg Journal: Blood Date: 2018-05-03 Impact factor: 22.113
Authors: Martin Culen; Zdenka Kosarova; Ivana Jeziskova; Adam Folta; Jana Chovancova; Tomas Loja; Nikola Tom; Vojtech Bystry; Veronika Janeckova; Dana Dvorakova; Jiri Mayer; Zdenek Racil Journal: J Cancer Res Clin Oncol Date: 2018-05-02 Impact factor: 4.553
Authors: Takayo Sasaki; Juan Carlos Rivera-Mulia; Daniel Vera; Jared Zimmerman; Sunny Das; Michelle Padget; Naoto Nakamichi; Bill H Chang; Jeff Tyner; Brian J Druker; Andrew P Weng; Curt I Civin; Connie J Eaves; David M Gilbert Journal: Exp Hematol Date: 2017-04-19 Impact factor: 3.084