Literature DB >> 25644247

Quantifying ultra-rare pre-leukemic clones via targeted error-corrected sequencing.

A L Young1, T N Wong2, A E O Hughes1, S E Heath2, T J Ley2, D C Link2, T E Druley1.   

Abstract

Entities:  

Mesh:

Year:  2015        PMID: 25644247      PMCID: PMC4497921          DOI: 10.1038/leu.2015.17

Source DB:  PubMed          Journal:  Leukemia        ISSN: 0887-6924            Impact factor:   11.528


× No keyword cloud information.
The quantification of rare clonal and subclonal populations from a heterogeneous DNA sample has multiple clinical and research applications for the study and treatment of leukemia. Specifically, in the hematopoietic compartment, recent reports demonstrate the presence of subclonal variation in normal and malignant hematopoiesis,[1,2] and leukemia is now recognized as an oligoclonal disease.[3] Currently, clonal heterogeneity in leukemia is studied using next-generation sequencing (NGS) targeting subclone-specific mutations. With this method, detecting mutations at 2–5% variant allele fraction (VAF) requires costly and time-intensive deep resequencing and identifying lower frequency variants is impractical regardless of sequencing depth. Recently, various methods have been developed to circumvent the error rate of NGS.[4, 5] These methods tag individual DNA molecules with unique oligonucleotide indexes, which enable error correction after sequencing. Here we present a direct application of error-corrected sequencing (ECS) to study clonal heterogeneity during leukemogenesis and validate the accuracy of this method with a series of benchmarking experiments. Specifically, we demonstrate the ability of ECS to identify leukemia-associated mutations in banked pre-leukemic blood and bone marrow from patients with either therapy-related acute myeloid leukemia (t-AML) or therapy-related myelodysplastic syndrome (t-MDS). T-AML/t-MDS occurs in 1–10% of individuals who receive alkylator- or epipodophyllotoxin-based chemotherapy or radiation to treat a primary malignancy.[6] For the seven individuals surveyed in this study, matched leukemia/normal whole-genome sequencing identified the t-AML/t-MDS-specific somatic mutations present at diagnosis. We applied our method for ECS to identify leukemia-specific mutations in four individuals from DNA extracted from blood and bone marrow samples collected years before diagnosis. In a separate study into the role of TP53 mutations in t-AML/t-MDS leukemogenesis, this method was used to identify leukemia-associated mutations at low frequency in samples banked years before diagnosis.[7] In two cases, subclones were identified below the 1% threshold of detection governed by conventional NGS. These results highlight the ability of targeted ECS to identify clinically silent single-nucleotide variations (SNVs). We employed ECS by tagging individual DNA molecules with adapters containing 16 bp random oligonucleotide molecular indexes in a manner similar to other reports.[4, 5, 8] Our implementation of ECS easily targets loci of interest by single or multiplex PCR and inserts seamlessly into the standard NGS library preparation (Supplementary Figure 1, Supplementary Methods). Our only deviations from the standard protocol are ligation of customized adapters containing random indexes instead of the manufacturer's supplied adapters and a quantitative PCR (qPCR) quantification step before sequencing (Supplementary Table 1). Following sequencing, sequence reads containing the same index and originating from the same molecule are grouped into read families. Sequencing errors are identified by comparing reads within a read family and removed to create an error-corrected consensus sequence (ECCS). We performed a dilution series experiment to assess bias during library preparation and determine the limit of detection for ECS. For this experiment, we spiked DNA from a t-AML sample into control human DNA, which was serially diluted over five orders of magnitude. The experiment was comprised of two technical replicates targeting two separate mutations (20 total independent libraries). The results demonstrate that ECS is quantitative to a VAF of 1:10 000 molecules and provides a highly reproducible digital readout of tumor DNA prevalence in a heterogeneous DNA sample (r2 of 0.9999 and 0.9991, Figures 1a and b). We next characterized the error profile based on the wild-type nucleotides included in the dilution series experiment. Variant identification using the ECCSs was 99% specific at a VAF of 0.0016 versus 0.0140 for deep sequencing alone (Figure 1c). We noticed that ECCS errors were heavily biased towards G to T transversions and to a lesser degree C to T transitions (Figure 1d, Supplementary Figure 2), as previously observed.[4, 9] When separated by substitution type, variants identified from the ECCSs were 99% specific at a VAF of 0.0034 for G to T (C to A) mutations, 0.00020 for C to T (G to A) mutations and 0.000079 for the other eight possible substitutions. Although excess G to T mutations are a known consequence of DNA oxidation leading to 8-oxo-guanine conversion,[4] the pre-treatment of samples with formamidopyrimidine-DNA glycosylase before PCR amplification did not appreciably improve the error profile of G to T mutations (Supplementary Figure 3).
Figure 1

Benchmarking for ECS and the identification of rare pre-leukemic mutations. (a, b) DNA extracted from a diagnostic leukemia sample with known mutations in RUNX1 (a) and IDH2 (b) was serially diluted into non-cancer, unrelated human DNA. Two replicates were run per sample/dilution. The coefficient of determination (r2) between diluted tumor concentration in the sample and VAF in the generated read families was 0.9999 and 0.9991 for RUNX1 and IDH2, respectively. (c) The VAF at every nucleotide not expected to contain mutations in the dilution series experiment were analyzed to determine the error profile of the error-corrected consensus sequences compared with conventional deep sequencing. A cumulative distribution function of VAF demonstrated a reduced error profile in read families relative to conventional deep sequenced reads. (d) The most frequent class of substitution seen in read families was in G to T (C to A) transversions, which was consistent with oxidative conversion of guanine to 8-oxo-guanine. (e, f) The leukemia-specific variants identified in ASXL1 and U2AF1 at diagnosis (circled) were not distinguishable from sequencing errors in the same substitution class by conventional deep sequencing. (g, h) Targeted error-corrected sequencing identified the ASXL1 variant in the 2002 banked sample at 0.004 VAF and the U2AF1 variant in the 2004 banked sample at 0.009 VAF.

As proof of principle, we applied ECS to study rare pre-leukemic clonal hematopoiesis in seven individuals who later developed t-AML/t-MDS. Leukemia/normal whole-genome sequencing at diagnosis was used to identify the leukemia-specific somatic mutations in each patient's malignancy (Supplementary Table 2). We applied targeted ECS to query these 18 different loci in 10 cryopreserved or formalin-fixed paraffin-embedded blood and bone marrow samples that were 9–22-year old and banked up to 12 years before diagnosis (Supplementary Table 3). We generated ~25 Gb of 150 bp paired-end reads from six Illumina (San Diego, CA, USA) MiSeq runs. We targeted 1–7 somatic mutations per individual (25 mutations spanning 5.5 kb from 15 genes in total) and identified leukemia-specific subclonal populations in four individuals up to 12 years before diagnosis (Table 1). For each sequencing library, we tagged ~2.5 million locus-specific amplicons generated from genomic DNA using high-fidelity PCR with randomly indexed custom adapters. Sequencing errors were removed to create ECCSs as described above. Each ECCS was then aligned to the reference genome for variant calling (Supplementary Figure 1).
Table 1

Patient-specific leukemia-associated somatic mutations identified by ECS

UPNSample IDYears priorGeneChrPositionMutAmino-acid changeVariant RFsReference RFsVAF
44629475.021OBSCN1228461129A to GH1857R61 238156 9860.2806
   TP53177578271T to AH193L220 551110 0470.6671
49925824.062RUNX12136252865C to GR139P2486 1960
57421426.047DMDX32827676G to AR187*7199 9450
64300680.0112ASXL12031022448G to TG645C785 7810.0001
   ASXL12031022442del GG645fs2 89882 2450.034
   GATA23128200135del CTTK390in_fr_del04 1870
   U2AF12144524456G to TS34Y85414 6130.0002
68494991.015ASXL12031023112T to GL866*3 583853 5980.0042
   U2AF12144524456G to TS34Y545514 4100.0011
 92.024ASXL12031023112T to GL866*54 074535 9760.0916
   U2AF12144524456G to TS34Y11 195355 2760.0305
 93.013ASXL12031023112T to GL866*17 319573 6290.0293
   U2AF12144524456G to TS34Y82792 1040.0089
85602430.021S100A41153517192A to GF27L0211 5120
   IGSF81160062252G to AP516S022 6140
   PLA2R12160798389A to GL1431P2338 6160
   POU3F2699282794C to AS15R8201 2400
   ANKRD18B933524645G to AC53Y7214 8360
   ESR21464701847G to AA416V10135 8610.0001
   FBN3198155081G to AP2029L0152 3040
94200833.049IDH21590631934C to TR88Q23 170236 5870.0892
   RUNX12136231791T to CD171G40253 1680.0002
 107.01<1IDH21590631934C to TR88Q138 180161 3710.4613
   RUNX12136231791T to CD171G368 43850 7960.8788

Abbreviations: ECS, error-corrected sequencing; RFs, read families; VAF, variant allele fraction. Two to seven mutations were queried per individual and the number of read families containing the variant allele or reference allele were reported and used to calculate the variant allele fraction.

Using conventional deep sequencing, we detected t-AML/t-MDS-specific mutations in prior banked samples at variant allele fractions between 0.03 and 0.87 (data not shown). In one individual (UPN 684949), deep sequencing alone was insufficient to distinguish known ASXL1 and U2AF1 mutations from the sequencing errors in samples banked 5 and 3 years before t-MDS diagnosis, respectively (Figures 1e and f). However, ECS identified the L866* nonsense mutation in ASXL1 at a VAF of 0.004 (Figure 1g) and the S34Y missense mutation in U2AF1 at a VAF of 0.009 (Figure 1h). In addition, ECS was able to temporally quantify these mutations from three pre-t-MDS samples banked yearly from 3 to 5 years before diagnosis (Supplementary Figures 4 and 5). In two cases (UPN643006 and UPN942008), only a subset of the variants identified at diagnosis were present in the prior banked sample (Table 1). Specifically, in the UPN643006 sample, banked 12 years before diagnosis, a single-nucleotide deletion in ASXL1 was present at VAF 0.03. But, the G to T substitution in ASXL1, CTT deletion in GATA2 and G to T substitution in U2AF1 were not detectable in this prior banked sample. Here we present a practical and clinically oriented application for targeted error-corrected NGS utilizing single molecule indexing. This method easily integrates into existing NGS library preparation protocols and enables the quantification of previously undetectable mutations in heterogeneous DNA samples. The only modification to the standard NGS library preparation is the replacement of the stock adapters with our randomly indexed adapters and the addition of a qPCR step before sequencing. The qPCR step limits the number of molecules sequenced, ensuring adequate coverage for each read family. With these two modifications, we achieve highly specific detection for rare mutations. The bioinformatics analysis is straightforward and does not require proprietary algorithms or tools (Supplementary Methods). Our results highlight the ability of this method to identify rare subclonal populations in a heterogeneous biological sample. As applied to t-AML/t-MDS, we show these previously undetectable mutations are present years before diagnosis and fluctuate in prevalence over time. A clinical application of ECS is to quantify minimal residual disease (MRD). As the genomic characterization of leukemia becomes more readily available, identifying causative genetic lesions and rare therapy-resistant subclones will become increasingly useful for risk stratification, therapeutic selection and disease monitoring. Already, whole-genome sequencing of AML has demonstrated that nearly every case of AML harbors one or more somatic SNVs.[10] These SNVs are more reliable clonal markers of malignancy than cell surface markers, which can change over time. Leveraging this information, conventional NGS was implemented retrospectively to detect MRD harboring leukemia-specific insertions/deletions (indels) as rare as 0.00001 VAF in NPM1[11] and 0.0001 VAF in RUNX1.[12] This was possible because indels are only rarely generated erroneously by NGS. Unfortunately, measuring rare leukemia-associated substitutions is limited owing to the relatively high error profile of conventional NGS.[13] However, ECS can achieve the 1:10 000 limit of detection featured by conventional MRD platforms.[14] For patients whose leukemia lacks suitable markers for conventional MRD, ECS could offer an alternative with comparable sensitivity and specificity that is easy to implement in a clinical sequencing lab. Furthermore, the ability to multiplex targets for ECS enables the surveillance of known mutations and the simultaneous discovery of new somatic mutations. Ongoing work will directly compare gold-standard MRD methods with targeted ECS in patients with and without relapsed leukemia.
  14 in total

1.  High-throughput DNA sequencing errors are reduced by orders of magnitude using circle sequencing.

Authors:  Dianne I Lou; Jeffrey A Hussmann; Ross M McBee; Ashley Acevedo; Raul Andino; William H Press; Sara L Sawyer
Journal:  Proc Natl Acad Sci U S A       Date:  2013-11-15       Impact factor: 11.205

2.  Detection of ultra-rare mutations by next-generation sequencing.

Authors:  Michael W Schmitt; Scott R Kennedy; Jesse J Salk; Edward J Fox; Joseph B Hiatt; Lawrence A Loeb
Journal:  Proc Natl Acad Sci U S A       Date:  2012-08-01       Impact factor: 11.205

3.  The origin and evolution of mutations in acute myeloid leukemia.

Authors:  John S Welch; Timothy J Ley; Daniel C Link; Christopher A Miller; David E Larson; Daniel C Koboldt; Lukas D Wartman; Tamara L Lamprecht; Fulu Liu; Jun Xia; Cyriac Kandoth; Robert S Fulton; Michael D McLellan; David J Dooling; John W Wallis; Ken Chen; Christopher C Harris; Heather K Schmidt; Joelle M Kalicki-Veizer; Charles Lu; Qunyuan Zhang; Ling Lin; Michelle D O'Laughlin; Joshua F McMichael; Kim D Delehaunty; Lucinda A Fulton; Vincent J Magrini; Sean D McGrath; Ryan T Demeter; Tammi L Vickery; Jasreet Hundal; Lisa L Cook; Gary W Swift; Jerry P Reed; Patricia A Alldredge; Todd N Wylie; Jason R Walker; Mark A Watson; Sharon E Heath; William D Shannon; Nobish Varghese; Rakesh Nagarajan; Jacqueline E Payton; Jack D Baty; Shashikant Kulkarni; Jeffery M Klco; Michael H Tomasson; Peter Westervelt; Matthew J Walter; Timothy A Graubert; John F DiPersio; Li Ding; Elaine R Mardis; Richard K Wilson
Journal:  Cell       Date:  2012-07-20       Impact factor: 41.582

4.  Molecular indexing enables quantitative targeted RNA sequencing and reveals poor efficiencies in standard library preparations.

Authors:  Glenn K Fu; Weihong Xu; Julie Wilhelmy; Michael N Mindrinos; Ronald W Davis; Wenzhong Xiao; Stephen P A Fodor
Journal:  Proc Natl Acad Sci U S A       Date:  2014-01-21       Impact factor: 11.205

5.  Monitoring of residual disease by next-generation deep-sequencing of RUNX1 mutations can identify acute myeloid leukemia patients with resistant disease.

Authors:  A Kohlmann; N Nadarajah; T Alpermann; V Grossmann; S Schindela; F Dicker; A Roller; W Kern; C Haferlach; S Schnittger; T Haferlach
Journal:  Leukemia       Date:  2013-08-20       Impact factor: 11.528

Review 6.  Minimal residual disease in acute myeloid leukaemia.

Authors:  Christopher S Hourigan; Judith E Karp
Journal:  Nat Rev Clin Oncol       Date:  2013-06-25       Impact factor: 66.675

7.  Genomic and epigenomic landscapes of adult de novo acute myeloid leukemia.

Authors:  Timothy J Ley; Christopher Miller; Li Ding; Benjamin J Raphael; Andrew J Mungall; A Gordon Robertson; Katherine Hoadley; Timothy J Triche; Peter W Laird; Jack D Baty; Lucinda L Fulton; Robert Fulton; Sharon E Heath; Joelle Kalicki-Veizer; Cyriac Kandoth; Jeffery M Klco; Daniel C Koboldt; Krishna-Latha Kanchi; Shashikant Kulkarni; Tamara L Lamprecht; David E Larson; Ling Lin; Charles Lu; Michael D McLellan; Joshua F McMichael; Jacqueline Payton; Heather Schmidt; David H Spencer; Michael H Tomasson; John W Wallis; Lukas D Wartman; Mark A Watson; John Welch; Michael C Wendl; Adrian Ally; Miruna Balasundaram; Inanc Birol; Yaron Butterfield; Readman Chiu; Andy Chu; Eric Chuah; Hye-Jung Chun; Richard Corbett; Noreen Dhalla; Ranabir Guin; An He; Carrie Hirst; Martin Hirst; Robert A Holt; Steven Jones; Aly Karsan; Darlene Lee; Haiyan I Li; Marco A Marra; Michael Mayo; Richard A Moore; Karen Mungall; Jeremy Parker; Erin Pleasance; Patrick Plettner; Jacquie Schein; Dominik Stoll; Lucas Swanson; Angela Tam; Nina Thiessen; Richard Varhol; Natasja Wye; Yongjun Zhao; Stacey Gabriel; Gad Getz; Carrie Sougnez; Lihua Zou; Mark D M Leiserson; Fabio Vandin; Hsin-Ta Wu; Frederick Applebaum; Stephen B Baylin; Rehan Akbani; Bradley M Broom; Ken Chen; Thomas C Motter; Khanh Nguyen; John N Weinstein; Nianziang Zhang; Martin L Ferguson; Christopher Adams; Aaron Black; Jay Bowen; Julie Gastier-Foster; Thomas Grossman; Tara Lichtenberg; Lisa Wise; Tanja Davidsen; John A Demchok; Kenna R Mills Shaw; Margi Sheth; Heidi J Sofia; Liming Yang; James R Downing; Greg Eley
Journal:  N Engl J Med       Date:  2013-05-01       Impact factor: 91.245

8.  Somatic mutations found in the healthy blood compartment of a 115-yr-old woman demonstrate oligoclonal hematopoiesis.

Authors:  Henne Holstege; Wayne Pfeiffer; Daoud Sie; Marc Hulsman; Thomas J Nicholas; Clarence C Lee; Tristen Ross; Jue Lin; Mark A Miller; Bauke Ylstra; Hanne Meijers-Heijboer; Martijn H Brugman; Frank J T Staal; Gert Holstege; Marcel J T Reinders; Timothy T Harkins; Samuel Levy; Erik A Sistermans
Journal:  Genome Res       Date:  2014-04-23       Impact factor: 9.043

9.  Detection of minimal residual disease in NPM1-mutated acute myeloid leukemia by next-generation sequencing.

Authors:  Stephen J Salipante; Jonathan R Fromm; Jay Shendure; Brent L Wood; David Wu
Journal:  Mod Pathol       Date:  2014-04-18       Impact factor: 7.842

10.  Role of TP53 mutations in the origin and evolution of therapy-related acute myeloid leukaemia.

Authors:  Terrence N Wong; Giridharan Ramsingh; Andrew L Young; Christopher A Miller; Waseem Touma; John S Welch; Tamara L Lamprecht; Dong Shen; Jasreet Hundal; Robert S Fulton; Sharon Heath; Jack D Baty; Jeffery M Klco; Li Ding; Elaine R Mardis; Peter Westervelt; John F DiPersio; Matthew J Walter; Timothy A Graubert; Timothy J Ley; Todd Druley; Daniel C Link; Richard K Wilson
Journal:  Nature       Date:  2014-12-08       Impact factor: 49.962

View more
  30 in total

1.  Measurable residual disease monitoring by NGS before allogeneic hematopoietic cell transplantation in AML.

Authors:  Felicitas Thol; Razif Gabdoulline; Alessandro Liebich; Piroska Klement; Johannes Schiller; Christian Kandziora; Lothar Hambach; Michael Stadler; Christian Koenecke; Madita Flintrop; Mira Pankratz; Martin Wichmann; Blerina Neziri; Konstantin Büttner; Bennet Heida; Sabrina Klesse; Anuhar Chaturvedi; Arnold Kloos; Gudrun Göhring; Brigitte Schlegelberger; Verena I Gaidzik; Lars Bullinger; Walter Fiedler; Albert Heim; Iyas Hamwi; Matthias Eder; Jürgen Krauter; Richard F Schlenk; Peter Paschka; Konstanze Döhner; Hartmut Döhner; Arnold Ganser; Michael Heuser
Journal:  Blood       Date:  2018-09-06       Impact factor: 22.113

Review 2.  Methods of Detection of Measurable Residual Disease in AML.

Authors:  Yi Zhou; Brent L Wood
Journal:  Curr Hematol Malig Rep       Date:  2017-12       Impact factor: 3.952

3.  Rational "Error Elimination" Approach to Evaluating Molecular Barcoded Next-Generation Sequencing Data Identifies Low-Frequency Mutations in Hematologic Malignancies.

Authors:  Saradhi Mallampati; Dzifa Y Duose; Michael A Harmon; Meenakshi Mehrotra; Rashmi Kanagal-Shamanna; Stephanie Zalles; Ignacio I Wistuba; Xiaoping Sun; Rajyalakshmi Luthra
Journal:  J Mol Diagn       Date:  2019-02-20       Impact factor: 5.568

Review 4.  Advancing the Minimal Residual Disease Concept in Acute Myeloid Leukemia.

Authors:  Peter Hokland; Hans B Ommen; Matthew P Mulé; Christopher S Hourigan
Journal:  Semin Hematol       Date:  2015-04-07       Impact factor: 3.851

Review 5.  Measurable residual disease testing in acute myeloid leukaemia.

Authors:  C S Hourigan; R P Gale; N J Gormley; G J Ossenkoppele; R B Walter
Journal:  Leukemia       Date:  2017-04-07       Impact factor: 11.528

6.  Measurable Residual Disease Assessment as a Surrogate Marker in New Drug Development in Acute Myeloid Leukemia.

Authors:  Gege Gui; Christopher S Hourigan
Journal:  Cancer J       Date:  2022 Jan-Feb 01       Impact factor: 3.360

7.  Engraftment of rare, pathogenic donor hematopoietic mutations in unrelated hematopoietic stem cell transplantation.

Authors:  Wing Hing Wong; Sima Bhatt; Kathryn Trinkaus; Iskra Pusic; Kevin Elliott; Nitin Mahajan; Fei Wan; Galen E Switzer; Dennis L Confer; John DiPersio; Michael A Pulsipher; Nirali N Shah; Jennifer Sees; Amelia Bystry; Jamie R Blundell; Bronwen E Shaw; Todd E Druley
Journal:  Sci Transl Med       Date:  2020-01-15       Impact factor: 17.956

Review 8.  The Prognostic Significance of Measurable ("Minimal") Residual Disease in Acute Myeloid Leukemia.

Authors:  Francesco Buccisano; Christopher S Hourigan; Roland B Walter
Journal:  Curr Hematol Malig Rep       Date:  2017-12       Impact factor: 3.952

Review 9.  Importance of clonal hematopoiesis in heart failure.

Authors:  Nicholas W Chavkin; Kyung-Duk Min; Kenneth Walsh
Journal:  Trends Cardiovasc Med       Date:  2021-04-20       Impact factor: 6.677

10.  Accurate Detection and Quantification of FLT3 Internal Tandem Duplications in Clinical Hybrid Capture Next-Generation Sequencing Data.

Authors:  Jack K Tung; Carlos J Suarez; Tsoyu Chiang; James L Zehnder; Henning Stehr
Journal:  J Mol Diagn       Date:  2021-08-05       Impact factor: 5.341

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.