Literature DB >> 34547772

CBFB-MYH11 fusion transcripts distinguish acute myeloid leukemias with distinct molecular landscapes and outcomes.

Benjamin J Huang1,2, Jenny L Smith3, Yi-Cheng Wang4, Kassra Taghizadeh3, Amanda R Leonti3, Rhonda E Ries3, Yanling Liu5, Pandurang Kolekar5, Katherine Tarlock3,6, Robert Gerbing4, Erin Crowgey7, Scott N Furlan3,6, Timothy I Shaw5, Kohei Hagiwara5, Lisa Wei8, Todd M Cooper6, Alan S Gamis9, Richard Aplenc10, E Anders Kolb7, Jason E Farrar11, Timothy Triche12, Todd A Alonzo13, Xiaotu Ma5, Soheil Meshinchi3,6.   

Abstract

Entities:  

Mesh:

Substances:

Year:  2021        PMID: 34547772      PMCID: PMC9153007          DOI: 10.1182/bloodadvances.2021004965

Source DB:  PubMed          Journal:  Blood Adv        ISSN: 2473-9529


× No keyword cloud information.

Introduction

Although the impacts of CBFB-MYH11 transcript variants on acute myeloid leukemia (AML) survival outcomes have been reported in adult patients with AML,[1] this finding has not yet been tested in cohorts that include children and adolescents. Furthermore, the underlying reasons for these outcome differences have not been well studied. Therefore, we harnessed transcriptome sequencing performed on diagnostic AML samples of 1776 pediatric patients enrolled on front-line Children’s Oncology Group (COG) clinical trials. This comprehensive dataset includes 186 patients with CBFB-MYH11 fusions (supplemental Table 1; Figure 1A). Our analysis of these transcriptomes demonstrates that fusion transcript subtype predicts relapse in pediatric CBFB-MYH11 AML and deepens our understanding of how CBFB-MYH11 fusion transcripts may impact underlying leukemia biology and outcomes.

Methods

Patient samples and RNA-sequencing

AML samples were collected with informed consent from patients (0-28 years of age) enrolled on COG trials CCG-2961 (#NCT00002798),[2] AAML03P1 (#NCT00070174),[3] AAML0531 (#NCT00372593),[4] and AAML1031 (#NCT01371981).[5] Institutional review board approval for each protocol was obtained at each participating institution and submitted to the Cancer Trials Support Unit (CTSU) regulatory office. Each protocol was conducted in accordance with the Declaration of Helsinki. Total RNA derived from peripheral blood or bone marrow diagnostic specimens was purified using AllPrep DNA/RNA/miRNA Universal Kits. Purified RNA samples were then prepared for either strand-specific polyadenylated enriched messenger RNA libraries or strand-specific ribosome RNA-depleted libraries by the British Columbia Genome Sciences Center. Paired-end sequencing was performed on Illumina HiSeq 2000/2500 platforms, and sequence reads were aligned to the GRCh37 reference genome using BWA (v0.5.7).[6] Reads were discarded based on mapping quality or if they failed the Illumina chastity filter, and duplicate reads were marked using Picard (v1.11). Gene level coverage analysis was performed using the British Columbia Genome Sciences Center pipeline v1.1 with Ensembl v69 annotations and was normalized based on RPKM or TPM.

Fusion calling

Fusion calls were made using CICERO, STAR-Fusion, and Trans-ABySS as previously described.[7-9] In order to maximize detection sensitivity, CBFB-MYH11 transcript breakpoints called by one or more fusion callers were accepted after manual examination using Bambino.[10] Overlapping calls between CICERO, STAR-Fusion, and Trans-ABySS for CBFB-MYH11 fusion transcripts were concordant at the level of exon transcript calling.

Variant calling

Variant calling was performed based on RNA sequencing and targeted DNA sequencing studies. RNA-seq libraries were constructed and sequenced as described above. All computational analyses were performed on a dedicated compute and storage infrastructure designed and implemented at St. Jude as previously described.[11] Briefly, RNA reads were mapped using the StrongARM pipeline.[12] Paired-end reads were aligned to 5 reference human genome databases (including GRCh37) using BWA,[6] and the final BAM file was constructed by selecting the best of the 5 alignments. SNVs and Indels were called using Bambino[10] and RNAIndel,[13] followed by validation and pathogenicity classification as previously described.[14,15]

Statistical analysis

Event-free survival (EFS) and overall survival (OS) analysis was performed using Kaplan-Meier estimates. EFS was defined as the time from enrollment to first event (relapse, induction failure, or death) or last follow-up. OS was defined as the time from study enrollment to death or last follow-up. Relapse risk (RR) was defined as the time since end of Induction I to relapse or last follow-up. Cox proportional hazard regression models were employed to estimate hazard ratios for univariable and multivariable analyses of EFS, OS, and RR. Differences in EFS, OS, and RR between groups were determined using log-rank testing. The χ2 test was used to test the significance of observed differences in proportions, and Fisher’s exact test was used when data were sparse. Differences in medians were compared by the Mann-Whitney U test. A P value <.05 was considered statistically significant.

Gene expression analysis

Uniform manifold approximation and projection (UMAP) was performed using the uwot (v0.1.10) R package. Gene set enrichment analysis (GSEA) was performed using command line tools.[16] In order to ensure a more uniform RNA-seq cohort, only ribosomal RNA-depleted sequenced libraries were included in the gene expression analysis.

Results and discussion

CBFB-MYH11 fusions in the 186 pediatric AML cases analyzed are predominantly composed of transcribed products from the fusion of exon 5 of CBFB with exon 33 of MYH11 (Figure 1A), hereafter referred to as E5-E33 CBFB-MYH11. To assess the prognostic significance of the E5-E33 transcript, we evaluated the probability of EFS, OS, and RR in our cohort stratified based on fusion transcript location (E5-E33 vs all others) (Figure 1B). Patients with E5-E33 had an EFS of 52% ± 8.1% (n = 156) at 5 years from diagnosis compared with an EFS of 80% ± 14.7% (n = 30) for those with other fusion transcripts (P value = .01).
Figure 1.

(A) CBFB-MYH11 fusion transcripts within our cohort with line weights that correspond to the associated prevalence. (B) Kaplan-Meier estimates for the probability of EFS, OS, and RR in patients with CBFB-MYH11 AML stratified based on fusion transcript (E5-E33 vs all others). (C) Oncoplot with associated somatic driver mutations based on fusion transcript status. KIT mutations are more prevalent in E5-E33 CBFB-MYH11 AMLs compared with others (Fisher’s exact test, P value = .0003). (D) Kaplan-Meier estimates for the probability of EFS, OS, and RR in patients with CBFB-MYH11 AML stratified based on fusion transcript and E17 KIT mutation status. (E) Age distribution comparing Schwind et al (adults) and patients from our cohort (pediatrics).

(A) CBFB-MYH11 fusion transcripts within our cohort with line weights that correspond to the associated prevalence. (B) Kaplan-Meier estimates for the probability of EFS, OS, and RR in patients with CBFB-MYH11 AML stratified based on fusion transcript (E5-E33 vs all others). (C) Oncoplot with associated somatic driver mutations based on fusion transcript status. KIT mutations are more prevalent in E5-E33 CBFB-MYH11 AMLs compared with others (Fisher’s exact test, P value = .0003). (D) Kaplan-Meier estimates for the probability of EFS, OS, and RR in patients with CBFB-MYH11 AML stratified based on fusion transcript and E17 KIT mutation status. (E) Age distribution comparing Schwind et al (adults) and patients from our cohort (pediatrics). We next harnessed variant calls on the same cohort of patients (n = 186) to elucidate the mutational landscape of CBFB-MYH11 AMLs. KIT mutations were significantly enriched in E5-E33 CBFB-MYH11 AMLs with a prevalence of 42% in the E5-E33 cohort vs 7% in the Other cohort (P value = .0003; Figure 1C; supplemental Table 1). In contrast, the prevalence of FLT3, NRAS, KRAS, and WT1 mutations was not significantly different between the 2 groups (Figure 1C). We have previously demonstrated that exon 17 (E17) KIT mutations are associated with adverse outcomes in patients with core-binding factor AML.[1,17] Therefore, we assessed for their impact on survival within our E5-E33 cohort and found that they were strongly associated with a significantly worse EFS of 32% ± 16.8% for E17 KIT+ at 5 years from diagnosis compared with an EFS of 56% ± 10.5% for E17 KIT− patients (P value = .042) (Figure 1D) and was notably independent of E17 KIT variant allele frequency (supplemental Figure 1). Conversely, exon 8 (E8) KIT mutations were associated with no difference in survival (supplemental Figure 2). The prognostic impact of E5-E33 transcripts on EFS was maintained in multivariable survival analysis that included age, white blood cell count, and E17 KIT mutations (supplemental Tables 2 and 3) and in survival analysis that excluded KIT mutant AMLs (supplemental Figure 3). Furthermore, the inferior outcomes associated with E5-E33 CBFB-MYH11 AMLs that are significantly worse in the presence of an additional E17 KIT mutation are particularly relevant because they address a gap in knowledge based on previously studied demographic age groups (Figure 1E) and further support the hypothesis that CBFB-MYH11 transcript subtypes impact leukemia biology and/or chemoresistance. Interestingly, and in contrast to Schwind et al,[1] despite adverse EFS in E5-E33 patients, OS did not differ based on fusion transcript (Figure 1B,D) suggesting that relapses in children, adolescents, and young adults with E5-E33 CBFB-MYH11 AMLs ± E17 KIT mutations are highly salvageable. Because hematopoietic stem cell transplantation (HSCT) in first complete remission is deferred to after an initial relapse in favorable risk pediatric patients (with only 1 of 186 patients within our cohort undergoing HSCT in first complete remission), we hypothesize that leukemia cells with CBFB-MYH11 fusion oncogene are highly susceptible to the allogeneic effect of the HSCT. Next, we performed unbiased GSEA comparing E5-E33 CBFB-MYH11 AMLs with all others with alternative breakpoints across the Molecular Signature Database[18,19] (specifically, the hallmark, curated, and oncogenic signature gene sets). We excluded KIT mutant AMLs because they have been shown to be associated with a distinct gene expression profile.[20] Intriguingly, 3 previously published gene sets that established and defined CBFB-MYH11 AML transcriptional signatures are significantly enriched in E5-E33 CBFB-MYH11 AMLs compared with non–E5-E33 CBFB-MYH11 AMLs (Figure 2A), suggesting that non–E5-E33 CBFB-MYH11 AMLs represent a distinct transcriptional subtype. Indeed, performing UMAP analysis revealed that E5-E33 CBFB-MYH11 AMLs occupy a distinct cluster from most other CBFB-MYH11 AMLs (P < .001), which are more heterogeneous in their transcriptional clustering (Figure 2B).
Figure 2.

(A) GSEA comparing E5-E33 CBFB-MYH11 AMLs with all others, using gene sets available through the Molecular Signature Database (hallmark, curated, and oncogenic signature gene sets). Three gene sets that define CBFB-MYH11 transcriptional signatures are significantly enriched in E5-E33 CBFB-MYH11 AMLs. Specifically, the Ross et al gene set is associated with a normalized enrichment score of 2.42 (P value <.0001) with the displayed heatmap of enriched genes. (B) UMAP analysis revealed that E5-E33 CBFB-MYH11 AMLs occupy a different cluster than most other CBFB-MYH11 AMLs. (C) GSEA comparing E5-E33 CBFB-MYH11 AMLs with all others and using a gene set derived by Mandoli et al based on CBFB-MYH11 chromatin immunoprecipitation sequencing (ChIP-seq) occupancy reveals significant enrichment (normalized enrichment score of 1.37, P value <.0001) in E5-E33 CBFB-MYH11 AMLs (100 most enriched genes shown). Violin plots of representative genes within the Mandoli et al gene set that are associated with hematopoietic stem cells or early hematopoiesis (Mann-Whitney U test).

(A) GSEA comparing E5-E33 CBFB-MYH11 AMLs with all others, using gene sets available through the Molecular Signature Database (hallmark, curated, and oncogenic signature gene sets). Three gene sets that define CBFB-MYH11 transcriptional signatures are significantly enriched in E5-E33 CBFB-MYH11 AMLs. Specifically, the Ross et al gene set is associated with a normalized enrichment score of 2.42 (P value <.0001) with the displayed heatmap of enriched genes. (B) UMAP analysis revealed that E5-E33 CBFB-MYH11 AMLs occupy a different cluster than most other CBFB-MYH11 AMLs. (C) GSEA comparing E5-E33 CBFB-MYH11 AMLs with all others and using a gene set derived by Mandoli et al based on CBFB-MYH11 chromatin immunoprecipitation sequencing (ChIP-seq) occupancy reveals significant enrichment (normalized enrichment score of 1.37, P value <.0001) in E5-E33 CBFB-MYH11 AMLs (100 most enriched genes shown). Violin plots of representative genes within the Mandoli et al gene set that are associated with hematopoietic stem cells or early hematopoiesis (Mann-Whitney U test). To uncover candidate mechanisms to explain the differences in survival outcomes and transcriptional landscapes between E5-E33 CBFB-MYH11 AMLs and others, we analyzed our transcriptome data in the context of established CBFB-MYH11 ChIP-seq experimental data.[21] We found that E5-E33 CBFB-MYH11 AMLs were enriched in the gene set that encompasses all genes located near high-confidence CBFB-MYH11 ChIP-seq sites (Figure 2C), which included genes implicated in hematopoietic stem cell self-renewal, such as LRRC33,[22] JAG1,[23] MKRN2,[24] BCOR,[25] DEGS1,[26] ID1,[27] KEAP1,[28] IL10RA,[29] IL1RAP,[30] and CD34.[31-35] On the basis of their locations in close proximity with experimentally validated CBFB-MYH11 ChIP-seq sites and their differential expression based on CBFB-MYH11 transcript subtypes, these genes represent candidate transcriptional markers associated with inferior survival outcomes and present an opportunity for further functional validation and/or therapeutic targeting studies.

Supplementary Material

The full-text version of this article contains a data supplement. Click here for additional data file.
  35 in total

1.  Functional and morphological characterization of immunomagnetically selected CD34+ hematopoietic progenitor cells.

Authors:  F Servida; D Soligo; L Caneva; F Bertolini; E de Harven; S Campiglio; C Corsini; G L Deliliers
Journal:  Stem Cells       Date:  1996-07       Impact factor: 6.277

2.  Bambino: a variant detector and alignment viewer for next-generation sequencing data in the SAM/BAM format.

Authors:  Michael N Edmonson; Jinghui Zhang; Chunhua Yan; Richard P Finney; Daoud M Meerzaman; Kenneth H Buetow
Journal:  Bioinformatics       Date:  2011-01-28       Impact factor: 6.937

3.  Functional Properties of KIT Mutations Are Associated with Differential Clinical Outcomes and Response to Targeted Therapeutics in CBF Acute Myeloid Leukemia.

Authors:  Soheil Meshinchi; Jessica A Pollard; Katherine Tarlock; Todd A Alonzo; Yi-Cheng Wang; Robert B Gerbing; Rhonda Ries; Michael R Loken; Laura Pardo; Tiffany Hylkema; Jason Joaquin; Leela Sarukkai; Susana C Raimondi; Betsy Hirsch; Lillian Sung; Richard Aplenc; Irwin Bernstein; Alan S Gamis
Journal:  Clin Cancer Res       Date:  2019-06-10       Impact factor: 12.531

4.  Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles.

Authors:  Aravind Subramanian; Pablo Tamayo; Vamsi K Mootha; Sayan Mukherjee; Benjamin L Ebert; Michael A Gillette; Amanda Paulovich; Scott L Pomeroy; Todd R Golub; Eric S Lander; Jill P Mesirov
Journal:  Proc Natl Acad Sci U S A       Date:  2005-09-30       Impact factor: 11.205

5.  RNAIndel: discovering somatic coding indels from tumor RNA-Seq data.

Authors:  Kohei Hagiwara; Liang Ding; Michael N Edmonson; Stephen V Rice; Scott Newman; John Easton; Juncheng Dai; Soheil Meshinchi; Rhonda E Ries; Michael Rusch; Jinghui Zhang
Journal:  Bioinformatics       Date:  2020-03-01       Impact factor: 6.937

6.  Cicero Predicts cis-Regulatory DNA Interactions from Single-Cell Chromatin Accessibility Data.

Authors:  Hannah A Pliner; Jonathan S Packer; José L McFaline-Figueroa; Darren A Cusanovich; Riza M Daza; Delasa Aghamirzaie; Sanjay Srivatsan; Xiaojie Qiu; Dana Jackson; Anna Minkina; Andrew C Adey; Frank J Steemers; Jay Shendure; Cole Trapnell
Journal:  Mol Cell       Date:  2018-08-02       Impact factor: 17.970

7.  The Molecular Signatures Database (MSigDB) hallmark gene set collection.

Authors:  Arthur Liberzon; Chet Birger; Helga Thorvaldsdóttir; Mahmoud Ghandi; Jill P Mesirov; Pablo Tamayo
Journal:  Cell Syst       Date:  2015-12-23       Impact factor: 10.304

8.  NRF2 Activation Impairs Quiescence and Bone Marrow Reconstitution Capacity of Hematopoietic Stem Cells.

Authors:  Shohei Murakami; Takuma Suzuki; Hideo Harigae; Paul-Henri Romeo; Masayuki Yamamoto; Hozumi Motohashi
Journal:  Mol Cell Biol       Date:  2017-09-12       Impact factor: 4.272

9.  SLAM family markers resolve functionally distinct subpopulations of hematopoietic stem cells and multipotent progenitors.

Authors:  Hideyuki Oguro; Lei Ding; Sean J Morrison
Journal:  Cell Stem Cell       Date:  2013-07-03       Impact factor: 24.633

10.  CICERO: a versatile method for detecting complex and diverse driver fusions using cancer RNA sequencing data.

Authors:  Liqing Tian; Yongjin Li; Michael N Edmonson; Xin Zhou; Scott Newman; Clay McLeod; Andrew Thrasher; Yu Liu; Bo Tang; Michael C Rusch; John Easton; Jing Ma; Eric Davis; Austyn Trull; J Robert Michael; Karol Szlachta; Charles Mullighan; Suzanne J Baker; James R Downing; David W Ellison; Jinghui Zhang
Journal:  Genome Biol       Date:  2020-05-28       Impact factor: 13.583

View more
  1 in total

1.  Integrated stem cell signature and cytomolecular risk determination in pediatric acute myeloid leukemia.

Authors:  Benjamin J Huang; Jenny L Smith; Jason E Farrar; Yi-Cheng Wang; Masayuki Umeda; Rhonda E Ries; Amanda R Leonti; Erin Crowgey; Scott N Furlan; Katherine Tarlock; Marcos Armendariz; Yanling Liu; Timothy I Shaw; Lisa Wei; Robert B Gerbing; Todd M Cooper; Alan S Gamis; Richard Aplenc; E Anders Kolb; Jeffrey Rubnitz; Jing Ma; Jeffery M Klco; Xiaotu Ma; Todd A Alonzo; Timothy Triche; Soheil Meshinchi
Journal:  Nat Commun       Date:  2022-09-19       Impact factor: 17.694

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.