Literature DB >> 33987955

RAG1 co-expression signature identifies ETV6-RUNX1-like B-cell precursor acute lymphoblastic leukemia in children.

Dongfeng Chen1,2, Alessandro Camponeschi2, Jessica Nordlund3, Yanara Marincevic-Zuniga3, Jonas Abrahamsson4, Gudmar Lönnerholm5, Linda Fogelstrand6,7, Inga-Lill Mårtensson2.   

Abstract

B-cell precursor acute lymphoblastic leukemia (BCP-ALL) can be classified into subtypes according to the genetic aberrations they display. For instance, the translocation t(12;21)(p13;q22), representing the ETV6-RUNX1 fusion gene (ER), is present in a quarter of BCP-ALL cases. However, around 10% of the cases lack classifying chromosomal abnormalities (B-other). In pediatric ER BCP-ALL, rearrangement mediated by RAG (recombination-activating genes) has been proposed as the predominant driver of oncogenic rearrangement. Herein we analyzed almost 1600 pediatric BCP-ALL samples to determine which subtypes express RAG. We demonstrate that RAG1 mRNA levels are especially high in the ETV6-RUNX1 (ER) subtype and in a subset of B-other samples. We also define 31 genes that are co-expressed with RAG1 (RAG1-signature) in the ER subtype, a signature that also identifies this subset of B-other samples. Moreover, this subset also shares leukemia and pro-B gene expression signatures as well as high levels of the ETV6 target genes (BIRC7, WBP1L, CLIC5, ANGPTL2) with the ER subtype, indicating that these B-other cases are the recently identified ER-like subtype. We validated our results in a cohort where ER-like has been defined, which confirmed expression of the RAG1-signature in this recently described subtype. Taken together, our results demonstrate that the RAG1-signature identifies the ER-like subtype. As there are no definitive genetic markers to identify this novel subtype, the RAG1-signature represents a means to screen for this leukemia in children.
© 2021 The Authors. Cancer Medicine published by John Wiley & Sons Ltd.

Entities:  

Keywords:  BCP-ALL; ETV6-RUNX1; ETV6-RUNX1-like; RAG1; leukemia

Mesh:

Substances:

Year:  2021        PMID: 33987955      PMCID: PMC8209579          DOI: 10.1002/cam4.3928

Source DB:  PubMed          Journal:  Cancer Med        ISSN: 2045-7634            Impact factor:   4.452


INTRODUCTION

Acquired chromosomal aberrations have been linked to the overall survival of patients with B‐cell precursor acute lymphoblastic leukemia (BCP‐ALL), which is the most common cancer in children. The many different subtypes of BCP‐ALL have been classified according to the genetic aberrations they display, allowing correlations between disease type and prognosis to be made. For instance, the translocation t(12;21)(p13;q22), representing the ETV6‐RUNX1 fusion gene (ER), is present in a quarter of BCP‐ALL cases. Unclassified cases form a heterogeneous group referred to as B‐other, which research efforts have reduced over the last 10 years from 25% to 5% of all cases. The characterization of disease genotype in such cases remains a priority, as it provides the means for diagnosis, prognosis, risk assessment, and targeted treatment. During the early stages of B cell development, B‐cell precursors (BCP) undergo immunoglobulin (Ig) gene rearrangement that is necessary for the production of a membrane‐bound B‐cell antigen receptor (BCR) on more mature B cells. Mouse knockout models, , have shown the recombination‐activating genes (RAG) to be essential for the rearrangement process, in which Ig V(D)J gene segments are rearranged to provide instructions for a unique BCR. RAG activity may also play a role in leukemogenesis, as has been proposed for the ER subtype, where it appears that RAG introduces mutations and aberrant rearrangements in non‐Ig loci. However, RAG is likely not responsible for the ER translocation, but rather for introducing additional genetic modifications that drive leukemogenesis. Herein, we examined 1582 BCP‐ALL cases, both microarray (DS1‐6‐M) and RNAseq (DS7‐8‐R) data sets (Table 1, Table S1), to determine on a large scale which subtypes express RAG and whether we could find any co‐expressed genes that would allow us to identify new subtypes.
TABLE 1

Data sets used in this study

GEO accessionDataset#CountryPlatformPatient #References
Healthy (Fetal BM)
GSE45460 DS0‐MSouth KoreaGPL62448[7]
BCP‐ALL (Pediatric)
GSE26281 DS1‐MUSAGPL96127[8]
GSE28497 DS2‐MUSAGPL96239[9]
GSE47051 DS3‐MSwedenGPL57075[10]
GSE12995 DS4‐MUSAGPL96175[11]
GSE33315 DS5‐MUSAGPL96483[12]
GSE26366 DS6‐MUSAGPL96172[13]
RNA‐seq1DS7‐RSwedenHiseq2000/2500116[14]
RNA‐seq2DS8‐RSwedenNextSeq 500195[15]
BCP‐ALL (Adult)
GSE34861 DS9‐MUSAGPL15088194[16]
B95DS10‐MUSAGPL830095[17]
Data sets used in this study

MATERIALS AND METHODS

Gene expression microarray data

Gene expression data from BCP‐ALL and healthy B‐cells were gathered from published studies (Table 1, Table S1). All gene expression microarray data were log2 transformed and normalized using the Robust Multichip Average (RMA) method.

Gene expression RNA‐sequencing data

Strand‐specific RNA sequencing libraries were constructed from rRNA‐depleted RNA using the ScriptSeq V2 Kit (Epicentre) and sequenced paired‐end on a HiSeq or MiSeq sequencer (Illumina Inc). The reads were mapped to the human 1000 Genomes build 37 (GRCh37) using Tophat 2. Gene expression counts were summarized at the exon‐level per gene using feature counts. Additional details can be found in Ref. [7].

Data analysis

All the gene expression microarray and RNA‐seq data were analyzed using Qlucore Omics Explorer 3.5 (Qlucore AB). Genes co‐expressed with RAG1 were identified by using Pearson's correlation coefficient analysis, and the corr. value was set as 60%. To assess the similarity of molecular signatures between pro‐B cells (pro‐B signature) and ETV6‐RUNX1 or ETV6‐RUNX1 like BCP‐ALL, gene set enrichment analysis (GSEA) was performed. To compare the sample clusters, principal components analysis (PCA) was run. Where needed, data were analyzed by one‐way analysis of variance (ANOVA) or unpaired two‐tailed t‐tests using Graph‐Pad Prism version 9, and statistical significance was set as: *p < 0.05, **p < 0.01, ***p < 0.001 and ****p < 0.0001.

RESULTS AND DISCUSSION

We have previously shown that the components of the pre‐BCR complex, assembled from Ig heavy chain and surrogate light chain, are differentially expressed in the ETV6‐RUNX1 and TCF3‐PBX1 BCP‐ALL subtypes. To determine the expression pattern of RAG1 and RAG2 that regulate Ig gene rearrangements, we first analyzed the expression of the RAG1 and RAG2 genes in the microarray dataset DS1‐M that was used in the aforementioned study, which includes 127 BCP‐ALL samples (Table 1, Table S1). We found both genes expressed in the ER subtype in this (Figure 1A) and the additional five microarray datasets (DS2‐6‐M) analyzed (Figure S1A). In particular, the expression of RAG1 was consistently higher in ER compared to all other genetic subtypes except B‐other. Next, we performed a genome‐wide screen for genes co‐expressed with RAG1 and RAG2 in DS1‐M using Pearson co‐efficiency correlation analysis. Although none consistently appeared with RAG2, we identified a set of 31 genes that were the highest‐ranked co‐expressed genes with RAG1, henceforth referred to as the RAG1‐signature (Figure 1B, Tables S2 and S3). Using the RAG1‐signature as an identifier in the DS1‐M data set distinguished the ER from all other BCP‐ALL subtypes, except for four B‐other and one CRLF2 samples that also expressed the RAG1‐signature (Figure 1C). By contrast, although pro‐ and pre‐B cells from healthy donors (DS0‐M, Table 1) express RAG1 and RAG2, they do not express the RAG1‐signature (Figure S2), demonstrating that this signature is specific for certain BCP‐ALL.
FIGURE 1

(A) Box plot shows the expression patterns of RAG1 and RAG2 in 127 BCP‐ALL samples from DS1‐M (GSE26281), one‐way ANOVA was used as statistical analysis, (*p < 0.05, **p < 0.01, ***p < 0.001, ****p < 0.0001). (B) PCA chart shows that 31 genes co‐expressed with RAG1 (RAG1‐signature) are identified in DS1‐M. Corr Value = 60%, using Pearson's Correlation Coefficient analysis. (C) Heatmap shows the new cluster distribution of samples with different genetic subtypes based on the levels of the RAG1‐signature in DS1‐M using hierarchical clustering analysis. PCA plot shows the new cluster pattern based on the RAG1‐signature in DS1‐M. Four B‐other and one CRLF2 samples express the RAG1‐signature. (D) Unsupervised PCA analyses in DS1‐M show ER and ER‐like BCP‐ALL sharing similar gene expression profiling (p = 1.8E‐6, 1240 genes)

(A) Box plot shows the expression patterns of RAG1 and RAG2 in 127 BCP‐ALL samples from DS1‐M (GSE26281), one‐way ANOVA was used as statistical analysis, (*p < 0.05, **p < 0.01, ***p < 0.001, ****p < 0.0001). (B) PCA chart shows that 31 genes co‐expressed with RAG1 (RAG1‐signature) are identified in DS1‐M. Corr Value = 60%, using Pearson's Correlation Coefficient analysis. (C) Heatmap shows the new cluster distribution of samples with different genetic subtypes based on the levels of the RAG1‐signature in DS1‐M using hierarchical clustering analysis. PCA plot shows the new cluster pattern based on the RAG1‐signature in DS1‐M. Four B‐other and one CRLF2 samples express the RAG1‐signature. (D) Unsupervised PCA analyses in DS1‐M show ER and ER‐like BCP‐ALL sharing similar gene expression profiling (p = 1.8E‐6, 1240 genes) Based on the above results, we hypothesized that the B‐other samples with a RAG1‐signature could represent the recently defined new subtype termed ER‐like, , , which usually carry ETV6 fusions and IKZF1 aberrations. However, they vary and hence lack the definitive ER fusion gene, predicting a similar gene expression pattern to the ER subtype. Therefore, to pinpoint the relationships between genetic subtypes, we performed an unsupervised PCA analysis based on all genes expressed in DS1‐M. This showed that the four B‐other samples with the RAG1‐signature clustered together with the ER samples with a unique leukemia signature (Figure 1D, labeled ER‐like to distinguish them from the remaining B‐other samples). We could confirm these results by analyzing the other five microarray data sets (DS2‐M to DS6‐M) with a total of 1145 samples, where the RAG1‐signature distinguished the ER from the other subtypes, and in each data set a few B‐other samples clustered with the ER samples (Figure S3). Moreover, we found that these samples expressed a leukemia‐signature and clustered with the ER samples also in these microarray data sets (Figure S4). To validate our observations based on microarray datasets, we analyzed RNA‐seq data from a cohort of 116 BCP‐ALL samples (DS7‐R). Here we found not only RAG1 but also RAG2 expressed at higher levels in the ER compared to the other subtypes (Figure S1B). Moreover, as in the microarray data sets, a few samples belonging to the B‐other group also expressed higher levels of RAG1 and RAG2. Further, using the RAG1‐signature as an identifier in this RNAseq data set distinguished the ER from the other subtypes (Figure 2A and Table S3). We found also four B‐other and one hyperdiploid (HH) samples that clustered with the ER samples (Figure 2A). This would be consistent with our previous analysis of the DNA methylation pattern of the patient samples in DS7‐R in which three of the four samples identified here showed a pattern similar to that of the ER samples. Thereafter, we performed unsupervised PCA analyses based on gene expression, which confirmed that the same B‐other samples (labeled ER‐like) clustered with the ER samples (Figure 2B). These results support the notion that the B‐other samples expressing the RAG1‐signature represent the ER‐like subtype. To gain further support for this hypothesis, we analyzed one more RNAseq data set comprising 195 samples (DS8‐R) in which the ER‐like subtype was recently identified. Also in this dataset did we find the RAG1‐signature in the ER samples as well as in those defined as ER‐like (Figure S5). Taken together, we conclude that the RAG1‐signature identifies both the ER and the ER‐like subtypes.
FIGURE 2

(A) Heatmap and PCA chart show the new cluster pattern based on the RAG1‐signature in the validating RNA‐seq data set (n = 116, DS7‐R). Four B‐other and one HH patients express the RAG1‐signature. (B) Unsupervised PCA analyses in DS7‐R show ER and ER‐like BCP‐ALL sharing similar gene expression profiling (p = 2.8E‐6, 1640 genes). Gene Set Enrichment Analyses (GSEA) show the enrichment of the pro‐B signature in (C) ER and (D) ER‐like BCP‐ALL in DS2‐M. (E) Box plots show the expression patterns of ETV6 target genes in DS2‐M and DS7‐R, unpaired two‐tailed t‐tests were used as statistical analysis, (*p < 0.05, **p < 0.01, ***p < 0.001, ****p < 0.0001). (F) Pie charts show ER‐like frequency among BCP‐ALL in DS1‐M, DS7‐R, and in all data sets (DS1‐8). The numbers in the center of the pie charts represent the number of samples. (B) Kaplan–Meier survival analysis was used to estimate the survival of patients in the indicated data sets. Survival in clusters was compared using the log‐rank test

(A) Heatmap and PCA chart show the new cluster pattern based on the RAG1‐signature in the validating RNA‐seq data set (n = 116, DS7‐R). Four B‐other and one HH patients express the RAG1‐signature. (B) Unsupervised PCA analyses in DS7‐R show ER and ER‐like BCP‐ALL sharing similar gene expression profiling (p = 2.8E‐6, 1640 genes). Gene Set Enrichment Analyses (GSEA) show the enrichment of the pro‐B signature in (C) ER and (D) ER‐like BCP‐ALL in DS2‐M. (E) Box plots show the expression patterns of ETV6 target genes in DS2‐M and DS7‐R, unpaired two‐tailed t‐tests were used as statistical analysis, (*p < 0.05, **p < 0.01, ***p < 0.001, ****p < 0.0001). (F) Pie charts show ER‐like frequency among BCP‐ALL in DS1‐M, DS7‐R, and in all data sets (DS1‐8). The numbers in the center of the pie charts represent the number of samples. (B) Kaplan–Meier survival analysis was used to estimate the survival of patients in the indicated data sets. Survival in clusters was compared using the log‐rank test We have previously shown that common lymphoid progenitors (CLP), pro‐B, pre‐B, and immature B cells from healthy donors display unique gene expression signatures. , Moreover, the ER subtype displays a pro‐B signature, whereas the t(1;19) TCF3/PBX1 ALL subtype resembles pre‐B cells. Here, we confirmed the pro‐B signature in ER samples in DS2‐M with 239 BCP‐ALL samples (Figure 2C). Considering its similarities to the ER subtype, we asked whether also ER‐like patient samples displayed the pro‐B signature. To reduce the dominant effect of the ER samples, they were excluded from this analysis. Our analyses showed that the pro‐B signature was present in the ER‐like samples as well (Figure 2D), an observation we could confirm in the two additional datasets analyzed (Figure S6). ETV6 encodes a transcription factor that suppresses the expression of genes such as WBP1L and CLIC5, which are both found in the RAG1‐signature. In the ER subtype, ETV6 is translocated on one allele and in some samples deleted on the other, resulting in a dysfunctional protein and/or reduced levels. Moreover, the HH patient sample in DS7‐R that clustered with the ER samples (Figure 2A) harbors a t(7;12) CBX3‐ETV6 fusion gene supporting the ER‐like phenotype. Also, the ER‐like samples in DS8‐R have been found to harbor deletions and in‐frame fusions that involve ETV6. We hypothesized, therefore, that aberrant levels or a dysfunctional ETV6, might result in elevated levels of its target genes. We asked, therefore, whether its target genes were expressed in the ER and ER‐like samples in both the microarray and RNAseq data sets. Consistent with the aforementioned notion, the expression levels of the ETV6‐target genes WBP1L, CLIC5, BIRC7, and ANGPTL2 were expressed at very high levels in both ER and ER‐like samples, with a consistent pattern and levels not observed in any of the other subtypes (Figure 2E and Figure S7). Thus, gaining further support that also the ER‐like subtype is deficient in the ETV6 transcription factor. Our results suggest that the ER‐like subtype is infrequent among BCP‐ALL. Analyzing the frequency showed that it was 3.1% in DS1‐M, 3.5% in DS7‐R, and taken all data sets (DS1‐8) together, hence 1582 BCP‐ALL samples, demonstrated that an average of 2.7% correspond to the ER‐like subtype (Figure 2F, Table S4). In adults BCP‐ALL of the ER subtype are infrequent. We asked therefore whether we could find any ER‐like samples based on the RAG1‐signature. However, among a total of 285 samples (DS9‐10‐M) with only one ER sample (Table 1, Table S1), which we could clearly distinguish, we were unable to define any ER‐like samples (Figure S8). Thus, this indicates that not only ER but also ER‐like subtypes are infrequent in adult BCP‐ALL. In this study, we show that a subset of BCP‐ALL patient samples with unclassified chromosomal abnormalities (B‐other) can be defined by the expression of RAG1 in conjunction with an additional 31 genes, the RAG1‐signature. This signature as well as leukemia and pro‐B gene expression signatures and the high levels of ETV6 target genes were all shared with the ER subtype, and suggested that these B‐other cases belong to the ER‐like subtype. Validating this in samples previously defined as ER‐like, we could confirm this notion. Taken together, our results demonstrate that the RAG1‐signature identifies ER‐like BCP‐ALL in children. As there are no consistent translocations or other definitive genetic markers, using the RAG1‐signature could represent a means to screen for the ER‐like subtype.

CONFLICT OF INTEREST

The authors declare no conflict of interest.

ETHICAL CONSIDERATIONS

Not applicable, meta‐analyses, all ethics linked to respective data set, see Table 1. Figure S1 Click here for additional data file. Figure S2 Click here for additional data file. Figure S3 Click here for additional data file. Figure S4 Click here for additional data file. Figure S5 Click here for additional data file. Figure S6 Click here for additional data file. Figure S7 Click here for additional data file. Figure S8 Click here for additional data file. Table S1‐S2 and S4 Click here for additional data file. Table S3 Click here for additional data file.
  22 in total

1.  RAG-mediated recombination is the predominant driver of oncogenic rearrangement in ETV6-RUNX1 acute lymphoblastic leukemia.

Authors:  Elli Papaemmanuil; Inmaculada Rapado; Yilong Li; Nicola E Potter; David C Wedge; Jose Tubio; Ludmil B Alexandrov; Peter Van Loo; Susanna L Cooke; John Marshall; Inigo Martincorena; Jonathan Hinton; Gunes Gundem; Frederik W van Delft; Serena Nik-Zainal; David R Jones; Manasa Ramakrishna; Ian Titley; Lucy Stebbings; Catherine Leroy; Andrew Menzies; John Gamble; Ben Robinson; Laura Mudie; Keiran Raine; Sarah O'Meara; Jon W Teague; Adam P Butler; Giovanni Cazzaniga; Andrea Biondi; Jan Zuna; Helena Kempski; Markus Muschen; Anthony M Ford; Michael R Stratton; Mel Greaves; Peter J Campbell
Journal:  Nat Genet       Date:  2014-01-12       Impact factor: 38.330

Review 2.  Molecular genetics of B-precursor acute lymphoblastic leukemia.

Authors:  Charles G Mullighan
Journal:  J Clin Invest       Date:  2012-10-01       Impact factor: 14.808

3.  Notch/HES1-mediated PARP1 activation: a cell type-specific mechanism for tumor suppression.

Authors:  Sankaranarayanan Kannan; Wendy Fang; Guangchun Song; Charles G Mullighan; Richard Hammitt; John McMurray; Patrick A Zweidler-McKay
Journal:  Blood       Date:  2011-01-11       Impact factor: 22.113

4.  Gene expression profiles of B-lineage adult acute lymphocytic leukemia reveal genetic patterns that identify lineage derivation and distinct mechanisms of transformation.

Authors:  Sabina Chiaretti; Xiaochun Li; Robert Gentleman; Antonella Vitale; Kathy S Wang; Franco Mandelli; Robin Foà; Jerome Ritz
Journal:  Clin Cancer Res       Date:  2005-10-15       Impact factor: 12.531

5.  RAG-1-deficient mice have no mature B and T lymphocytes.

Authors:  P Mombaerts; J Iacomini; R S Johnson; K Herrup; S Tonegawa; V E Papaioannou
Journal:  Cell       Date:  1992-03-06       Impact factor: 41.582

6.  RAG-2-deficient mice lack mature lymphocytes owing to inability to initiate V(D)J rearrangement.

Authors:  Y Shinkai; G Rathbun; K P Lam; E M Oltz; V Stewart; M Mendelsohn; J Charron; M Datta; F Young; A M Stall
Journal:  Cell       Date:  1992-03-06       Impact factor: 41.582

7.  Integrative epigenomic analysis identifies biomarkers and therapeutic targets in adult B-acute lymphoblastic leukemia.

Authors:  Huimin Geng; Sarah Brennan; Thomas A Milne; Wei-Yi Chen; Yushan Li; Christian Hurtz; Soo-Mi Kweon; Lynette Zickl; Seyedmehdi Shojaee; Donna Neuberg; Chuanxin Huang; Debabrata Biswas; Yuan Xin; Janis Racevskis; Rhett P Ketterling; Selina M Luger; Hillard Lazarus; Martin S Tallman; Jacob M Rowe; Mark R Litzow; Monica L Guzman; C David Allis; Robert G Roeder; Markus Müschen; Elisabeth Paietta; Olivier Elemento; Ari M Melnick
Journal:  Cancer Discov       Date:  2012-10-29       Impact factor: 39.397

8.  DNA methylation-based subtype prediction for pediatric acute lymphoblastic leukemia.

Authors:  Mats G Gustafsson; Gudmar Lönnerholm; Erik Forestier; Ann-Christine Syvänen; Jessica Nordlund; Christofer L Bäcklin; Vasilios Zachariadis; Lucia Cavelier; Johan Dahlberg; Ingegerd Öfverholm; Gisela Barbany; Ann Nordgren; Elin Övernäs; Jonas Abrahamsson; Trond Flaegstad; Mats M Heyman; Ólafur G Jónsson; Jukka Kanerva; Rolf Larsson; Josefine Palle; Kjeld Schmiegelow
Journal:  Clin Epigenetics       Date:  2015-02-17       Impact factor: 6.551

9.  CD27 expression and its association with clinical outcome in children and adults with pro-B acute lymphoblastic leukemia.

Authors:  D Chen; N Gerasimčik; A Camponeschi; Y Tan; Q Wu; S Brynjolfsson; J Zheng; J Abrahamsson; J Nordlund; G Lönnerholm; L Fogelstrand; I-L Mårtensson
Journal:  Blood Cancer J       Date:  2017-06-23       Impact factor: 11.037

10.  A global DNA methylation and gene expression analysis of early human B-cell development reveals a demethylation signature and transcription factor network.

Authors:  Seung-Tae Lee; Yuanyuan Xiao; Marcus O Muench; Jianqiao Xiao; Marina E Fomin; John K Wiencke; Shichun Zheng; Xiaoqin Dou; Adam de Smith; Anand Chokkalingam; Patricia Buffler; Xiaomei Ma; Joseph L Wiemels
Journal:  Nucleic Acids Res       Date:  2012-10-16       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.