| Literature DB >> 35734412 |
Bruno A Lopes1, Caroline Pires Poubel1,2, Cristiane Esteves Teixeira2, Aurélie Caye-Eude3,4, Hélène Cavé3,4, Claus Meyer5, Rolf Marschalek5, Mariana Boroni2, Mariana Emerenciano1.
Abstract
The KMT2A (MLL) gene rearrangements (KMT2A-r) are associated with a diverse spectrum of acute leukemias. Although most KMT2A-r are restricted to nine partner genes, we have recently revealed that KMT2A-USP2 fusions are often missed during FISH screening of these genetic alterations. Therefore, complementary methods are important for appropriate detection of any KMT2A-r. Here we use a machine learning model to unravel the most appropriate markers for prediction of KMT2A-r in various types of acute leukemia. A Random Forest and LightGBM classifier was trained to predict KMT2A-r in patients with acute leukemia. Our results revealed a set of 20 genes capable of accurately estimating KMT2A-r. The SKIDA1 (AUC: 0.839; CI: 0.799-0.879) and LAMP5 (AUC: 0.746; CI: 0.685-0.806) overexpression were the better markers associated with KMT2A-r compared to CSPG4 (also named NG2; AUC: 0.722; CI: 0.659-0.784), regardless of the type of acute leukemia. Of importance, high expression levels of LAMP5 estimated the occurrence of all KMT2A-USP2 fusions. Also, we performed drug sensitivity analysis using IC50 data from 345 drugs available in the GDSC database to identify which ones could be used to treat KMT2A-r leukemia. We observed that KMT2A-r cell lines were more sensitive to 5-Fluorouracil (5FU), Gemcitabine (both antimetabolite chemotherapy drugs), WHI-P97 (JAK-3 inhibitor), Foretinib (MET/VEGFR inhibitor), SNX-2112 (Hsp90 inhibitor), AZD6482 (PI3Kβ inhibitor), KU-60019 (ATM kinase inhibitor), and Pevonedistat (NEDD8-activating enzyme (NAE) inhibitor). Moreover, IC50 data from analyses of ex-vivo drug sensitivity to small-molecule inhibitors reveals that Foretinib is a promising drug option for AML patients carrying FLT3 activating mutations. Thus, we provide novel and accurate options for the diagnostic screening and therapy of KMT2A-r leukemia, regardless of leukemia subtype.Entities:
Keywords: KMT2A; MLL; acute leukemia; biomarker; machine learning; therapy
Year: 2022 PMID: 35734412 PMCID: PMC9208280 DOI: 10.3389/fphar.2022.749472
Source DB: PubMed Journal: Front Pharmacol ISSN: 1663-9812 Impact factor: 5.988
Cohort characterization.
| Variables | Overall | Acute leukemias |
| ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| B-ALL | T-ALL | AML | ALAL |
|
|
| |||||||||
| Age group |
| ||||||||||||||
| Pediatric | 1019 | (61.4) | 414 | (99.8) | 270 | (98.9) | 263 | (29.3) | 72 | (100.0) | 92 | (76.0) | 927 | (60.3) | |
| Younger adult | 321 | (19.3) | 1 | (0.2) | 3 | (1.1) | 317 | (35.3) | 0 | (0.0) | 23 | (19.0) | 298 | (19.4) | |
| Older adult | 319 | (19.2) | 0 | (0.0) | 0 | (0.0) | 319 | (35.5) | 0 | (0.0) | 6 | (5.0) | 313 | (20.4) | |
| Sex | 0.1388 | ||||||||||||||
| Female | 696 | (42.0) | 190 | (45.8) | 78 | (28.6) | 399 | (44.4) | 29 | (40.3) | 59 | (48.8) | 637 | (41.4) | |
| Male | 963 | (58.0) | 225 | (54.2) | 195 | (71.4) | 500 | (55.6) | 43 | (59.7) | 62 | (51.2) | 901 | (58.6) | |
| Sample type | — | ||||||||||||||
| Diagnosis | 1498 | (90.3) | 395 | (95.2) | 271 | (99.3) | 760 | (84.5) | 72 | (100.0) | 118 | (97.5) | 1380 | (89.7) | |
| Refractory | 93 | (5.6) | 0 | (0.0) | 0 | (0.0) | 93 | (10.3) | 0 | (0.0) | 1 | (0.8) | 92 | (6.0) | |
| Relapse | 45 | (2.7) | 20 | (4.8) | 2 | (0.7) | 23 | (2.6) | 0 | (0.0) | 2 | (1.7) | 43 | (2.8) | |
| Remission | 18 | (1.1) | 0 | (0.0) | 0 | (0.0) | 18 | (2.0) | 0 | (0.0) | 0 | (0.0) | 18 | (1.2) | |
| Unknown | 5 | (0.3) | 0 | (0.0) | 0 | (0.0) | 5 | (0.6) | 0 | (0.0) | 0 | (0.0) | 5 | (0.3) | |
|
| — | ||||||||||||||
| | 121 | (7.3) | 22 | (5.3) | 14 | (5.1) | 77 | (8.6) | 8 | (11.1) | — | — | |||
| | 1538 | (92.7) | 393 | (94.7) | 259 | (94.9) | 822 | (91.4) | 64 | (88.9) | — | — | |||
| Total | 1659 | (100.0) | 415 | (100.0) | 273 | (100.0) | 899 | (100.0) | 72 | (100.0) | 121 | (100.0) | 1538 | (100.0) | |
Pearson’s Chi-squared test.
Pediatric (<22 years); Adult—younger adults (≥22 years and <60 years) and older adults (≥60 years).
bold value, p < 0.05.
FIGURE 1Machine learning performance. (A) ROC curve of the selected machine learning model with 247 genes, and the confusion matrix with performance measures of testing data. The darker the quadrant color, the greater the number of patients. Respective performance measures are below the matrix. (B) Feature contributions according to the shapley values. On the x-axis, positive and negative values are correlated with prediction of KMT2A-r and KMT2A-WT, respectively. Each dot represents one patient, colored according to the feature value (red fading to blue means high to low values).
FIGURE 2Testing and validation of potential KMT2A-r predictors. (A) ROC curve and confusion matrix of the test cohort for the machine learning model with top 20 genes. The darker the quadrant color, the greater the number of patients. The machine learning performance measures are beside the matrix. (B) Feature contributions of each gene included in this model according to the shapley values. On the x-axis, positive and negative values are correlated with prediction of KMT2A-r and KMT2A-WT, respectively. Each dot represents one patient, colored according to the feature value (red fading to blue means high to low values). (C) The performance measures of our model in an independent cohort (TCGA dataset including AML patients), represented by the confusion matrix, and a (D) heatmap illustrating the expression of 11 KMT2A-r predictor genes as indicated by the machine learning analysis. The clusterization was performed in the French dataset with ALL patients.
FIGURE 3Relationship between SKIDA1, LAMP5, and CSPG4 expression and KMT2A-r in acute leukemia. (A) ROC curves and AUC values of each gene transcript according to acute leukemia subtypes. (B) Transcript expression among varied KMT2A fusions.
FIGURE 4Identification of therapeutic drugs for KMT2A-r leukemia. (A) Comparison of the IC50 data, available in the GDSC database, to several drugs between KMT2A-WT and KMT2A-r leukemia cell lines. The KG-1 (red point) is a human AML cell line, with a variant, KG-1a, known to be resistant to chemotherapy. Comparison of the sensitivity to foretinib between (B) AML molecular subgroups and KMT2A-r, as well as (C) FLT3 wild-type (red boxplot) and FLT3 mutated (yellow boxplot). (D) Transcript expression compared between KMT2A-WT and KMT2A-r in acute leukemia samples included in this study.