| Literature DB >> 34858811 |
Akanksha Farswan1, Anubha Gupta1, Krishnamachari Sriram2, Atul Sharma3, Lalit Kumar3, Ritu Gupta4.
Abstract
INTRODUCTION: Current risk predictors of multiple myeloma do not integrate ethnicity-specific information. However, the impact of ethnicity on disease biology cannot be overlooked. In this study, we have investigated the impact of ethnicity in multiple myeloma risk prediction. In addition, an efficient and robust artificial intelligence (AI)-enabled risk-stratification system is developed for newly diagnosed multiple myeloma (NDMM) patients that utilizes ethnicity-specific cutoffs of key prognostic parameters.Entities:
Keywords: AI in cancer research; GMM clustering in cancer; ML in cancer survival; consensus clustering in cancer; hematological malignancy; risk stratification of multiple myeloma
Year: 2021 PMID: 34858811 PMCID: PMC8630746 DOI: 10.3389/fonc.2021.720932
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 6.244
Figure 1Workflow for the development of the Consensus-based Risk-Stratification System (CRSS) for newly diagnosed multiple myeloma patients.
Comparison of established and proposed cutoffs for clinical and laboratory parameters for the stratification of patients for progression-free survival (PFS) and overall survival (OS) in MMIn and MMRF using Kaplan–Meier analysis.
| Parameter | Established cutoff value | Proposed cutoff value | PFS | OS | ||
|---|---|---|---|---|---|---|
|
|
|
|
| |||
|
| ||||||
| Age (years) | >65 |
| 0.11 |
| 5.84e-5 |
|
| Albumin (g/dl) | ≤3.5 | ≤3.5 | 0.115 | 0.115 | 7.0e-4 | 7.0e-4 |
| β2M (mg/L) | ≥5.5 |
|
| 9.32e-10 | 4.13e-10 |
|
| Calcium (mg/dl) | ≥11 | ≥11 | 0.0078 | 0.0078 | 0.0037 | 0.0037 |
| eGFR (ml/min/1.73m2) | ≤40 |
| 0.16 |
| 0.005 |
|
| Hb (g/dl) | ≤10 |
| 0.0019 |
| 0.0014 |
|
|
| ||||||
| Age (years) | >65 |
| 3.23e-05 |
| 1.06e-05 |
|
| Albumin (g/dl) | ≤3.5 | ≤3.5 | 0.00017 | 0.00017 | 8.47e-07 | 8.47e-07 |
| β2M (mg/L) | ≥5.5 | ≥5.5 | 1.22e-10 | 1.22e-10 | 9.25e-13 | 9.25e-13 |
| Calcium (mg/dl) | ≥11 |
| 0.0077 |
| 5.88e-06 |
|
| eGFR (ml/min/1.73m2) | ≤40 |
| 4.5e-05 |
| 7.48e-06 |
|
| Hb (g/dl) | ≤10 |
| 2.82e-06 |
| 6.77e-06 |
|
The proposed cutoffs were found using complete data of MMIn (n = 1,070) and MMRF (n = 900). Less than or equal to cutoff reveals the increased risk in the patient. “>65” shows that a patient with age greater than 65 years is at greater risk than a patient less than 65 years. “≤3.5” shows that a patient with albumin levels less than equal to 3.5 is at a greater risk than a patient with albumin levels greater than 3.5. It holds true for other parameters also in a similar manner. Bold values of the column “proposed cutoff value” signify the change in the value of the parameters from the existing cut-offs. p-values in bold signify that p-values became more significant with the proposed changes in cutoffs.
Figure 2(A, B) Progression-free survival in patients with multiple myeloma (MM) from the MMIn cohort (n = 1,070) stratified by the Revised International Staging System (R-ISS) (n = 355) and the proposed CRSS (n = 384), respectively. R-ISS1 is the low-risk stage, R-ISS2 is the intermediate-risk stage, and R-ISS3 is the high-risk stage. Median progression-free survival (PFS) for R-ISS1, R-ISS2, and R-ISS3 are 196, 160, and 105 weeks, respectively. The observed p-value obtained after performing a log-rank test on R-ISS is 9.47e-3. Similarly, CRSS-1 is the low-risk stage, CRSS-2 is the intermediate-risk stage, and CRSS-3 is the high-risk stage. Median PFS for CRSS-1, CRSS-2, and CRSS-3 are 213, 138, and 100 weeks, respectively. The observed p-value obtained after performing a log-rank test on CRSS is 5.60e-8. (C, D) Overall survival in patients with MM from the MMIn cohort (n = 1,070) stratified by the R-ISS (n = 355) and CRSS (n = 384), respectively. Median overall survival (OS) for R-ISS1, R-ISS2, and R-ISS3 are 478, 337, and 168 weeks, respectively. The observed p-value obtained after performing a log-rank test on R-ISS is 1.00e-6. Median OS for CRSS-1, CRSS-2, and CRSS-3 are 495, 249, and 182 weeks, respectively. The observed p-value obtained after performing a log-rank test on CRSS is 4.96e-11. (E, F) Univariate Cox hazard analysis on the prognostic factors—age, albumin, beta-2 microglobulin (β2M), calcium, estimated glomerular filtration rate (eGFR), hemoglobin, and high-risk cytogenetic abnormalities (HRCA)—for PFS and OS, respectively. Hazard ratios for all the parameters except HRCA were calculated on complete data (n = 1,070) for the MMIn dataset. Hazard ratio for HRCA and the risk-staging models were found using the data for which HRCA information was present (n = 384 for the MMIn dataset).
Comparison of different models devised for the risk stratification of patients in the MMIn and MMRF cohorts with the R-ISS.
| PFS | OS | ||||||
|---|---|---|---|---|---|---|---|
| Hazard ratio |
| C-index | Hazard ratio |
| C-index | ||
|
| |||||||
| R-ISS ( | 1.42 | 0.004 | 0.57 | 2.32 | <5e-6 | 0.636 | |
| 2vs1 | 1.24 | 0.33 | 2.31 | 0.04 | |||
| 3vs1 | 1.92 | 0.009 | 5.37 | 0.00013 | |||
| Model A1 | 1.5 | 1.00e-5 | 0.594 | 2.03 | <5e-6 | 0.646 | |
| 2vs1 | 1.53 | 0.007 | 2.13 | 0.0013 | |||
| 3vs1 | 2.26 | 2.00e-5 | 4.16 | <5e-6 | |||
| Model A2 | 1.4 | 0.0001 | 0.579 | 1.74 | 1.00e-5 | 0.616 | |
| 2vs1 | 1.42 | 0.056 | 1.9 | 0.02 | |||
| 3vs1 | 1.98 | 0.00013 | 3.13 | 2.00e-5 | |||
| Model A3 (CRSS) |
| <5e-6 |
|
| <5e-6 |
| |
| 2vs1 |
| 3.00e-4 |
| <5e-6 | |||
| 3vs1 |
| <5e-6 |
| <5e-6 | |||
|
| |||||||
| R-ISS ( | 1.61 | 0.00001 | 0.578 | 2.26 | <5e-6 | 0.618 | |
| 2vs1 | 1.49 | 0.015 | 1.79 | 0.03 | |||
| 3vs1 | 2.6 | 0.00001 | 4.66 | <5e-6 | |||
| Model M1 | 1.55 | <5e-6 | 0.6 | 2.07 | <5e-6 | 0.656 | |
| 2vs1 | 1.55 | 0.00042 | 2.06 | 0.00067 | |||
| 3vs1 | 2.4 | <5e-6 | 4.3 | <5e-6 | |||
| Model M2 | 1.62 | <5e-6 | 0.6 | 2.36 | <5e-6 | 0.657 | |
| 2vs1 | 1.44 | 0.01 | 2.12 | 0.0081 | |||
| 3vs1 | 2.54 | <5e-6 | 5.22 | <5e-6 | |||
| Model M3 | 1.54 | <5e-6 | 0.604 | 2.2 | <5e-6 | 0.679 | |
| 2vs1 | 1.87 | <5e-6 | 2.95 | <5e-6 | |||
| 3vs1 | 2.32 | <5e-6 | 5.11 | <5e-6 | |||
| Model M4 (CRSS) |
| <5e-6 |
|
| <5e-6 |
| |
| 2vs1 |
| 8.10e-4 |
| 3.40e-4 | |||
| 3vs1 |
| <5e-6 |
| <5e-6 | |||
Models were built using data for which high-risk cytogenetic information (HRCA) was available (n = 384 for MMIn and n = 800 for MMRF). R-ISS information was available for only 355 out of 384 patients in the MMIn dataset and 658 out of 800 patients in the MMRF dataset. The model with the best performance was A3 and M4 (in bold).
Model A1: beta-2 microglobulin (β2M), albumin, LDH, and CA [del17, t(4;14), t(14;16)] at existing cutoffs. Model A2: age, β2M, albumin, calcium, estimated glomerular filtration rate (eGFR), Hb, and HRCA using existing cutoffs. Model A3: age, β2M, albumin, calcium, eGFR, Hb, and HRCA using proposed cutoffs for MMIn data. Model M1: β2M, albumin, LDH, and HRCA at existing cutoffs. Model M2: age, β2M, albumin, calcium, eGFR, Hb, and HRCA using existing cutoffs. Model M3: age, β2M, albumin, calcium, eGFR, Hb, and HRCA using proposed cutoffs for MMIn data. Model M4: age, β2M, albumin, calcium, eGFR, Hb, and HRCA using proposed cutoffs for MMRF data.
Prediction of progression-free survival and overall survival (in %) for CRSS and R-ISS at 1, 2, 3, 4, and 5 years in the MMIn (n = 384) and MMRF datasets (n = 800).
| MMIn data | |||||||
|---|---|---|---|---|---|---|---|
| R-ISS ( | CRSS ( | ||||||
| Year | 1 | 2 | 3 | 1 | 2 | 3 | |
| PFS | 1 | 0.9318 | 0.8305 | 0.6967 | 0.8966 | 0.7812 | 0.7196 |
| 2 | 0.8606 | 0.6601 | 0.5223 | 0.7709 | 0.6265 | 0.4472 | |
| 3 | 0.6404 | 0.5124 | 0.3632 | 0.6449 | 0.4729 | 0.2515 | |
| 4 | 0.3422 | 0.4179 | 0.2810 | 0.5251 | 0.3624 | 0.0587 | |
| 5 | 0.2738 | 0.2856 | 0.2342 | 0.4014 | 0.2679 | 0.0587 | |
| OS | 1 | 0.9773 | 0.9387 | 0.7784 | 0.9630 | 0.8938 | 0.7976 |
| 2 | 0.9540 | 0.8415 | 0.6393 | 0.9466 | 0.7679 | 0.6155 | |
| 3 | 0.9282 | 0.7764 | 0.5342 | 0.9098 | 0.6702 | 0.5831 | |
| 4 | 0.8895 | 0.6790 | 0.4953 | 0.8979 | 0.5691 | 0.4574 | |
| 5 | 0.8895 | 0.6422 | 0.3698 | 0.8979 | 0.4791 | 0.3136 | |
|
| |||||||
|
|
| ||||||
| Year | 1 | 2 | 3 | 1 | 2 | 3 | |
| PFS | 1 | 0.9033 | 0.8132 | 0.6358 | 0.9325 | 0.8367 | 0.6611 |
| 2 | 0.7957 | 0.6261 | 0.4040 | 0.8162 | 0.6734 | 0.4423 | |
| 3 | 0.6295 | 0.4862 | 0.3059 | 0.7008 | 0.5084 | 0.3129 | |
| 4 | 0.4641 | 0.3414 | 0.2781 | 0.5151 | 0.3711 | 0.2249 | |
| 5 | 0.2769 | 0.2450 | 0.2781 | 0.4121 | 0.2637 | 0.1799 | |
| OS | 1 | 0.9807 | 0.9092 | 0.8559 | 0.9869 | 0.9379 | 0.8231 |
| 2 | 0.9612 | 0.8372 | 0.6460 | 0.9689 | 0.8772 | 0.6780 | |
| 3 | 0.9286 | 0.7799 | 0.5211 | 0.9478 | 0.8217 | 0.5814 | |
| 4 | 0.8833 | 0.7461 | 0.4904 | 0.9478 | 0.7844 | 0.5293 | |
| 5 | 0.5748 | 0.7108 | 0.3678 | 0.9478 | 0.6569 | 0.4691 | |
Figure 3Model interpretation using SHAP (SHapley Additive exPlanations). SHAP summary plots for different risk stages inferred from MMIn data showing the relative impact of different parameters (top to bottom) contributing to a particular risk stage prediction. (A, B) CRSS-1: Normal levels of β2M and hemoglobin are the key contributors to the low-risk stage prediction. Furthermore, high values of age on the left side of the summary plot are pushing the model away from the low-risk prediction and are indicative of either intermediate or high risk. Overall, β2M has the highest impact and calcium has the lowest impact on the low-risk stage prediction. (C, D) CRSS-2: β2M and hemoglobin are the key contributors to the intermediate-risk stage. Elevated levels of β2M with lower levels of hemoglobin are indicative of intermediate risk. (E, F) CRSS-3: Presence of HRCA is contributing the most to the high-risk stage. Elevated values of β2M and calcium and lower levels of albumin, hemoglobin, and eGFR are contributing toward the high-risk stage prediction.
Figure 7Model interpretation using SHAP. SHAP summary plots for different risk stages inferred in MMRF data showing the impact of different parameters used in the model. (A, B) CRSS-1: albumin, HRCA, and β2M have the highest impact on the low-risk stage. Normal levels of albumin, absence of HRCA, and lower values of β2M are contributing to low risk (CRSS-1) in myeloma patients. (C, D) CRSS-2: β2M, albumin, and HRCA are the key contributors to the intermediate-risk stage. (E, F) CRSS-3: β2M and hemoglobin have the highest impact on the high-risk stage. Elevated levels of β2M and lower values of hemoglobin are contributing toward the high-risk stage in the patient. Lower values of albumin and eGFR are further promoting high-risk stage prediction.
Figure 4SHAP waterfall plots for the randomly chosen four patients in low-risk stage (CRSS-1) from the MMIn dataset. The pink color shows the positive impact of the feature, while the blue color shows the negative impact of the feature. Features with a positive impact contributed to the class of low-risk stage prediction, while features with a negative impact contributed to class opposite to low risk. β2M, hemoglobin, age, and HRCA have the highest overall impact on low-risk stage prediction in the MMIn dataset. However, this ranking itself differs from patient to patient as can be seen in (A–D). (A) β2M has the highest impact followed by hemoglobin, age, and HRCA. (B) Hemoglobin has the highest impact followed by β2M and age. (C, D) β2M has the highest impact followed by age and HRCA.
Figure 6SHAP waterfall plots for randomly chosen patients in high-risk stage (CRSS-3) from the MMIn dataset. The pink color shows the positive impact of the feature, while the blue color shows the negative impact of the feature. Features with a positive impact contributed to the class of high-risk stage prediction, while features with a negative impact contributed to class opposite to highest risk. HRCA, β2M, age, and albumin have the highest overall impact on high-risk stage prediction. However, this ranking differs from patient to patient as can be seen in (A–C). (A) HRCA has the highest impact. (B) β2M has the highest impact. (C, D) Age and albumin have the highest impact.