Jianbin Sun1, Xiaoyan Ke2, Mingzhi Zhang3, Yuhui Wang1, Fengyang An1, Yulin Zhao4, Li Zhu1. 1. Department of Otorhinolaryngology, Head and Neck Surgery, Peking University Third Hospital, Beijing, China. 2. Department of Hematology, Peking University Third Hospital, Beijing, China. 3. Department of Oncology, the First Affiliated Hospital of Zhengzhou University, Zhengzhou, China. 4. Department of Rhinology, the First Affiliated Hospital of Zhengzhou University, Zhengzhou, China.
Abstract
BACKGROUND: The prognostic index of natural killer lymphoma (PINK) is recommended for use as a prognostic model for determining the best non-anthracycline-based treatment for extranodal natural killer T-cell lymphoma, nasal-type (ENKTL). However, this model does not provide an accurate individual risk estimation for patients; therefore, our retrospective study was conducted to determine this risk. METHODS: Clinical data from 250 patients with ENKTL treated with non-anthracycline-based regimens were analyzed. The statistically significant clinical characteristics were selected as the parameters for our models. The patient data from 250 patients were randomly divided into 5 groups for 5-fold cross validation before final models were established on all of the patients' data. A statistical model nomogram based on a Cox proportional hazards model, and a machine learning model based on the lightGBM algorithm, were constructed. Concordance index (C-index) and calibration curve, areas under the curve (AUC) values, and binary error were used to evaluate two models. RESULTS: Five variables [age, the Chinese Southwest Oncology Group and Asia Lymphoma Study Group ENKTL (CA) staging system, Eastern Cooperative Oncology Group (ECOG) score, B symptoms, and lactate dehydrogenase (LDH)] were significant and were selected as parameters for creating the statistical model nomogram, while lesion sites (anatomical regions, lymph nodes and primary lesion site) took place of CA staging system in machine learning model. During cross validation, the mean C-indices of training cohort and validation cohort for statistical model nomogram and PINK were 0.851±0.008, 0.843±0.029, 0.758±0.019 and 0.757±0.080, respectively, while the mean 3-year AUC for machine learning model were 0.920±0.010 and 0.865±0.035, respectively. The calibration curves and binary errors showed a good correlation between the predicted result and the reality. CONCLUSIONS: These two models could provide ENKTL patients with an accurate individual risk estimation in the era of non-anthracycline-based treatment. 2021 Translational Cancer Research. All rights reserved.
BACKGROUND: The prognostic index of natural killer lymphoma (PINK) is recommended for use as a prognostic model for determining the best non-anthracycline-based treatment for extranodal natural killer T-cell lymphoma, nasal-type (ENKTL). However, this model does not provide an accurate individual risk estimation for patients; therefore, our retrospective study was conducted to determine this risk. METHODS: Clinical data from 250 patients with ENKTL treated with non-anthracycline-based regimens were analyzed. The statistically significant clinical characteristics were selected as the parameters for our models. The patient data from 250 patients were randomly divided into 5 groups for 5-fold cross validation before final models were established on all of the patients' data. A statistical model nomogram based on a Cox proportional hazards model, and a machine learning model based on the lightGBM algorithm, were constructed. Concordance index (C-index) and calibration curve, areas under the curve (AUC) values, and binary error were used to evaluate two models. RESULTS: Five variables [age, the Chinese Southwest Oncology Group and Asia Lymphoma Study Group ENKTL (CA) staging system, Eastern Cooperative Oncology Group (ECOG) score, B symptoms, and lactate dehydrogenase (LDH)] were significant and were selected as parameters for creating the statistical model nomogram, while lesion sites (anatomical regions, lymph nodes and primary lesion site) took place of CA staging system in machine learning model. During cross validation, the mean C-indices of training cohort and validation cohort for statistical model nomogram and PINK were 0.851±0.008, 0.843±0.029, 0.758±0.019 and 0.757±0.080, respectively, while the mean 3-year AUC for machine learning model were 0.920±0.010 and 0.865±0.035, respectively. The calibration curves and binary errors showed a good correlation between the predicted result and the reality. CONCLUSIONS: These two models could provide ENKTL patients with an accurate individual risk estimation in the era of non-anthracycline-based treatment. 2021 Translational Cancer Research. All rights reserved.
Extranodal natural killer T-cell lymphoma, nasal-type (ENKTL) is recognized as a distinct subtype of lymphoid neoplasms by the World Health Organization (WHO) (1). It has the characteristics of having a low global morbidity rate and a poor prognosis (2-4). L-asparaginase has been determined to be effective in inhibiting the proliferation of natural killer cells since patients with ENKTL respond poorly to conventional CHOP (cyclophosphamide, doxorubicin, vincristine, and prednisone) and CHOP-like regimens because of drug resistance (5-8). New chemotherapies incorporating L-asparaginase and its analogues were gradually introduced into clinical practice, with an increasing number of patients deriving benefit from these newer non-anthracycline–based treatments.However, many studies (9-11) have shown that prognostic models based on conventional CHOP or CHOP-like chemotherapies, including the International prognostic index (IPI) (12) and Korean prognostic index (KPI) (13), were found no longer applicable under this new mode of treatment. As a result, a new prognostic model prognostic index of natural killer lymphoma (PINK) (14) was proposed in 2016. PINK incorporates four risk factors: age over 60 years, non-nasal type disease, stage III or IV disease in the Ann Arbor staging system and involvement of distant lymph nodes. It classifies patients into three groups: group 1 (risk factors ≤1), group 2 (risk factors =2), and group 3 (risk factors =3). These risks groups are significantly associated with different 3-year overall survival (OS). PINK has thus been validated for clinical use (15) and has been recommended by the latest National Comprehensive Cancer Network (NCCN) guideline (V.1, 2020); however, it classifies patients into just 3 groups. As a result, only a rough assessment of risk can be provided to a given patient: low risk, intermediate risk, or high risk. This does not provide an individualized or precise assessment of risk. Therefore, we have conducted this retrospective research to develop improved models. We present the following article in accordance with the TRIPOD reporting checklist (available at: http://dx.doi.org/10.21037/tcr-20-3017).
Methods
Participants
The study recruited 262 patients with ENKTL initially treated in the Peking University Third Hospital and the First Affiliated Hospital of Zhengzhou University from 2010 to 2020. Individuals were included in the study if they met the following criteria: (I) diagnosed as ENKTL according to histological and immunophenotypic features (including AE1/AE3, CD3, CD7, CD20, CD43, CD56, TIA-1, GranzymeB, EBER and Ki-67) on the basis of the WHO classification (16); (II) staged according to an endoscopic and imaging examination, including computed tomography (CT), magnetic resonance imaging (MRI) and/or ultrasound; (III) treated with non-anthracycline-based regimens. Patients were excluded if (I) they suffered from a previous malignancy or second primary tumor; (II) if they were not initially treated or their pretreatment information could not be accessed. Of the 262 patients, 5 suffered from previous malignancy or second primary tumor, 7 were not initially treated, or their pretreatment information could not be accessed. Finally, 250 patients were eligible for inclusion in our study. The study was conducted in accordance with the Declaration of Helsinki (as revised in 2013). This study was approved by our institutional review board (IRB00006761-M2020191). As our study was retrospective in nature with no personally identifiable information, the need for informed consent was waived.
Evaluation, definition and treatment
The data of participants’ medical history and physical examinations were gathered. Data from the endoscopic examination of upper aerodigestive tract; imaging examination (CT/enhanced CT/PET-CT, MR/enhanced MR and ultrasound) of involved regions and of the chest, abdomen, pelvis, limbs, and other regions; and bone marrow aspiration were collected. Each participant was staged using the Ann Arbor staging system and stratified using the PINK model. They were also staged by the newest staging system, the Chinese Southwest Oncology Group and Asia Lymphoma Study Group ENKTL (CA) system.The Ann Arbor staging system was originally developed for Hodgkin’s lymphoma (17), However, this system was reported to be less accurate in assessing aggressive non-Hodgkin’s lymphoma due to disease spread patterns differing from those of non-Hodgkin’s lymphoma (18). ENKTL is particularly disparate, as it primarily occurs in the upper aerodigestive tract (19,20). A new staging system: the CA staging system, was first proposed in 2014, and in 2019 underwent prospective study and validation both in China and internationally. It has demonstrated superior evaluative ability for ENKTL compared to the Ann Arbor system (21), and thus patients were also staged by the CA staging system in the present study.The CA staging system is consist of 4 stages: stage I, lesions located in nasal cavity or nasopharynx without local invasiveness or involvement of lymph node; stage II, lesions located in nasal cavity or nasopharynx with local invasiveness but without involvement of lymph node or lesions primarily located outside upper aerodigestive tract ; stage III, lesions combined with involvement of regional lymph node; stage IV, disseminated lesions or involvement of non-regional lymph nodes or lymph nodes involved on both sides of the diaphragm (21).To facilitate machine learning, all involved regions are defined according to their anatomy, which included the nasal cavity, sinus, nasopharynx, oropharynx or tonsil, larynx, palate, oral cavity and tongue, parapharyngeal space or parotid gland, orbital cavity, skin of face, skin and soft tissue of body, gastrointestinal tract, lung, pericardium, spleen, liver, kidney or adrenal gland, testis, uterus or ovary, bone marrow and central nervous system; The involved lymph nodes are also defined according to the classification and delineation of systemic lymph nodes, which included the head, cervical, supraclavicular, axillary, mediastinal, abdominal, retroperitoneal, pelvic, and limb lymph nodes. Primary site could be classified as upper aerodigestive tract or extra upper aerodigestive tract.All participants received non-anthracycline-based treatment according to their risk-based stratification. Comprehensive treatment included surgery, radiotherapy, chemotherapy, targeted therapy, stem cell transplantation, and maintenance therapy. Surgery treatment referred to resection of the lesion during the biopsy, and follow-up examinations returned negative results, usually followed by radiotherapy or chemotherapy. Radiotherapy was performed using the standard scheme of a radical dose of 50–52 Gy/25–26 Gy. Chemotherapy was performed using non-anthracycline-based regimens, mainly incorporating P-GEMOX (gemcitabine, pegaspargase and oxaliplatin), DDGP (dexamethasone, cisplatin, gemcitabine and pegaspargase) and Modified-SMILE (dexamethasone, methotrexate, ifosfamide, pegaspargase, Mesna and etoposide), which are the preferred regimens of the NCCN guideline (v.1, 2020).It should be noted that 7 patients received conventional CHOP or CHOP-like regimens. However, 1 of them changed to conventional therapy because the expected clinical effects were not achieved after administration of non-anthracycline-based treatment, 2 received as consolidation therapy after being relieved, and 4 received it as pretreatment before transplantation. Since none of them were initially or mainly treated with conventional therapies, they were still included in the study. Targeted therapy was used when disease progressed, and mainly consisted of programmed cell death protein 1 (PD-1) receptor inhibitors and chidamide. Transplantation referred to autologous stem cell transplantation after complete remission (CR) or unconfirmed complete remission (CRu). Maintenance therapy was mainly used for patients in terminal stage, and included steroids, thalidomide, and other forms for palliative treatment.
The construction and validation of the statistical model nomogram
Fifteen variables associated with OS that were reported in previous studies were selected including sex, age, Ann Arbor stage, CA stage, lymph node involvement, the interval from primary symptoms to diagnosis, primary site, Eastern Cooperative Oncology Group (ECOG) score, B symptoms, PINK model stratification, and lactate dehydrogenase (LDH) level, hemoglobin, platelet, and monocyte and prognostic nutritional index [PNI, defined as albumin g/L + 5 × lymphocyte count (×109/L)]. We separated continuous variables into low and high groups either using the well-known cutoff (age) or on the basis of the usual cutoff value for the normal range (such as LDH and hemoglobin) or the cutoffs of which prognostic value had been validated in previous studies (platelet, monocyte, PNI) (22,23). Univariate analysis was then performed using the Kaplan–Meier method. Apart from gender and intervals from primary symptoms to diagnosis, the other thirteen variables were significant for OS and underwent multivariate analysis by Cox regression method. Five variables, including age, CA stage, ECOG, B symptoms, and LDH score, were found to be statistically significant and were finally selected to construct the statistical model nomogram. For a better evaluation of this method of constructing a model, we used 5-fold cross validation. All patients were stratified randomly divided and divided into 5 groups according to CA stage; in turn, each group served as the validation cohort with the other 4 serving as the training cohort, and the final model was constructed on all 250 patients’ data (). Discrimination and calibration powers were evaluated by concordance index (C-index) and calibration curve, respectively, which as is were recommended for the evaluation of survival regression analysis.
Figure 1
Flow chart for participants selection, cross-validation and establishment of models.
Flow chart for participants selection, cross-validation and establishment of models.
The construction and validation of the machine learning model
As CA stage consists of the involved regions, lymph nodes, and the primary site, to facilitate machine learning, these 3 were analyzed instead of the CA stage itself. Age, ECOG score, B symptoms, and LDH score were also included in the machine learning model. LightGBM, recognized as an excellent gradient boosting decision tree (GBDT) algorithm in machine learning, was used for data analysis. A 5-fold cross-validation for the machine learning model was used for the same groups used by statistical model nomogram. For 3-year OS prediction, patients were first labeled as live or dead according to whether their survival time was over 3 years or not, and were then included in the analysis. AUC and binary errors were used to evaluate the discrimination and calibration powers due to binary classification nature of the LightGBM algorithm.
Statistical analysis
The OS rate was defined as the period from the initial diagnosis to ENKTL-related death or the last follow-up. Survival curves were depicted by the life table method, and univariate analysis was conducted with the Kaplan-Meier method. Multivariate analysis was conducted with Cox regression method by entering selection process, and the nomogram was constructed using multivariate Cox proportional hazards model method. Two missed values from PNI and intervals between primary symptoms to diagnosis were not included in univariate analysis and multivariate analysis. C-indices were calculated by calculating the area under the receiver operating characteristic (ROC) curve. Calibration curves comprised the predicted probability of OS curve and actual OS curve, and were plotted by bootstrap resampling (1,000 resamples for the training cohort and 500 for the validation cohort). AUC was defined as true positives (TP): TP + false positives (FP) and binary error was defined as FP + FN/all samples. Statistical analyses were performed in SPSS 20.0 (RRID:SCR_002865). The C-index, nomogram, and calibration curve were developed in R language using RStudio (RRID:SCR_000432). Machine learning was performed using Python in PyCharm 2020.1.2 (RRID:SCR_018221). Graphs were processed in GraphPad Prism 8 (RRID:SCR_002798) and Adobe Illustrator CC 2014 (RRID:SCR_010279). A P value <0.05 was considered statistically significant.
Results
Participant characteristics
The clinical characteristics of 250 participants are shown in . The involved regions and lymph nodes are outlined in . Out of all 250 patients analyzed, 195 (78%) patients received whole-body PET-CT imaging before initial treatment. In 55 patients who did not receive PET-CT, 38 were in early stage (Ann Arbor stage I or II) and 17 were in advanced stage (Ann Arbor stage III or IV). The mean follow-up time for survival patients was 32 months, and the 1- and 3-year OS were 80.9% and 69.6%, respectively (). In the univariate analysis, the following 13 variables were associated with OS: age, Ann Arbor stage, CA stage, lymph node involvement, primary site, ECOG score, B symptoms, PINK, LDH level, hemoglobin, platelet, monocyte, and PNI. In the multivariate analysis, 5 variables were significant: age, CA stage, ECOG score, B symptoms, and LDH level (). The grouping by CA stage for cross-validation is shown in .
Table 1
Clinical characteristics
Characteristics
n (%)
Gender
Male
182 (72.8)
Female
68 (27.2)
Age (years)
<60
209 (83.6)
≥60
41 (16.4)
Ann arbor stage
I
67 (26.8)
II
101 (40.4)
III
20 (8.0)
IV
62 (24.8)
CA stage
I
72 (28.8)
II
53 (21.2)
III
44 (17.6)
IV
81 (32.4)
Lymph node involvement
Yes
95 (38.0)
No
155 (62.0)
Interval from primary symptoms to diagnosis (months)
<12
214 (85.6)
≥12
34 (13.6)
Unknown
2(0.8)
Primary site
UADT
196 (78.4)
Extra-UADT
54 (21.6)
ECOG score
0
59 (23.6)
1
141 (56.4)
2
42 (16.8)
≥3
8 (3.2)
B symptoms
Yes
133 (53.2)
No
117 (46.8)
PINK
Group1
133 (53.2)
Group2
53 (21.2)
Group3
64 (25.6)
LDH level (U/L)
<245
143 (57.2)
≥245
107 (42.8)
Hemoglobin (g/L)
<120
86 (34.4)
≥120
164 (65.6)
Platelet (109/L)
<200
119 (47.6)
≥200
131 (52.4)
Monocyte (109/L)
<0.5
115 (46.0)
≥0.5
135 (54.0)
PNI
<48
169 (67.6)
≥48
79 (31.6)
Unknown
2(0.8)
Treatment
Surgery
14 (5.6)
Radiotherapy
113 (45.2)
Chemotherapy
P-GEMOX
27 (10.8)
DDGP
158 (63.2)
Modified-SMILE
39 (15.6)
Others
24 (9.6)
CHOP or CHOP-like regimens
7 (2.8)
Targeted therapy
17 (6.8)
Transplantation
4 (1.6)
Maintenance therapy
7 (2.8)
CA, the Chinese Southwest Oncology Group and Asia Lymphoma Study Group ENKTL system; ECOG, Eastern Cooperative Oncology Group; PINK, prognostic index of natural killer lymphoma; LDH, lactate dehydrogenase concentrations; PNI, prognostic nutritional index; P-GEMOX, gemcitabine, pegaspargase and oxaliplatin; DDGP, dexamethasone, cisplatin, gemcitabine and pegaspargase; Modified-SMILE, dexamethasone, methotrexate, ifosfamide, pegaspargase, mesna and etoposide; CHOP, cyclophosphamide, doxorubicin, vincristine, and prednisone.
Table 2
Involved regions and lymph nodes
n (%)
Regions
None
3 (1.2)
Nasal cavity
173 (69.2)
Sinus
40 (16.0)
Nasopharynx
82 (32.8)
Oropharynx or tonsil
38 (15.2)
Larynx
12 (4.8)
Palate
19 (7.6)
Oral cavity and tongue
7 (2.8)
Parapharyngeal space or parotid gland
26 (10.4)
Orbital cavity
11 (4.4)
Skin of face
23(9.2)
Skin and soft tissue of body
39 (15.6)
Gastrointestinal tract
28 (11.2)
Lung
22 (8.8)
Pericardium
1 (0.4)
Spleen
13 (5.2)
Liver
6 (2.4)
Kidney or adrenal gland
12 (4.8)
Testis
9 (3.6)
Uterus or ovary
7 (2.8)
Bone marrow
7 (2.8)
Central nervous system
5 (2.0)
Lymph nodes
None
156 (62.4)
Head
7 (2.8)
Neck
76 (30.4)
Supraclavicular
7 (2.8)
Axilla
19 (7.6)
Mediastinum
16 (6.4)
Abdominal cavity
16 (6.4)
Retroperitoneal
16 (6.4)
Pelvic cavity
30(12.0)
Limb
2 (0.8)
Involved regions and lymph nodes are determined by CT/enhanced CT/PET-CT, MR/enhanced MR and ultrasound, 195/250 (78%) received whole-body PET-CT. In 55 patients who did not received PET-CT, 38 were in early stage (Ann Arbor stage I or II), and 17 were in advanced stage (Ann Arbor stage III or IV).
Figure 2
Kaplan-Meier overall survival (OS) curve for all 250 patients included in the analyses.
Table 3
Univariate analysis and multivariable analysis
Characteristics
Univariate analysis
Multivariable analysis
P value
HR (95% CI)
P value
Gender (male vs. female)
0.598
–
–
Ann Arbor Stage (I, II, III, IV)
<0.001
–
0.983
Lymph node involvement (yes vs. no)
<0.001
–
0.624
Interval from primary symptoms to diagnosis (<12 vs. ≥12 months; n=248)
0.962
–
–
Primary site (UADT vs. Extra-UADT)
<0.001
–
0.595
PINK (Group1, Group2, Group3)
<0.001
–
0.477
Hemoglobin (<120 vs. ≥120 g/L)
<0.001
–
0.1
Platelet (<200 vs. ≥200, 109/L)
0.018
–
0.819
Monocyte (<0.5 vs. ≥0.5, 109/L)
0.008
–
0.498
PNI (<48 vs. ≥48; n=248)
0.001
–
0.355
Age (<60 vs. ≥60, years)
0.039
2.011 (1.043–3.878)
0.037
CA Stage (I, II, III, IV)
<0.001
2.341 (1.199–4.569)
0.013
ECOG score (0,1,2 vs. ≥3)
<0.001
2.4 (1.607–3.583)
<0.001
B symptoms (Yes vs. No)
0.007
0.325 (0.161–0.657)
0.002
LDH level (<245 vs. ≥245 U/L)
<0.001
3.746 (1.070–3.746)
0.03
Two missed values in Interval from primary symptoms to diagnosis and PNI were not included in univariate analysis. Fifteen variables in 250 patients first received univariate analysis and 13 statistically significant ones received multivariable analysis. In multivariable analysis, 5 variables were statistically significant. UADT, upper aerodigestive tract; PINK, prognostic index of natural killer lymphoma; PNI, prognostic nutritional index; CA, the Chinese Southwest Oncology Group and Asia Lymphoma Study Group ENKTL system; ECOG, Eastern Cooperative Oncology Group; LDH, lactate dehydrogenase concentrations.
Table 4
Grouping for cross-validation
CA stage
Group1
Group2
Group3
Group4
Group5
Total
I
Censored
14
14
14
14
13
69
Not censored
1
1
1
0
0
3
II
Censored
9
9
9
9
10
46
Not censored
0
1
2
2
2
7
III
Censored
6
7
7
7
7
34
Not censored
2
1
2
2
3
10
IV
Censored
7
7
7
7
8
36
Not censored
11
10
8
9
7
45
Total
50
50
50
50
50
250
CA, the Chinese Southwest Oncology Group and Asia Lymphoma Study Group ENKTL system; ECOG, Eastern Cooperative Oncology Group; PINK, prognostic index of natural killer lymphoma; LDH, lactate dehydrogenase concentrations; PNI, prognostic nutritional index; P-GEMOX, gemcitabine, pegaspargase and oxaliplatin; DDGP, dexamethasone, cisplatin, gemcitabine and pegaspargase; Modified-SMILE, dexamethasone, methotrexate, ifosfamide, pegaspargase, mesna and etoposide; CHOP, cyclophosphamide, doxorubicin, vincristine, and prednisone.Involved regions and lymph nodes are determined by CT/enhanced CT/PET-CT, MR/enhanced MR and ultrasound, 195/250 (78%) received whole-body PET-CT. In 55 patients who did not received PET-CT, 38 were in early stage (Ann Arbor stage I or II), and 17 were in advanced stage (Ann Arbor stage III or IV).Kaplan-Meier overall survival (OS) curve for all 250 patients included in the analyses.Two missed values in Interval from primary symptoms to diagnosis and PNI were not included in univariate analysis. Fifteen variables in 250 patients first received univariate analysis and 13 statistically significant ones received multivariable analysis. In multivariable analysis, 5 variables were statistically significant. UADT, upper aerodigestive tract; PINK, prognostic index of natural killer lymphoma; PNI, prognostic nutritional index; CA, the Chinese Southwest Oncology Group and Asia Lymphoma Study Group ENKTL system; ECOG, Eastern Cooperative Oncology Group; LDH, lactate dehydrogenase concentrations.According to the Ann Arbor staging system, the distribution for stages I-IV was 67 (26.8%), 101 (40.4%), 20 (8.0%) and 62 (24.8%), respectively. Stage III had the lowest proportion (8.0%), and its survival curve overlapped with that of stage IV, implying poor discrimination (). These results were in line with previous studies (19-26). In contrast, the distribution of the CA staging system for stages I–IV was 72 (28.8%), 53 (21.2%), 44 (17.6%), 81 (32.4%), respectively, which is more balanced, and there was less overlap between the survival curves (). The survival curves of the PINK model also showed good discrimination ().
Figure 3
Kaplan-Meier survival curves according to the Ann Arbor stage (A), CA Stage (B) and PINK (C). CA, the Chinese Southwest Oncology Group and Asia Lymphoma Study Group ENKTL system; PINK, prognostic index of natural killer lymphoma.
Kaplan-Meier survival curves according to the Ann Arbor stage (A), CA Stage (B) and PINK (C). CA, the Chinese Southwest Oncology Group and Asia Lymphoma Study Group ENKTL system; PINK, prognostic index of natural killer lymphoma.
New models and comparison with the PINK model
The statistical model nomogram and machine learning model predicting the 3-year OS are illustrated in .
Figure 4
Statistical model nomogram predicting 3-year OS for patients with extranodal NK/T-cell lymphoma, nasal-type. To use the nomogram, for an individual patient, the value is located on each variable axis, and a line drawn upwards to the Points axis to determines points received for each variable value. The sum of five points is located on the Total Points axis, and lines drawn downwards to the 3-year OS determine the likelihood of 3-year OS rates. OS, overall survival.
Figure 5
Machine learning model predicting 3-year OS for patients with extranodal NK/T-cell lymphoma, nasal-type. A patient’s 3-year OS likelihood can be output by checking the involved regions and lymph nodes, primary site, ECOG, LDH, and entering age. e.g., Illustrated is a 53-year-old patient who had a lifespan of 15 months after diagnosis, with nasal cavity, sinus, skin of face, skin and soft tissue of body, kidney or adrenal gland involved, upper aerodigestive tract of primary site, ECOG score 1, LDH ≥245 U/L and B symptom. ECOG, Eastern Cooperative Oncology Group; LDH, lactate dehydrogenase concentrations.
Statistical model nomogram predicting 3-year OS for patients with extranodal NK/T-cell lymphoma, nasal-type. To use the nomogram, for an individual patient, the value is located on each variable axis, and a line drawn upwards to the Points axis to determines points received for each variable value. The sum of five points is located on the Total Points axis, and lines drawn downwards to the 3-year OS determine the likelihood of 3-year OS rates. OS, overall survival.Machine learning model predicting 3-year OS for patients with extranodal NK/T-cell lymphoma, nasal-type. A patient’s 3-year OS likelihood can be output by checking the involved regions and lymph nodes, primary site, ECOG, LDH, and entering age. e.g., Illustrated is a 53-year-old patient who had a lifespan of 15 months after diagnosis, with nasal cavity, sinus, skin of face, skin and soft tissue of body, kidney or adrenal gland involved, upper aerodigestive tract of primary site, ECOG score 1, LDH ≥245 U/L and B symptom. ECOG, Eastern Cooperative Oncology Group; LDH, lactate dehydrogenase concentrations.To use the statistical model nomogram, for an individual patient, the value of each variable is located on the corresponding variable axis, and a line drawn upwards to the Points axis to determines the point received for each variable. The sum of 3 points is located on the Total Points axis, and the lines drawn downwards to the 3-year OS axis determine the likelihood of 3-year OS.To use machine learning model, a patient’s 3-year OS likelihood can be output by checking the involved regions and lymph nodes, primary site, ECOG, LDH, and entering age.For discrimination power, C-indices for the statistical model nomogram were 0.840, 0.846, 0.855, 0.859, and 0.853 in training cohort, and 0.875, 0.872, 0.819, 0.815, and 0.834 in the validation cohort, with mean C-indices of 0.851±0.008 and 0.843±0.029, respectively. Accordingly, the C-indices for PINK were 0.734, 0.751, 0.783, 0.750, and 0.770 in the training cohort, and 0.862, 0.783, 0.652, 0.779, and 0.710 in the validation cohort, with mean C-indices 0.758±0.019 and 0.757±0.080, respectively. In addition, the 3-year AUCs for machine learning model were 0.929, 0.919, 0.904, 0.923, and 0.926 in the training cohort, and 0.911, 0.841, 0.821, 0.874, and 0.878 in the validation cohort, with mean AUCs 0.902±0.010 and 0.865±0.035.To demonstrate the calibration power, calibration curves of 5 rounds cross-validation for statistical model nomogram are plotted in ; they show a good correlation between the predicted outcomes and the reality. Finally, binary errors in the machine learning model were 0.167, 0.175, 0.194, 0.148, and 0.162 in the training cohort, and 0.179, 0.250, 0.235, 0.222, and 0.120 in the validation cohort, with mean binary errors of 0.169±0.017 and 0.201±0.053, indicating good calibration power.
Figure 6
Calibration curves for the prediction of 3-year OS in 5 rounds cross-validation for nomogram; A, B, C, D and E are calibration curves of training cohort and validation cohort in Round1-Round5. OS, overall survival.
Calibration curves for the prediction of 3-year OS in 5 rounds cross-validation for nomogram; A, B, C, D and E are calibration curves of training cohort and validation cohort in Round1-Round5. OS, overall survival.
Discussion
At present, risk-adapted treatment is widely accepted, and thus improving the risk stratification of patients is a rational goal. In the past, the IPI, KPI, and other models were used (24,27); however, they were either not specially designed for ENKTL or based on conventional anthracycline-based chemotherapy. The treatment for ENKTL has changed from anthracycline-based treatment to non-anthracycline-based treatment, but the current NCCN guidelines do not recommend these models. PINK is applicable in the era of non-anthracycline–based treatment; however, patients cannot be assessed individually. Our study ultimately identified age, CA stage, ECOG, B symptoms, and LDH as significant factors in constructing statistical model nomograms, and replaced the CA stage with the involved regions, lymph nodes, and primary site in a machine learning model.We found that our models' discriminatory power was superior to that of PINK, and this is perhaps due to two mains reasons. First, we used the CA staging system instead of the Ann Arbor system adopted by PINK. The CA staging system is specially developed for ENKTL, has been proven to be superior in stratifying ENKTL patients, and incorporates more clinical information. In PINK, there are 4 parameters, age, stage III or IV disease in Ann Arbor staging system, non-nasal type disease, and involvement of distant lymph nodes; however, the CA staging system itself contains non-nasal type disease and involvement of lymph nodes, so we use fewer variables while analyzing more information. Second, compared with the PINK statistical grouping method, less information is lost during modeling when using the Cox regression method. As for machine learning algorithm, lightGBM, unlike traditional statistical method, which assume a linear combination of risk factors, lightGBM is essentially a decision tree of machine learning that uses more details and manages more complex relationships.Apart from improving discrimination power, our models’ variables are common and relevant to clinical practice, enhancing their practical utility.First, age is statistically significant and included because older adults generally have more complications and are presumed to be less tolerance to chemotherapy than young patients (28,29), which makes inadequate treatment a common problem. Furthermore, gene aberrations are more likely to accumulate in older patients leading to more aggressive tumors (30,31).The CA staging system has advantages over the Ann Arbor staging system. For unbalanced stratification in stage III and overlapping survival curves between stage III and IV in the Ann Arbor staging system, PINK incorporates stage III and IV as a single parameter. The CA staging system has a more balanced stratification across the 4 stages, and it is statistically significant in multivariate analysis, while the Ann Arbor staging system is not. The CA staging system is specially designed for ENKTL, and there is a high probability it will replace the Ann Arbor staging system in the future; our model is thus intended to better assist clinicians in this regard. We also discarded the staging system in the machine learning model, which allows it to analyze more clinical information and be more adaptable.We found that the ECOG score in our models demonstrated greater weight than in previous models; in particular, when the ECOG score exceeded 3, it had more weight than stage IV of the CA staging system. There are two plausible explanations for this. First, non-anthracycline-based treatments are considered more intense than conventional CHOP or CHOP-like regimens, and thus patients with a poor performance status could be more vulnerable to treatment-related side effects (14). Second, with the effect of non-anthracycline-based treatments exceeding conventional therapies, advanced stage patients who are tolerant to newer regimens generally benefit more than before, making a poor performance status the most dangerous risk factor.Last, unlike the 3 variables mentioned above, which mainly reflect the patient’s ability to tolerate treatment and invasive potential of the tumor (32), B symptoms and elevated LDH can indicate the burden and replication capacity, and thus correlate with a worse OS (33,34); therefore, these 2 variables could further assist in identifying those at a higher risk.Although our models reached acceptable levels of discrimination and calibration in the cross-validation, they still have not undergone further validation from other centers’ data. Furthermore, although the 3 parameters we adopted could easily be applied to clinical practice and enhance practical utility, other risk factors, such as Epstein–Barr (EB) virus DNA, could not be assessed. This is mainly due to the different definition of high titer and the measurement procedures in our and other centers, which make the wider application of these factors challenging (14,35). Although the CA staging system has undergone prospective studies and has proven to be more suitable for ENKTL, the power to discriminate between CA stage II and III was not statistically significant in our data. This may be because regional lymph nodes lie in the radiation field and radiotherapy improves the response rate in stage III, or simply because there was bias arising from the limited data of our study. We assume it might because regional lymph nodes are in the radiation field and radiotherapy hence improve the response rate in stage III, or it is just due to bias because we are not able to include many centers’ data. As the machine learning model did not include a staging system, it does not have the same issues; however, machine leaning is usually used for the analysis of a vast amount of multidimensional data, and, due to low incidence of ENKTL, the amount of data in our research was limited.
Conclusions
In conclusion, we developed two prognostic models that could provide an individual estimation of risk for patients with ENKLT in the era of non-anthracycline-based treatment. The models were preliminarily validated to have good discriminatory power; however, this still needs to be verified by prospective study.
Authors: U Scherf; D T Ross; M Waltham; L H Smith; J K Lee; L Tanabe; K W Kohn; W C Reinhold; T G Myers; D T Andrews; D A Scudiero; M B Eisen; E A Sausville; Y Pommier; D Botstein; P O Brown; J N Weinstein Journal: Nat Genet Date: 2000-03 Impact factor: 38.330
Authors: Steven H Swerdlow; Elias Campo; Stefano A Pileri; Nancy Lee Harris; Harald Stein; Reiner Siebert; Ranjana Advani; Michele Ghielmini; Gilles A Salles; Andrew D Zelenetz; Elaine S Jaffe Journal: Blood Date: 2016-03-15 Impact factor: 22.113
Authors: T M Kim; S-Y Lee; Y K Jeon; B-Y Ryoo; G J Cho; Y S Hong; H J Kim; S-Y Kim; C S Kim; S Kim; J S Kim; S K Sohn; H H Song; J L Lee; Y K Kang; C Y Yim; W S Lee; Y J Yuh; C W Kim; D S Heo Journal: Ann Oncol Date: 2008-04-02 Impact factor: 32.976