Literature DB >> 35116395

New prognostic models for extranodal natural killer T-cell lymphoma, nasal-type using Cox regression and machine learning.

Jianbin Sun¹, Xiaoyan Ke², Mingzhi Zhang³, Yuhui Wang¹, Fengyang An¹, Yulin Zhao⁴, Li Zhu¹.

Abstract

BACKGROUND: The prognostic index of natural killer lymphoma (PINK) is recommended for use as a prognostic model for determining the best non-anthracycline-based treatment for extranodal natural killer T-cell lymphoma, nasal-type (ENKTL). However, this model does not provide an accurate individual risk estimation for patients; therefore, our retrospective study was conducted to determine this risk.
METHODS: Clinical data from 250 patients with ENKTL treated with non-anthracycline-based regimens were analyzed. The statistically significant clinical characteristics were selected as the parameters for our models. The patient data from 250 patients were randomly divided into 5 groups for 5-fold cross validation before final models were established on all of the patients' data. A statistical model nomogram based on a Cox proportional hazards model, and a machine learning model based on the lightGBM algorithm, were constructed. Concordance index (C-index) and calibration curve, areas under the curve (AUC) values, and binary error were used to evaluate two models.
RESULTS: Five variables [age, the Chinese Southwest Oncology Group and Asia Lymphoma Study Group ENKTL (CA) staging system, Eastern Cooperative Oncology Group (ECOG) score, B symptoms, and lactate dehydrogenase (LDH)] were significant and were selected as parameters for creating the statistical model nomogram, while lesion sites (anatomical regions, lymph nodes and primary lesion site) took place of CA staging system in machine learning model. During cross validation, the mean C-indices of training cohort and validation cohort for statistical model nomogram and PINK were 0.851±0.008, 0.843±0.029, 0.758±0.019 and 0.757±0.080, respectively, while the mean 3-year AUC for machine learning model were 0.920±0.010 and 0.865±0.035, respectively. The calibration curves and binary errors showed a good correlation between the predicted result and the reality.
CONCLUSIONS: These two models could provide ENKTL patients with an accurate individual risk estimation in the era of non-anthracycline-based treatment. 2021 Translational Cancer Research. All rights reserved.

Entities: Chemical

Keywords: CA staging system; Extranodal natural killer T-cell lymphoma, nasal-type (ENKTL); machine learning; nomogram; non-anthracycline-based treatment

Year: 2021 PMID： 35116395 PMCID： PMC8798130 DOI： 10.21037/tcr-20-3017

Source DB: PubMed Journal: Transl Cancer Res ISSN： 2218-676X Impact factor: 1.241

Introduction

Extranodal natural killer T-cell lymphoma, nasal-type (ENKTL) is recognized as a distinct subtype of lymphoid neoplasms by the World Health Organization (WHO) (1). It has the characteristics of having a low global morbidity rate and a poor prognosis (2-4). L-asparaginase has been determined to be effective in inhibiting the proliferation of natural killer cells since patients with ENKTL respond poorly to conventional CHOP (cyclophosphamide, doxorubicin, vincristine, and prednisone) and CHOP-like regimens because of drug resistance (5-8). New chemotherapies incorporating L-asparaginase and its analogues were gradually introduced into clinical practice, with an increasing number of patients deriving benefit from these newer non-anthracycline–based treatments. However, many studies (9-11) have shown that prognostic models based on conventional CHOP or CHOP-like chemotherapies, including the International prognostic index (IPI) (12) and Korean prognostic index (KPI) (13), were found no longer applicable under this new mode of treatment. As a result, a new prognostic model prognostic index of natural killer lymphoma (PINK) (14) was proposed in 2016. PINK incorporates four risk factors: age over 60 years, non-nasal type disease, stage III or IV disease in the Ann Arbor staging system and involvement of distant lymph nodes. It classifies patients into three groups: group 1 (risk factors ≤1), group 2 (risk factors =2), and group 3 (risk factors =3). These risks groups are significantly associated with different 3-year overall survival (OS). PINK has thus been validated for clinical use (15) and has been recommended by the latest National Comprehensive Cancer Network (NCCN) guideline (V.1, 2020); however, it classifies patients into just 3 groups. As a result, only a rough assessment of risk can be provided to a given patient: low risk, intermediate risk, or high risk. This does not provide an individualized or precise assessment of risk. Therefore, we have conducted this retrospective research to develop improved models. We present the following article in accordance with the TRIPOD reporting checklist (available at: http://dx.doi.org/10.21037/tcr-20-3017).

Methods

Participants

The study recruited 262 patients with ENKTL initially treated in the Peking University Third Hospital and the First Affiliated Hospital of Zhengzhou University from 2010 to 2020. Individuals were included in the study if they met the following criteria: (I) diagnosed as ENKTL according to histological and immunophenotypic features (including AE1/AE3, CD3, CD7, CD20, CD43, CD56, TIA-1, GranzymeB, EBER and Ki-67) on the basis of the WHO classification (16); (II) staged according to an endoscopic and imaging examination, including computed tomography (CT), magnetic resonance imaging (MRI) and/or ultrasound; (III) treated with non-anthracycline-based regimens. Patients were excluded if (I) they suffered from a previous malignancy or second primary tumor; (II) if they were not initially treated or their pretreatment information could not be accessed. Of the 262 patients, 5 suffered from previous malignancy or second primary tumor, 7 were not initially treated, or their pretreatment information could not be accessed. Finally, 250 patients were eligible for inclusion in our study. The study was conducted in accordance with the Declaration of Helsinki (as revised in 2013). This study was approved by our institutional review board (IRB00006761-M2020191). As our study was retrospective in nature with no personally identifiable information, the need for informed consent was waived.

Evaluation, deﬁnition and treatment

The data of participants’ medical history and physical examinations were gathered. Data from the endoscopic examination of upper aerodigestive tract; imaging examination (CT/enhanced CT/PET-CT, MR/enhanced MR and ultrasound) of involved regions and of the chest, abdomen, pelvis, limbs, and other regions; and bone marrow aspiration were collected. Each participant was staged using the Ann Arbor staging system and stratiﬁed using the PINK model. They were also staged by the newest staging system, the Chinese Southwest Oncology Group and Asia Lymphoma Study Group ENKTL (CA) system. The Ann Arbor staging system was originally developed for Hodgkin’s lymphoma (17), However, this system was reported to be less accurate in assessing aggressive non-Hodgkin’s lymphoma due to disease spread patterns differing from those of non-Hodgkin’s lymphoma (18). ENKTL is particularly disparate, as it primarily occurs in the upper aerodigestive tract (19,20). A new staging system: the CA staging system, was first proposed in 2014, and in 2019 underwent prospective study and validation both in China and internationally. It has demonstrated superior evaluative ability for ENKTL compared to the Ann Arbor system (21), and thus patients were also staged by the CA staging system in the present study. The CA staging system is consist of 4 stages: stage I, lesions located in nasal cavity or nasopharynx without local invasiveness or involvement of lymph node; stage II, lesions located in nasal cavity or nasopharynx with local invasiveness but without involvement of lymph node or lesions primarily located outside upper aerodigestive tract ; stage III, lesions combined with involvement of regional lymph node; stage IV, disseminated lesions or involvement of non-regional lymph nodes or lymph nodes involved on both sides of the diaphragm (21). To facilitate machine learning, all involved regions are deﬁned according to their anatomy, which included the nasal cavity, sinus, nasopharynx, oropharynx or tonsil, larynx, palate, oral cavity and tongue, parapharyngeal space or parotid gland, orbital cavity, skin of face, skin and soft tissue of body, gastrointestinal tract, lung, pericardium, spleen, liver, kidney or adrenal gland, testis, uterus or ovary, bone marrow and central nervous system; The involved lymph nodes are also deﬁned according to the classification and delineation of systemic lymph nodes, which included the head, cervical, supraclavicular, axillary, mediastinal, abdominal, retroperitoneal, pelvic, and limb lymph nodes. Primary site could be classified as upper aerodigestive tract or extra upper aerodigestive tract. All participants received non-anthracycline-based treatment according to their risk-based stratification. Comprehensive treatment included surgery, radiotherapy, chemotherapy, targeted therapy, stem cell transplantation, and maintenance therapy. Surgery treatment referred to resection of the lesion during the biopsy, and follow-up examinations returned negative results, usually followed by radiotherapy or chemotherapy. Radiotherapy was performed using the standard scheme of a radical dose of 50–52 Gy/25–26 Gy. Chemotherapy was performed using non-anthracycline-based regimens, mainly incorporating P-GEMOX (gemcitabine, pegaspargase and oxaliplatin), DDGP (dexamethasone, cisplatin, gemcitabine and pegaspargase) and Modified-SMILE (dexamethasone, methotrexate, ifosfamide, pegaspargase, Mesna and etoposide), which are the preferred regimens of the NCCN guideline (v.1, 2020). It should be noted that 7 patients received conventional CHOP or CHOP-like regimens. However, 1 of them changed to conventional therapy because the expected clinical effects were not achieved after administration of non-anthracycline-based treatment, 2 received as consolidation therapy after being relieved, and 4 received it as pretreatment before transplantation. Since none of them were initially or mainly treated with conventional therapies, they were still included in the study. Targeted therapy was used when disease progressed, and mainly consisted of programmed cell death protein 1 (PD-1) receptor inhibitors and chidamide. Transplantation referred to autologous stem cell transplantation after complete remission (CR) or unconfirmed complete remission (CRu). Maintenance therapy was mainly used for patients in terminal stage, and included steroids, thalidomide, and other forms for palliative treatment.

The construction and validation of the statistical model nomogram

Fifteen variables associated with OS that were reported in previous studies were selected including sex, age, Ann Arbor stage, CA stage, lymph node involvement, the interval from primary symptoms to diagnosis, primary site, Eastern Cooperative Oncology Group (ECOG) score, B symptoms, PINK model stratification, and lactate dehydrogenase (LDH) level, hemoglobin, platelet, and monocyte and prognostic nutritional index [PNI, defined as albumin g/L + 5 × lymphocyte count (×109/L)]. We separated continuous variables into low and high groups either using the well-known cutoff (age) or on the basis of the usual cutoff value for the normal range (such as LDH and hemoglobin) or the cutoffs of which prognostic value had been validated in previous studies (platelet, monocyte, PNI) (22,23). Univariate analysis was then performed using the Kaplan–Meier method. Apart from gender and intervals from primary symptoms to diagnosis, the other thirteen variables were significant for OS and underwent multivariate analysis by Cox regression method. Five variables, including age, CA stage, ECOG, B symptoms, and LDH score, were found to be statistically significant and were finally selected to construct the statistical model nomogram. For a better evaluation of this method of constructing a model, we used 5-fold cross validation. All patients were stratified randomly divided and divided into 5 groups according to CA stage; in turn, each group served as the validation cohort with the other 4 serving as the training cohort, and the final model was constructed on all 250 patients’ data (). Discrimination and calibration powers were evaluated by concordance index (C-index) and calibration curve, respectively, which as is were recommended for the evaluation of survival regression analysis.

Figure 1

Flow chart for participants selection, cross-validation and establishment of models.

The construction and validation of the machine learning model

As CA stage consists of the involved regions, lymph nodes, and the primary site, to facilitate machine learning, these 3 were analyzed instead of the CA stage itself. Age, ECOG score, B symptoms, and LDH score were also included in the machine learning model. LightGBM, recognized as an excellent gradient boosting decision tree (GBDT) algorithm in machine learning, was used for data analysis. A 5-fold cross-validation for the machine learning model was used for the same groups used by statistical model nomogram. For 3-year OS prediction, patients were first labeled as live or dead according to whether their survival time was over 3 years or not, and were then included in the analysis. AUC and binary errors were used to evaluate the discrimination and calibration powers due to binary classification nature of the LightGBM algorithm.

Statistical analysis

The OS rate was defined as the period from the initial diagnosis to ENKTL-related death or the last follow-up. Survival curves were depicted by the life table method, and univariate analysis was conducted with the Kaplan-Meier method. Multivariate analysis was conducted with Cox regression method by entering selection process, and the nomogram was constructed using multivariate Cox proportional hazards model method. Two missed values from PNI and intervals between primary symptoms to diagnosis were not included in univariate analysis and multivariate analysis. C-indices were calculated by calculating the area under the receiver operating characteristic (ROC) curve. Calibration curves comprised the predicted probability of OS curve and actual OS curve, and were plotted by bootstrap resampling (1,000 resamples for the training cohort and 500 for the validation cohort). AUC was defined as true positives (TP): TP + false positives (FP) and binary error was defined as FP + FN/all samples. Statistical analyses were performed in SPSS 20.0 (RRID:SCR_002865). The C-index, nomogram, and calibration curve were developed in R language using RStudio (RRID:SCR_000432). Machine learning was performed using Python in PyCharm 2020.1.2 (RRID:SCR_018221). Graphs were processed in GraphPad Prism 8 (RRID:SCR_002798) and Adobe Illustrator CC 2014 (RRID:SCR_010279). A P value <0.05 was considered statistically significant.

Results

Participant characteristics

The clinical characteristics of 250 participants are shown in . The involved regions and lymph nodes are outlined in . Out of all 250 patients analyzed, 195 (78%) patients received whole-body PET-CT imaging before initial treatment. In 55 patients who did not receive PET-CT, 38 were in early stage (Ann Arbor stage I or II) and 17 were in advanced stage (Ann Arbor stage III or IV). The mean follow-up time for survival patients was 32 months, and the 1- and 3-year OS were 80.9% and 69.6%, respectively (). In the univariate analysis, the following 13 variables were associated with OS: age, Ann Arbor stage, CA stage, lymph node involvement, primary site, ECOG score, B symptoms, PINK, LDH level, hemoglobin, platelet, monocyte, and PNI. In the multivariate analysis, 5 variables were significant: age, CA stage, ECOG score, B symptoms, and LDH level (). The grouping by CA stage for cross-validation is shown in .

Table 1

Clinical characteristics

Characteristics	n (%)
Gender
Male	182 (72.8)
Female	68 (27.2)
Age (years)
<60	209 (83.6)
≥60	41 (16.4)
Ann arbor stage
I	67 (26.8)
II	101 (40.4)
III	20 (8.0)
IV	62 (24.8)
CA stage
I	72 (28.8)
II	53 (21.2)
III	44 (17.6)
IV	81 (32.4)
Lymph node involvement
Yes	95 (38.0)
No	155 (62.0)
Interval from primary symptoms to diagnosis (months)
<12	214 (85.6)
≥12	34 (13.6)
Unknown	2(0.8)
Primary site
UADT	196 (78.4)
Extra-UADT	54 (21.6)
ECOG score
0	59 (23.6)
1	141 (56.4)
2	42 (16.8)
≥3	8 (3.2)
B symptoms
Yes	133 (53.2)
No	117 (46.8)
PINK
Group1	133 (53.2)
Group2	53 (21.2)
Group3	64 (25.6)
LDH level (U/L)
<245	143 (57.2)
≥245	107 (42.8)
Hemoglobin (g/L)
<120	86 (34.4)
≥120	164 (65.6)
Platelet (10⁹/L)
<200	119 (47.6)
≥200	131 (52.4)
Monocyte (10⁹/L)
<0.5	115 (46.0)
≥0.5	135 (54.0)
PNI
<48	169 (67.6)
≥48	79 (31.6)
Unknown	2(0.8)
Treatment
Surgery	14 (5.6)
Radiotherapy	113 (45.2)
Chemotherapy
P-GEMOX	27 (10.8)
DDGP	158 (63.2)
Modified-SMILE	39 (15.6)
Others	24 (9.6)
CHOP or CHOP-like regimens	7 (2.8)
Targeted therapy	17 (6.8)
Transplantation	4 (1.6)
Maintenance therapy	7 (2.8)

Table 2

Involved regions and lymph nodes

	n (%)
Regions
None	3 (1.2)
Nasal cavity	173 (69.2)
Sinus	40 (16.0)
Nasopharynx	82 (32.8)
Oropharynx or tonsil	38 (15.2)
Larynx	12 (4.8)
Palate	19 (7.6)
Oral cavity and tongue	7 (2.8)
Parapharyngeal space or parotid gland	26 (10.4)
Orbital cavity	11 (4.4)
Skin of face	23(9.2)
Skin and soft tissue of body	39 (15.6)
Gastrointestinal tract	28 (11.2)
Lung	22 (8.8)
Pericardium	1 (0.4)
Spleen	13 (5.2)
Liver	6 (2.4)
Kidney or adrenal gland	12 (4.8)
Testis	9 (3.6)
Uterus or ovary	7 (2.8)
Bone marrow	7 (2.8)
Central nervous system	5 (2.0)
Lymph nodes
None	156 (62.4)
Head	7 (2.8)
Neck	76 (30.4)
Supraclavicular	7 (2.8)
Axilla	19 (7.6)
Mediastinum	16 (6.4)
Abdominal cavity	16 (6.4)
Retroperitoneal	16 (6.4)
Pelvic cavity	30(12.0)
Limb	2 (0.8)

Involved regions and lymph nodes are determined by CT/enhanced CT/PET-CT, MR/enhanced MR and ultrasound, 195/250 (78%) received whole-body PET-CT. In 55 patients who did not received PET-CT, 38 were in early stage (Ann Arbor stage I or II), and 17 were in advanced stage (Ann Arbor stage III or IV).

Figure 2

Kaplan-Meier overall survival (OS) curve for all 250 patients included in the analyses.

Table 3

Univariate analysis and multivariable analysis

Characteristics	Univariate analysis	Multivariable analysis
Characteristics	P value	HR (95% CI)	P value
Gender (male vs. female)	0.598	–	–
Ann Arbor Stage (I, II, III, IV)	<0.001	–	0.983
Lymph node involvement (yes vs. no)	<0.001	–	0.624
Interval from primary symptoms to diagnosis (<12 vs. ≥12 months; n=248)	0.962	–	–
Primary site (UADT vs. Extra-UADT)	<0.001	–	0.595
PINK (Group1, Group2, Group3)	<0.001	–	0.477
Hemoglobin (<120 vs. ≥120 g/L)	<0.001	–	0.1
Platelet (<200 vs. ≥200, 10⁹/L)	0.018	–	0.819
Monocyte (<0.5 vs. ≥0.5, 10⁹/L)	0.008	–	0.498
PNI (<48 vs. ≥48; n=248)	0.001	–	0.355
Age (<60 vs. ≥60, years)	0.039	2.011 (1.043–3.878)	0.037
CA Stage (I, II, III, IV)	<0.001	2.341 (1.199–4.569)	0.013
ECOG score (0,1,2 vs. ≥3)	<0.001	2.4 (1.607–3.583)	<0.001
B symptoms (Yes vs. No)	0.007	0.325 (0.161–0.657)	0.002
LDH level (<245 vs. ≥245 U/L)	<0.001	3.746 (1.070–3.746)	0.03

Two missed values in Interval from primary symptoms to diagnosis and PNI were not included in univariate analysis. Fifteen variables in 250 patients first received univariate analysis and 13 statistically significant ones received multivariable analysis. In multivariable analysis, 5 variables were statistically significant. UADT, upper aerodigestive tract; PINK, prognostic index of natural killer lymphoma; PNI, prognostic nutritional index; CA, the Chinese Southwest Oncology Group and Asia Lymphoma Study Group ENKTL system; ECOG, Eastern Cooperative Oncology Group; LDH, lactate dehydrogenase concentrations.

Table 4

Grouping for cross-validation

CA stage	Group1	Group2	Group3	Group4	Group5	Total
I
Censored	14	14	14	14	13	69
Not censored	1	1	1	0	0	3
II
Censored	9	9	9	9	10	46
Not censored	0	1	2	2	2	7
III
Censored	6	7	7	7	7	34
Not censored	2	1	2	2	3	10
IV
Censored	7	7	7	7	8	36
Not censored	11	10	8	9	7	45
Total	50	50	50	50	50	250

CA, the Chinese Southwest Oncology Group and Asia Lymphoma Study Group ENKTL system; ECOG, Eastern Cooperative Oncology Group; PINK, prognostic index of natural killer lymphoma; LDH, lactate dehydrogenase concentrations; PNI, prognostic nutritional index; P-GEMOX, gemcitabine, pegaspargase and oxaliplatin; DDGP, dexamethasone, cisplatin, gemcitabine and pegaspargase; Modified-SMILE, dexamethasone, methotrexate, ifosfamide, pegaspargase, mesna and etoposide; CHOP, cyclophosphamide, doxorubicin, vincristine, and prednisone. Involved regions and lymph nodes are determined by CT/enhanced CT/PET-CT, MR/enhanced MR and ultrasound, 195/250 (78%) received whole-body PET-CT. In 55 patients who did not received PET-CT, 38 were in early stage (Ann Arbor stage I or II), and 17 were in advanced stage (Ann Arbor stage III or IV). Kaplan-Meier overall survival (OS) curve for all 250 patients included in the analyses. Two missed values in Interval from primary symptoms to diagnosis and PNI were not included in univariate analysis. Fifteen variables in 250 patients first received univariate analysis and 13 statistically significant ones received multivariable analysis. In multivariable analysis, 5 variables were statistically significant. UADT, upper aerodigestive tract; PINK, prognostic index of natural killer lymphoma; PNI, prognostic nutritional index; CA, the Chinese Southwest Oncology Group and Asia Lymphoma Study Group ENKTL system; ECOG, Eastern Cooperative Oncology Group; LDH, lactate dehydrogenase concentrations. According to the Ann Arbor staging system, the distribution for stages I-IV was 67 (26.8%), 101 (40.4%), 20 (8.0%) and 62 (24.8%), respectively. Stage III had the lowest proportion (8.0%), and its survival curve overlapped with that of stage IV, implying poor discrimination (). These results were in line with previous studies (19-26). In contrast, the distribution of the CA staging system for stages I–IV was 72 (28.8%), 53 (21.2%), 44 (17.6%), 81 (32.4%), respectively, which is more balanced, and there was less overlap between the survival curves (). The survival curves of the PINK model also showed good discrimination ().

Figure 3

Kaplan-Meier survival curves according to the Ann Arbor stage (A), CA Stage (B) and PINK (C). CA, the Chinese Southwest Oncology Group and Asia Lymphoma Study Group ENKTL system; PINK, prognostic index of natural killer lymphoma.

New models and comparison with the PINK model

The statistical model nomogram and machine learning model predicting the 3-year OS are illustrated in .

Figure 4

Figure 5

Machine learning model predicting 3-year OS for patients with extranodal NK/T-cell lymphoma, nasal-type. A patient’s 3-year OS likelihood can be output by checking the involved regions and lymph nodes, primary site, ECOG, LDH, and entering age. e.g., Illustrated is a 53-year-old patient who had a lifespan of 15 months after diagnosis, with nasal cavity, sinus, skin of face, skin and soft tissue of body, kidney or adrenal gland involved, upper aerodigestive tract of primary site, ECOG score 1, LDH ≥245 U/L and B symptom. ECOG, Eastern Cooperative Oncology Group; LDH, lactate dehydrogenase concentrations.

Statistical model nomogram predicting 3-year OS for patients with extranodal NK/T-cell lymphoma, nasal-type. To use the nomogram, for an individual patient, the value is located on each variable axis, and a line drawn upwards to the Points axis to determines points received for each variable value. The sum of five points is located on the Total Points axis, and lines drawn downwards to the 3-year OS determine the likelihood of 3-year OS rates. OS, overall survival. Machine learning model predicting 3-year OS for patients with extranodal NK/T-cell lymphoma, nasal-type. A patient’s 3-year OS likelihood can be output by checking the involved regions and lymph nodes, primary site, ECOG, LDH, and entering age. e.g., Illustrated is a 53-year-old patient who had a lifespan of 15 months after diagnosis, with nasal cavity, sinus, skin of face, skin and soft tissue of body, kidney or adrenal gland involved, upper aerodigestive tract of primary site, ECOG score 1, LDH ≥245 U/L and B symptom. ECOG, Eastern Cooperative Oncology Group; LDH, lactate dehydrogenase concentrations. To use the statistical model nomogram, for an individual patient, the value of each variable is located on the corresponding variable axis, and a line drawn upwards to the Points axis to determines the point received for each variable. The sum of 3 points is located on the Total Points axis, and the lines drawn downwards to the 3-year OS axis determine the likelihood of 3-year OS. To use machine learning model, a patient’s 3-year OS likelihood can be output by checking the involved regions and lymph nodes, primary site, ECOG, LDH, and entering age. For discrimination power, C-indices for the statistical model nomogram were 0.840, 0.846, 0.855, 0.859, and 0.853 in training cohort, and 0.875, 0.872, 0.819, 0.815, and 0.834 in the validation cohort, with mean C-indices of 0.851±0.008 and 0.843±0.029, respectively. Accordingly, the C-indices for PINK were 0.734, 0.751, 0.783, 0.750, and 0.770 in the training cohort, and 0.862, 0.783, 0.652, 0.779, and 0.710 in the validation cohort, with mean C-indices 0.758±0.019 and 0.757±0.080, respectively. In addition, the 3-year AUCs for machine learning model were 0.929, 0.919, 0.904, 0.923, and 0.926 in the training cohort, and 0.911, 0.841, 0.821, 0.874, and 0.878 in the validation cohort, with mean AUCs 0.902±0.010 and 0.865±0.035. To demonstrate the calibration power, calibration curves of 5 rounds cross-validation for statistical model nomogram are plotted in ; they show a good correlation between the predicted outcomes and the reality. Finally, binary errors in the machine learning model were 0.167, 0.175, 0.194, 0.148, and 0.162 in the training cohort, and 0.179, 0.250, 0.235, 0.222, and 0.120 in the validation cohort, with mean binary errors of 0.169±0.017 and 0.201±0.053, indicating good calibration power.

Figure 6

Calibration curves for the prediction of 3-year OS in 5 rounds cross-validation for nomogram; A, B, C, D and E are calibration curves of training cohort and validation cohort in Round1-Round5. OS, overall survival.

Discussion

At present, risk-adapted treatment is widely accepted, and thus improving the risk stratification of patients is a rational goal. In the past, the IPI, KPI, and other models were used (24,27); however, they were either not specially designed for ENKTL or based on conventional anthracycline-based chemotherapy. The treatment for ENKTL has changed from anthracycline-based treatment to non-anthracycline-based treatment, but the current NCCN guidelines do not recommend these models. PINK is applicable in the era of non-anthracycline–based treatment; however, patients cannot be assessed individually. Our study ultimately identified age, CA stage, ECOG, B symptoms, and LDH as significant factors in constructing statistical model nomograms, and replaced the CA stage with the involved regions, lymph nodes, and primary site in a machine learning model. We found that our models' discriminatory power was superior to that of PINK, and this is perhaps due to two mains reasons. First, we used the CA staging system instead of the Ann Arbor system adopted by PINK. The CA staging system is specially developed for ENKTL, has been proven to be superior in stratifying ENKTL patients, and incorporates more clinical information. In PINK, there are 4 parameters, age, stage III or IV disease in Ann Arbor staging system, non-nasal type disease, and involvement of distant lymph nodes; however, the CA staging system itself contains non-nasal type disease and involvement of lymph nodes, so we use fewer variables while analyzing more information. Second, compared with the PINK statistical grouping method, less information is lost during modeling when using the Cox regression method. As for machine learning algorithm, lightGBM, unlike traditional statistical method, which assume a linear combination of risk factors, lightGBM is essentially a decision tree of machine learning that uses more details and manages more complex relationships. Apart from improving discrimination power, our models’ variables are common and relevant to clinical practice, enhancing their practical utility. First, age is statistically significant and included because older adults generally have more complications and are presumed to be less tolerance to chemotherapy than young patients (28,29), which makes inadequate treatment a common problem. Furthermore, gene aberrations are more likely to accumulate in older patients leading to more aggressive tumors (30,31). The CA staging system has advantages over the Ann Arbor staging system. For unbalanced stratification in stage III and overlapping survival curves between stage III and IV in the Ann Arbor staging system, PINK incorporates stage III and IV as a single parameter. The CA staging system has a more balanced stratification across the 4 stages, and it is statistically significant in multivariate analysis, while the Ann Arbor staging system is not. The CA staging system is specially designed for ENKTL, and there is a high probability it will replace the Ann Arbor staging system in the future; our model is thus intended to better assist clinicians in this regard. We also discarded the staging system in the machine learning model, which allows it to analyze more clinical information and be more adaptable. We found that the ECOG score in our models demonstrated greater weight than in previous models; in particular, when the ECOG score exceeded 3, it had more weight than stage IV of the CA staging system. There are two plausible explanations for this. First, non-anthracycline-based treatments are considered more intense than conventional CHOP or CHOP-like regimens, and thus patients with a poor performance status could be more vulnerable to treatment-related side effects (14). Second, with the effect of non-anthracycline-based treatments exceeding conventional therapies, advanced stage patients who are tolerant to newer regimens generally benefit more than before, making a poor performance status the most dangerous risk factor. Last, unlike the 3 variables mentioned above, which mainly reflect the patient’s ability to tolerate treatment and invasive potential of the tumor (32), B symptoms and elevated LDH can indicate the burden and replication capacity, and thus correlate with a worse OS (33,34); therefore, these 2 variables could further assist in identifying those at a higher risk. Although our models reached acceptable levels of discrimination and calibration in the cross-validation, they still have not undergone further validation from other centers’ data. Furthermore, although the 3 parameters we adopted could easily be applied to clinical practice and enhance practical utility, other risk factors, such as Epstein–Barr (EB) virus DNA, could not be assessed. This is mainly due to the different deﬁnition of high titer and the measurement procedures in our and other centers, which make the wider application of these factors challenging (14,35). Although the CA staging system has undergone prospective studies and has proven to be more suitable for ENKTL, the power to discriminate between CA stage II and III was not statistically significant in our data. This may be because regional lymph nodes lie in the radiation ﬁeld and radiotherapy improves the response rate in stage III, or simply because there was bias arising from the limited data of our study. We assume it might because regional lymph nodes are in the radiation ﬁeld and radiotherapy hence improve the response rate in stage III, or it is just due to bias because we are not able to include many centers’ data. As the machine learning model did not include a staging system, it does not have the same issues; however, machine leaning is usually used for the analysis of a vast amount of multidimensional data, and, due to low incidence of ENKTL, the amount of data in our research was limited.

Conclusions

In conclusion, we developed two prognostic models that could provide an individual estimation of risk for patients with ENKLT in the era of non-anthracycline-based treatment. The models were preliminarily validated to have good discriminatory power; however, this still needs to be veriﬁed by prospective study.

33 in total

1. A gene expression database for the molecular pharmacology of cancer.

Authors: U Scherf; D T Ross; M Waltham; L H Smith; J K Lee; L Tanabe; K W Kohn; W C Reinhold; T G Myers; D T Andrews; D A Scudiero; M B Eisen; E A Sausville; Y Pommier; D Botstein; P O Brown; J N Weinstein
Journal: Nat Genet Date: 2000-03 Impact factor: 38.330

2. BCL2 expression in de novo diffuse large B-cell lymphoma partly reflects normal differences in age distribution.

Authors: Ulf Thunberg; Rose-Marie Amini; Johan Linderoth; Göran Roos; Gunilla Enblad; Mattias Berglund
Journal: Br J Haematol Date: 2009-06-08 Impact factor: 6.998

3. Validity of the Ann Arbor staging classification for the non-Hodgkin's lymphomas.

Authors: S A Rosenberg
Journal: Cancer Treat Rep Date: 1977-09

4. Report of the Committee on Hodgkin's Disease Staging Classification.

Authors: P P Carbone; H S Kaplan; K Musshoff; D W Smithers; M Tubiana
Journal: Cancer Res Date: 1971-11 Impact factor: 12.701

5. A modified international prognostic index including pretreatment hemoglobin level for early stage extranodal natural killer/T cell lymphoma.

Authors: Liang Wang; Zhong-jun Xia; Yue Lu; Qiao-xuan Wang; Shao-qing Niu; Hui-qiang Huang; Yu-jing Zhang
Journal: Leuk Lymphoma Date: 2015-03-28

6. The effect of pre-irradiation dose intense CHOP on anthracyline resistance in localized nasal NK/T-cell lymphoma.

Authors: Se-Hoon Lee; Yong Chan Ahn; Won Seog Kim; Young H Ko; Kihyun Kim; Keunchil Park
Journal: Haematologica Date: 2006-02-17 Impact factor: 9.941

Review 7. The 2016 revision of the World Health Organization classification of lymphoid neoplasms.

Authors: Steven H Swerdlow; Elias Campo; Stefano A Pileri; Nancy Lee Harris; Harald Stein; Reiner Siebert; Ranjana Advani; Michele Ghielmini; Gilles A Salles; Andrew D Zelenetz; Elaine S Jaffe
Journal: Blood Date: 2016-03-15 Impact factor: 22.113

8. Clinical heterogeneity of extranodal NK/T-cell lymphoma, nasal type: a national survey of the Korean Cancer Study Group.

Authors: T M Kim; S-Y Lee; Y K Jeon; B-Y Ryoo; G J Cho; Y S Hong; H J Kim; S-Y Kim; C S Kim; S Kim; J S Kim; S K Sohn; H H Song; J L Lee; Y K Kang; C Y Yim; W S Lee; Y J Yuh; C W Kim; D S Heo
Journal: Ann Oncol Date: 2008-04-02 Impact factor: 32.976

9. Frequent expression of P-glycoprotein/MDR1 by nasal T-cell lymphoma cells.

Authors: M Yamaguchi; K Kita; H Miwa; K Nishii; K Oka; T Ohno; S Shirakawa; M Fukumoto
Journal: Cancer Date: 1995-12-01 Impact factor: 6.860

10. Validation of nomogram-revised risk index and comparison with other models for extranodal nasal-type NK/T-cell lymphoma in the modern chemotherapy era: indication for prognostication and clinical decision-making.

Authors: Si-Ye Chen; Yong Yang; Shu-Nan Qi; Ying Wang; Chen Hu; Xia He; Li-Ling Zhang; Gang Wu; Bao-Lin Qu; Li-Ting Qian; Xiao-Rong Hou; Fu-Quan Zhang; Xue-Ying Qiao; Hua Wang; Gao-Feng Li; Yu-Jing Zhang; Yuan Zhu; Jian-Zhong Cao; Sheng-Min Lan; Jun-Xin Wu; Tao Wu; Su-Yu Zhu; Mei Shi; Li-Ming Xu; Zhi-Yong Yuan; Joachim Yahalom; Richard Tsang; Yu-Qin Song; Jun Zhu; Hang Su; Ye-Xiong Li
Journal: Leukemia Date: 2020-03-09 Impact factor: 11.528