Jielun Lu1, Huo Tan2, Bo Li1, Shuyi Chen2, Lihua Xu2, Yawei Zou1. 1. Department of Pediatrics, The First Affiliated Hospital of Guangzhou Medical University, Guangzhou, Guangdong 510000, P.R. China. 2. Department of Hematology, The First Affiliated Hospital of Guangzhou Medical University, Guangzhou, Guangdong 510000, P.R. China.
Burkitt lymphoma (BL) is a highly aggressive B-cell non-Hodgkin lymphoma that was discovered by Denis Burkitt in 1958 in Africa (1). This was the first tumor shown to have a chromosomal translocation activating an oncogene (c-MYC) (2), as well as being associated with Epstein-Barr virus (3). Owing to the introduction of dose-intense multi-drug combination chemotherapy, prophylaxis for central nervous system disease and improvements in supportive care, BL has become the aggressive lymphoma subset that is associated with the highest cure rate in both adults and children (4).However, compared with published single or multi-center clinical studies, the 3-year survival rate according to the 2002–2008 Surveillance, Epidemiology and End Results (SEER) US study, including 3,691 patients with BL, was notably low (56%) (5). In addition, current guidelines are still recommending uniform, aggressive treatment for all patients without considering prognosis and risk categorization. As with other malignancies, identifying the prognostic factors of BL is critical, as these can guide treatment selection. Castillo et al (6) reported that age, race and stage have independent prognostic roles in survival, by analyzing data from 1,428 patients recorded between 1998 and 2009 from the SEER program database. Nevertheless, systematically identifying prognostic factors to arrive at a more precise estimate of prognosis is of importance, as survival is undoubtedly multifactorial. Nomograms, a simple graphical presentation of a multivariate predictive model, show the impact of each included variable on an outcome of interest, thereby providing a numerical probability of the outcome (7). One of the advantages of nomograms is the ability to integrate multiple prognostic factors into the individual survival numerical estimate of a single patient, providing individualized survival predictions. Nomograms have gradually become a useful tool in the field of oncology, for predicting cancer prognosis (8–10). However, to the best of our knowledge, a nomogram for survival prediction in BL has not been reported.The present study aimed to use data from a large US population dataset, the SEER database, to provide an update on the incidence and evaluate treatment outcomes in patients with BL over the past three decades, between 1983 and 2015. Furthermore, nomograms were developed and validated to predict the overall survival (OS) and cancer-specific survival (CSS) of patients with BL with different values of prognostic factors.
Patients and methods
Data sources and patient selection
The SEER Program that was used to obtain the data for the present study was created in 1973 to collect cancer statistics in the US, to reduce the cancer burden among the US population (http://seer.cancer.gov). The data on patient demographics, primary tumor site, tumor morphology and stage at diagnosis, first course of treatment, and follow-up for vital status (through linkage with the National Center for Health Statistics) were routinely collected in all 18 registries spread across the US. Currently, SEER registries cover 28% of the US population; coverage includes 25% of white Americans, 26% of African Americans, 38% of Hispanics, 44% of Native Americans and Alaska natives, 50% of Asians and 67% of Hawaiian/Pacific islanders. The SEER database was accessed via SEER*Stat software (version 8.3.5; National Cancer Institute).The inclusion criteria of the study were as follows: i) Diagnosed with BL as the primary malignancy, according to the 3rd edition (ICD-O-3) histology code 9687 (Burkitt lymphoma) (earlier patients coded as ICD-O-1 and ICD-O-2 were previously re-coded as ICD-O-3); ii) only one malignant tumor present, in order to exclude interference from other tumors; and iii) followed up for vital status until December 31st, 2015, to ensure a minimal follow-up length of two years. The exclusion criteria were as follows: i) Diagnosed by autopsy or reported only on a death certificate; and ii) one or more statistical factors unknown. In total, 4,600 patients were included in the analysis.
Definition of variables
Patients were divided into three groups according to the time of diagnosis (1983–1993, 1994–2004 and 2005–2015), in order to evaluate the development of the diagnosis and treatment of BL. The patients were further stratified by other clinicopathological features including age, sex (male or female), race (black, white, American Indian/Alaska native, Chinese and other Asian/Pacific islander) marital status (married, single, or widowed/divorced/separated), Ann Arbor stage (I, II, or IV), primary site (lymph nodes, digestive system, oral cavity and pharynx, bone marrow, nervous system and other), percentage of families with incomes below the poverty line of the county (% families below poverty; ≤5, 5–10, 10–20 or ≥20), median household income, chemotherapy (yes, or no/unknown), outcome at study cutoff (alive or succumbed), and survival time.To assess the prognostic significance of age and to avoid arbitrary predetermined cutoff points, X-tile plots were constructed to assess age using four groups, and the most effective cutoff points were identified following correction for the use of minimum P statistics by Miller-Siegmund P-value correction, and dividing into four parts. The X-tile program (Yale University) was first introduced to identify the best cutoff values of several prognostic factors in cohorts of patients with breast cancer (11).
Incidence and survival analysis
To assess changes in the incidence of BL, the overall population incidence was estimated by calculating from the patient data in the SEER database. Incidence rates were expressed per 1,000,000 in the population and were age-adjusted to the 2000 US standard population (12).As improvements in diagnosis and treatment are made over time, the present study intended to assess changes in survival rates, including OS, meaning the time from diagnosis to mortality for any reason, and CSS, defined as the time from diagnosis to mortality attributed to BL, in patients with BL grouped by different diagnostic time periods (1983–1993, 1994–2004 and 2005–2015).
Construction and validation of the nomograms
To improve predictions for the construction of the nomogram plots for BL, 2,669 patients diagnosed during 2005–2015, identified from the SEER database, were randomly assigned to the training cohort (n=1,762, for the construction and internal validation of the nomograms) and validation cohort (n=907, for external validation of the nomograms). Potential prognostic factors including age, sex, race, marital status, % families below poverty, chemotherapy, primary site and stage, that were associated with OS or CSS were included in the Kaplan-Meier analyses and log-rank tests. To eliminate the interactions between various factors, a multivariate Cox proportional hazard model was used to identify independent prognostic factors and, furthermore, hazard ratios were calculated for the significant prognostic factors. To predict whether a patient will experience the event (mortality for any reason or due to BL) at 3 or 5 years, nomograms incorporate prognostic factors and present them in an intuitive way. To validate the nomogram plots internally and externally, bootstrap validation was used with 1,000-iteration resampling. The data of the training cohort were used for internal validation and validation cohort was used for external validation. Harrell's concordance index (C-index), was used to estimate the predictive accuracy of the nomogram, for the consistency of the predicted and actual values (13). The range of the C-index is 0.5 to 1; 0.5 is completely inconsistent, indicating that the model has no predictive effect, while 1 is completely consistent, indicating that the model prediction results are completely consistent with the actual results. It is generally believed that the C-index is less accurate at 0.50–0.70, has moderate accuracy between 0.71–0.90 and has high accuracy at >0.90. In addition, calibration plots were also used to validate the nomograms, comparing predicted values with actual outcomes.
Statistical analysis
OS was defined as the time from diagnosis to death from all possible causes. CSS was defined as the time from diagnosis to death attributed to BL. Patients who were alive at the time of last follow-up were counted as censored observations. Baseline clinicopathological characteristics of the patients were compared between the training and validation cohort and among the different groups of diagnostic time periods (1983–1993, 1994–2004 and 2005–2015). A Chi-square test was used for counting data (age group, sex, race, marital status, % families below poverty, chemotherapy, primary site and stage). An analysis of variance test (followed by Least Significant Difference) was used for analysis of measurement data (age, median household income and survival time). Potential prognostic factors including age, sex, race, marital status, % families below poverty, chemotherapy, primary site and stage were incorporated in the univariate analysis using Kaplan-Meier analyses and log-rank tests for OS and CSS, respectively. Significant variables were further assessed using multivariate Cox proportional hazards model hazard ratios of each variable with corresponding 95% confidence intervals (CI) were calculated regarding OS and CSS.All statistical analysis was conducted using SPSS version 23.0 (IBM Corp.). R version 3.3.5 (http://www.r-project.org/) was used to construct and validate nomograms. P<0.05 (two-tailed) was considered to indicate a statistically significant difference.
Results
Patient characteristics
Followed the inclusion and exclusion criteria, 4,600 patients diagnosed with BL between 1983 and 2015 were identified. All the patients were classified by their characteristics, including age, sex, race, marital status, Ann Arbor stage, primary site, % families below poverty, median household income, chemotherapy, radiotherapy, outcome at study cutoff and survival time (Table I).
Table I.
Demographics and clinical characteristics of patients with Burkitt lymphoma.
Characteristics
No. (%) of patients (n=4,600)
Age, years, mean ± SD
39.71±22.86
Sex
Male
3,490 (75.9)
Female
1,110 (24.1)
Ethnicity
White
3,738 (81.3)
Black
469 (10.2)
Chinese
65 (1.4)
Other Asian/Pacific islander
299 (6.5)
American Indian/Alaska native
29 (0.6)
Marital status
Single (never married)
2,552 (55.5)
Married (including common law)
1,548 (33.7)
Widowed/divorced/separated
500 (10.9)
% Families below poverty
≤5
616 (13.4)
5–10
1,911 (41.5)
10–20
1,953 (42.5)
≥20
120 (2.6)
Median household income (in tens) ± SD
6,073.37±1,360.855
Primary site
Lymph nodes
3,192 (69.4)
Digestive system
707 (15.4)
Oral cavity and pharynx
204 (4.4)
Bone marrow and nervous system
126 (2.7)
Other
371 (8.1)
Stage
I
939 (20.4)
II
728 (15.8)
III
528 (11.5)
IV
2,405 (52.3)
Year of diagnosis
1983–1993
440 (9.6)
1994–2004
1,491 (32.4)
2005–2015
2,669 (58.0)
Chemotherapy
Yes
4,090 (88.9)
No/unknown
510 (11.1)
Outcome at study cut-off
Alive
2,432 (52.9)
Dead
2,168 (47.1)
Survival time, months, mean ± SD
59.74±75.241
SD, standard deviation.
Using the X-tile program, it was possible to confirm the best cutoff values of age as a prognostic factor for BL. The X-tile program divided age into four subgroups by choosing the grouping scheme that valued the highest χ2 record. The X-tile program identified the most appropriate cutoff values of age as 20, 40 and 56 years based on OS (Fig. 1). Thus, age was divided into the following categories: ≤20, 21–40, 41–55 and ≥56 years. The age grouping profile of the patients with BL is described in Table II.
Figure 1.
Optimal cutoff values of age identified by X-tile analysis of the patients with BL. (A) The optimal cutoff points calculated and selected by X-tile software evaluating the χ2 log-rank value. The x-axis represents all potential cut-points from low to high (left to right) that define a low subset, whereas the y-axis represents cut-points from high to low (top to bottom), that define a high subset. The arrows represent the direction in which the low subset (x-axis) and the high subset (y-axis) increase in size. Red coloration of cut-points indicates an inverse correlation with survival, whereas green coloration represents direct associations. The optimal cut-point occurs at the brightest pixel (green or red), which was recognized by computer software. The optimal cutoff point occurs at the brightest pixel with red coloration, indicating an inverse correlation with survival, highlighted by the black circle. (B) Histogram based on the optimal cutoff point. (C) Kaplan-Meier analysis based on the optimal cutoff points. The optimal cutoff points for age were 20.0, 40.0 and 56.0 years based on the survival analysis (P<0.001). BL, Burkitt lymphoma.
Table II.
Age grouping profile of patients with Burkitt lymphoma.
Characteristics
No. (%) of patients (n=4,600)
Age, group, years
≤20
1,183 (25.7)
21–40
1,147 (24.9)
41–55
1,148 (25.0)
≥56
1,122 (24.4)
The incidence of BL between 1983 and 2015 was calculated by updating the SEER data to explore the recent development trends of the disease (Fig. 2), although the incidence between 1973 and 2008 has been previously reported (5). Incidence gradually increased from 1983 and reached its peak in 2009, with a rate of 0.491 per 100,000 (95% CI, 0.412–0.581). From 2009, the incidence slowly declined year by year and dropped to 0.28 per 100,000 (95% CI, 0.224–0.346).
Figure 2.
Estimated incidence of Burkitt lymphoma during 1983–2015. Incidence gradually increased from 1983 and reached its peak in 2009, with a rate of 0.491 per 100,000 (95% CI, 0.412–0.581). From 2009, the incidence slowly declined year by year and dropped to 0.280 per 100,000 (95% CI, 0.224–0.346). CI, confidence interval.
Among the different groups of diagnostic time periods (1983–1993, 1994–2004 and 2005–2015), the aforementioned characteristics of the patients significantly differed (P<0.01; Table III), revealing the differences in patient composition. The univariate survival analysis showed that an earlier diagnostic time was associated with worse outcomes, while a more recent diagnosis time was associated with better outcomes (P<0.0001; Fig. 3). The OS rates of patients diagnosed during 2005–2015 were increased, with a 3-year OS of 62.6% and a 5-year OS of 61.2%, in contrast to the 3-year OS of 35.2% and 5-year OS of 33.8% of patients diagnosed in the period 1983–1993 (Fig. 4A). Similarly, the CSS rates of patients diagnosed in 2005–2015 were 65.3% for 3-year CSS and 64.4% for 5-year CSS, while the 3-year CSS was 37.0% and the 5-year CSS was 38.0% for patients diagnosed during 1983–1993 (Fig. 4B).
Table III.
Demographics and clinical characteristics of Burkitt lymphoma patients with different diagnostic time.
No. (%) of patients
Characteristics
1983–1993
1994–2004
2005–2015
P-value[a]
Age, years, mean ± SD
32.02±20.74
39.56±23.50
41.07±22.58
<0.001
Age, group, years
<0.001
≤20
153 (34.77)
401 (26.89)
629 (23.57)
21–40
150 (34.09)
363 (24.35)
634 (23.75)
41–55
78 (17.73)
357 (23.94)
713 (26.71)
≥56
59 (13.41)
370 (24.82)
693 (25.96)
Sex
0.002
Male
364 (82.73)
1,125 (75.45)
2,001 (74.97)
Female
76 (17.27)
366 (24.55)
668 (25.03)
Race
0.003
White
386 (87.73)
1,216 (81.56)
2,136 (80.03)
Black
29 (6.59)
147 (9.86)
293 (10.98)
Chinese
2 (0.45)
22 (1.48)
41 (1.54)
Other Asian/Pacific islander
21 (4.77)
103 (6.91)
175 (6.56)
American Indian/Alaska native
2 (0.45)
3 (0.20)
24 (0.90)
Marital status
<0.001
Single (never married)
303 (68.86)
832 (55.80)
1,417 (53.09)
Married (including common law)
94 (21.36)
494 (33.13)
960 (35.97)
Widowed/divorced/separated
43 (9.77)
165 (11.06)
292 (10.94)
% Families below poverty
<0.001
≤5
79 (17.95)
284 (19.05)
253 (9.48)
5–10
261 (59.32)
565 (37.89)
1,085 (40.65)
10–20
95 (21.59)
619 (41.52)
1,239 (46.42)
≥20
5 (1.14)
23 (1.54)
92 (3.45)
Median household income (in tens) ± SD
6,485.27±1,233.39
6,071.02±1,324.73
6,006.77±1,389.06
<0.001
Primary site
0.004
Lymph nodes
323 (73.41)
1,070 (71.76)
1,799 (67.40)
Digestive system
67 (15.23)
228 (15.29)
412 (15.44)
Oral cavity and pharynx
12 (2.73)
61 (4.09)
131 (4.91)
Bone marrow and nervous system
13 (2.95)
35 (2.35)
78 (2.92)
Other
25 (5.68)
97 (6.51)
249 (9.33)
Stage
<0.001
I
92 (20.91)
330 (22.13)
517 (19.37)
II
43 (9.77)
241 (16.16)
444 (16.64)
III
34 (7.73)
170 (11.40)
324 (12.14)
IV
271 (61.59)
750 (50.30)
1,384 (51.85)
Chemotherapy
0.001
Yes
376 (85.45)
1,303 (87.39)
2,411 (90.33)
No/Unknown
64 (14.55)
188 (12.61)
258 (9.67)
Outcome at study cut-off
<0.001
Alive
126 (28.64)
638 (42.79)
1,668 (62.50)
Dead
314 (71.36)
853 (57.21)
1,001 (37.50)
Survival time, months, mean ± SD
100.96±140.57
81.80±84.29
40.62±40.28
<0.001
Baseline clinicopathological characteristics of the patients were compared among the different groups of diagnostic time periods (1983–1993, 1994–2004 and 2005–2015). Chi-square test was used for counting data (age group, sex, race, marital status, % families below poverty, chemotherapy, primary site and stage). Analysis of variance test was used for analysis of measurement data (age, median household income and survival time). SD, standard deviation.
Figure 3.
Kaplan-Meier overall survival of the independent validation set according to three groups as determined by diagnostic year. The results showed that Burkitt lymphoma patients in early diagnostic time group have poorer overall survival than those in late diagnostic time group (log-rank P<0.0001).
Figure 4.
Trends in 5-year survival rates for patients with BL from the SEER database between 1983 and 2015. Data are shown with different groups classified by different diagnostic years (1983–1993, 1994–2004 and 2005–2015). (A) The OS rates for patients with BL. (B) The CSS rates for patients with BL. OS, overall survival; CSS, cancer-specific survival; BL, Burkitt lymphoma.
Demographics and clinical characteristics of the training and validation cohorts of BLpatients were collated (Table IV). Using univariate and multivariate analysis, four variables (age, race, chemotherapy and stage) were found to be significant factors associated with OS (Table V) and CSS (Table VI). Though primary site did not produce meaningful results in the multivariate analysis, it was still included in the nomogram construction to verify its precise clinical significance. The nomograms for 3- and 5-year OS and CSS prediction were constructed by integrating the impact of five independent predictors (Figs. 5 and 6). The predicted probabilities of each covariate were mapped on a scale from 0 to 100, where 100 represented the highest effect and 0 the lowest. The total points accumulated were used to predict the probability for the patients. Each definition of the predictors has an accurate value displayed in the digital axis. The exact score of each point of the various covariates was calculated (Table VII). For the practical application of the nomograms, an example is provided: The total number of points of a 25-year old black patient diagnosed with stage II BL, with a lymph node primary site, would be 60 if treated with chemotherapy and the estimated 3-year OS rate would be 85%.
Table IV.
Demographics and clinical characteristics of training and validation cohort for BL patients.
No. of patients[b]
Characteristics
Training cohort (n=1,762)
Validation cohort (n=907)
P-value[a]
Age, years, mean ± SD
41.45±22.50
40.33±22.73
0.913
Age, group, years
0.137
≤20
398 (22.59)
231 (25.47)
21–40
426 (24.18)
208 (22.93)
41–55
461 (26.16)
252 (27.78)
≥56
477 (10.05)
216 (23.81)
Sex
0.741
Male
1,317 (74.74)
684 (75.41)
Female
445 (25.26)
223 (24.59)
Ethnicity
0.677
Black
185 (10.50)
108 (11.91)
White
1,418 (80.48)
718 (79.16)
American Indian/Alaska native
18 (1.02)
6 (0.66)
Chinese
24 (1.36)
17 (1.87)
Other Asian/Pacific islander
117 (6.64)
58 (6.39)
Marital status
0.354
Single (never married)
919 (52.16)
498 (54.91)
Married (including common law)
650 (36.89)
310 (34.18)
Widowed/divorced/separated
193 (10.95)
99 (10.92)
% Families below poverty
0.974
≤5
172 (9.76)
81 (8.93)
5–10
715 (40.58)
370 (40.79)
10–20
822 (46.65)
417 (45.98)
≥20
53 (3.01)
39 (4.30)
Median household income (in tens) ± SD
6,022.31±1,374.65
5,976.59±1,416.92
0.845
Primary site
0.812
Lymphonodus
1,201 (68.16)
598 (65.93)
Digestive system
266 (15.10)
146 (16.10)
Oral cavity and pharynx
86 (4.88)
45 (4.96)
Bone marrow and nervous system
51 (2.89)
27 (2.98)
Other
158 (8.97)
91 (10.03)
Stage
0.140
I
320 (18.16)
197 (21.72)
II
298 (16.40)
146 (16.10)
III
211 (11.98)
113 (12.46)
IV
933 (52.95)
451 (49.72)
Chemotherapy
0.356
Yes
1,585 (89.95)
826 (91.07)
No/Unknown
177 (10.05)
81 (8.93)
Outcome at study cut-off
0.933
Alive
1,100 (62.43)
568 (62.62)
Dead
662 (37.57)
339 (37.38)
Survival time, Months, mean ± SD
40.33±40.43
41.16±40.00
0.352
Baseline clinicopathological characteristics of the patients were compared between the training and validation cohort. Chi-square test was used for counting data (age group, sex, race, marital status, % families below poverty, chemotherapy, primary site and Stage). Analysis of variance test was used for analysis of measurement data (age, median household income and survival time).
2,669 patients diagnosed BL during 2005–2015 were randomly assigned to the training cohort and validation cohort by SPSS version 23.0. SD, standard deviation; BL, Burkitt lymphoma.
Table V.
Univariate analysis and multivariate analysis of overall survival in the training cohort.
Multivariate analysis
Characteristics
Univariate analysis P-value
HR (95% CI)
P-value
Age, group, years
0.000
≤20
0.106 (0.071–0.157)
<0.001<0.001<0.001
21–40
0.389 (0.308–0.492)
41–55
0.589 (0.487–0.711)
≥56
Reference
Sex
0.108
NI
Male
NI
Female
Ethnicity
0.000
White
1.432 (0.995–2.062)
0.054
Black
0.911 (0.673–1.233)
0.545
American Indian/Alaska native
2.942 (1.546–5.597)
0.001
Chinese
1.272 (0.570–2.838)
0.557
Other Asian/Pacific islander
Reference
Marital status
0.000
Single (never married)
1.027 (0.803–0.312)
0.834
Married (including common law)
0.952 (0.761–1.190)
0.665
Widowed/divorced/separated
Reference
% Families below poverty
0.001
≤5
Reference
5–10
0.989 (0.838–1.167)
0.895
10–20
1.012 (0.859–1.192)
0.886
≥20
0.956 (0.669–1.367)
0.806
Primary site
0.000
Oral cavity and pharynx
0.82 (0.563–1.197)
0.305
Lymphonodus
0.744 (0.488–1.132)
0.168
Digestive system
0.876 (0.469–1.635)
0.678
Bone marrow and nervous system
Reference
Other
0.957 (0.619–1.480)
0.845
Stage
0.000
I
0.415 (0.322–0.535)
<0.001<0.0010.004
II
0.461 (0.356–0.596)
III
0.688 (0.532–0.889)
IV
Reference
Chemotherapy
0.000
Yes
4.895 (3.982–6.017)
<0.001
No/unknown
CI, confidence interval; NI, ; HR, hazard ratio.
Table VI.
Univariate analysis and multivariate analysis of cancer-specific survival in the training cohort.
Multivariate analysis
Characteristics
Univariate analysis P-value
HR (95% CI)
P-value
Age, group, years
0.000
<0.001<0.001<0.001
≤20
0.106 (0.071–0.157)
21–40
0.389 (0.308–0.492)
41–55
0.589 (0.487–0.711)
≥56
Reference
Sex
0.182
NI
NI
Male
Female
Ethnicity
0.000
White
1.432 (0.995–2.062)
0.054
Black
0.911 (0.673–1.233)
0.545
American Indian/Alaska native
2.942 (1.546–5.597)
0.001
Chinese
1.272 (0.570–2.838)
0.557
Other Asian/Pacific islander
Reference
Marital status
0.000
Single (never married)
1.027 (0.803–1.312)
0.834
Married (including common law)
0.952 (0.761–1.190)
0.665
Widowed/divorced/separated
Reference
% Families below poverty
0.002
≤5
1.011 (0.847–1.207)
0.707
5–10
1.035 (0.869–1.233)
0.879
10–20
1.014 (0.698–1.474)
0.449
≥20
Reference
Primary site
0.005
Oral cavity and pharynx
0.821 (0.563–1.197)
0.305
Lymphonodus
0.744 (0.488–1.132)
0.168
Digestive system
0.876 (0.469–1.635)
0.678
Bone marrow and nervous system
0.957 (0.619–1.480)
0.845
Other
Reference
Stage
0.000
<0.001<0.0010.004
I
Reference
II
0.415 (0.322–0.535)
III
0.461 (0.356–0.596)
IV
0.688 (0.532–0.889)
Chemotherapy
0.000
Yes
3.898 (3.358–4.525)
<0.001
No/Unknown
CI, confidence interval; NI, ; HR, hazard ratio.
Figure 5.
Nomogram predicting the 3- and 5-year OS of patients with BL. PI, Pacific islander; AI, American Indian; AN, Alaska Native; 1, lymph nodes; 2, digestive system; 3, oral cavity and pharynx; 4, other; 5, bone marrow and nervous system; OS, overall survival.
Figure 6.
Nomogram predicting the 3- and 5-year CSS of patients with BL. PI, Pacific islander; AI, American Indian; CSS, cancer-specific survival; AN, Alaska native; 1, lymph nodes; 2, digestive system; 3, oral cavity and pharynx; 4, other; 5, bone marrow and nervous system.
Table VII.
Detailed scores of each predictor in the nomograms.
Characteristics
Overall survival nomogram
Cancer-specific survival nomogram
Age, group, years
≤20
0
0
20–40
33
33
41–56
67
65
≥56
100
98
Ethnicity
White
11
7
Black
8
5
American Indian/Alaska native
5
3
Chinese
3
2
Other Asian/Pacific islander
0
0
Primary site
Lymphonodus
0
0
Digestive system
2
1
Oral cavity and pharynx
3
2
Bone marrow and nervous system
6
3
Other
5
4
Stage
I
0
0
II
19
21
III
39
42
IV
58
62
Chemotherapy
Yes
0
0
No/unknown
90
100
The nomograms were validated both internally and externally. The C-indexes of the nomograms predicting 3- and 5-year OS in the internal and external validation were 0.777 (95% CI, 0.758–0.796) and 0.76 (95% CI, 0.735–0.786), respectively. The C-indexes of the nomograms predicting 3- and 5-year CSS in the internal and external validation were 0.777 (95% CI, 0.757–0.797) and 0.755 (95% CI, 0.728–0.782), respectively. In conclusion, the nomograms constructed in the present study were quite accurate. The internal and external calibration plots for 3- and 5-year OS and CSS showed good consistency between the nomogram predictions and the observed outcomes, in that the calculated points were close to the diagonal line (Figs. 7 and 8).
Figure 7.
Internal calibration and external calibration of the nomograms predicting 3 and 5-year OS for patients with BL. (A) Internal calibration for 3-year OS. (B) Internal calibration for 5-year OS. (C) External calibration for 3-year OS. (D) External calibration for 5-year OS. The x-axis of each plot represents the nomogram prediction and the y-axis represents the observed outcomes. Each cohort equality divided into 5 and the distance between the points was calculated; the diagonal lines on the plots indicate the consistency between predictions and observed outcomes. OS, overall survival; CSS, cancer-specific survival.
Figure 8.
Internal calibration and external calibration of the nomograms predicting 3 and 5-year CSS for patients with BL patients. (A) Internal calibration for 3-year CSS. (B) the Internal calibration for 5-year CSS. (C) External calibration for 3-year CSS. (D) External calibration for 5-year CSS. The x-axis of each plot represents the nomogram prediction and the y-axis represents the observed outcomes. Each cohort equality divided into 5 and the distance between the points was calculated; the diagonal lines on the plots indicate the consistency between predictions and observed outcomes. OS, overall survival; CSS, cancer-specific survival.
Discussion
As a highly malignant disease with a demonstrated propensity for dissemination, the incidence rates of BL had gradually increased and reached 4.91 per 1,000,000 (95% CI, 4.12–5.81) in 2009. Based on the relationship between the occurrence of BL and acquired immune deficiency syndrome (AIDS), it is possible that this incidence peak is related to the AIDS epidemic (14). The incidence has slowly declined year by year and dropped to 2.80 per 1,000,000 (95% CI, 2.24–3.46) by 2015.For different diagnostic times, the present study showed that the OS and CSS within 5 years of patients being diagnosed in 2005–2015 were increased compared with patients diagnosed in 1994–2004 and 1983–1993. This is likely to be related to improvements in the therapeutic regimens. In the 1990s, the introduction of dose-intense multi-drug combination chemotherapy and prophylaxis for central nervous system disease led to the high cure rate of BL. It has been reported that complete response and OS reached 67–95 and 54–74%, respectively (15–22). Furthermore, the addition of rituximab resulted in another prognostic improvement in BL, both in children and adults. The 5-year survival rates were reportedly >90% in children (23,24) and the 3-year survival rate was close to 80% in adults (25–27). The 5-year OS rate was improved from 60 to 70% when the Hyper-CVAD regimen with rituximab was compared with the Hyper-CVAD alone. Nevertheless, possibly due to selection bias and therapeutic regimens, the OS rate of the population was decreased.To estimate the prognosis of diverse patient groups, numerous factors, including age, race, advanced stage, poor performance status, central nervous system or bone marrow involvement, anemia, the presence of circulating blasts and elevated lactate dehydrogenase (LDH), were proven to be individual risk factors in previous studies (6,28–31). Nevertheless, no nomograms to comprehensively consider prognostic factors and predict the outcomes of patients with BL had yet been developed. Therefore, in the present study, nomograms for predicting the 3- and 5-year OS and CSS of patients with BL were constructed using data obtained from the SEER database. Nomograms for BL may allow clinicians to implement differential therapy for patients in different situations, similar to practices in other malignancy such as osteosarcoma, prostate cancer, gastric cancer, lung cancer and breast cancer. As mentioned above, the cure rate of Burkitt leukemia has increased considerably in the past three decades (1983–2015). In order to improve the predictive level of nomogram, the patient data diagnosed during 2005–2015 was selected for modeling. By analyzing the clinicopathological features with univariate and multivariate analysis, four of these, including age, race, chemotherapy and stage, were eventually regarded as independent prognostic factors for OS and CSS. In particular, race was identified as a prognostic factor and this may be related to genotypic milieu and gene specificity. Lymphoma at different primary sites has different clinical features and prognosis (32,33), even though primary sites is not significant in the Cox multivariate analysis, it was still imported into the nomogram prediction. The nomograms of OS and CSS were constructed and were used to predict prognosis with exact scores corresponding to different survival rates, which was validated both internally and externally via C-indexes and calibration plots. American Indian single patients of older age, whose primary sites were in the bone marrow or nervous system and who had disease of stage IV without chemotherapy, were identified to have the worst survival outcomes.There are still several limitations that should be considered with respect to the nomograms for BL. Firstly, it is true, as the reviewer suggests, that lab data, especially that of LDH, is an important prognostic factor in lymphoma. It is regrettable that the data obtained from SEER do not cover lab data, so the present study was unable to assess the potential prognostic factors and incorporate these into the prediction models to make the nomograms more accurate. Secondly, the nomograms predicted the outcome of future patients via SEER data collected previously and these predictions will remain constant over time. This may lead to a lack of accuracy of the nomogram as time passes, due to improvements in the detection and treatment of BL. Thirdly, the data obtained from SEER do not cover specific chemotherapy regimens, as explained by an official statement and the classification is not completely accurate in that ‘No’ and ‘Unknown’ are bracketed together. The authors of the present study may possibly further investigate the effect of different chemotherapy regimens on prognosis in a single-center study at The First Affiliated Hospital of Guangzhou Medical University (Guangzho, China).Moreover, the present study constructed and validated the nomograms using data from the same retrospective database. Another non-overlapping dataset or a new study is recommended for stricter validation of the predictive ability of the nomograms. Fourthly, even though multivariate analysis was used to control confounding variables, it is still difficult to completely avoid influences among correlating variables, such as the relationship between primary site and stage. Finally, though the present study has already begun to collect cases of diagnosed BL; it is unfortunate that the limited number of cases is insufficient to verify the accuracy of model. In a follow-up study, the authors will solve this problem by collecting more virtual data.The results of the present study demonstrated that the nomogram plots for BL may be a useful tool for clinicians to establish treatment options that vary according to the patient's situation and to identify different categories of patients for scientific research.
Authors: Deborah A Thomas; Stefan Faderl; Susan O'Brien; Carlos Bueso-Ramos; Jorge Cortes; Guillermo Garcia-Manero; Francis J Giles; Srdan Verstovsek; William G Wierda; Sherry A Pierce; Jianqin Shan; Mark Brandt; Fredrick B Hagemeister; Michael J Keating; Fernando Cabanillas; Hagop Kantarjian Journal: Cancer Date: 2006-04-01 Impact factor: 6.860
Authors: C Patte; A Auperin; J Michon; H Behrendt; G Leverger; D Frappaz; P Lutz; C Coze; Y Perel; M Raphaël; M J Terrier-Lacombe Journal: Blood Date: 2001-06-01 Impact factor: 22.113
Authors: S Smeland; A K Blystad; S O Kvaløy; I M Ikonomou; J Delabie; G Kvalheim; J Hammerstrøm; G F Lauritzsen; H Holte Journal: Ann Oncol Date: 2004-07 Impact factor: 32.976
Authors: M Diviné; E Lepage; J Brière; P Pautier; B Dupriez; P Lederlin; P Mineur; H Tilly; M Blanc; B Audhuy; R Herbrecht; B Coiffier; F Reyes Journal: J Clin Oncol Date: 1996-01 Impact factor: 44.544
Authors: G M Mead; M R Sydes; J Walewski; A Grigg; C S Hatton; N Pescosta; C Guarnaccia; M S Lewis; J McKendrick; S P Stenning; D Wright; P Norbert Journal: Ann Oncol Date: 2002-08 Impact factor: 32.976
Authors: David A Rizzieri; Jeffrey L Johnson; Donna Niedzwiecki; Edward J Lee; James W Vardiman; Bayard L Powell; Maurice Barcos; Clara D Bloomfield; Charles A Schiffer; Bruce A Peterson; George P Canellos; Richard A Larson Journal: Cancer Date: 2004-04-01 Impact factor: 6.860