| Literature DB >> 25060659 |
Arif N Ali1, Jeffrey M Switchenko, Sungjin Kim, Jeanne Kowalski, Mark W El-Deiry, Jonathan J Beitler.
Abstract
BACKGROUND: The current study was conducted to develop a multifactorial statistical model to predict the specific head and neck (H&N) tumor site origin in cases of squamous cell carcinoma confined to the cervical lymph nodes ("unknown primaries").Entities:
Keywords: End Results (SEER); Epidemiology; Surveillance; cervical lymph nodes; predictive model; radiation; unknown primary
Mesh:
Year: 2014 PMID: 25060659 PMCID: PMC4232899 DOI: 10.1002/cncr.28901
Source DB: PubMed Journal: Cancer ISSN: 0008-543X Impact factor: 6.860
SEER Data Descriptive Statistics
| Variable | Level | N = 20,011 | % |
|---|---|---|---|
| Sex | Female | 3799 | 19.0 |
| Male | 16,212 | 81.0 | |
| Primary tumor site | Oropharynx | 12,829 | 64.1 |
| Nasopharynx | 1650 | 8.2 | |
| Hypopharynx | 1854 | 9.3 | |
| Larynx | 3678 | 18.4 | |
| Race | Asian | 1307 | 6.5 |
| Black | 2458 | 12.3 | |
| Hispanic | 1289 | 6.4 | |
| White | 14,957 | 74.7 | |
| Level 1 lymph nodes | Not involved | 15,294 | 76.4 |
| Involved | 4717 | 23.6 | |
| Level 2 lymph nodes | Not involved | 5612 | 28.0 |
| Involved | 14,399 | 72.0 | |
| Level 3 lymph nodes | Not involved | 12,723 | 63.6 |
| Involved | 7288 | 36.4 | |
| Level 4 lymph nodes | Not involved | 16,587 | 82.9 |
| Involved | 3424 | 17.1 | |
| Level 5 lymph nodes | Not involved | 17,210 | 86.0 |
| Involved | 2801 | 14.0 | |
| Retropharyngeal lymph nodes | Not involved | 19,456 | 97.2 |
| Involved | 555 | 2.8 | |
| Age, y | Mean | 59.37 | — |
| Median | 59 | — | |
| Minimum | 1 | — | |
| Maximum | 100 | — | |
| SD | 11.34 | — |
Abbreviation: SD, standard deviation; SEER, Surveillance, Epidemiology, and End Results.
Relationship of Head and Neck Primary Tumor Site With SEER Variables
| Covariate | Statistics | Level | Oropharynx N=12,829 | Nasopharynx N=1650 | Hypopharynx N=1854 | Larynx N=3678 | Parametric |
|---|---|---|---|---|---|---|---|
| Sex | No. (row %) | Female | 2108 (55.49) | 460 (12.11) | 332 (8.74) | 899 (23.66) | |
| No. (row %) | Male | 10,721 (66.13) | 1190 (7.34) | 1522 (9.39) | 2779 (17.14) | ||
| Race | No. (row %) | Asian | 382 (29.23) | 682 (52.18) | 116 (8.88) | 127 (9.72) | |
| No. (row %) | Black | 1240 (50.45) | 210 (8.54) | 319 (12.98) | 689 (28.03) | ||
| No. (row %) | Hispanic | 758 (58.81) | 141 (10.94) | 129 (10.01) | 261 (20.25) | ||
| No. (row %) | White | 10,449 (69.86) | 617 (4.13) | 1290 (8.62) | 2601 (17.39) | ||
| Level 1 lymph nodes | No. (row %) | Not involved | 9680 (63.29) | 1240 (8.11) | 1471 (9.62) | 2903 (18.98) | |
| No. (row %) | Involved | 3149 (66.76) | 410 (8.69) | 383 (8.12) | 775 (16.43) | ||
| Level 2 lymph nodes | No. (row %) | Not involved | 3230 (57.56) | 510 (9.09) | 650 (11.58) | 1222 (21.77) | |
| No. (row %) | Involved | 9599 (66.66) | 1140 (7.92) | 1204 (8.36) | 2456 (17.06) | ||
| Level 3 lymph nodes | No. (row %) | Not involved | 8534 (67.08) | 1107 (8.7) | 1021 (8.02) | 2061 (16.2) | |
| No. (row %) | Involved | 4295 (58.93) | 543 (7.45) | 833 (11.43) | 1617 (22.19) | ||
| Level 4 lymph nodes | No. (row %) | Not involved | 10,934 (65.92) | 1309 (7.89) | 1436 (8.66) | 2908 (17.53) | |
| No. (row %) | Involved | 1895 (55.34) | 341 (9.96) | 418 (12.21) | 770 (22.49) | ||
| Level 5 lymph nodes | No. (row %) | Not involved | 11,396 (66.22) | 1112 (6.46) | 1539 (8.94) | 3163 (18.38) | |
| No. (row %) | Involved | 1433 (51.16) | 538 (19.21) | 315 (11.25) | 515 (18.39) | ||
| Retropharyngeal lymph nodes | No. (row %) | Not involved | 12,586 (64.69) | 1475 (7.58) | 1784 (9.17) | 3611 (18.56) | |
| No. (row %) | Involved | 243 (43.78) | 175 (31.53) | 70 (12.61) | 67 (12.07) | ||
| Age, y | No. | 12,829 | 1650 | 1854 | 3678 | ||
| Mean | 59.1 | 52.23 | 63.29 | 61.52 | |||
| Median | 58 | 53 | 62 | 61 |
Abbreviation: SEER, Surveillance, Epidemiology, and End Results.
The parametric P value was calculated using the analysis of variance for numerical covariates and the chi-square test for categorical covariates.
Bold values indicate statistical significance less than 0.05.
Multivariate Analysis of SEER Variables
| Oropharynx | Nasopharynx | Hypopharynx | Larynx | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Covariate | Level | OR (95% CI) | OR (95% CI) | OR (95% CI) | OR (95% CI) | ||||
| Sex | Male | 1.54 (1.39-1.71) | 0.51 (0.42-0.61) | 1.15 (0.96-1.38) | .126 | 0.72 (0.63-0.81) | |||
| Female | — | — | — | — | — | — | — | — | |
| Race | Asian | 0.16 (0.13-0.19) | 27.6 (22.4-33.9) | 1.00 (0.74-1.34) | .996 | 0.48 (0.37-0.64) | |||
| Black | 0.44 (0.38-0.49) | 2.09 (1.66-2.64) | 1.60 (1.32-1.93) | 1.86 (1.61-2.15) | |||||
| Hispanic | 0.61 (0.52-0.72) | 2.74 (2.08-3.61) | 1.29 (0.99-1.68) | .056 | 1.15 (0.94-1.41) | .175 | |||
| White | — | — | — | — | — | — | — | — | |
| Age | 0.99 (0.99-0.99) | 0.95 (0.94-0.96) | 1.04 (1.03-1.04) | 1.02 (1.02-1.03) | |||||
| Level 1 lymph nodes | Involved | 1.34 (1.20-1.49) | 0.96 (0.79-1.17) | .687 | 0.79 (0.66-0.94) | 0.77 (0.68-0.88) | |||
| Level 2 lymph nodes | Involved | 1.46 (1.32-1.61) | 0.97 (0.80-1.18) | .771 | 0.71 (0.61-0.83) | 0.74 (0.66-0.83) | |||
| Level 3 lymph nodes | Involved | 0.77 (0.70-0.84) | 0.71 (0.59-0.86) | 1.37 (1.19-1.59) | 1.38 (1.24-1.54) | ||||
| Level 4 lymph nodes | Involved | 0.79 (0.70-0.89) | 1.07 (0.86-1.33) | .567 | 1.22 (1.03-1.45) | 1.24 (1.08-1.42) | |||
| Level 5 lymph nodes | Involved | 0.71 (0.63-0.80) | 2.42 (1.98-2.96) | 1.22 (1.01-1.47) | 0.92 (0.79-1.07) | .281 | |||
| Retropharyngeal lymph nodes | Involved | 0.59 (0.46-0.76) | 3.46 (2.50-4.81) | 1.20 (0.81-1.78) | .352 | 0.60 (0.41-0.87) | |||
Abbreviations: 95% CI, 95% confidence interval; OR, odds ratio; SEER, Surveillance, Epidemiology, and End Results.
Bold values indicate statistical significance less than 0.05.
Figure 1Predictive nomograms are shown for (A) nasopharynx primary site, (B) oropharynx primary site, (C) hypopharynx primary site, and (D) larynx primary site. Prob indicates probability.
Figure 2Validation plots for assessing the predictive ability of each primary site model through split-sample internal validation are shown.
RMSE Estimates for Validation and Training Sets Using Validation Model Estimates
| Site | Validation RMSE | Training RMSE | Absolute Difference | % Difference |
|---|---|---|---|---|
| Nasopharynx | 0.235 | 0.233 | 0.002 | 0.9% |
| Oropharynx | 0.457 | 0.459 | 0.002 | 0.4% |
| Hypopharynx | 0.285 | 0.287 | 0.002 | 0.7% |
| Larynx | 0.378 | 0.383 | 0.005 | 1.3% |
Abbreviation: RMSE, root mean square error.