| Literature DB >> 35416990 |
Nathan Farrokhian1, Andrew J Holcomb2, Erin Dimon1, Omar Karadaghy1, Christina Ward1, Erin Whiteford2, Claire Tolan2, Elyse K Hanly3, Marisa R Buchakjian3, Brette Harding4, Laura Dooley4, Justin Shinn5, C Burton Wood5, Sarah L Rohde5, Sobia Khaja6, Anuraag Parikh7, Mustafa G Bulbul7, Joseph Penn1, Sara Goodwin1, Andrés M Bur1.
Abstract
Importance: Given that early-stage oral cavity squamous cell carcinoma (OCSCC) has a high propensity for subclinical nodal metastasis, elective neck dissection has become standard practice for many patients with clinically negative nodes. Unfortunately, for most patients without regional metastasis, this risk-averse treatment paradigm results in unnecessary morbidity.Entities:
Mesh:
Year: 2022 PMID: 35416990 PMCID: PMC9008495 DOI: 10.1001/jamanetworkopen.2022.7226
Source DB: PubMed Journal: JAMA Netw Open ISSN: 2574-3805
Demographic and Clinical Differences in Pooled Cohort Between Patients With and Without Occult Nodal Metastasis
|
| All, No. (%) (n = 634) | Occult nodes, No. (%) | ||
|---|---|---|---|---|
| Without (n = 520) | With (n = 114) | |||
| Sex | ||||
| Female | 290 (45.7) | 241 (46.3) | 49 (43.0) | .58 |
| Male | 344 (54.3) | 279 (53.7) | 65 (57.0) | |
| Age, mean (SD), y | 61.2 (13.6) | 60.9 (13.5) | 62.7 (13.7) | .19 |
| BMI, mean (SD) | 27.7 (6.3) | 28.0 (6.3) | 26.7 (6.2) | .06 |
| White race and ethnicity | 589 (92.9) | 486 (93.5) | 103 (90.4) | .22 |
| Smoking | ||||
| Never smoker | 233 (36.8) | 185 (35.6) | 48 (42.1) | .10 |
| <10 pack-years | 41 (6.5) | 30 (5.8) | 11 (9.6) | |
| ≥10 pack-years | 223 (35.2) | 190 (36.5) | 33 (28.9) | |
| LVI | 72 (11.4) | 42 (8.1) | 30 (26.3) | <.001 |
| PNI | 142 (22.4) | 96 (18.5) | 46 (40.4) | <.001 |
| Margins involved | 47 (7.4) | 33 (6.3) | 14 (12.3) | .046 |
| DOI, mean (SD), mm | 5.7 (4.3) | 5.4 (4.3) | 7.0 (4.3) | <.001 |
| Largest diameter, mean (SD), mm | 15.9 (9.4) | 15.5 (9.3) | 17.4 (9.7) | .05 |
| Grade | ||||
| I: Well differentiated | 242 (38.2) | 224 (43.1) | 18 (15.8) | <.001 |
| II: Moderately differentiated | 327 (51.6) | 256 (49.2) | 71 (62.3) | |
| III: Poorly differentiated | 55 (8.7) | 32 (6.2) | 23 (20.2) | |
| Subsite | ||||
| Tongue | 449 (70.8) | 358 (68.8) | 91 (79.8) | .49 |
| FOM | 79 (12.5) | 69 (13.3) | 10 (8.8) | |
| Gum | 29 (4.6) | 26 (5.0) | 3 (2.6) | |
| Buccal | 24 (3.8) | 20 (3.8) | 4 (3.5) | |
| RMT | 20 (3.2) | 16 (3.1) | 4 (3.5) | |
| Palate | 14 (2.2) | 13 (2.5) | 1 (0.9) | |
| Lip | 3 (0.5) | 3 (0.6) | 0 (0.0) | |
| Oral, NOS | 16 (2.5) | 15 (2.9) | 1 (0.9) | |
Abbreviations: BMI, body mass index (calculated as weight in kilograms divided by height in meters squared); DOI, depth of invasion; FOM, floor of mouth; LVI, lymphovascular invasion; NOS, not otherwise specified; PNI, perineural invasion; RMT, retromolar trigone.
Race and ethnicity data were reported in the electronic health records of the participating institutions. The race and ethnic categories included Asian, Black, Hispanic, Native American, and White. More than 90% of the patients were identified as White individuals.
Figure 1. Predictive Ability of Depth of Invasion and the Machine Learning Models for Occult Nodal Metastasis in the External Validation Set
A, Area under the receiver operating characteristic curves are shown for all 4 machine learning models (logistic regression [LR], random forest [RF], support vector machine classifier [SVC], and XGBoost) vs a tumor depth threshold. B, Precision recall curves for each of the models are shown with associated area under the curve (AUC).
Performance of Each Predictive Model on the External Validation Cohort
| Depth | LR model | SVC model | RF model | XGBoost model | |
|---|---|---|---|---|---|
| ROC AUC | 0.619 | 0.783 | 0.806 | 0.805 | 0.838 |
| NA | <.001 | <.001 | <.001 | <.001 | |
| Sensitivity, % | 62.5 | 83.3 | 83.3 | 91.7 | 91.7 |
| Specificity, % | 61.3 | 64.5 | 68.6 | 60.5 | 72.6 |
| PPV, % | 23.8 | 31.3 | 33.9 | 31.0 | 39.3 |
| NPV, % | 89.4 | 95.2 | 95.5 | 97.4 | 97.8 |
| Accuracy, % | 61.5 | 67.6 | 71.0 | 65.5 | 75.7 |
| Misclassified pN positive, % | 37.5 | 16.7 | 16.7 | 8.3 | 8.3 |
| NNS to identify pN positive, % | NA | 29.6 | 29.6 | 21.0 | 21.0 |
| Misclassified pN negative, % | 38.7 | 35.5 | 31.5 | 39.5 | 27.4 |
| NNS to avoid END | NA | 37.0 | 16.5 | NA | 10.6 |
Abbreviations: AUC, area under curve; END, elective neck dissection; LR, logistic regression; NA, not applicable; NNS, number needed to screen; NPV, negative predictive value; pN, pathological node; PPV, positive predictive value; RF, random forest; ROC, receiver operating characteristic; SVC, support vector machine classifier.
Decision thresholds were optimized using the Youden index.
Figure 2. Relative Distance of Each Patient in the External Validation Cohort From the Decision Threshold for the Developed XGBoost Model
The decision threshold was normalized to 0. Probability values were normalized as a relative distance from the largest absolute probability and the decision boundary.
Figure 3. Relative Feature Importance of the Top 10 Features for Each Model Developed
For the ensemble models, importance was determined by the number of times each feature was split. For the logistic regression and support vector machine classifier models, coefficient magnitude was used as a proxy for feature importance. Feature importance values were normalized against the highest performing feature for each model. BMI indicates body mass index (calculated as weight in kilograms divided by height in meters squared); DOI, depth of invasion; LVI, lymphovascular invasion; and PNI, perineural invasion.