| Literature DB >> 30232350 |
Benjamin H Kann1, Sanjay Aneja2, Gokoulakrichenane V Loganadane2, Jacqueline R Kelly2, Stephen M Smith3, Roy H Decker2, James B Yu2, Henry S Park2, Wendell G Yarbrough4, Ajay Malhotra5, Barbara A Burtness6, Zain A Husain2.
Abstract
Identification of nodal metastasis and tumor extranodal extension (ENE) is crucial for head and neck cancer management, but currently only can be diagnosed via postoperative pathology. Pretreatment, radiographic identification of ENE, in particular, has proven extremely difficult for clinicians, but would be greatly influential in guiding patient management. Here, we show that a deep learning convolutional neural network can be trained to identify nodal metastasis and ENE with excellent performance that surpasses what human clinicians have historically achieved. We trained a 3-dimensional convolutional neural network using a dataset of 2,875 CT-segmented lymph node samples with correlating pathology labels, cross-validated and fine-tuned on 124 samples, and conducted testing on a blinded test set of 131 samples. On the blinded test set, the model predicted ENE and nodal metastasis each with area under the receiver operating characteristic curve (AUC) of 0.91 (95%CI: 0.85-0.97). The model has the potential for use as a clinical decision-making tool to help guide head and neck cancer patient management.Entities:
Mesh:
Year: 2018 PMID: 30232350 PMCID: PMC6145900 DOI: 10.1038/s41598-018-32441-y
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Study Patient and Lymph Node Characteristics.
| Patient Cohort | Patients (N = 270) | Lymph Nodes (N = 653) |
|---|---|---|
| Primary Cancer Site | n (%) | n (%) |
| Oropharynx | 72 (26.7) | 178 (27.3) |
| Oral Cavity | 106 (39.3) | 251 (38.4) |
| Larynx/Hypopharynx/Nasopharynx | 48 (17.8) | 126 (19.3) |
| Salivary Gland | 18 (6.7) | 36 (5.5) |
| Unknown/Other | 26 (9.6) | 62 (9.5) |
| Clinical T-stage | ||
| T0 | 5 (1.9) | 17 (2.6) |
| T1 | 36 (13.3) | 91 (13.9) |
| T2 | 72 (26.7) | 172 (26.3) |
| T3 | 37 (13.7) | 94 (14.4) |
| T4 | 44 (16.3) | 107 (16.4) |
| Unknown | 76 (28.2) | 172 (26.3) |
| Clinical N-stage | ||
| N0 | 83 (30.7) | 185 (28.3) |
| N1 | 38 (14.1) | 82 (12.6) |
| N2 | 76 (28.2) | 209 (32.0) |
| N3 | 9 (3.3) | 33 (5.1) |
| Unknown | 64 (23.7) | 144 (22.0) |
| HPV/p16 Status* | ||
| Negative | 188 (69.6) | 454 (69.5) |
| Positive | 76 (28.2) | 185 (28.3) |
| Unknown | 6 (2.2) | 14 (2.2) |
| Lymph Node Pathology | ||
| Negative | 380 (58.2) | |
| Nodal Metastasis, ENE(−) | 153 (23.4) | |
| Node Metastasis, ENE(+) | 120 (18.4) | |
*Patients with non-oropharyngeal carcinoma who did not undergo HPV or p16 testing were coded as negative, given the very low incidence of HPV/p16 positive tumors in these disease sites. Abbreviations: ENE = extranodal extension.
Figure 1(A,B) Lymph Node Region of Interest Preprocessing. (A) 2D representation of 3D lymph node segmentation preprocessing resulting in a dimension-preserving input (1) and a size-invariant, “zoomed-in” input (2). (B) Representation of actual 3D input arrays for dual-input deep learning neural network.
Figure 2Deep Learning Neural Network (DLNN) Model Architecture. Dimension-preserving input “BoxNet” DLNN (A) is merged with size-invariant “SmallNet” DLNN (B) to form the dual-input, “DualNet” DLNN (C) with model output. The model has the capability to merge HPV/p16-status (D). Note: dropout, batch normalization, and activation layers not shown.
DLNN Performance Metric Comparisons for Model Selection.
| DLNN Version | Training Set (n = 2875) | Validation Set (n = 124) | Test Set (n = 98*) | |||
|---|---|---|---|---|---|---|
| AUC | Loss | AUC | Loss | AUC | Loss | |
| BoxNet | 0.95 | 0.443 | 0.87 | 0.522 | 0.90 | 0.565 |
| SmallNet | 0.95 | 0.406 | 0.86 | 0.467 | 0.88 | 0.573 |
| DualNet | 0.99 | 0.182 | 0.87 | 0.403 | 0.91 | 0.483 |
| BoxNet + HPV status | 0.97 | 0.407 | 0.83 | 0.521 | 0.88 | 0.554 |
| SmallNet + HPV status | 0.96 | 0.431 | 0.89 | 0.450 | 0.88 | 0.519 |
| DualNet + HPV status | 0.99 | 0.165 | 0.90 | 0.417 | 0.87 | 0.547 |
Loss represents overall cross-entropy loss of the multilabel prediction for extranodal extension (ENE) and nodal metastasis. AUC calculated from ENE classification predictions. *Test set for ENE includes lymph nodes with region of interest diameters ≥1 cm. Abbreviations: AUC = area under the curve.
Figure 3(A,B) Test Set ROC Curve Comparisons for Extranodal Extension (ENE) and Nodal Metastasis Prediction for Deep Learning Neural Network (DLNN), Radiomic Feature Random Forest, and Benchmark Logistic Model.
Model Performance and Benchmark Comparisons on Independent Test Set By Lymph Node Feature.
| Performance Metric | Extranodal Extension (ENE) | Nodal Metastasis (NM) | ||||
|---|---|---|---|---|---|---|
| ENE Test Set (n = 98*) | Test Set (n = 131) | |||||
| DualNet DLNN | Random Forest | Benchmark Logistic | DualNet DLNN | Random Forest | Benchmark Logistic | |
| AUC | 0.91 | 0.88 | 0.81 | 0.91 | 0.91 | 0.86 |
| Accuracy | 85.7% | 82.6% | 77.7% | 85.5% | 84.7% | 76.1% |
| Sensitivity | 0.88 | 0.79 | 0.72 | 0.84 | 0.75 | 0.79 |
| Specificity | 0.85 | 0.84 | 0.80 | 0.87 | 0.92 | 0.74 |
| PPV | 0.66 | 0.61 | 0.54 | 0.88 | 0.87 | 0.69 |
| NPV | 0.95 | 0.93 | 0.89 | 0.82 | 0.83 | 0.83 |
| Youden Index | 0.73 | 0.63 | 0.51 | 0.71 | 0.67 | 0.53 |
*Test set for ENE includes lymph nodes with region of interest diameters ≥ 1 cm. Abbreviations: AUC = area under the curve; PPV = positive predictive value; NPV = negative predictive value. Youden index = Sensitivity + Specificity − 1.