| Literature DB >> 33684314 |
Laia Humbert-Vidan1,2, Vinod Patel1, Ilkay Oksuz2,3, Andrew Peter King2, Teresa Guerrero Urbano1,4.
Abstract
OBJECTIVES: Mandible osteoradionecrosis (ORN) is one of the most severe toxicities in patients with head and neck cancer (HNC) undergoing radiotherapy (RT). The existing literature focuses on the correlation of mandible ORN and clinical and dosimetric factors. This study proposes the use of machine learning (ML) methods as prediction models for mandible ORN incidence.Entities:
Mesh:
Year: 2021 PMID: 33684314 PMCID: PMC8010531 DOI: 10.1259/bjr.20200026
Source DB: PubMed Journal: Br J Radiol ISSN: 0007-1285 Impact factor: 3.039
Summary of demographic and clinical characteristics of the two patient groups (ORN and non-ORN) in the study cohort
| ORNa | non-ORN | |
|---|---|---|
| Gender (male) | 34 (71%) | 38 (79%) |
| Age (median) | 64 | 59 |
| Smoking (at RT start) | 25 (52%) | 19 (40%) |
| Smoking (previous) | 14 (29%) | 19 (40%) |
| Alcohol (at RTb start) | 33 (69%) | 33 (69%) |
| Alcohol (previous) | 5 (10%) | 4 (8%) |
| Chemotherapy | 33 (69%) | 33 (69%) |
| Dental extractions (pre-RT) | 30 (63%) | 31 (65%) |
| Dental extractions (post-RT) | 5 (10%) | 2 (4%) |
| Surgery (pre-RT) | 10 (21%) | 18 (38%) |
| Surgery (post-RT) | 1 (2%) | 3 (6%) |
| Tumour site | ||
| Oropharynx | 28 (58%) | 28 (58%) |
| Larynx | 3 (6%) | 7 (15%) |
| Oral cavity | 13 (27%) | 9 (19%) |
| Hypopharynx | 0 (0%) | 2 (4%) |
| Paranasal sinus | 1 (2%) | 1 (2%) |
| Unknown | 3 (6%) | 1 (2%) |
Osteoradionecrosis.
Radiotherapy.
Comparison of prediction performance obtained with the five models considered: multivariate LR, SVM, RF, AdaBoost and ANNa
| Model | Variables | Performance measure | |||||
|---|---|---|---|---|---|---|---|
| Training accuracy | Testing accuracy | TPRb | TNRc | PPVd | NPVe | ||
| LR | All | 0.76 | 0.73 | 0.98 | 0.47 | 0.66 | 0.97 |
| Selected | 0.77 | 0.75 | 0.90 | 0.60 | 0.71 | 0.88 | |
| SVM | All | 0.88 | 0.67 | 0.75 | 0.60 | 0.65 | 0.71 |
| Selected | 0.84 | 0.76 | 0.96 | 0.56 | 0.68 | 0.94 | |
| RF | All | 0.77 | 0.60 | 0.63 | 0.56 | 0.59 | 0.61 |
| Selected | 0.78 | 0.71 | 0.77 | 0.66 | 0.70 | 0.76 | |
| AdaBoost | All | 0.78 | 0.74 | 0.96 | 0.52 | 0.67 | 0.92 |
| Selected | 0.78 | 0.75 | 0.93 | 0.56 | 0.68 | 0.91 | |
| ANN | All | 0.71 | 0.66 | 0.75 | 0.57 | 0.66 | 0.68 |
| Selected | 0.88 | 0.77 | 0.90 | 0.64 | 0.72 | 0.90 | |
ANN, artificial neural network; AdaBoost, Adaptive Boosting; LR, logistic regression; RF, random forest; SVM, support vector machine.
Results including all variables and only the selected variables (Dmax, Dmean, extractions post-RT and surgery pre-RT) are shown.
The figures shown correspond to the average over the five stratified cross-validation folds.
True positive rate (sensitivity or recall).
True negative rate (specificity).
Positive predictive value (precision).
Negative predictive value.
Results from the McNemar’s statistical hypothesis test on all model pair combinations
| χ2 ( | LR | SVM | RF | AdaBoost |
|---|---|---|---|---|
| SVM | 4.000 (0.754) | |||
| RF | 5.000 (0.424) | 7.000 (0.263) | ||
| AdaBoost | 5.000 (1.000) | 3.000 (1.000) | 4.000 (0.267) | |
| ANN | 6.000 (0.607) | 3.000 (1.000) | 7.000 (0.189) | 6.000 (0.791) |
The first number is the McNemar’s test statistic (χ2) and the number in brackets is the corresponding p value (probability of observing this, or a larger, χ2 value). After a Bonferroni correction for multiple tests, a significance threshold of 0.005 was used. In all model pairs, the tests failed to reject the null hypothesis that the performance of the compared models is equal.
Logistic Regression (LR)
Support Vector Machine (SVM)
Random Forest (RF)
Adaptive Boosting (AdaBoost)
Artificial Neural Network (ANN)