| Literature DB >> 32346085 |
Dongchul Cha1, Seung Ho Shin1, Sung Huhn Kim1, Jae Young Choi1, In Seok Moon2.
Abstract
In vestibular schwannoma patients with functional hearing status, surgical resection while preserving the hearing is feasible. Hearing levels, tumor size, and location of the tumor have been known to be candidates of predictors. We used a machine learning approach to predict hearing outcomes in vestibular schwannoma patients who underwent hearing preservation surgery: middle cranial fossa, or retrosigmoid approach. After reviewing the medical records of 52 patients with a pathologically confirmed vestibular schwannoma, we included 50 patient's records in the study. Hearing preservation was regarded as positive if the postoperative hearing was within serviceable hearing (50/50 rule). The categorical variable included the surgical approach, and the continuous variable covered audiometric and vestibular function tests, and the largest diameter of the tumor. Four different algorithms were lined up for comparison of accuracy: support vector machine(SVM), gradient boosting machine(GBM), deep neural network(DNN), and diffuse random forest(DRF). The average accuracy of predicting hearing preservation ranged from 62% (SVM) to 90% (DNN). The current study is the first to incorporate machine learning methodology into a prediction of successful hearing preservation surgery. Although a larger population may be needed for better generalization, this study could aid the surgeon's decision to perform a hearing preservation approach for vestibular schwannoma surgery.Entities:
Mesh:
Year: 2020 PMID: 32346085 PMCID: PMC7188896 DOI: 10.1038/s41598-020-64175-1
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Clinical characteristics of patients (N = 50). Preservation of hearing is classified as positive if PTA < 50 dB HL and WRS > 50% (50/50 rule).
| Characteristics | Number of patients (N = 50) | |
|---|---|---|
| Age, mean ± SD | 47.42 ± 11.46 | |
| Sex, male:female | 19:31 | |
| Side, left:right | 29:21 | |
| Approach, MCFA:RSA | 27:23 | |
| Preservation of hearing | 28(56%) | |
| PTA, mean ± SD (dB HL) | 26.61 ± 15.64 | 62.53 ± 41.71 |
| SRT, mean ± SD (dB HL) | 31.00 ± 20.05 | 71.04 ± 44.69 |
| WRS, mean ± SD (%) | 81.00 ± 22.90 | 50.08 ± 42.33 |
| MCL, mean ± SD (dB HL) | 63.50 ± 11.44 | 84.46 ± 26.09 |
| I-V ABR Latency(ms) | 5.62 ± 2.45 | |
| VEMP asymmetry, mean ± SD(%) | 27.45 ± 45.50 | |
| Caloric CP, mean ± SD (%) | 29.92 ± 31.64 | |
| Tumor size, mean ± SD(mm) | 13.11 ± 6.23 |
PTA: Pure-tone average, SRT: Speech reception threshold, WRS: word recognition score MCL: most comfortable level, SD: standard deviation, dB HL: decibel hearing level.
mm: millimeter, ms: milliseconds.
Results of four machine learning models.
| Model | Accuracy | Sensitivity | Specificity | PPV | NPV | F1 score |
|---|---|---|---|---|---|---|
| DNN | 0.90 | 0.93 | 0.86 | 0.90 | 0.90 | 0.91 |
| GBM | 0.88 | 0.86 | 0.91 | 0.92 | 0.83 | 0.89 |
| DRF | 0.86 | 0.89 | 0.82 | 0.86 | 0.86 | 0.88 |
| SVM | 0.62 | 0.92 | 0.23 | 0.60 | 0.71 | 0.73 |
Upper section: Confusion matrix of DNN (left) and SVM (right) model.
Predicted Y and Actual Y indicates true positive (machine predicted as hearing preservable, postoperative hearing were actually preserved).
Lower section: Detailed results of four models, sorted by accuracy and F1 score.
PPV: positive predictive value, NPV: negative predictive value.
Top five feature importance among the three models.
| Rank | DNN | GBM | DRF |
|---|---|---|---|
| 1st | WRS | WRS | PTA(3 K) |
| 2nd | VEMP | PTA(3 K) | PTA(8 K) |
| 3rd | Tumor Size | SRT | MCL |
| 4th | Caloric-CP | PTA(8 K) | WRS |
| 5th | I-V Interval | I-V Interval | SRT |
Detailed characteristics of three learning models.
| Model | Characteristics | |||
|---|---|---|---|---|
| Minimum | Maximum | Mean | ||
| GBM Number of trees: 63 | Depth | 5 | 6 | 5.96 |
| Leaves | 9 | 25 | 17.68 | |
| DRF Number of trees: 45 | Depth | 3 | 6 | 4.49 |
| Leaves | 4 | 11 | 8.13 | |
| DNN | Structure | Input: 23, Hidden: 50 neurons (first layer), 20 neurons (second layer), Output: 2, In-place ReLU and Batch Normalization each after hidden layer | ||
| Learning strategy | Discriminative learning rates (Initial learning rate of 0.003 for 15 epochs, followed by rate of 0.00001 for 25 epochs) Batch size = 25, Mixed precision training of FP16 and FP32 | |||
| All models | Continuous variables | Pure-tone (250, 500, 1 K, 2 K, 3 K, 4 K, 8 K Hertz, decibels), SRT(decibels), WRS(percent), MCL(decibels), ABR I-V interval (milliseconds), Tumor size (largest diameters in millimeters) VEMP-asymmetry (percent),Caloric-CP (percent) | ||
| Categorical variable | Approach (RSA or MCFA) | |||
| Training, Validation | Training set: 80%, Validation set: 20% Five-fold cross-validation for assessment of accuracy | |||
GBM: gradient boosting machine, DNN: Deep neural network, DRF: diffuse random forest.
FP16: Half precision floating-point, FP32: Single-precision floating-point format.
ReLU: Rectified Linear Unit.
*The feature values were not scaled and used in as is.