| Literature DB >> 30947300 |
Zamir G Merali1, Christopher D Witiw1, Jetan H Badhiwala1, Jefferson R Wilson1,2, Michael G Fehlings1,3.
Abstract
Degenerative cervical myelopathy (DCM) is a spinal cord condition that results in progressive non-traumatic compression of the cervical spinal cord. Spine surgeons must consider a large quantity of information relating to disease presentation, imaging features, and patient characteristics to determine if a patient will benefit from surgery for DCM. We applied a supervised machine learning approach to develop a classification model to predict individual patient outcome after surgery for DCM. Patients undergoing surgery for DCM as a part of the AOSpine CSM-NA or CSM-I prospective, multi-centre studies were included in the analysis. Out of 757 patients 605, 583, and 539 patients had complete follow-up information at 6, 12, and 24 months respectively and were included in the analysis. The primary outcome was improvement in the SF-6D quality of life indicator score by the minimum clinically important difference (MCID). The secondary outcome was improvement in the modified Japanese Orthopedic Association (mJOA) score by the MCID. Predictor variables reflected information about pre-operative disease severity, disease presentation, patient demographics, and comorbidities. A machine learning approach of feature engineering, data pre-processing, and model optimization was used to create the most accurate predictive model of outcome after surgery for DCM. Following data pre-processing 48, 108, and 101 features were chosen for model training at 6, 12, and 24 months respectively. The best performing predictive model used a random forest structure and had an average area under the curve (AUC) of 0.70, classification accuracy of 77%, and sensitivity of 78% when evaluated on a testing cohort that was not used for model training. Worse pre-operative disease severity, longer duration of DCM symptoms, older age, higher body weight, and current smoking status were associated with worse surgical outcomes. We developed a model that predicted positive surgical outcome for DCM with good accuracy at the individual patient level on an independent testing cohort. Our analysis demonstrates the applicability of machine-learning to predictive modeling in spine surgery.Entities:
Mesh:
Year: 2019 PMID: 30947300 PMCID: PMC6448910 DOI: 10.1371/journal.pone.0215133
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Baseline characteristics of combined training, validation, and testing dataset.
| n = 605 | |
|---|---|
| 56 (48,64) | |
| 62.7% | |
| 26.8% | |
| 3.7% | |
| 6.7% | |
| 0.9% | |
| 2.2% | |
| 38.6% | |
| 1.5% | |
| 9.1% | |
| 2.2% | |
| 12.4% | |
| 0% | |
| 13.3% | |
| 11.0% | |
| 4.5% | |
| 2.0% | |
| 2.1% | |
| 71.7% | |
| 76.9% | |
| 21.0% | |
| 24.4% | |
| 5.7% | |
| 88.8% | |
| 74.1% | |
| 75.2% | |
| 56.5% | |
| 26.6% | |
| 82.3% | |
| 62.4% | |
| 35.8% | |
| 77.4% | |
| 62.0% | |
| 35.3% | |
| 46.6% | |
| 58.4% |
Comparison of model performance when predicting improvement in SF6D score.
| AUC | Accuracy | |||||
|---|---|---|---|---|---|---|
| 6 months | 12 months | 24 months | 6 months | 12 months | 24 months | |
| Random Forest | 0.64 | 0.68 | 0.7 | 0.70 | 0.71 | 0.69 |
| Support Vector Machine | 0.65 | 0.62 | 0.7 | 0.64 | 0.67 | 0.68 |
| Logistic Regression | 0.58 | 0.63 | 0.67 | 0.62 | 0.60 | 0.65 |
| Decision Tree | 0.65 | 0.63 | 0.67 | 0.64 | 0.49 | 0.65 |
| Artificial Neural Network | 0.59 | 0.52 | 0.53 | 0.56 | 0.52 | 0.51 |
Fig 1Results of the recursive feature elimination algorithm applied to 6-month follow-up (A), 12-month follow-up (B), and 24-month follow-up (C). The figures demonstrate the change in root mean squared error (RMSE) as features were iteratively added to the model. As greater number of features were added to the model the RMSE decreased to a minimum value, demonstrating best model fit, then began to increase as greater numbers of ‘distracting’ features were added. The set of features that achieved the minimum RMSE were used for model training (shown by vertical black line).
Fig 2Receiver operating characteristic curves for the random forest model at all follow-up points on the training/validation dataset.
The blue lines represent each cross validation fold.
Predictive performance of the random forest model on the testing dataset.
| Samples | Features | Accuracy | Sensitivity | Specificity | PPV | NPV | AUC | |
|---|---|---|---|---|---|---|---|---|
| SF-6D | ||||||||
| 6 months | 181 | 41 | 71.8% | 0.75 | 0.50 | 0.90 | 0.25 | 0.71 |
| 12 months | 181 | 108 | 77.0% | 0.78 | 0.63 | 0.98 | 0.12 | 0.70 |
| 24 months | 181 | 101 | 70.8% | 0.74 | 0.47 | 0.92 | 0.17 | 0.73 |
| mJOA | ||||||||
| 6 months | 195 | 41 | 66.7% | 0.70 | 0.59 | 0.82 | 0.43 | 0.73 |
| 12 months | 188 | 108 | 71.3% | 0.72 | 0.69 | 0.91 | 0.36 | 0.73 |
| 24 months | 168 | 101 | 64.9% | 0.63 | 0.80 | 0.96 | 0.23 | 0.67 |
Confusion matrix showing the random forest model predictions for the independent testing dataset at 6, 12, and 24 months.
| 6 months | |||
| Prediction | |||
| Not Improved | 13 | 13 | 26 |
| Improved | 38 | 117 | 156 |
| Totals | 51 | 130 | |
| 12 months | |||
| Prediction | |||
| Not Improved | 5 | 3 | 8 |
| Improved | 37 | 129 | 166 |
| Totals | 42 | 132 | |
| 24 months | |||
| Prediction | |||
| Not Improved | 8 | 9 | 17 |
| Improved | 38 | 106 | 144 |
| Totals | 46 | 115 |
Fig 3Density plots for the top 6 most important predictive features selected by the random forest model.
These density plots demonstrate the distribution of the key features between the patients who did (blue) and did not (red) show improvement in SF6D at 1-year follow-up. In all key features there is overlap of the curves, demonstrating that there is no one singe feature that can alone predict if a patient with DCM will improve with surgery.