| Literature DB >> 34632117 |
Pawel Siciarz1,2, Salem Alfaifi3, Eric Van Uytven3, Shrinivas Rathod3,4, Rashmi Koul4,5, Boyd McCurdy4,6,2.
Abstract
PURPOSE: To create and investigate a novel, clinical decision-support system using machine learning (ML). METHODS AND MATERIALS: The ML model was developed based on 79 radiotherapy plans of brain tumor patients that were prescribed a total dose of 60 Gy delivered with volumetric-modulated arc therapy (VMAT). Structures considered for analysis included planning target volume (PTV), brainstem, cochleae, and optic chiasm. The model aimed to classify the target variable that included class-0 corresponding to plans for which the PTV treatment planning objective was met and class-1 that was associated with plans for which the PTV objective was not met due to the priority trade-off to meet one or more organs-at-risk constraints. Several models were evaluated using double-nested cross-validation and an area-under-the-curve (AUC) metric, with the highest performing one selected for further investigation. The model predictions were explained with Shapely additive explanation (SHAP) interaction values.Entities:
Year: 2021 PMID: 34632117 PMCID: PMC8487981 DOI: 10.1016/j.ctro.2021.09.001
Source DB: PubMed Journal: Clin Transl Radiat Oncol ISSN: 2405-6308
Fig. 1a) Percentage deviations from treatment planning objectives ΔD for all 79 patients and associated structures, where the red dashed line indicates the boundary between positive and negative ΔD; b) percentage deviations from treatment planning objectives for plans which did not meet a specified objective. The percentage values of ΔD were used as dosimetric features for training the machine learning model. correspond to the mean deviation for n plans. c) Cumulative distribution of the 5th percentile of Hausdorff distances between the PTV and organs-at-risk. d) The absolute PTV volume for each plan. The size and the colors of the markers are proportionate and correspond to the PTV volume measure. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
Fig. 2Confusion matrices a) and Logistic Regression model evaluation metrics b) for fourth cross-validation fold. The meaning behind each metric was briefly summarized in Appendix 2 (Table A2); c) Receiver Operating Characteristic for five cross-validation folds created based on the testing data and the performance of the Logistic Regression model. The dashed diagonal line (‘Chance’ in the legend) represents the random assignment of classes.
Fig. 3a) Feature importance represented by the impact of directional SHAP values and particular feature values on the output of the model; b) The average feature contribution to the model output measured by mean absolute SHAP values.
Fig. 4Partial dependency charts for a) geometric features and b) dosimetric features. The color bars associated with each chart indicate the feature with which the evaluated feature (on the x-axis) has the strongest interaction. Specifically, the interaction indicates the influence those two features have on the model prediction. For example, if we consider a feature on the x-axis, then another feature on the color bar will be automatically selected in order to maximize the mutual impact of those two features on the model prediction. Partial dependency charts for all cross-validation folds are included in Appendix 3.
Fig. 5Local explanations of the logistic regression model in the form of features importance and, generated by the model, binary class probabilities. For illustrative purposes, two randomly selected samples belonging to each class (including one incorrectly classified sample) were selected. The threshold for the class assignment was determined by the probability of 50%.