| Literature DB >> 33458359 |
Samsara Terparia1, Romaana Mir2, Yat Tsang1,2, Catharine H Clark2,3,4, Rushil Patel2.
Abstract
BACKGROUND ANDEntities:
Keywords: Conformity index; Delineation; Interobserver variation; Machine learning; Quality assurance; SABR
Year: 2020 PMID: 33458359 PMCID: PMC7807884 DOI: 10.1016/j.phro.2020.10.008
Source DB: PubMed Journal: Phys Imaging Radiat Oncol ISSN: 2405-6316
Table of benchmarks, structures and imaging provided to clinicians to aid delineation in the pre-trial outlining benchmark exercises.
| SABR Benchmark | Spine | Pelvic Lymph Node | Adrenal Gland | Liver |
|---|---|---|---|---|
| Number of Cases | 71 | 85 | 44 | 53 |
| Imaging Modalities Provided to aid Delineation | CT, MRI (T1 and T2 weighted) | Contrast enhanced CT, MRI (T2 weighted) | Contrast enhanced CT | 4DCT, 3DCT, MRI (T2 weighted) |
| Structures Considered | GTV | GTV | Liver | GTV |
| Stomach | Liver | |||
| Stomach | ||||
| Oesophagus | ||||
| Heart |
Table showing resulting 5-fold predictive accuracies, sensitivities and specificities of trained models by structure and algorithm type.
| Structure Type (number of “Pass” contours/total number of contours) | Machine Learning Model accuracy (%) (Sensitivity %/Specificity %) | |||||
|---|---|---|---|---|---|---|
| Tree | Logistic Regression | Support Vector Machine | K- Nearest Neighbour | Ensemble | ||
| All (242/393–62%) | 77 | 72 | 80 | 78 | 78 | |
| TV (148/209–71%) | 84 | 80 | 80 | 80 | 79 | |
| All OAR (94/184–51%) | 78 | 71 | 78 | 79 | 82 | |
| TV | Liver GTV | 68 | 76 | 70 | 68 | 68 |
| Node GTV | 81 | 81 | 85 | 86 | 82 | |
| Spine GTV | 79 | 83 | 86 | 87 | 83 | |
| OAR | Liver | 78 | 78 | 84 | 85 | 81 |
| Stomach | 93 | 92 | 96 | 91 | 96 | |
| Oesophagus | 81 | 72 | 75 | 84 | 84 | |
| Heart | 71 | 88 | 94 | 88 | 82 | |
Fig. 1Bar chart demonstrating the median predictive accuracies, sensitivities and specificities obtained across all trained ML models.
Fig. 2Receiver Operator Characteristic (ROC) curves for Liver GTV Logistic Regression Model (Top) and Stomach SVM Model (Bottom). The Area Under Curve (AUC) represents the model’s overall ability to correctly classify structures into each category.
Fig. 3Scatter plots showing Liver GTV Logistic Regression Model (Top) and Stomach SVM Model (Bottom), fitted with DSC against Hausdorff distance. Blue crosses indicate where the model has predicted an investigator contour as a “fail”, but was visually scored as a “pass”. Blue dots indicate where the model has correctly predicted an investigator contour as a “fail”. Red crosses indicate where the model has predicted an investigator contour as a “pass”, but was visually scored as a “fail”. Red dots indicate where the model has correctly predicted an investigator contour as a “pass”.
Fig. 4Comparison of passed (green) and failed (red) Liver GTV (top) and Stomach contours (bottom) in transverse, coronal and sagittal views (left to right). The Gold Standard is outlined in blue.