| Literature DB >> 35600347 |
James Thomas Patrick Decourcy Hallinan1,2, Lei Zhu3, Wenqiao Zhang4, Desmond Shi Wei Lim1,2, Sangeetha Baskar1, Xi Zhen Low1,2, Kuan Yuen Yeong5, Ee Chin Teo1, Nesaretnam Barr Kumarakulasinghe6, Qai Ven Yap7, Yiong Huak Chan7, Shuxun Lin8, Jiong Hao Tan9, Naresh Kumar9, Balamurugan A Vellayappan10, Beng Chin Ooi4, Swee Tian Quek1,2, Andrew Makmur1,2.
Abstract
Background: Metastatic epidural spinal cord compression (MESCC) is a devastating complication of advanced cancer. A deep learning (DL) model for automated MESCC classification on MRI could aid earlier diagnosis and referral. Purpose: To develop a DL model for automated classification of MESCC on MRI. Materials andEntities:
Keywords: Bilsky classification; MRI; deep learning model; epidural spinal cord compression; metastatic epidural spinal cord compression; spinal metastasis classification; spinal metastatic disease
Year: 2022 PMID: 35600347 PMCID: PMC9114468 DOI: 10.3389/fonc.2022.849447
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 5.738
Figure 1Bilsky classification of metastatic epidural spinal cord compression on MRI of the thoracic spine. Axial T2-weighted (repetition time ms/echo time ms, 5,300/100) images were used. Training of the deep learning model was performed by a radiologist by placing a bounding box around the region of interest at each T2-weighted image. A bounding box example is included for a low-grade Bilsky 1b lesion (1b). CSF, cerebrospinal fluid.
Patient demographics and clinical characteristics for the internal and external test sets.
| Characteristics | Internal training set (n = 129) | Internal test set (n = 35) | External test set (n = 32) |
|---|---|---|---|
| Age (years)* | 61 ± 13 (18–93) | 61 ± 12 (39–87) | 60 ± 13 (19–85) |
| Women | 55 (42.6) | 18 (51.4) | 12 (37.5) |
| Men | 74 (57.4) | 17 (48.6) | 20 (62.5) |
| Ethnicity | |||
| Chinese | 93 (72.1) | 28 (80) | 23 (71.9) |
| Malay | 21 (16.3) | 3 (8.6) | 7 (21.9) |
| Indian | 7 (5.4) | 2 (5.7) | 0 (0) |
| Others | 8 (6.2) | 2 (5.7) | 2 (6.2) |
| Cancer subtype | |||
| Breast | 23 (17.8) | 8 (22.9) | 3 (9.4) |
| Lung | 21 (16.3) | 11 (31.4) | 13 (40.6) |
| Prostate | 19 (14.7) | 5 (14.3) | 4 (12.5) |
| Colon | 15 (11.6) | 3 (8.6) | 3 (9.4) |
| Renal cell carcinoma | 10 (7.8) | 2 (5.7) | 1 (3.1) |
| Nasopharyngeal carcinoma | 9 (7) | 3 (8.6) | 1 (3.1) |
| Others | 32 (24.8) | 3 (8.6) | 7 (21.9) |
| No. of MRI thoracic spines | 177/215 (82.3) | 38/215 (17.6) | 32 |
| MESCC location | |||
| Diffuse thoracic# | 30 (23.3) | 8 (22.9) | 3 (9.4) |
| C7–T2 | 13 (10.1) | 3 (8.6) | 6 (18.8) |
| T3–T10 | 55 (42.6) | 18 (51.4) | 15 (46.9) |
| T11–L3 | 31 (24.0) | 6 (17.1) | 8 (25) |
MESCC, malignant epidural spinal cord compression.
*Values are mean ± SD (range) for numerical variables and n (%) for categorical variables.
#Two or more sites of thoracic epidural disease.
Figure 2Flow chart of the study design for the internal training/validation and test sets. The deep learning model performance was compared with an expert musculoskeletal radiologist (reference standard) and three specialist readers.
Reference standards for the internal (training and test) and external (test) sets showing the number of Bilsky MESCC grades.
| Bilsky MESCC grade | Internal training/validation set | Internal test set | External test set |
|---|---|---|---|
| 0 | 4,508 (76.9) | 849 (79.6) | 454 (60.2) |
| 1a | 424 (7.2) | 82 (7.7) | 48 (6.4) |
| 1b | 469 (8.0) | 51 (4.8) | 83 (11) |
| 1c | 216 (3.7) | 35 (3.3) | 51 (6.7) |
| 2 | 105 (1.8) | 26 (2.4) | 39 (5.2) |
| 3 | 141 (2.4) | 23 (2.2) | 79 (10.5) |
|
| 5,863 | 1,066 | 754 |
Values are n (%). A region of interest (bounding box) for Bilsky grade was drawn at each axial T2-weighted image.
MESCC, malignant epidural spinal cord compression.
Internal and external test set classifications using dichotomous Bilsky gradings (low versus high grade) on MRI.
| Reader | Internal test set | External test set | ||
|---|---|---|---|---|
| Kappa (95% CI) | p-Value | Kappa (95% CI) | p-Value | |
| DL model | 0.92 (0.91–0.94) | <0.001 | 0.94 (0.92–0.96) | <0.001 |
| Neuroradiologist | 0.96 (0.95–0.98) | <0.001 | 0.95 (0.93–0.97) | <0.001 |
| Radiation oncologist | 0.97 (0.96–0.98) | <0.001 | 0.94 (0.92–0.96) | <0.001 |
| Spine surgeon | 0.98 (0.97–0.99) | <0.001 | 0.94 (0.91–0.96) | <0.001 |
Gwet’s kappa was used.
DL, deep learning model.
Internal and external test set sensitivity and specificity for the deep learning model and specialist readers using dichotomous Bilsky gradings (low versus high grade) on MRI.
| Reader | Internal test set | External test set | ||
|---|---|---|---|---|
| Sens (95% CI) | Spec (95% CI) | Sens (95% CI) | Spec (95% CI) | |
| DL model | 97.6 (91.7–99.7) | 93.6 (91.9–95.0) | 89.9 (84.4–94.0) | 98.1 (96.7–99.1) |
| Neuroradiologist | 84.5 (75.0–91.5) | 98.1 (97.0–98.8) | 92.9 (87.9–96.2) | 97.9 (96.4–98.9) |
| Radiation oncologist | 94.0 (86.7–98.0) | 97.9 (96.7–98.7) | 88.8 (83.0–93.1) | 98.5 (97.1–99.3) |
| Spine surgeon | 79.8 (69.6–87.7) | 99.5 (98.8–99.8) | 83.4 (77.0–88.7) | 99.3 (98.3–99.8) |
DL, deep learning model; Sens, sensitivity; Spec, specificity.