| Literature DB >> 32690103 |
Wen Chen1,2, Yimin Li3, Brandon A Dyer2,4, Xue Feng5, Shyam Rao2, Stanley H Benedict2, Quan Chen6,7, Yi Rong8.
Abstract
BACKGROUND: Impaired function of masticatory muscles will lead to trismus. Routine delineation of these muscles during planning may improve dose tracking and facilitate dose reduction resulting in decreased radiation-related trismus. This study aimed to compare a deep learning model with a commercial atlas-based model for fast auto-segmentation of the masticatory muscles on head and neck computed tomography (CT) images.Entities:
Keywords: Auto-segmentation; Deep learning model; Masticatory muscles
Mesh:
Year: 2020 PMID: 32690103 PMCID: PMC7372849 DOI: 10.1186/s13014-020-01617-0
Source DB: PubMed Journal: Radiat Oncol ISSN: 1748-717X Impact factor: 3.481
Patients characteristics
| Characteristics | Training group ( | Validation group( |
|---|---|---|
| Primary site | ||
| Oropharynx | 16 (59.3%) | 20 (69.0%) |
| Larynx | 2 (7.4%) | 4 (13.8%) |
| Nasopharynx and Sinonasal | 4 (14.8%) | 2 (6.9%) |
| Other sites | 5 (18.5%) | 3 (10.3%) |
| Stage | ||
| I | 3 (11.1%) | 2 (6.9%) |
| II | 3 (11.1%) | 3 (10.3%) |
| III | 5 (18.5%) | 6 (20.7%) |
| IV | 16 (59.3%) | 17 (58.6%) |
| N/X | 0 (0%) | 1 (3.5%) |
| Primary Tumor Surgery | ||
| Yes | 15 | 17 |
| No | 12 | 12 |
Fig. 1Transverse view of different contours for one presentative patient. a manual contours (green lines, reference standard) vs. DLAS (red lines), (b) manual contours (green lines) vs. ABAS (blue lines)
Fig. 2The metrics of geometric and spatial similarity for all muscles manually delineated by three clinicians (interobserver variation). In each box, the central mark is the median and edges are the 25 and 75th percentiles. and the upper and lower whiskers represents the highest and lowest values. The overall values (mean ± SD) for every metric were presented on the right upper corner for each subfigure. “+” in the box represents the mean values
Mean values and standard deviation (Mean ± SD) for the 6 metrics across all organs contoured using three methods: A. DLAS; B. ABAS; C. interobserver variation (baseline)
| Metrics | DLAS | ABAS | interobserver variation | |||
|---|---|---|---|---|---|---|
| A vs B | A vs C | B vs C | ||||
| DSC | 0.86 ± 0.03 | 0.83 ± 0.04 | 0.86 ± 0.05 | 0.00 | 0.26 | 0.00 |
| Recall | 0.86 ± 0.05 | 0.81 ± 0.07 | 0.81 ± 0.07 | 0.00 | 0.00 | 0.91 |
| Precision | 0.85 ± 0.05 | 0.85 ± 0.07 | 0.92 ± 0.04 | 0.97 | 0.00 | 0.00 |
| HD95 | 0.30 ± 0.09 | 0.37 ± 0.13 | 0.31 ± 0.13 | 0.00 | 0.20 | 0.00 |
| HD | 0.73 ± 0.31 | 0.83 ± 0.37 | 0.82 ± 0.53 | 0.00 | 0.84 | 0.03 |
| MSD | 0.08 ± 0.02 | 0.11 ± 0.03 | 0.08 ± 0.04 | 0.00 | 0.20 | 0.00 |
* represents T test was performed among these three methods
Fig. 3Comparison DLAS and ABAS performance. The performance was evaluated with (a) DSC, (b) recall, (c) precision, (d) HD95, (e) HD, (f) MSD. In each box, the central mark is the median and edges are the 25 and 75th percentiles and the upper and lower whiskers represents the highest and lowest values. Paired t test was used for analysis. *P < 0.05, ** P < 0.01, *** P < 0.001, **** P < 0.0001, ns, no significance
Fig. 4The overall scores achieved by DLAS and ABAS for all pairs of muscles. *P < 0.05. In each box, the central mark is the median and edges are the 25 and 75th percentiles and the upper and lower whiskers represents the highest and lowest values
The percentages (%) of cases for each muscle auto segmented by DLAS and ABAS which were worse than that achieved by physicians (mean DSC was used to compare the results)
| M-R | M-L | T-R | T-L | LP-R | LP-L | MP-R | MP-L | |
|---|---|---|---|---|---|---|---|---|
| DLAS | 62.1% | 51.7% | 20.7% | 24.1% | 65.5% | 65.5% | 44.8% | 37.9% |
| (18/29) | (15/29) | (6/29) | (7/29) | (19/29) | (19/29) | (13/29) | (11/29) | |
| ABAS | 96.6% | 89.7% | 48.3% | 41.4% | 96.6% | 82.8% | 79.3% | 69.0% |
| (28/29) | (26/29) | (14/29) | (12/29) | (28/29) | (24/29) | (23/29) | (20/29) | |
| 0.02 | 0.03 | 0.05 | 0.26 | 0.01 | 0.23 | 0.01 | 0.03 |
* represents Chi-Square test was performed between DLAS and ABAS
Fig. 5Comparisons of ∆dose of DLAS vs ABAS. Paired t test was used for analysis. *P < 0.05. In each box, the central mark is the median and edges are the 25 and 75th percentiles. and the upper and lower whiskers represents the highest and lowest values