| Literature DB >> 34490103 |
Zhikai Liu1, Wanqi Chen2, Hui Guan1, Hongnan Zhen1, Jing Shen1, Xia Liu1, An Liu3, Richard Li3, Jianhao Geng4, Jing You4, Weihu Wang4, Zhouyu Li5, Yongfeng Zhang6, Yuanyuan Chen7, Junjie Du8, Qi Chen9, Yu Chen9, Shaobin Wang9, Fuquan Zhang1, Jie Qiu1.
Abstract
PURPOSE: To propose a novel deep-learning-based auto-segmentation model for CTV delineation in cervical cancer and to evaluate whether it can perform comparably well to manual delineation by a three-stage multicenter evaluation framework.Entities:
Keywords: auto-segmentation; cervical cancer; clinical target volume; deep-learning; evaluation; radiotherapy
Year: 2021 PMID: 34490103 PMCID: PMC8417437 DOI: 10.3389/fonc.2021.702270
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 6.244
Figure 1The overall architecture of the proposed model.
Figure 2The flowchart of the three-stage multicenter randomized controlled evaluation.
Criteria for the radiation oncologist evaluation.
| Score | Grade | Criteria |
|---|---|---|
|
| No revision | The segmentation is perfect and completely acceptable for treatment. |
|
| Minor revision | The segmentation needs a few minor edits but has no significant clinical impact without correction. |
|
| Major revision | The segmentation needs significant revision. Treatment planning should not proceed without contour correction. |
|
| Rejection | The segmentation is unacceptable and needs to be redrawn. |
The comparison of DSC and 95HD value of our proposed model and DpnUNet.
| Test | Patient (No.) | Proposed Model | DpnUNet | ||
|---|---|---|---|---|---|
| DSC | 95HD (mm) | DSC | 95HD (mm) | ||
|
| 1 | 0.9 | 1.95 | 0.84 | 2.09 |
| 2 | 0.91 | 2.34 | 0.84 | 2.38 | |
| 3 | 0.9 | 3.68 | 0.89 | 3.61 | |
| 4 | 0.9 | 1.95 | 0.90 | 1.85 | |
| 5 | 0.83 | 7.68 | 0.75 | 8,84 | |
| 6 | 0.88 | 2.98 | 0.81 | 3.10 | |
| 7 | 0.84 | 7.07 | 0.80 | 8.10 | |
| 8 | 0.9 | 2.55 | 0.93 | 2.45 | |
| 9 | 0.89 | 2.83 | 0.83 | 3.85 | |
| 10 | 0.88 | 3.35 | 0.86 | 3.41 | |
|
| 11 | 0.85 | 5.1 | 0.75 | 6.17 |
| 12 | 0.91 | 2.83 | 0.90 | 3.48 | |
| 13 | 0.81 | 7.76 | 0.84 | 7.92 | |
| 14 | 0.91 | 2.24 | 0.89 | 2.33 | |
| 15 | 0.91 | 2.21 | 0.94 | 1.97 | |
| 16 | 0.89 | 2.24 | 0.87 | 2.06 | |
| 17 | 0.9 | 2.83 | 0.82 | 2.49 | |
| 18 | 0.89 | 2.45 | 0.92 | 2.88 | |
| 19 | 0.93 | 2.25 | 0.94 | 2.26 | |
| 20 | 0.85 | 2.93 | 0.84 | 2.25 | |
|
| 0.88 ± 0.03 | 3.46 ± 1.88 | 0.86 ± 0.06 | 3.67 ± 2.22 | |
Graded oncologist evaluation for AI and GT contours.
| Week 0 | ||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Oncologist | A | B | C | D | E | F | G | H | I | |||||||||
| Score | AI | GT | AI | GT | AI | GT | AI | GT | AI | GT | AI | GT | AI | GT | AI | GT | AI | GT |
|
| 89% | 97% | 93% | 95% | 30% | 37% | 71% | 74% | 54% | 57% | 75% | 85% | 94% | 94% | 82% | 77% | 45% | 37% |
|
| 11% | 3% | 7% | 5% | 61% | 56% | 28% | 25% | 46% | 42% | 25% | 15% | 6% | 6% | 18% | 23% | 45% | 57% |
|
| 0 | 0 | 0 | 0 | 9% | 6% | 1% | 1% | 0 | 1% | 0 | 0 | 0 | 0 | 0 | 0 | 10% | 6% |
|
| 0 | 0 | 0 | 0 | 0 | 1% | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0% |
|
| 2.89 | 2.96 | 2.93 | 2.95 | 2.21 | 2.29 | 2.70 | 2.73 | 2.54 | 2.56 | 2.75 | 2.85 | 2.94 | 2.94 | 2.82 | 2.77 | 2.35 | 2.31 |
|
| 0.061 | 0.553 | 0.282 | 0.640 | 0.719 | 0.078 | 1.000 | 0.382 | 0.494 | |||||||||
|
| ||||||||||||||||||
|
| 93% | 92% | 88% | 93% | 29% | 37% | 78% | 77% | 42% | 50% | 78% | 69% | 94% | 96% | 33% | 33% | 50% | 42& |
|
| 7% | 8% | 12% | 7% | 63% | 54% | 22% | 21% | 57% | 50% | 22% | 31% | 5% | 4% | 62% | 64% | 39% | 56% |
|
| 0 | 0 | 0 | 0 | 8% | 9% | 0 | 1% | 1% | 0 | 0 | 0 | 1% | 0 | 5% | 3% | 7% | 2% |
|
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4% | 0 |
|
| 2.93 | 2.92 | 2.88 | 2.93 | 2.21 | 2.28 | 2.78 | 2.74 | 2.41 | 2.5 | 2.78 | 2.69 | 2.93 | 2.96 | 2.28 | 2.3 | 2.35 | 2.40 |
|
| 0.789 | 0.229 | 0.352 | 0.940 | 0.230 | 0.150 | 0.509 | 0.846 | 0.728 | |||||||||
|
| 0.782 | 0.108 | 0.907 | 0.064 |
| 0.118 | 0.491 |
| 0.170 | |||||||||
P < 0.05, the results are statistically significant.
Figure 3Average scores for AI and GT by the nine oncologists. (A) Week 0. (B) Week 2.
The results of the Turing-like imitation test.
| Oncologist | Week 0 | Week 2 | Consistency ( | ||
|---|---|---|---|---|---|
| Positive | Negative | Positive | Negative | ||
| A | 130 (65%) | 70 (35%) | 137 (68.5%) | 63 (31.5%) | 0.296 |
| B | 92 (46%) | 108 (54%) | 100 (50%) | 100 (50%) | 0.461 |
| C | 106 (53%) | 94 (47%) | 116 (58%) | 84 (42%) | 0.134 |
| D | 107 (53.5%) | 93 (46.5%) | 100 (50%) | 100 (50%) | 0.510 |
| E | 98 (49%) | 102 (51%) | 114 (57%) | 86 (43%) |
|
| F | 111 (55.5%) | 89 (44.5%) | 102 (51%) | 98 (49%) | 0.508 |
| G | 122 (61%) | 78 (39%) | 117 (58.5%) | 83 (41.5%) | 0.712 |
| H | 119 (59.5%) | 81 (40.5%) | 95 (47.5%) | 105 (52.5%) | 0.101 |
| I | 90 (45%) | 110 (55%) | 89 (44.5%) | 111(55.5%) | 0.815 |
|
| 0.139 | 0.128 | |||
P < 0.05, results are statistically significant.
Figure 4The distribution map of the positive results.
Figure 5(A) Sample CTV where the AI contour was approved by all the oncologists. AI contours in green line. GT contours in red line. (B) Sample CTV where the GT contour was approved by all the oncologists. AI contours in green line. GT contours in red line.