| Literature DB >> 34811957 |
Yijun Wu1, Kai Kang1, Chang Han1, Shaobin Wang2, Qi Chen2, Yu Chen2, Fuquan Zhang1, Zhikai Liu1.
Abstract
BACKGROUND: Delineation of clinical target volume (CTV) for radiotherapy is a time-consuming and labor-intensive work. This study aims to propose a novel convolutional neural network (CNN)-based model for fast auto-segmentation of CTV. To evaluate its performance and clinical utility, a blind randomized validation method was used.Entities:
Keywords: clinical evaluation; convolutional neural network; deep learning; neoadjuvant radiotherapy; rectal cancer
Mesh:
Year: 2021 PMID: 34811957 PMCID: PMC8704175 DOI: 10.1002/cam4.4441
Source DB: PubMed Journal: Cancer Med ISSN: 2045-7634 Impact factor: 4.452
FIGURE 1Development of our U‐Net architecture. En is encoder block. Down and Conv are down sample and convolution layers. WEN is connection weight responding to a specific connection
FIGURE 2Flow chart for clinicians' scoring and Turing test. AI, artificial intelligence; GT, ground truth
FIGURE 3Sample contours for clinicians' scoring in patients with rectal cancer receiving neoadjuvant radiotherapy. Rejected (0 point): (A, B); Major revision (1 point): (C, D); Minor revision (2 points): (E–H); and Totally accepted (3 points): (I–L)
FIGURE 4Sample slices for Turing test in patients with rectal cancer receiving neoadjuvant radiotherapy. Red: artificial intelligence (AI) contour; Green: ground truth (GT) contour. (A–D): AI performs better than GT; (E–H): AT performs worse than GT
DSC and 95HD values of our proposed model in the validation dataset of patients for clinicians' scoring and Turing test
| Evaluation | Patient | DSC | 95HD |
|---|---|---|---|
| Clinicians' evaluation | 1 | 0.91 | 9.51 |
| 2 | 0.89 | 8.13 | |
| 3 | 0.89 | 9.02 | |
| 4 | 0.93 | 5.94 | |
| 5 | 0.93 | 7.05 | |
| 6 | 0.93 | 5.97 | |
| 7 | 0.88 | 11.68 | |
| 8 | 0.89 | 10.46 | |
| 9 | 0.93 | 7.60 | |
| 10 | 0.90 | 10.50 | |
| Mean ± SD | − | 0.91 ± 0.02 | 8.59 ± 1.98 |
| Turing test | 11 | 0.88 | 9.62 |
| 12 | 0.84 | 10.08 | |
| 13 | 0.89 | 8.46 | |
| 14 | 0.88 | 9.79 | |
| 15 | 0.92 | 5.02 | |
| 16 | 0.91 | 5.47 | |
| 17 | 0.89 | 8.11 | |
| 18 | 0.90 | 6.98 | |
| 19 | 0.90 | 7.09 | |
| 20 | 0.91 | 5.76 | |
| Mean ± SD | − | 0.89 ± 0.02 | 8.97 ± 1.86 |
|
| − | 0.113 | 0.284 |
Abbreviations: 95HD, 95th percentile Hausdorff distance; DSC, Dice similarity coefficient.
Clinicians' scoring for AI and GT contours
| Clinician | A | B | C | D | E | F | G | H | I | J | Mean (%) | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Score | AI | GT | AI | GT | AI | GT | AI | GT | AI | GT | AI | GT | AI | GT | AI | GT | AI | GT | AI | GT | AI | GT |
| Week 0 | ||||||||||||||||||||||
| 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0.1 (0.2) | 0.1 (0.2) |
| 1 | 1 | 1 | 0 | 0 | 7 | 11 | 0 | 0 | 0 | 3 | 0 | 0 | 1 | 1 | 17 | 13 | 0 | 0 | 0 | 0 | 2.6 (5.2) | 2.9 (5.8) |
| 2 | 6 | 13 | 25 | 22 | 27 | 27 | 3 | 11 | 37 | 36 | 3 | 8 | 12 | 15 | 11 | 20 | 18 | 23 | 6 | 5 | 14.8 (29.6) | 18.0 (36.0) |
| 3 | 43 | 36 | 25 | 28 | 15 | 11 | 47 | 39 | 13 | 11 | 47 | 42 | 37 | 34 | 22 | 17 | 32 | 27 | 44 | 45 | 32.5 (65.0) | 29.0 (58.0) |
| Mean score | 2.84 | 2.70 | 2.50 | 2.56 | 2.12 | 1.96 | 2.94 | 2.78 | 2.26 | 2.16 | 2.94 | 2.84 | 2.72 | 2.66 | 2.10 | 2.08 | 2.64 | 2.54 | 2.88 | 2.90 | 2.59 | 2.52 |
|
| 0.095 | 0.55 | 0.247 | 0.022 | 0.347 | 0.112 | 0.521 | 0.846 | 0.312 | 0.75 | 0.086 | |||||||||||
| Week 2 | ||||||||||||||||||||||
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 15 | 10 | 0 | 0 | 0 | 0 | 1.5 (3.0) | 1.0 (2.0) |
| 1 | 0 | 1 | 0 | 0 | 1 | 4 | 0 | 0 | 4 | 8 | 0 | 0 | 0 | 0 | 8 | 16 | 0 | 0 | 0 | 0 | 1.3 (2.6) | 2.9 (5.8) |
| 2 | 7 | 7 | 27 | 32 | 25 | 34 | 4 | 5 | 37 | 36 | 3 | 13 | 11 | 13 | 11 | 10 | 15 | 14 | 12 | 12 | 15.2 (30.4) | 17.6(35.2) |
| 3 | 43 | 42 | 23 | 18 | 24 | 12 | 46 | 45 | 9 | 6 | 47 | 37 | 39 | 37 | 16 | 14 | 35 | 36 | 38 | 38 | 32.0 (64.0) | 28.5 (57.0) |
| Mean score | 2.86 | 2.82 | 2.46 | 2.36 | 2.46 | 2.16 | 2.92 | 2.90 | 2.10 | 1.96 | 2.94 | 2.74 | 2.78 | 2.74 | 1.56 | 1.56 | 2.70 | 2.72 | 2.76 | 2.76 | 2.55 | 2.47 |
|
| 0.751 | 0.312 | 0.008 | 0.728 | 0.181 | 0.007 | 0.641 | 0.989 | 0.826 | 1.000 | 0.115 | |||||||||||
| Kappa value | 0.613 | −0.029 | 0.240 | 0.463 | 0.158 | 0.361 | 0.115 | 0.155 | 0.264 | 0.495 | – | – | ||||||||||
|
| <0.001 | 0.766 | 0.001 | <0.001 | 0.04 | <0.001 | 0.229 | 0.004 | 0.006 | <0.001 | – | – | ||||||||||
Abbreviations: AI, artificial intelligence; GT, ground truth.
Kappa test.
FIGURE 5(A, B) Mean score of 10 clinicians for each contour. (C, D) Mean score of 10 clinicians for each slice in Turing test. AI, artificial intelligence; GT, ground truth
Turing test
| Clinician | A | B | C | D | E | F | G | H | I | J | Mean positive rate |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Week 0 | |||||||||||
| Positive | 39 | 41 | 30 | 32 | 42 | 50 | 37 | 42 | 39 | 53 | 40.5% |
| Negative | 61 | 59 | 70 | 68 | 58 | 50 | 63 | 58 | 61 | 47 | |
| Week 2 | |||||||||||
| Positive | 41 | 48 | 41 | 49 | 44 | 46 | 42 | 46 | 52 | 43 | 45.2% |
| Negative | 59 | 52 | 59 | 51 | 56 | 54 | 58 | 54 | 48 | 57 | |
|
| 0.059 | 0.334 | 0.008 | 0.096 | 0.198 | 0.76 | 0.05 | 0.281 | 0.452 | 0.752 | – |