| Literature DB >> 35633448 |
Esben Andreas Carlsen1,2, Kristian Lindholm1,2, Andreas Kjaer3,4, Flemming Littrup Andersen1,2, Amalie Hindsholm1,2, Mathias Gæde1,2, Claes Nøhr Ladefoged1,2, Mathias Loft1,2, Camilla Bardram Johnbeck1,2, Seppo Wang Langer2,5,6, Peter Oturai1,2, Ulrich Knigge2,7.
Abstract
BACKGROUND: Segmentation of neuroendocrine neoplasms (NENs) in [64Cu]Cu-DOTATATE positron emission tomography makes it possible to extract quantitative measures useable for prognostication of patients. However, manual tumor segmentation is cumbersome and time-consuming. Therefore, we aimed to implement and test an artificial intelligence (AI) network for tumor segmentation. Patients with gastroenteropancreatic or lung NEN with [64Cu]Cu-DOTATATE PET/CT performed were included in our training (n = 117) and test cohort (n = 41). Further, 10 patients with no signs of NEN were included as negative controls. Ground truth segmentations were obtained by a standardized semiautomatic method for tumor segmentation by a physician. The nnU-Net framework was used to set up a deep learning U-net architecture. Dice score, sensitivity and precision were used for selection of the final model. AI segmentations were implemented in a clinical imaging viewer where a physician evaluated performance and performed manual adjustments.Entities:
Keywords: Artificial intelligence; Neuroendocrine neoplasms; Prognostication; Tumor segmentation; [64Cu]Cu-DOTATATE PET
Year: 2022 PMID: 35633448 PMCID: PMC9148347 DOI: 10.1186/s13550-022-00901-2
Source DB: PubMed Journal: EJNMMI Res ISSN: 2191-219X Impact factor: 3.434
Fig. 1Total tumor segmentation of neuroendocrine neoplasms by a convolutional neural network
Fig. 2Illustration of data used for training and test. The models and the ensemble hereof were inferred on the test cohort. Boxes are not drawn to scale
Evaluation scheme of AI segmentations
| Rating | Criteria |
|---|---|
| 1. Perfect/optimal | The segmentation is as good as manual segmentation, that is, no false positive or false negative segmentations |
| 2. Optimal with minor adjustments | The segmentation contains all lesions and only minor* false positives or false negatives |
| 3. Acceptable with minor adjustments | The segmentation contains the majority of the lesions (at least 1 and ≤ 3 missing) and ≤ 2 false positive segmentations. Additionally only minor* false positives or false negatives |
| 4. Acceptable with major adjustments | The segmentation contains most of the lesions (at least 1 and ≤ 6 missing) and ≤ 4 false positive segmentations. Additionally only minor* false positives or false negatives |
| 5. Non-usable | The segmentation does not contain enough of the lesions (≥ 7 lesions missing or no lesions segmented if less than 7 lesions present) or too many false positives (≥ 5) for correction to be meaningful |
* Minor is defined as only parts of a predicted lesion are wrong
Demographic data for patients with neuroendocrine neoplasms
| Training and test datasets | Training | Test | Overall |
|---|---|---|---|
| ( | ( | ( | |
| Mean age, year | 62 (SD, 11) | 65 (SD, 10) | 63 (SD, 10) |
| Gender | |||
| Male | 63 (54) | 24 (59) | 87 (55) |
| Female | 54 (46) | 17 (42) | 71 (45) |
| Site of primary | |||
| Small intestine | 67 (57) | 23 (56) | 90 (57) |
| Pancreas | 23 (20) | 9 (22) | 32 (20) |
| Lung | 7 (6) | 5 (12) | 12 (8) |
| Cecum | 6 (5) | 2 (5) | 8 (5) |
| Extrahepatic biliary tract | 2 (2) | 0 (0) | 2 (1) |
| Esophagus | 1 (1) | 0 (0) | 1 (1) |
| Gastric | 1 (1) | 0 (0) | 1 (1) |
| Unknown primary NEN | 10 (9) | 2 (5) | 12 (8) |
| WHO* | |||
| Grade 1 | 29 (25) | 3 (7) | 32 (20) |
| Grade 2 | 75 (64) | 32 (78) | 107 (68) |
| Grade 3 | 6 (5) | 6 (15) | 12 (8) |
| Missing | 7 (6) | 0 (0) | 7 (4) |
| Mean Ki67, % | 8 (SD, 14) | 11 (SD, 8) | 9 (SD, 12) |
*Denotes statistically significant difference in distribution of WHO grades between training and test data (p = 0.013). Data are number followed by percentage in parentheses, unless otherwise indicated. Percentage were rounded and may not add up to 100%
Metrics for AI segmentations without manual adjustments applied to the test cohort (n = 41)
| Pixel-wise | AI model | ||||
|---|---|---|---|---|---|
| Model 1 | Model 2 | Model 3 | Model 4 | Ensemble | |
| Dice | 0.801 (0.206) | 0.768 (0.234) | 0.763 (0.233) | 0.801 (0.196) | |
| Precision | 0.772 (0.258) | 0.752 (0.279) | 0.787 (0.258) | 0.786 (0.250) | |
| Sensitivity | 0.860 (0.180) | 0.869 (0.182) | 0.821 (0.231)* | 0.872 (0.177) | |
All values calculated as mean of the 41 patients of the test cohort with standard deviation in parentheses. Bold numbers mark the highest value across the models/ensemble in each evaluation metric. *Denotes statistically significant difference in sensitivity between Model 4 and Model 1 (p = 0.017)
Fig. 3Representative examples of the segmentations from the AI model for two patients. Maximum intensity projection [64Cu]Cu-DOTATATE PET without tumor segmentation (A, D). Ground truth segmentation of tumor (B, E). AI predicted segmentations—no manual adjustments performed (C, F). In the AI output, all separate segmentations are given a unique color, e.g., red, blue, green, making manual adjustment with deletion of a segmentation easy and fast (e.g., part of the bladder was erroneously segmented in C)
Evaluation of number of false-positive/false-negative segmentations by AI without manual adjustments
| Evaluation score* | |
|---|---|
| 1 (Perfect/optimal) | 7 (17%) |
| 2 (Optimal with minor adjustments) | 3 (1%) |
| 3 (Acceptable with minor adjustments) | 19 (46%) |
| 4 (Acceptable with major adjustments) | 6 (15%) |
| 5 (Non-usable) | 6 (15%) |
*Defined in Table 1
Fig. 4Boxplot depicting time spent on manual correction to obtain the final total tumor segmentation