| Literature DB >> 30825711 |
M Visser1, D M J Müller2, R J M van Duijn3, M Smits4, N Verburg2, E J Hendriks3, R J A Nabuurs2, J C J Bot3, R S Eijgelaar5, M Witte5, M B van Herk6, F Barkhof7, P C de Witt Hamer8, J C de Munck3.
Abstract
BACKGROUND: Tumor segmentation of glioma on MRI is a technique to monitor, quantify and report disease progression. Manual MRI segmentation is the gold standard but very labor intensive. At present the quality of this gold standard is not known for different stages of the disease, and prior work has mainly focused on treatment-naive glioblastoma. In this paper we studied the inter-rater agreement of manual MRI segmentation of glioblastoma and WHO grade II-III glioma for novices and experts at three stages of disease. We also studied the impact of inter-observer variation on extent of resection and growth rate.Entities:
Keywords: Glioblastoma; Glioma; Inter-rater agreement; Low-grade glioma; MRI; Manual segmentation
Mesh:
Year: 2019 PMID: 30825711 PMCID: PMC6396436 DOI: 10.1016/j.nicl.2019.101727
Source DB: PubMed Journal: Neuroimage Clin ISSN: 2213-1582 Impact factor: 4.881
Patient characteristics.
| Glioblastoma | Non-glioblastoma | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Pat | Path | Sex | Age | T1 | T2 | T3 | Pat | Path | Sex | Age | T1 | T2 | T3 |
| 1 | F | 67,1 | 13 | 0 | 415 | 21 | F | 53,7 | 26 | 91 | 1746 | ||
| 2 | F | 72,1 | 6 | 1 | 229 | 22 | M | 44,7 | 1 | 111 | 1033 | ||
| 3 | M | 65,3 | 2 | 0 | 920 | 23 | F | 23,1 | 67 | 111 | 188 | ||
| 4 | F | 66,1 | 2 | 1 | 310 | 24 | M | 30,1 | 53 | 77 | 861 | ||
| 5 | M | 66,7 | 1 | 1 | 474 | 25 | M | 18,6 | 1 | 184 | 1477 | ||
| 6 | F | 64,0 | 15 | 1 | 274 | 26 | F | 21,8 | 9 | 92 | 1538 | ||
| 7 | M | 45,4 | 4 | 3 | 591 | 27 | M | 52,6 | 67 | 143 | 1595 | ||
| 8 | M | 52,8 | 9 | 3 | 255 | 28 | F | 35,5 | 46 | 108 | 1820 | ||
| 9 | M | 61,3 | 7 | 0 | 279 | 29 | M | 30,8 | 255 | 102 | 686 | ||
| 10 | M | 70,5 | 2 | 0 | 184 | 30 | F | 28,6 | 111 | 127 | 207 | ||
| 11 | M | 75,5 | 1 | 1 | 188 | 31 | M | 34,8 | 109 | 1 | 191 | ||
| 12 | F | 66,2 | 8 | 2 | 540 | 32 | F | 48,2 | 2 | 99 | 573 | ||
| 13 | M | 71,6 | 10 | 1 | 825 | 33 | M | 29,1 | 12 | 101 | 1438 | ||
| 14 | M | 55,1 | 2 | 3 | 770 | 34 | M | 23.0 | 60 | 145 | 965 | ||
| 15 | F | 42,3 | 5 | 1 | 732 | 35 | M | 41,6 | 1 | 90 | 903 | ||
| 16 | M | 73,0 | 18 | 1 | 329 | 36 | F | 39,5 | 1 | 61 | 183 | ||
| 17 | F | 47,2 | 3 | 1 | 168 | 37 | M | 52,8 | 8 | 170 | 306 | ||
| 18 | F | 41,8 | 21 | 1 | 267 | 38 | M | 37,8 | 52 | 161 | 186 | ||
| 19 | F | 72,6 | 8 | 1 | 204 | 39 | F | 44,3 | 68 | 380 | 402 | ||
| 20 | M | 51,4 | 19 | 2 | 278 | 40 | M | 46,7 | 126 | 112 | 1038 | ||
T1: time of preoperative scans (days before surgery), T2: time of postoperative scans (days after surgery), T3: time of progression, GB: glioblastoma, A2: astrocytoma grade II, O2: Oligodendroglioma grade II, OA2: oligoastrocytoma grade II, A3: anaplastic astrocytoma grade III.
Fig. 1Bar plots of the number of patients with corresponding number of expert (EX) and novice (NO) raters detecting any enhancing tumor and any non-enhancing tumor for glioblastoma and non-glioblastoma in MRIs preoperative, postoperative and at progression.
Intra-class coefficient with 95% confidence intervals for experts and novices.
| Histology group | Contrast | Rater | Preoperative | Postoperative | Progression |
|---|---|---|---|---|---|
| GB | Enhancing | Experts | 0.99 (0.98–1.00) | 0.92 (0.85–0.97) | 0.91 (0.82–0.96) |
| GB | Enhancing | Novices | 0.98 (0.96–1.00) | 0.60 (0.39–0.78) | 0.97 (0.95–0.99) |
| GB | Non-enhancing | Experts | 0.61 (0.41–0.79) | 0.25 (0.05–0.52) | 0.53 (0.24–0.76) |
| GB | Non-enhancing | Novices | 0.55 (0.24–0.78) | 0.15 (0.00–0.38) | 0.40 (0.09–0.67) |
| Non-GB | Enhancing | Experts | 0.28 (0.07–0.55) | 1.00 (1.00–1.00) | |
| Non-GB | Enhancing | Novices | 0.57 (0.35–0.77) | 0.66 (0.47–0.83) | |
| Non-GB | Non-enhancing | Experts | 0.92 (0.81–0.97) | 0.84 (0.70–0.93) | 0.80 (0.65–0.91) |
| Non-GB | Non-enhancing | Novices | 0.73 (0.40–0.89) | 0.55 (0.32–0.76) | 0.73 (0.46–0.88) |
GB: glioblastoma.
No enhancing elements were identified for non-glioblastomas in the postoperative MRI, with the exception of 2 disjoint residual volumes each by a different rater.
Fig. 2Box plots of the spatial overlap among experts (EX) and novices (NO) measured as generalized conformity index for enhancing tumor and non-enhancing tumor segmentations of 20 glioblastoma and 20 non-glioblastoma patients in MRIs taken at preoperative, postoperative and progression time points. Each dot represents the agreement among raters for one patient's MRI. Indices above 0.7 are considered excellent. The median of measurements and interquartile distances are plotted as boxes, which were omitted when fewer than five data points were present. Few data points were available for enhancing tumor segmentations in non-glioblastoma, because the generalized conformity index could not be calculated when fewer than two observers detected tumor.
Fig. 3Spatial overlap agreement as generalized conformity index versus tumor volume (average over experts) of enhancing tumor (A) and non-enhancing tumor (B) segmentations for glioblastomas and non-glioblastomas at subsequent MRI timings. Each dot represents the agreement of spatial overlap among experts on one patient's MRI. For enhancing tumor at postoperative phase it is shown that spatial overlap increases after artificial dilation of segmentation (grey dots), however not to the level of progression segmentation of the same volume.
Fig. 4Box plots of agreement between majority vote of all eight raters and each of the individual raters, as Jaccard index for enhancing tumor and non-enhancing tumor segmentations in glioblastoma and non-glioblastoma at the three MRI time points. Each dot represents the agreement between the consensus and the individual rater for one patient's segmentation. The first four subplots represent the experts, the second four refer to the novices. The median of measurements and interquartile distances are plotted as boxes, which were omitted when fewer than five data points were measured.
Fig. 5Boxplots of agreement between rater and majority vote consensus of experts and novices combined measured as Jaccard index for enhancing and non-enhancing tumor segmentations in glioblastoma and non-glioblastoma at three MRI timings. Each dot represents the agreement between a rater's segmentations and the majority vote consensus of all raters for one patient's segmentation. Indices above 0.7 are considered excellent. The median of measurements and interquartile distances are plotted as boxes, which were omitted when fewer than five data points were measured.
Fig. 6The variation in extent of resection and growth rate for glioblastoma and non-glioblastoma between eight raters per patient. In each plot patients are sorted by median extent of resection and growth rate, respectively. Each dot represents the calculation for one patient of one rater. Experts and novices are labelled according to the legend. The median of measurements and interquartile distances are plotted as boxes. The quartile coefficients of dispersion are plotted below the boxplots.
An overview of previous studies on inter-rater agreement.
| Authors | Year | Low grade | High grade | #Exp | #Nov | Context | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| Pre | Post | Prog | Pre | Post | Prog | |||||
| 2001 | 4 | 6 | 3 | Added value of MRI to CT for segmentation. | ||||||
| 2005 | 7 | 5⁎ | idem | |||||||
| 2009 | 22⁎⁎ | 8 | Reproducibility of 2D tumor dimensions. | |||||||
| 2010 | 8 | 8 | 2 | 1 | Manual PreOp/PostOp glioblastoma segmentation | |||||
| 2012 | 10 | 2 | GLISTR | |||||||
| 2012 | 5 | 5 | 3 | 4 | Reproducibility of 2D tumor dimensions. | |||||
| 2014 | 37 | 37 | 1 | 2 | Semi-automatic segmentation. | |||||
| 2014 | 25 | 1 | 1 | BraTumIA | ||||||
| 2015 | 14 | 51 | 4 | BRATS | ||||||
| 2015 | 5 | 5 | 4 | 8 | Evaluation of inter-rater variability | |||||
| 2016 | 9 | 3 | 13 | Idem | ||||||
| 2016 | 19 | 4 | BraTumIA | |||||||
| 2016 | 15 | 15 | 2 | Semi-automatic segmentation | ||||||
| 2016 | 14 | 14 | 14 | 1 | 1 | BraTumIA (longitudinal) | ||||
| 2017 | 23 | 1 | Intra-rater assessment | |||||||
| 2018 | 4 | 2 | Semi-automatic segmentation | |||||||
| 2018 | 4 | 2 | Quantification of progression | |||||||
| This Study | 2018 | 20 | 20 | 20 | 20 | 20 | 20 | 4 | 4 | Evaluation of inter-rater variability |
Unspecified type of rater.
Moment after surgery not specified.
Supervised by expert neuro-radiologist.
This study has multiple longitudinal moments after postoperative.
Expert used as ground truth, novices test semi-automated method.