| Literature DB >> 33191265 |
MinJae Woo1, Moonseong Heo1, A Michael Devane2, Steven C Lowe2, Ronald W Gimbel3.
Abstract
BACKGROUND: A growing number of research studies have reported inter-observer variability in sizes of tumours measured from CT scans. It remains unclear whether the conventional statistical measures correctly evaluate the CT measurement consistency for optimal treatment management and decision-making. We compared and evaluated the existing measures for evaluating inter-observer variability in CT measurement of cancer lesions.Entities:
Keywords: adult oncology; computed tomography; protocols & guidelines; quality in health care
Mesh:
Year: 2020 PMID: 33191265 PMCID: PMC7668356 DOI: 10.1136/bmjopen-2020-040096
Source DB: PubMed Journal: BMJ Open ISSN: 2044-6055 Impact factor: 2.692
Figure 1Example of increased and decrease inter-observer variability from observed data. To generate a case with increased inter-observer variability, the difference between each measurement and the median value was increased by 40% (right). The difference between each measurement and the median value was decreased by 40% in the case with decreased inter-observer variability (left).
Figure 2Visualisation of measurement distribution for each case. Each vertical line in the graphs represent different CT case and each point represent per cent difference between a measurement and the corresponding median value. The light blue area represents plus and minus 10% interval from the median value.
Figure 3Visualisation of pairwise bias from Bland-Altman analysis. The systematic discrepancy (bias) was calculated using average per cent differences and presented in decimal format. Darker red colours represent larger per cent measurement differences. The positive values indicate that the radiologist on y-axis over-estimated compared with the radiologist on x-axis. The negative values indicate that the radiologist on y-axis under-estimated compared with the radiologist on x-axis.
Descriptive statistics for the original observed data
| CT image sets | Number of image slices | Median measurements (SD) | Range | Min–Max |
| Hepatic metastasis 1 | 9 | 4.46 (0.38) | (3.81 to 5.19) | 30.70% |
| Hepatic metastasis 2 | 5 | 2.68 (0.22) | (2.31 to 3.03) | 27.00% |
| Hepatic metastasis 3 | 5 | 1.91 (0.18) | (1.68 to 2.21) | 27.20% |
| Hepatic metastasis 4 | 13 | 6.14 (0.48) | (5.32 to 6.72) | 23.30% |
| Hepatic metastasis 5 | 6 | 2.68 (0.29) | (2.24 to 3.13) | 33.10% |
| Lung lesion 1 | 8 | 3.46 (0.24) | (3.10 to 3.86) | 21.80% |
| Lung lesion 2 | 10 | 4.18 (0.23) | (3.90 to 4.51) | 14.50% |
| Lung lesion 3 | 6 | 2.00 (0.17) | (1.71 to 2.37) | 32.40% |
| Lung lesion 4 | 10 | 4.29 (0.36) | (3.69 to 5.02) | 30.50% |
| Lung lesion 5 | 4 | 1.56 (0.11) | (1.27 to 1.68) | 27.80% |
Note: Average measurement and range are in centimetres (cm). SD denotes standard deviation. Min denotes minimum measurement for each lesion. Max denotes maximum measurement for each lesion. Per cent difference between minimum and maximum values was calculated using the following formula: difference (min, max)/average (min, max). Range consists of (minimum observed value to maximum observed value).
Figure 4Responsiveness comparison of intraclass correlation coefficient (ICC) and Bland-Altman outlier scores. Scaling factor d represents per cent increase in the deviation of each measurement from the corresponding median. Horizontal axis corresponds to scaling factor d used to decrease or increase the inter-observer variability. Vertical axis represents ICC and Bland-Altman scores. Vertical dotted lines in red represent different data sets.
Figure 5Standard Bland-Altman plotting for the selected pairs. The upper and lower limit of agreement (LOA) were calculated using 2SD. The dotted and solid lines represent LOA and mean difference, respectively. Different colours represent different radiologist pairs. While there were a total 78 possible pairs, the plotting included six selected pairs for visualisation purposes. The total number of outliers was unchanging across the different cases, regardless of the number of pairs in the plotting.