| Literature DB >> 32548296 |
Lubomir M Hadjiiski1, Kenny H Cha1, Richard H Cohan1, Heang-Ping Chan1, Elaine M Caoili1, Matthew S Davenport1,2, Ravi K Samala1, Alon Z Weizer2, Ajjai Alva3, Galina Kirova-Nedyalkova4, Kimberly Shampain1, Nathaniel Meyer1, Daniel Barkmeier1, Sean A Woolen5, Prasad R Shankar1, Isaac R Francis1, Phillip L Palmbos3.
Abstract
We evaluated the intraobserver variability of physicians aided by a computerized decision-support system for treatment response assessment (CDSS-T) to identify patients who show complete response to neoadjuvant chemotherapy for bladder cancer, and the effects of the intraobserver variability on physicians' assessment accuracy. A CDSS-T tool was developed that uses a combination of deep learning neural network and radiomic features from computed tomography (CT) scans to detect bladder cancers that have fully responded to neoadjuvant treatment. Pre- and postchemotherapy CT scans of 157 bladder cancers from 123 patients were collected. In a multireader, multicase observer study, physician-observers estimated the likelihood of pathologic T0 disease by viewing paired pre/posttreatment CT scans placed side by side on an in-house-developed graphical user interface. Five abdominal radiologists, 4 diagnostic radiology residents, 2 oncologists, and 1 urologist participated as observers. They first provided an estimate without CDSS-T and then with CDSS-T. A subset of cases was evaluated twice to study the intraobserver variability and its effects on observer consistency. The mean areas under the curves for assessment of pathologic T0 disease were 0.85 for CDSS-T alone, 0.76 for physicians without CDSS-T and improved to 0.80 for physicians with CDSS-T (P = .001) in the original evaluation, and 0.78 for physicians without CDSS-T and improved to 0.81 for physicians with CDSS-T (P = .010) in the repeated evaluation. The intraobserver variability was significantly reduced with CDSS-T (P < .0001). The CDSS-T can significantly reduce physicians' variability and improve their accuracy for identifying complete response of muscle-invasive bladder cancer to neoadjuvant chemotherapy.Entities:
Keywords: Bladder cancer; decision support systems; intraobserver variability; observer performance study; radiomics; treatment response assessment
Mesh:
Year: 2020 PMID: 32548296 PMCID: PMC7289252 DOI: 10.18383/j.tom.2020.00013
Source DB: PubMed Journal: Tomography ISSN: 2379-1381
Figure 1.Image analysis pipeline of the computerized decision-support system for treatment response assessment (CDSS-T system). AI-CALS: auto-initialized cascaded level sets. DL-CNN: deep-learning convolutional neural network.
Figure 2.Graphical user interface (GUI) for reading with and without the computer-aided diagnosis (CAD) system designed for supporting treatment response assessment (CDSS-T). (A) The pre- and posttreatment scans are shown side by side, and (B) the observer estimates the treatment response, recording the estimate in the interface indicated by the arrow. (C) The observer is shown the CDSS-T score and the score distribution of the 2 classes is displayed for reference, as indicated by the middle arrow and bottom arrow, respectively. The observer may revise their treatment response assessment after considering the CDSS-T score using the interface pointed to by the top arrow.
Diagnostic Performance in Terms of AUC of Physicians Without and With the Use of CDSS-T for the Assessment of Complete Response to Neoadjuvant Chemotherapy on the Entire Data Set of 157 Bladder Cancers
| Observer | AUC Without CDSS-T | AUC With CDSS-T |
|---|---|---|
| Physician 1 | 0.74 ± 0.04 | 0.77 ± 0.04 |
| Physician 2 | 0.74 ± 0.04 | 0.77 ± 0.04 |
| Physician 3 | 0.74 ± 0.04 | 0.76 ± 0.04 |
| Physician 4 | 0.76 ± 0.04 | 0.79 ± 0.04 |
| Physician 5 | 0.74 ± 0.04 | 0.74 ± 0.04 |
| Physician 6 | 0.76 ± 0.04 | 0.77 ± 0.04 |
| Physician 7 | 0.66 ± 0.05 | 0.73 ± 0.04 |
| Physician 8 | 0.73 ± 0.04 | 0.75 ± 0.04 |
| Physician 9 | 0.78 ± 0.04 | 0.81 ± 0.04 |
| Physician 10 | 0.73 ± 0.04 | 0.76 ± 0.04 |
| Physician 11 | 0.72 ± 0.04 | 0.76 ± 0.04 |
| Physician 12 | 0.75 ± 0.04 | 0.78 ± 0.04 |
| Mean AUC | 0.74 | 0.77 |
Diagnostic Performance in Terms of AUC of Physicians Without and With the Use of CDSS-T for the Assessment of Complete Response to Neoadjuvant Chemotherapy on the First 51 Cases in Each Observer's Individually Randomized Reading List
| Observer | AUC | AUC | ||
|---|---|---|---|---|
| Without CDSS-T | With CDSS-T | Without CDSS-T | With CDSS-T | |
| Physician 1 | 0.76 ± 0.07 | 0.76 ± 0.07 | 0.79 ± 0.07 | 0.77 ± 0.07 |
| Physician 2 | 0.88 ± 0.05 | 0.90 ± 0.04 | 0.88 ± 0.05 | 0.93 ± 0.04 |
| Physician 3 | 0.69 ± 0.08 | 0.70 ± 0.08 | 0.66 ± 0.08 | 0.70 ± 0.08 |
| Physician 4 | 0.70 ± 0.07 | 0.78 ± 0.06 | 0.83 ± 0.06 | 0.83 ± 0.06 |
| Physician 5 | 0.83 ± 0.06 | 0.86 ± 0.06 | 0.81 ± 0.07 | 0.82 ± 0.07 |
| Physician 6 | 0.75 ± 0.08 | 0.76 ± 0.08 | 0.83 ± 0.06 | 0.87 ± 0.05 |
| Physician 7 | 0.65 ± 0.08 | 0.74 ± 0.07 | 0.73 ± 0.07 | 0.77 ± 0.06 |
| Physician 8 | 0.75 ± 0.08 | 0.78 ± 0.08 | 0.81 ± 0.08 | 0.82 ± 0.07 |
| Physician 9 | 0.81 ± 0.06 | 0.86 ± 0.05 | 0.79 ± 0.06 | 0.85 ± 0.05 |
| Physician 10 | 0.80 ± 0.08 | 0.85 ± 0.07 | 0.81 ± 0.07 | 0.88 ± 0.06 |
| Physician 11 | 0.65 ± 0.10 | 0.71 ± 0.09 | 0.65 ± 0.09 | 0.70 ± 0.09 |
| Physician 12 | 0.82 ± 0.07 | 0.85 ± 0.06 | 0.76 ± 0.07 | 0.77 ± 0.07 |
| Mean AUC | 0.76 | 0.80 | 0.78 | 0.81 |
| Mean standard deviation | 0.073 | 0.069 | 0.069 | 0.064 |
In the 12 groups of 51 cases, each group contained different cases for each observer, were evaluated 2 times, shown as original evaluation and repeated evaluation.
Statistical significance in the difference:
AUC:
AUC(orig, with) versus AUC(orig, without): P = .001.
AUC(repeat, with) versus AUC(repeat, without): P = .010.
AUC(orig, without) versus AUC(repeat, without): P = .083.
AUC(orig, with) versus AUC(repeat, with): P = .222.
Standard deviation of AUC (SDAUC):
SDAUC(orig, with) versus SDAUC(orig, without): P < .0002.
SDAUC(repeat, with) versus SDAUC(repeat, without): P < .004.
SDAUC(orig, without) versus SDAUC(repeat, without): P = .112.
SDAUC(orig, with) versus SDAUC(repeat, with): P = .066.
Figure 3.AUC values for the 12 observers with and without CDSS-T and the corresponding CDSS-T alone of the first 51 cases in each observer's individually randomized reading list for the original evaluation. The performance of all but 1 (physician 1) of the physicians increased using CDSS-T. Two physicians (physicians 3 and 9) with CDSS-T performed better than the CDSS-T alone.
Figure 4.AUC values for the 12 observers with and without CDSS-T and the corresponding CDSS-T alone of the first 51 cases in each observer's individually randomized reading list for the repeated evaluation. The performance of all but one (physician 1) of the physicians increased using CDSS-T. Five physicians (physicians 2, 3, 4, 6, and 9) with CDSST performed better than the CDSS-T alone.
Intraobserver Variability Assessment Based on the Standard Deviation (SD) of the Differences of the Observer's Original Evaluation Likelihood Estimates and the Observer's Corresponding Repeated Evaluation Likelihood Estimates
| Observer | SD of the differences in the observers' likelihood estimates between the original and repeated evaluation | |
|---|---|---|
| Without CDSS-T | With CDSS-T | |
| Physician 1 | 26.34 | 20.70 |
| Physician 2 | 15.35 | 13.89 |
| Physician 3 | 23.75 | 20.95 |
| Physician 4 | 18.87 | 14.46 |
| Physician 5 | 33.59 | 28.24 |
| Physician 6 | 32.28 | 22.92 |
| Physician 7 | 30.46 | 21.57 |
| Physician 8 | 15.99 | 13.40 |
| Physician 9 | 25.42 | 18.99 |
| Physician 10 | 24.01 | 17.95 |
| Physician 11 | 27.84 | 24.09 |
| Physician 12 | 44.42 | 41.92 |
| Mean | 26.53 | 21.59 |
The intraobserver variability assessments were performed for the observer's evaluations without CDSS-T and with CDSS-T.
Mean SD with CDSS-T (21.59) was significantly smaller than the mean SD without CDSS-T (26.53), (P < .0001).