| Literature DB >> 35314631 |
Di Sun1, Lubomir Hadjiiski1, Ajjai Alva2, Yousef Zakharia3, Monika Joshi4, Heang-Ping Chan1, Rohan Garje3, Lauren Pomerantz4, Dean Elhag3, Richard H Cohan1, Elaine M Caoili1, Wesley T Kerr5, Kenny H Cha6, Galina Kirova-Nedyalkova7, Matthew S Davenport1,8, Prasad R Shankar1, Isaac R Francis1, Kimberly Shampain1, Nathaniel Meyer1, Daniel Barkmeier1, Sean Woolen1, Phillip L Palmbos2, Alon Z Weizer8, Ravi K Samala1, Chuan Zhou1, Martha Matuszak9.
Abstract
This observer study investigates the effect of computerized artificial intelligence (AI)-based decision support system (CDSS-T) on physicians' diagnostic accuracy in assessing bladder cancer treatment response. The performance of 17 observers was evaluated when assessing bladder cancer treatment response without and with CDSS-T using pre- and post-chemotherapy CTU scans in 123 patients having 157 pre- and post-treatment cancer pairs. The impact of cancer case difficulty, observers' clinical experience, institution affiliation, specialty, and the assessment times on the observers' diagnostic performance with and without using CDSS-T were analyzed. It was found that the average performance of the 17 observers was significantly improved (p = 0.002) when aided by the CDSS-T. The cancer case difficulty, institution affiliation, specialty, and the assessment times influenced the observers' performance without CDSS-T. The AI-based decision support system has the potential to improve the diagnostic accuracy in assessing bladder cancer treatment response and result in more consistent performance among all physicians.Entities:
Keywords: bladder cancer; computer-aided diagnosis; observer study; treatment response
Mesh:
Year: 2022 PMID: 35314631 PMCID: PMC8938803 DOI: 10.3390/tomography8020054
Source DB: PubMed Journal: Tomography ISSN: 2379-1381
Characteristics of the patients and cancers used in the observer study.
| Characteristics | Notes | Detail | Total Number | |
|---|---|---|---|---|
| Patient gender and age | 100 males | Mean: 63 years | 123 patients | |
| 23 females | Mean: 63 years | |||
| Average maximum diameter (mm) | Completely responding cancers | Pre-treatment: 30.1 | 157 cancer pairs | |
| Incompletely responding cancers | Pre-treatment: 43.0 | |||
| Cancer stage | Pre-treatment | Post-treatment | 157 cancer pairs | |
| T0 | 0 | 40 | ||
| T1 | 8 | 37 | ||
| T2 | 76 | 23 | ||
| T3 | 63 | 38 | ||
| T4 | 10 | 19 | ||
Figure 1The graphical user interface for reading with and without our computerized decision support system (CDSS-T) for bladder cancer treatment response assessment. (a,b) The pre- and post-treatment CTU scans are shown side-by-side. (c) The observer estimates the treatment response. (d) The observer is shown the CAD score and the score distribution of the two classes as reference. (e) The observer may revise their treatment response assessment after considering the CAD score.
Observers of different specialties and from different institutions.
| Specialty | Observer Number | Proficiency | Institution | |||
|---|---|---|---|---|---|---|
| UM | UI | PSU | TH | |||
| Abdominal Radiologist | 5 | Experienced | 4 | - | - | 1 |
| Diagnostic Radiology Resident | 4 | Inexperienced | 4 | - | - | - |
| Urologist | 1 | Experienced | 1 | - | - | - |
| Oncologist | 5 | Experienced | 2 | 2 | 1 | - |
| Medical Student | 1 | Inexperienced | - | - | 1 | - |
| Neurology Fellow | 1 | Inexperienced | 1 | - | - | - |
UM: University of Michigan, UI: University of Iowa, PSU: Pennsylvania State University, TH: Tokuda Hospital, Sofia.
Figure 2Results of the observer performance study. (a) AUCs of the 17 observers with and without CDSS-T. The performance of the CDSS-T system is shown with the dashed line. The performance of all but one observer (#6) increased with using CDSS-T. (b) Average ROC curves with and without CDSS-T.
AUCs of observers with and without CDSS-T. The standard deviation of the AUCs of 17 observers was calculated for both the without CDSS-T and with CDSS-T conditions.
| Observer # | AUC without CDSS-T | AUC with CDSS-T | Individual |
|---|---|---|---|
| 1 | 0.74 | 0.77 | 0.155 |
| 2 | 0.75 | 0.77 | 0.260 |
| 3 | 0.73 | 0.76 | 0.013 * |
| 4 | 0.74 | 0.77 | 0.128 |
| 5 | 0.76 | 0.80 | 0.010 * |
| 6 | 0.74 | 0.74 | 0.861 |
| 7 | 0.76 | 0.77 | 0.541 |
| 8 | 0.73 | 0.75 | 0.135 |
| 9 | 0.78 | 0.81 | 0.191 |
| 10 | 0.74 | 0.76 | 0.244 |
| 11 | 0.67 | 0.73 | 0.014 * |
| 12 | 0.75 | 0.79 | 0.095 |
| 13 | 0.72 | 0.77 | 0.027 * |
| 14 | 0.65 | 0.76 | 0.020 * |
| 15 | 0.67 | 0.78 | 0.003 * |
| 16 | 0.76 | 0.79 | 0.026 * |
| 17 | 0.73 | 0.76 | 0.083 |
| Mean AUC | 0.73 | 0.77 | 0.002 *,$ |
| Standard Deviation | 0.04 | 0.02 | - |
* Statistically significant difference at p < 0.05 level. $ Obtained from MRMC analysis.
Performance comparisons with and without CDSS-T for easy and difficult subsets.
| AUC of CDSS-T | Average AUC without CDSS-T | Average AUC with CDSS-T | # of Physicians | ||
|---|---|---|---|---|---|
| Easy Subset | 0.88 | 0.80 | 0.84 | 0.016 * | 17 physicians |
| Difficult Subset | 0.67 | 0.58 | 0.62 | 0.148 | |
| Easy Subset | 0.88 | 0.83 | 0.85 | 0.033 * | 9 radiologists |
| Difficult Subset | 0.67 | 0.59 | 0.61 | 0.379 | |
| Easy Subset | 0.88 | 0.78 | 0.84 | 0.051 | 5 oncologists |
| Difficult Subset | 0.67 | 0.57 | 0.63 | 0.009 * |
* Statistically significant difference at p < 0.05 level.
Performance comparison between experienced and inexperienced observers for the total of 157 lesion pairs.
| AUC of CDSS-T | Average AUC without CDSS-T | Average AUC with CDSS-T | # of Physicians | ||
|---|---|---|---|---|---|
| Experienced Physicians | 0.80 | 0.73 | 0.77 | 0.007 * | 5 abdominal radiologists, 1 urologist, and 5 oncologists |
| Inexperienced Physicians | 0.73 | 0.77 | 0.019 * | 5 residents and 1 medical student | |
| Experienced Radiologists | 0.75 | 0.77 | 0.060 | 5 abdominal radiologists | |
| Inexperienced Radiologists | 0.74 | 0.77 | 0.007 * | 4 radiology residents | |
| UM Experienced Radiologists | 0.75 | 0.77 | 0.018 * | 4 abdominal radiologists from UM | |
| UM Inexperienced Radiologists | 0.74 | 0.77 | 0.007 * | 4 radiology residents from UM |
* Statistically significant difference at p < 0.05 level.
Performance comparison between observers from different specialties for the total of 157 lesion pairs.
| AUC of CDSS-T | Average AUC without CDSS-T | Average AUC with CDSS-T | # of Physicians | ||
|---|---|---|---|---|---|
| Radiologists | 0.80 | 0.75 | 0.77 | 0.014 * | 9 |
| Urologist | 0.74 | 0.76 | 0.244 | 1 | |
| Oncologists | 0.71 | 0.77 | 0.011 * | 5 | |
| Medical Student | 0.65 | 0.76 | 0.020 * | 1 | |
| Neurology Fellow | 0.73 | 0.76 | 0.083 | 1 |
* Statistically significant difference at p < 0.05 level.
Figure 3ROC curves comparison for observers of different specialties.
Performance comparison between observers from different institutions for the total of 157 lesion pairs.
| AUC of CDSS-T | Average AUC without CDSS-T | Average AUC with CDSS-T | # of Physicians | ||
|---|---|---|---|---|---|
| UM Physicians | 0.8 | 0.74 | 0.77 | 0.002 * | 12 |
| TH Physician | 0.74 | 0.74 | 0.861 | 1 | |
| PSU Physicians | 0.69 | 0.76 | 0.117 | 2 | |
| UI Physicians | 0.72 | 0.78 | 0.326 | 2 | |
| UM Oncologists | 0.71 | 0.76 | 0.071 | 2 | |
| PSU Oncologist | 0.72 | 0.77 | 0.027 * | 1 | |
| UI Oncologists | 0.72 | 0.78 | 0.326 | 2 |
* Statistically significant difference at p < 0.05 level.
Figure 4ROC curves comparison for observers from different institutions. (a) Performance of observers from four institutions. (b) Performance of oncologists from three institutions.
Diagnostic performance in terms of AUC of physicians without and with the CDSS-T aid for the assessment of complete response to neoadjuvant chemotherapy on the first 51 cases in each observer’s individually randomized reading list. The iMRMC package provided standard deviation value along with each AUC. The mean and standard deviations of AUC values for the 17 observers with and without CDSS-T for both the original and the repeated evaluations were calculated.
| Observer # | AUC Original Evaluation | AUC Repeated Evaluation | ||
|---|---|---|---|---|
| Without CDSS-T | With CDSS-T | Without CDSS-T | With CDSS-T | |
| 1 | 0.75 ± 0.08 | 0.76 ± 0.08 | 0.8 ± 0.07 | 0.79 ± 0.07 |
| 2 | 0.88 ± 0.05 | 0.91 ± 0.04 | 0.88 ± 0.05 | 0.92 ± 0.03 |
| 3 | 0.65 ± 0.10 | 0.72 ± 0.10 | 0.67 ± 0.10 | 0.72 ± 0.09 |
| 4 | 0.71 ± 0.09 | 0.71 ± 0.09 | 0.69 ± 0.09 | 0.71 ± 0.08 |
| 5 | 0.70 ± 0.07 | 0.78 ± 0.06 | 0.82 ± 0.06 | 0.83 ± 0.05 |
| 6 | 0.82 ± 0.07 | 0.85 ± 0.07 | 0.81 ± 0.07 | 0.81 ± 0.07 |
| 7 | 0.75 ± 0.08 | 0.77 ± 0.08 | 0.84 ± 0.05 | 0.87 ± 0.05 |
| 8 | 0.74 ± 0.09 | 0.77 ± 0.08 | 0.81 ± 0.08 | 0.8 ± 0.08 |
| 9 | 0.81 ± 0.06 | 0.85 ± 0.05 | 0.8 ± 0.06 | 0.85 ± 0.05 |
| 10 | 0.79 ± 0.08 | 0.84 ± 0.07 | 0.8 ± 0.07 | 0.87 ± 0.07 |
| 11 | 0.65 ± 0.08 | 0.75 ± 0.08 | 0.73 ± 0.07 | 0.78 ± 0.07 |
| 12 | 0.81 ± 0.07 | 0.85 ± 0.07 | 0.75 ± 0.08 | 0.76 ± 0.07 |
| 13 | 0.81 ± 0.06 | 0.89 ± 0.04 | 0.77 ± 0.07 | 0.83 ± 0.06 |
| 14 | 0.59 ± 0.10 | 0.82 ± 0.07 | 0.69 ± 0.10 | 0.81 ± 0.07 |
| 15 | 0.73 ± 0.07 | 0.88 ± 0.06 | 0.64 ± 0.10 | 0.83 ± 0.07 |
| 16 | 0.86 ± 0.05 | 0.93 ± 0.03 | 0.87 ± 0.05 | 0.87 ± 0.05 |
| 17 | 0.63 ± 0.08 | 0.69 ± 0.08 | 0.68 ± 0.10 | 0.76 ± 0.09 |
| Mean AUC | 0.75 | 0.81 | 0.77 | 0.81 |
| Standard Deviation | 0.08 | 0.07 | 0.07 | 0.06 |
|
| ||||
| AUC (orig.without) versus AUC (orig.with): | ||||
| AUC (repeat.without) versus AUC (repeat.with): | ||||
| AUC (orig.without) versus AUC (repeat.without): | ||||
| AUC (orig.with) versus AUC (repeat.with): | ||||
* Statistically significant difference at p < 0.05 level.