| Literature DB >> 35202417 |
Eun Young Kim1, Young Jae Kim2, Won-Jun Choi3, Ji Soo Jeon2, Moon Young Kim4,5, Dong Hyun Oh6,7, Kwang Nam Jin4,5, Young Jun Cho6,7.
Abstract
PURPOSE: Lunit INSIGHT CXR (Lunit) is a commercially available deep-learning algorithm-based decision support system for chest radiography (CXR). This retrospective study aimed to evaluate the concordance rate of radiologists and Lunit for thoracic abnormalities in a multicenter health screening cohort. METHODS AND MATERIALS: We retrospectively evaluated the radiology reports and Lunit results for CXR at several health screening centers in August 2020. Lunit was adopted as a clinical decision support system (CDSS) in routine clinical practice. Subsequently, radiologists completed their reports after reviewing the Lunit results. The DLA result was provided as a color map with an abnormality score (%) for thoracic lesions when the score was greater than the predefined cutoff value of 15%. Concordance was achieved when (a) the radiology reports were consistent with the DLA results ("accept"), (b) the radiology reports were partially consistent with the DLA results ("edit") or had additional lesions compared with the DLA results ("add"). There was discordance when the DLA results were rejected in the radiology report. In addition, we compared the reading times before and after Lunit was introduced. Finally, we evaluated systemic usability scale questionnaire for radiologists and physicians who had experienced Lunit.Entities:
Mesh:
Year: 2022 PMID: 35202417 PMCID: PMC8870572 DOI: 10.1371/journal.pone.0264383
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Flow chart of the study population.
CXR = chest radiography.
Demographic information.
| After adoption of Lunit (n = 3,113) | Before adoption of Lunit (n = 3,284) | |||
|---|---|---|---|---|
| Sex, men | 1,157 (37.2%) | 1,338 (40.7%) | 0.003 | |
| Age (years) | 49±15 | 52±15 | < 0.001 | |
| High-risk of lung cancer† | 129 (4.1%) | 160 (4.9%) | 0.161 | |
| CXR | normal | 2,770 (89.0%) | 2,998 (91.3%) | 0.017** |
| Inactive | 226 (7.3%) | 186 (5.7%) | ||
| insignificant abnormal | 27 (0.9%) | 19 (0.6%) | ||
| significant abnormal | 90 (2.9%) | 81 (2.5%) | ||
| Further study recommendation | 37 (1.2%) | 49 (1.5%) | 0.292 | |
| Reading time, median (IQR) †† | 19s (36s) | 14s (23s) | < 0.001* | |
Note: Except where indicated, data are the mean (± SD) or number (%). SD = standard deviation. IQR = interquartile range. Comparisons of means and proportions of the two groups for demographic information were performed using Student’s t-test (*Mann-Whitney U test) and chi-squared tests.
†High-risk lung cancer patients: age: 55–74 years and a smoking history of 30 pack-years or more.
††Except for missing/unreliable reading time information (n = 479, 7.5%).
**at multiple testing, normal CXR was less frequent at Lunit group compared to control (89.0% vs. 91.3% adjusted P-value = 0.015).
Concordance according to radiology report and Lunit for chest radiograph (CXR).
| CXR positive (11%) | CXR negative (89%) | |
|---|---|---|
| Lunit positive (20.1%) | Reject (n = 343,11.0%) | |
| Lunit negative (79.9%) | Reject (n = 59, 1.9%) | Accept (n = 2,427, 78.0%) |
*when the lesions described on CXR report and Lunit result were in coincide (“accept”), when the lesions in the radiology reports were in partial agreement with those detected by Lunit (“edit”), when the radiology reports had additional lesions compared with those detected by Lunit (“add”). When the lesion described on CXR report was totally different from the Lunit lesion, the case was designated as “reject”.
Fig 2Sunburst chart for concordance and discordance between Lunit and radiology reports.
Concordance and discordance were based on the agreement of the Lunit results and the radiology report. Concordance was achieved when the radiology reports were consistent with the DLA results (“accept”), the radiology reports were partially consistent with the DLA findings (“edit”) or they had additional lesions compared with the DLA findings (“add”). There was discordance when the DLA results were rejected in the radiology report.
Concordance and discordance of chest radiograph (CXR) report and Lunit result stratified by the CXR lesion category.
| Concordance (n = 2,701) | Discordance (n = 412) | ||
|---|---|---|---|
| CXR_ normal (n = 2,770; 89.0%) | 2,427 (89.9%) | 343 (83.3%) | < .001 |
| CXR_ inactive (n = 266; 7.3%) | 186 (6.9%) | 40 (9.7%) | |
| CXR_ insignificant abnormal (n = 27; 0.9%) | 21 (0.8%) | 6 (1.5%) | |
| CXR_ significant abnormal (n = 90; 2.9%) | 67 (2.5%) | 23 (5.6%) |
*at multiple testing, concordance was more frequent for normal CXR than for significantly abnormal CXR (87.6% vs. 74.4% Bonferroni-adjusted P = 0.003).
Fig 3Concordance rate according to chest radiograph (CXR) lesion categories.
Multiple testing showed that the concordance rate was significantly higher for normal (87.6%) than for significantly abnormal (74.4%, Bonferroni-adjusted P = 0.003) cases.
Reading times stratified by the reading condition (before vs. after the clinical integration of Lunit assist), radiologists, and chest radiograph (CXR) lesion categories.
| For all cases (n = 5,918) | |||
|---|---|---|---|
| Parameters | Estimated | 95% confidence intervals | |
| Intercept | 3.48 | 3.36, 3.60 | < .001 |
| Lunit aided | 0.19 | 0.15, 0.22 | < .001 |
| Lunit unaided | 0.00 | 0.00 | |
| Radiologist 1 | 1.09 | 1.04, 1.14 | < .001 |
| Radiologist 2 | -0.39 | -0.44, -0.34 | < .001 |
| Radiologist 3 | 0.00 | 0.00 | |
| CXR_normal | -1.13 | -1.24, -1.01 | < .001 |
| CXR_inactive | -0.50 | -0.63, -0.36 | < .001 |
| CXR_insignificant abnormal | -0.12 | -0.35, 0.12 | 0.334 |
| CXR_significant abnormal | 0.00 | 0.00 | |
|
| |||
| Intercept | 2.31 | 2.27, 2.36 | < .001 |
| Lunit aided | 0.20 | 0.16, 0.23 | < .001 |
| Lunit unaided | 0.00 | 0.00 | |
| Radiologist 1 | 1.17 | 1.12, 1.22 | < .001 |
| Radiologist 2 | -0.43 | -0.48, -0.37 | < .001 |
| Radiologist 3 | 0.00 | 0.00 | |
|
| |||
| Intercept | 3.91 | 3.80, 4.03 | < .001 |
| Lunit aided | -0.16 | -0.25, -0.07 | < .001 |
| Lunit unaided | 0.00 | 0.00 | |
| Radiologist 1 | 0.18 | 0.08, 0.28 | < .001 |
| Radiologist 2 | 0.21 | 0.08, 0.34 | < .001 |
| Radiologist 3 | 0.00 | 0.00 | |
| CXR_inactive | -0.58 | -0.68, -0.48 | < .001 |
| CXR_insignificant abnormal | -0.23 | -0.40, -0.04 | 0.010 |
| CXR_significant abnormal | 0.00 | 0.00 | |
System usability scale (SUS) for Lunit.
| Group | SUS score |
|---|---|
| All (n = 23) | 77.8 ± 11.9 |
| Radiologists (n = 14) | 75.7 ± 13.8 |
| Radiology residents (n = 3) | 81.7 ± 8.0 |
| Physicians (n = 6) | 80.8 ± 8.5 |
*data are the mean (± standard deviation).