| Literature DB >> 34943442 |
Dana Li1,2, Lea Marie Pehrson1, Carsten Ammitzbøl Lauridsen1,3, Lea Tøttrup4, Marco Fraccaro4, Desmond Elliott5, Hubert Dariusz Zając5, Sune Darkner5, Jonathan Frederik Carlsen1, Michael Bachmann Nielsen1,2.
Abstract
Our systematic review investigated the additional effect of artificial intelligence-based devices on human observers when diagnosing and/or detecting thoracic pathologies using different diagnostic imaging modalities, such as chest X-ray and CT. Peer-reviewed, original research articles from EMBASE, PubMed, Cochrane library, SCOPUS, and Web of Science were retrieved. Included articles were published within the last 20 years and used a device based on artificial intelligence (AI) technology to detect or diagnose pulmonary findings. The AI-based device had to be used in an observer test where the performance of human observers with and without addition of the device was measured as sensitivity, specificity, accuracy, AUC, or time spent on image reading. A total of 38 studies were included for final assessment. The quality assessment tool for diagnostic accuracy studies (QUADAS-2) was used for bias assessment. The average sensitivity increased from 67.8% to 74.6%; specificity from 82.2% to 85.4%; accuracy from 75.4% to 81.7%; and Area Under the ROC Curve (AUC) from 0.75 to 0.80. Generally, a faster reading time was reported when radiologists were aided by AI-based devices. Our systematic review showed that performance generally improved for the physicians when assisted by AI-based devices compared to unaided interpretation.Entities:
Keywords: CT; artificial intelligence; chest X-ray; computer-based devices; deep learning; observer tests; performance; radiology; thoracic diagnostic imaging
Year: 2021 PMID: 34943442 PMCID: PMC8700414 DOI: 10.3390/diagnostics11122206
Source DB: PubMed Journal: Diagnostics (Basel) ISSN: 2075-4418
Figure 1The QUADAS-2 tool for evaluating risk of bias and assess quality of research.
Figure 2Preferred reporting items for systematic reviews and meta-analyses (PRISMA) flowchart of the literature search and study selection.
(a) Included studies with artificial intelligence-based devices as concurrent readers in the observer test. (b) Included studies with artificial intelligence-based devices in an observer test with a sequential test design.
|
|
|
|
|
|
|
|
|
|
| |||||||
| Bai et al. [ | 2021 | RT-PCR | EfficientNet-B3 Convolutional Neural Network | COVID-19 pneumonia | 119 | 6 radiologists (10–20 years of chest CT experience) | CT |
| Beyer et al. [ | 2007 | Radiologist identified and consensus vote | Commercially available | Pulmonary nodules | 50 | 4 radiologists (2–11 years experience) | CT |
| de Hoop et al. [ | 2010 | Histologically confirmed | Commercially available (OnGuard 5.0; Riverain Medical, Miamisburg, OH, USA) | Pulmonary nodules | 111 | 1 general radiologist, 1 chest radiologist, and 4 residents | Chest X-ray |
| Dorr et al. [ | 2020 | RT-PCR | DenseNet 121 architecture | COVID-19 pneumonia | 60 | 23 radiologists and 31 emergency care physicians | Chest X-ray |
| Kim et al. [ | 2020 | Bacterial culture and RT-PCR for viruses | Commercially available (Lunit INSIGHT for chest radiography, version 4.7.2; Lunit, Seoul, South Korea) | Pneumonia | 387 | 3 emergency department physicians (6–7 years experience) | Chest X-ray |
| Koo et al. [ | 2020 | Pathologically confirmed | Commercially available (Lunit Insight CXR, ver. 1.00; Lunit, Seoul, South Korea) | Pulmonary nodules | 434 | 2 thoracic radiologists and 2 residents | Chest X-ray |
| Kozuka et al. [ | 2020 | Radiologist identified and majority vote | Faster Region-Convolutional Neural Network | Pulmonary nodules | 120 | 2 radiologists (1–4 years experience) | CT |
| Lee et al. [ | 2012 | Pathologically confirmed | Commercially available (IQQA-Chest, EDDA Technology, Princeton Junction, NJ, USA) | Pulmonary nodules malignant/benign | 200 | 5 chest radiologists and 5 residents | Chest X-ray |
| Li et al. [ | 2011 | CT | Commercially available (SoftView, version 2.0; Riverrain Medical, Miamisburg, OH, USA-Image normalization, feature extraction and regression networks) | Pulmonary nodules | 151 | 3 radiologists (10–25 years experience) | Chest X-ray |
| Li et al. [ | 2011 | Pathologically confirmed and radiology assessed | Commercially available (SoftView, version 2.0; Riverain Medical) | Pulmonary nodules | 80 | 2 chest radiologists, 4 general radiologists, and 4 residents | Chest X-ray |
| Liu et al. [ | 2020 | - | Segmentation model with class attention map including a residual convolutional block | COVID-19 pneumonia | 643 | - | Chest X-ray |
| Liu et al. [ | 2019 | Radiologist identified and majority vote | DenseNet and Faster Region-Convolutional Neural Network | Pulmonary nodule | 271 | 2 radiologists (10 years experience) | CT |
| Martini et al. [ | 2021 | Radiologist consensus | Commercially available (ClearRead-CT, Riverrain Technologies, Miamisburg, OH, USA) | Pulmonary consolidations/nodules | 100 | 2 senior radiologists, 2 final-year residents, and 2 inexperienced residents | MDCT |
| Nam et al. [ | 2021 | RT-PCR and CT | Deep learning-based algorithm (Deep convolutional neural network) | Pneumonia, pulmonary edema, active tuberculosis, interstitial lung disease, nodule/mass, pleural effusion, acute aortic syndrome, pneumoperitoneum, rib fracture, pneumothorax, mediastinal mass. | 202 | 2 thoracic radiologists, 2 board-certified radiologists, and 2 residents | Chest X-ray |
| Rajpurkar et al. [ | 2020 | Positive culture or Xpert MTB/RIF test | Convolutional Neural Network | Tuberculosis | 114 | 13 physicians (6 months–25 years of experience) | Chest X-ray |
| Singh et al. [ | 2021 | Radiologically reviewed | Commercially available (ClearRead CT Vessel Suppression and Detect, Riverain Technologies TM) | Subsolid nodules (Incl ground-glass and/or part-solid) | 123 | 2 radiologists (5–10 years experience) | CT |
| Sung et al. [ | 2021 | CT and clinical information | Commercially available (Med-Chest X-ray system (version 1.0.0, VUNO, Seoul, South Korea) | Nodules, consolidation, interstitial opacity, pleural effusion, pneumothorax | 128 | 2 thoracic radiologists, 2 board-certified radiologists, 1 radiology resident, and 1 non-radiology resident | Chest X-ray |
| Yang et al. [ | 2021 | RT-PCR | Deep Neural Network | COVID-19 pneumonia | 60 | 3 radiologists (5–20 years experience) | CT |
| Zhang et al. [ | 2021 | RT-PCR | Deep Neural Network using the blur processing method to improve the image enhancement algorithm | COVID-19 pneumonia | 15 | 2 physicians (13–15 years experience) | CT |
|
|
|
|
|
|
|
|
|
|
| |||||||
| Abe et al. [ | 2004 | Radiological review and clinical correlation | Single three-layer, feed-forward Artificial Neural Network with a back-propagation algorithm | Sarcoidosis, miliary tuberculosis, lymphangitic carcinomatosis, interstitial pulmonary edema, silicosis, scleroderma, P. Carinii pneumonia, Langerhals cell histiocytosis, idiopathic pulmonary fibrosis, viral pneumonia, pulmonary drug toxicity | 30 | 5 radiologists (6–18 years experience) | Chest X-ray |
| Abe et al. [ | 2003 | Radiology consensus | Fourier transformation and Artificial Neural Network | Detection of interstitial lung disease | 20 | 8 chest radiologists, 13 other radiologists, and 7 residents | Chest X-ray |
| Clinical correlation and bacteriological | Artificial Neural Network | Differential diagnosis of 11 types of interstitial lung disease | 28 | 16 chest radiologists, 25 other radiologists, and 12 residents | Chest X-ray | ||
| Pathology | Artificial Neural Network | Distinction between malignant and benign pulmonary nodules | 40 | 7 chest radiologists, 14 other radiologists, and 7 residents | Chest X-ray | ||
| Awai et al. [ | 2004 | Radiological review | Artificial Neural Network | Pulmonary nodules | 50 | 5 board-certified radiologists and 5 residents | CT |
| Awai et al. [ | 2006 | Histology | Neural Network | Pulmonary nodules malignant/benign | 33 | 10 board-certified radiologists and 9 radiology residents | CT |
| Beyer et al. [ | 2007 | Radiologist identified and consensus vote | Commercially available (LungCAD prototype version, Siemens Corporate Research, Malvern, PA, USA) | Pulmonary nodules | 50 | 4 radiologists (2–11 years experience) | CT |
| Bogoni et al. [ | 2012 | Majority of agreement | Commercially available (Lung CAD VC20A, Siemens Healthcare, Malvern, PA, USA) | Pulmonary nodules | 43 | 5 fellowship-trained chest radiologists (1–10 years experience) | CT |
| Chae et al. [ | 2020 | Pathologically confirmed and radiologically reviewed | CT-lungNET (Deep Convolutional Neural Network) | Pulmonary nodules | 60 | 2 medical students, 2 residents, 2 non-radiology physicians, and 2 thoracic radiologists | CT |
| Chen et al. [ | 2007 | Surgery or biopsy | Deep Neural Network | Pulmonary nodules malignant/benign | 60 | 3 junior radiologists, 3 secondary radiologists, and 3 senior radiologists | CT |
| Fukushima et al. [ | 2004 | Pathological, bacteriological and clinical correlation | Single three-layer, feed-forward Artificial Neural Network with a back-propagation algorithm | Sarcoidose, diffuse panbronchioloitis, nonspecific interstitial pneumonia, lymphangitic carcinomatosis, usual interstitial pneumonia, silicosis, BOOP or chronic eopsinophilic pneumonia, pulmonary alveolar proteinosis, miliary tuberculosis, lymphangiomyomatosis, P, carinii pneumonia or cytomegalovirus pneumonia | 130 | 4 chest radiologists and 4 general radiologists | High Resolution CT |
| Hwang et al. [ | 2019 | Pathology, clinical or radiological | Deep Convolutional Neural Network with dense blocks | 4 different target diseases (pulmonary malignant neoplasms, tuberculosis, pneumonia, pneumothorax) classified in to binary classification of normal/abnormal | 200 | 5 thoracic radiologists, board-certified radiologists, and 5 non-radiology physicians | Chest X-ray |
| Kakeda et al. [ | 2004 | CT | Commercially available (Trueda, Mitsubishi Space Software, Tokyo, Japan) | Pulmonary nodules | 90 | 4 board-certified radiologists and 4 residents | Chest X-ray |
| Kasai et al. [ | 2008 | CT | Three Artificial Neural Networks | Pulmonary nodules | 41 | 6 chest radiologists and 12 general radiologists | Lateral chest X-ray only |
| Kligerman et al. [ | 2013 | Histology and CT | Commercially available (OnGuard 5.1; Riverain Medical, Miamisburg, OH, USA) | Lung cancer | 81 | 11 board-certified general radiologists (1–24 years experience) | Chest X-ray |
| Liu et al. [ | 2021 | Histology, CT, and biopsy/surgical removal | Convolutional Neural Networks | Pulmonary nodules malignant/benign | 879 | 2 senior chest radiologists, 2 secondary chest radiologists, and 2 junior radiologists | CT |
| Matsuki et al. [ | 2001 | Pathology and radiology | Three-layer, feed-forward Artificial Neural Network with a back-propagation algorithm | Pulmonary nodules | 50 | 4 attending radiologists, 4 radiology fellows, 4 residents | High Resolution CT |
| Nam et al. [ | 2019 | Pathologically confirmed and radiologically reviewed | Deep Convolutional Neural Networks with 25 layers and 8 residual connections | Pulmonary nodules malignant/benign | 181 | 4 thoracic radiologists, 5 board-certified radiologists, 6 residents, and 3 non-radiology physicians | Chest X-ray |
| Oda et al. [ | 2009 | Histology, cytology, and CT | Massive training Artificial Neural Network | Pulmonary nodules | 60 | 7 board-certified radiologists and 5 residents | Chest X-ray |
| Rao et al. [ | 2007 | Consensus and majority vote | LungCAD | Pulmonary nodules | 196 | 17 board-certified radiologists | MDCT |
| Schalekamp et al. [ | 2014 | Radiologically reviewed, pathology and clinical correlation | Commercially available (ClearRead +Detect 5.2; Riverain Technologies and ClearRead Bone Suppression 2.4; Riverain Technologies) | Pulmonary nodules | 300 | 5 radiologists and 3 residents | Chest X-ray |
| Sim et al. [ | 2020 | Biopsy, surgery, CT, and pathology | Commercially available (ALND, version 1.00; Samsung Electronics, Suwon, South Korea) | Cancer nodules | 200 | 5 senior chest radiologists, 4 chest radiologists, and 3 residents | Chest X-ray |
Figure 3Sensitivity and specificity (a) and AUC (b) without and with the aid of an AI-based device.
Sensitivity and specificity (a); accuracy and AUC (b); and time measurement results (c) for observer tests without and with AI-based devices as a concurrent reader.
|
|
|
|
|
| ||
|
|
|
|
| |||
|
| ||||||
| Bai et al. [ | 79 | 88 | 88 | 91 |
| |
| Beyer et al. [ | 56.5 | - | 61.6 | - |
| |
| de Hoop et al. [ | 56 * | - | 56 * | - |
| - |
| Dorr et al. [ | 47 | 79 | 61 | 75 |
| |
| Kim et al. [ | 73.9 | 88.7 | 82.2 | 98.1 |
| |
| Koo et al. [ | 92.4 | 93.1 | 95.1 | 97.2 |
| - |
| Kozuka et al. [ | 68 | 91.7 | 85.1 | 83.3 |
| |
| Lee et al. [ | 84 | - | 88 | - |
| - |
| Rajpurkar et al. [ | 70 | 52 | 73 | 61 |
| - |
| Singh et al. [ | 68 * | 77.5 * | 73 * | 74 * |
| - |
| Sung et al. [ | 80.1 | 89.3 | 88.9 | 96.6 |
| |
| Yang et al. [ | 89.5 | - | 94.2 | - |
| |
|
|
|
|
|
| ||
|
|
|
|
| |||
|
| ||||||
| Bai et al. [ | 85 | - | 90 | - |
| |
| Kim et al. [ | - | 0.871 | - | 0.916 |
| |
| Koo et al. [ | - | 0.93 | - | 0.96 |
| |
| Li et al. [ | - | 0.840 | - | 0.863 |
| |
| Li et al. [ | - | 0.807 | - | 0.867 |
| |
| Liu et al. [ | - | 0.66 * | - | 0.78 * |
| - |
| Nam et al. [ | 66.3 * | - | 82.4 * | - |
| |
| Rajpurkar et al. [ | 60 | - | 65 | - |
| |
| Singh et al. [ | - | 0.73 * | - | 0.74 * |
| Not statistically significant |
| Sung et al. [ | - | 0.93 | - | 0.98 |
| |
| Yang et al. [ | 94.1 | - | 95.1 | - |
| |
|
|
|
|
|
| ||
|
|
| |||||
|
| ||||||
| Beyer et al. [ | 294 s (1) | 337 s (1) |
| |||
| Kim et al. [ | 165 min (2) | 101 min (2) |
| - | ||
| Kozuka et al. [ | 373 min(2) | 331 min (2) |
| - | ||
| Liu et al. [ | 100.5 min (3) | 34 min (3) |
| |||
| Liu et al. [ | 15 min (1) | 5–10 min (1) |
| - | ||
| Martini et al. [ | 194 s (1) | 154 s (1) |
| |||
| Nam et al. [ | 2771.2 s * (1) | 1916 s * (1) |
| |||
| Sung et al. [ | 24 s (1) | 12 s (1) |
| |||
| Zhang et al. [ | 3.623 min (2) | 0.744 min (2) |
| - | ||
a: * our calculated average; ** for sensitivity only; - not applicable; ↑ positive change. b: * our calculated average; - not applicable; ↑ positive change. c: (1) per image/case reading time; (2) total reading time for multiple cases; (3) station survey time; * our calculated average; - not applicable; ↑ positive change; ↓ negative change.
Sensitivity and specificity (a); accuracy and AUC (b); and time measurement results (c) for sequential observer tests without and with AI-based devices as a second reader.
|
|
|
|
|
| ||
|
|
|
|
| |||
|
| ||||||
| Abe et al. [ | 64 | - | 81 | - |
| |
| Beyer et al. [ | 56.5 | - | 52.9 | - |
| |
| Bogoni et al. [ | 45.34 * | - | 59.34 * | - |
| |
| Chae et al. [ | 70 * | 69 * | 65 * | 84 * |
| Not statistically significant |
| Hwang et al. [ | 79 * | 93.2 * | 88.4 * | 94 * |
| |
| Kligerman et al. [ | 44 | - | 50 | - |
| |
| Sim et al. [ | 65.1 | - | 70.3 | - |
| |
|
|
|
|
|
| ||
|
|
|
|
| |||
|
| ||||||
| Abe et al. [ | - | 0.81 | - | 0.87 |
| |
| Abe et al. [ | - | 0.94 | - | 0.98 |
| |
| Abe et al. [ | - | 0.77 | - | 0.81 |
| |
| Awai et al. [ | - | 0.64 | - | 0.67 |
| |
| Awai et al. [ | - | 0.843 | - | 0.924 |
| |
| Chae et al. [ | 69 * | 0.005 * | 75 * | 0.13 * |
| Not statistically significant |
| Chen et al. [ | - | 0.84 * | - | 0.95 * |
| |
| Fukushima et al. [ | - | 0.972 * | - | 0.982 * |
| |
| Hwang et al. [ | - | 0.880 * | - | 0.934 * |
| |
| Kakeda et al. [ | - | 0.924 | - | 0.986 |
| |
| Kasai et al. [ | - | 0.804 | - | 0.816 |
| Not statistically significant |
| Kligerman et al. [ | - | 0.38 | - | 0.43 |
| |
| Liu et al. [ | - | 0.913 | - | 0.938 |
| |
| Matsuki et al. [ | - | 0.831 | - | 0.956 |
| |
| Nam et al. [ | - | 0.85 * | - | 0.89 * |
| |
| Oda et al. [ | - | 0.816 | - | 0.843 |
| |
| Rao et al. [ | 78 | - | 82.8 | - |
| |
| Schalekamp et al. [ | - | 0.812 | - | 0.841 |
| |
|
|
|
|
|
| ||
|
|
| |||||
|
| ||||||
| Beyer et al. [ | 294 s (1) | 274 s (1) |
| |||
| Bogoni et al. [ | 143 s (1) | 225 s (1) |
| - | ||
a:* our calculated average; - not applicable; ↑ positive change; ↓ negative change. b: * our calculated average; - not applicable; ↑ positive change. c: (1) per image/case reading time; - not applicable; ↑ positive change; ↓ negative change.