| Literature DB >> 35316363 |
Alexandre Ben Cheikh1,2, Guillaume Gorincour3,4, Hubert Nivet1,5,6, Julien May1, Mylene Seux1, Paul Calame7,8, Vivien Thomson1,2, Eric Delabrousse7,8, Amandine Crombé1,9,10.
Abstract
OBJECTIVES: To evaluate and compare the diagnostic performances of a commercialized artificial intelligence (AI) algorithm for diagnosing pulmonary embolism (PE) on CT pulmonary angiogram (CTPA) with those of emergency radiologists in routine clinical practice.Entities:
Keywords: Artificial intelligence; Computed tomography angiography; Predictive value of tests; Pulmonary embolism; Sensitivity and specificity
Mesh:
Year: 2022 PMID: 35316363 PMCID: PMC8938594 DOI: 10.1007/s00330-022-08645-2
Source DB: PubMed Journal: Eur Radiol ISSN: 0938-7994 Impact factor: 7.034
Fig. 1Example of an output provided by the artificial intelligence (AI) algorithm (AIDOC Medical) to detect pulmonary embolism (PE). A 64-year-old man presented with spontaneous unilateral pain in the lower limb, increased with palpation, and unilateral edema. The revised simplified Geneva score was 3 and the D-dimer dosage was positive. A Contrast-enhanced CT pulmonary angiogram (CTPA) demonstrated a PE in the left lower limb (blue arrow). B On the same cross-section, the AI algorithm highlighted the same location of the suspected PE through a color-encoded map.
Fig. 2Flow chart of the study. Abbreviations: AI, artificial intelligence; CTPA, CT pulmonary angiogram
Characteristics of cohort-2019
| Characteristics | Patients |
|---|---|
| Age (years) | |
| Mean (sd) | 65.4 ± 18.9 |
| Median (range) | 68.3 (18.1–101.9) |
| Sex | |
| Women | 689/1202 (57.3%) |
| Men | 513/1202 (42.7%) |
| Pulmonary embolism (: gold standard) | |
| Present | 190/1202 (15.8%) |
| Absent | 1012/1202 (84.2%) |
| Protocols | |
| CTPA | 1100/1202 (91.5%) |
| CTPA + Abdomen-pelvic | 78/1202 (6.5%) |
| Brain + CTPA | 13/1202 (1.1%) |
| CTPA in pregnant women | 8/1202 (0.7%) |
| Brain + CTPA + abdomen-pelvic | 3/1202 (0.2%) |
| Respiratory artefacts limiting interpretation | |
| Yes | 360/1202 (30%) |
| No | 842/1202 (30%) |
| Quality of the injection | |
| Good | 748/1202 (62.2%) |
| Average | 259/1202 (21.5%) |
| Poor | 67/1202 (5.6%) |
| No mention | 128/1202 (10.6%) |
Note. Results are number of patients with percentage in parentheses, except for age also given as mean ± standard deviation
Abbreviations: CTPA CT pulmonary angiogram, sd standard deviation
Summary of the subcohorts extracted from cohort-2019
| Name | Properties | No. of patients | Positivity rate for PE |
|---|---|---|---|
| Cohort-2019 | - | 1202/1202 (100%) | 190/1202 (15.8%) |
| Subcohort-1 | CTPAs with average injection quality | 259/1202 (21.5%) | 38/259 (14.7%) |
| Subcohort-2 | CTPAs with poor injection quality | 67/1202 (5.6%) | 5/67 (7.5%) |
| Subcohort-3 | CTPAs with average-to-poor injection quality | 326/1202 (27.1%) | 43/326 (13.2%) |
| Subcohort-4 | CTPAs with respiratory artefacts limiting the interpretation | 360/1202 (30%) | 44/360 (12.2%) |
| Subcohort-5 | CTPAs with respiratory artefacts limiting the interpretation AND average-to-poor injection quality | 168/1202 (14%) | 18/168 (10.7%) |
Note. Abbreviations: CTPA CT pulmonary angiogram, no. number; PE pulmonary embolism
Performance statistics of artificial intelligence (AI) and radiologists in cohort-2019 and its subcohorts depending on factors limiting the radiological interpretation
| Cohorts | Sensitivity (95%CI) | Specificity (95%CI) | PPV (95%CI) | NPV (95%CI) | Accuracy (95%CI) |
|---|---|---|---|---|---|
| Cohort-2019 | |||||
| Radiologists | 0.9 (0.848–0.939) | 0.981 (0.972–0.988) | |||
| AI | 0.958 (0.943–0.969) | 0.804 (0.753–0.846) | 0.953 (0.939–0.964) | ||
| Subcohort-1: Average injection quality | |||||
| Radiologists | 0.895 (0.752–0.971) | 0.98 (0.952–0.992) | |||
| AI | 0.932 (0.891–0.962) | 0.724 (0.615–0.811) | 0.934 (0.897–0.961) | ||
| Subcohort-2: Poor injection quality | |||||
| Radiologists | 0.6 (0.147–0.947) | 0.929 (0.817–0.975) | |||
| AI | 0.952 (0.865–0.99) | 0.697 (0.44–0.831) | |||
| Subcohort-3: Average-to-poor injection quality | |||||
| Radiologists | 0.86 (0.721–0.947) | 0.974 (0.947–0.988) | |||
| AI | 0.936 (0.901–0.962) | 0.738 (0.642–0.816) | 0.939 (0.907–0.962) | ||
| Subcohort-4: Limiting respiratory artifacts | |||||
| Radiologists | 0.909 (0.783–0.975) | 0.983 (0.958–0.993) | |||
| AI | 0.937 (0.904–0.961) | 0.734 (0.642–0.81) | 0.936 (0.906–0.959) | ||
| Subcohort-5: Limiting respiratory artifacts AND average-to-poor injection quality | |||||
| Radiologists | |||||
| AI | 0.92 (0.864–0.958) | 0.689 (0.56–0.794) | 0.989 (0.929–0.998) | 0.923 (0.871–0.958) | |
Note. Abbreviations: 95%CI 95% confidence interval, AI artificial intelligence, NPV negative predictive value, PPV positive predictive value, ns not significant; *: p < 0.05; **: p < 0.005; ***: p < 0.001. For each metrics and each subcohort; the highest value between AI and radiologists is indicated in boldface
Figure 3Performances of artificial intelligence (AI) and teleradiologists (TR) to diagnose pulmonary embolism on CT pulmonary angiogram (CTPA) in a multicentric emergency cohort. A Patients from the entire cohort-2019. B Patients from the subcohort 2 with poor quality injection. Abbreviations: 95%CI, 95% confidence interval; NPV, negative predictive value; PPV, positive predictive value. *: p < 0.05, **: p < 0.005; ***: p < 0.001
Discordances between radiologists, artificial intelligence (AI), and gold-standard in cohort-2019 and its subcohorts depending on factors limiting the radiological interpretation.
| Cohorts | Discordance between AI and radiologists | Discordance between AI and gold-standard | Discordance between radiologists and gold-standard | No. of TPs captured by AI and not by radiologists | No. of TPs captured by radiologists and not by AI |
|---|---|---|---|---|---|
| Cohort-2019 | 85/1202 (7.1%) | 57/1202 (4.7%) | 28/1202 (2.3%) | 19/190 (10%) | 14/190 (7.4%) |
| Subcohort 1 (average injection quality) | 23/259 (8.9%) | 17/259 (6.6%) | 6/259 (2.3%) | 4/38 (10.5%) | 2/38 (5.3%) |
| Subcohort 2 (poor injection quality) | 6/67 (9%) | 3/67 (4.5%) | 3/67 (4.5%) | 2/5 (40%) | 0/5 (0%) |
| Subcohort 3 (average or poor injection quality) | 29/326 (8.9%) | 20/326 (6.1%) | 9/326 (2.8%) | 6/43 (14%) | 2/43 (4.7%) |
| Subcohort 4 (respiratory artifacts) | 29/360 (8.1%) | 23/360 (6.4%) | 6/360 (1.7%) | 4/44 (9.1%) | 3/44 (6.8%) |
| Subcohort 5 (respiratory artifacts AND average or poor injection quality) | 15/168 (8.9%) | 13/168 (7.7%) | 2/168 (1.2%) | 1/18 (5.6%) | 1/18 (5.6%) |
Note. Abbreviations: AI artificial intelligence, FP false positive, TP true positive
Review of the true pulmonary embolism (PE) missed by artificial intelligence (AI) and radiologists
| Characteristics | PE missed by AI ( | PE missed by radiologists ( | |
|---|---|---|---|
| Radiologists’ experience | |||
| Junior | 4/14 (28.6%) | 3/19 (15.8%) | 0.6477 |
| Senior | 10/14 (71.4%) | 16/19 (84.2%) | |
| 23 ± 13 | 14.4 ± 7 | ||
| Patients’ sex | |||
| Women | 8/14 (57.1%) | 12/19 (63.2%) | 1 |
| Men | 6/14 (42.9%) | 7/19 (36.8%) | |
| 65.9 ± 18.7 | 75.3 ± 14.2 | 0.1353 | |
| Respiratory artifacts | |||
| Absent | 7/10 (70%) | 13/17 (76.5%) | 1 |
| Present | 3/10 (30%) | 4/17 (23.5%) | |
| 432.3 ± 188 | 440.6 ± 156.1 | 0.8555 | |
| Presence of other confusing abnormal findings | |||
| No | 10/14 (71.4%) | 5/19 (26.3%) | |
| Yes | 4/14 (28.6%) | 14/19 (73.7%) | |
| Details regarding the abnormal findings | |||
| Pleural effusion | 3/14 (21.4%) | 3/19 (15.8%) | 1 |
| Infectious disease | 4/14 (28.6%) | 7/19 (36.8%) | 0.9009 |
| Cardiac decompensation | 0/14 (0%) | 2/19 (10.5%) | 0.607 |
| Neoplasia | 1/14 (7.1%) | 2/19 (10.5%) | 1 |
| Atelectasia | 2/14 (14.3%) | 4/19 (21.1%) | 0.9669 |
| Emphysema | 0/14 (0%) | 3/19 (15.8%) | 0.3438 |
| Major dorsal kyphosis | 1/14 (7.1%) | 2/19 (10.5%) | 1 |
| Other | 1/14 (7.1%) | 1/19 (5.3%) | 1 |
| PE laterality | |||
| Right | 6/14 (42.9%) | 9/19 (47.4%) | 0.3701 |
| Left | 3/14 (21.4%) | 7/19 (36.8%) | |
| Bilateral | 5/14 (35.7%) | 3/19 (15.8%) | |
| 1.6 ± 1.1 | 1.2 ± 0.5 | 0.2144 | |
| Number of clot | |||
| Multiple | 7/14 (50%) | 5/19 (26.3%) | 0.3022 |
| Single | 7/14 (50%) | 14/19 (73.7%) | |
| Location of the most proximal clot | |||
| Proximal | 3/14 (21.4%) | 0/19 (0%) | 0.1802 |
| Lobar | 2/14 (14.3%) | 3/19 (15.8%) | |
| Segmental | 7/14 (50%) | 14/19 (73.7%) | |
| Sub-segmental | 2/14 (14.3%) | 2/19 (10.5%) | |
| Positioning of the clot in the vessel | |||
| Central | 13/14 (92.9%) | 15/19 (78.9%) | 0.5417 |
| Marginal | 1/14 (7.1%) | 4/19 (21.1%) | |
| Occlusive clot | |||
| No | 5/14 (35.7%) | 12/19 (63.2%) | 0.2276 |
| Yes | 9/14 (64.3%) | 7/19 (36.8%) | |
| 21.4 +/- 14.1 | 10.7 +/- 6.9 | ||
| Dilated pulmonary artery | |||
| No | 8/14 (57.1%) | 15/19 (78.9%) | 0.3351 |
| Yes | 6/14 (42.9%) | 4/19 (21.1%) | |
Note. Data are number of patients with percentage in parentheses except for (1), where data are mean ± standard deviation. The tests are chi-square or Fisher test for categorical characteristics and unpaired Wilcoxon test for numerical characteristics
Other abbreviations: HU, Hounsfield unit
*: p < 0.05
Figure 4Clinical examples. A 71-year-old patient with a medical history of cancer and recent surgery presented with heart rate > 95 beats per minute and a borderline saturation and underwent a contrast-enhanced CT pulmonary angiogram (CTPA) (A), which showed a segmental, sub-acute, pulmonary embolism (PE) in the right low limb, which was missed by the emergency radiologist during his on-call duty (red arrow). B On the same cross-section, the PE was correctly identified by the artificial intelligence (AI) algorithm (AIDOC Medical). Example of pulmonary embolism (PE) correctly diagnosed by the emergency radiologist and not by the artificial intelligence (AI) algorithm. Opposite example: An 85-year-old patient with a medical history of PE and a recent surgery presented with a heart rate between 75 and 94 beats per minute and acute dyspnea and underwent CTPA (C). Two segmental PEs were correctly diagnosed by the emergency radiologist but missed by the AI algorithm (white arrows)
Figure 5Use of artificial intelligence (AI) by radiologists for emergency clinical routine at Imadis. Qualitative assessment: results of the satisfaction survey sent 9 months after implementing AI in clinical workflow (A, B). Quantitative assessment (C): comparison of interpretation duration for a single CT pulmonary angiogram in 2018 (without AI) and 2020 (with AI) (lines inside the violin plots correspond to 1st quartile, median, and 3rd quartile). ***: p < 0.001