Literature DB >> 32694453

Impact of Confounding Thoracic Tubes and Pleural Dehiscence Extent on Artificial Intelligence Pneumothorax Detection in Chest Radiographs.

Johannes Rueckel1, Lena Trappmann1, Balthasar Schachtner, Philipp Wesp1, Boj Friedrich Hoppe1, Nicola Fink1, Jens Ricke1, Julien Dinkel, Michael Ingrisch1, Bastian Oliver Sabel1.   

Abstract

OBJECTIVES: We hypothesized that published performances of algorithms for artificial intelligence (AI) pneumothorax (PTX) detection in chest radiographs (CXRs) do not sufficiently consider the influence of PTX size and confounding effects caused by thoracic tubes (TTs). Therefore, we established a radiologically annotated benchmarking cohort (n = 6446) allowing for a detailed subgroup analysis.
MATERIALS AND METHODS: We retrospectively identified 6434 supine CXRs, among them 1652 PTX-positive cases and 4782 PTX-negative cases. Supine CXRs were radiologically annotated for PTX size, PTX location, and inserted TTs. The diagnostic performances of 2 AI algorithms ("AI_CheXNet" [Rajpurkar et al], "AI_1.5" [Guendel et al]), both trained on publicly available datasets with labels obtained from automatic report interpretation, were quantified. The algorithms' discriminative power for PTX detection was quantified by the area under the receiver operating characteristics (AUROC), and significance analysis was based on the corresponding 95% confidence interval. A detailed subgroup analysis was performed to quantify the influence of PTX size and the confounding effects caused by inserted TTs.
RESULTS: Algorithm performance was quantified as follows: overall performance with AUROCs of 0.704 (AI_1.5) / 0.765 (AI_CheXNet) for unilateral PTXs, AUROCs of 0.666 (AI_1.5) / 0.722 (AI_CheXNet) for unilateral PTXs smaller than 1 cm, and AUROCs of 0.735 (AI_1.5) / 0.818 (AI_CheXNet) for unilateral PTXs larger than 2 cm. Subgroup analysis identified TTs to be strong confounders that significantly influence algorithm performance: Discriminative power is completely eliminated by analyzing PTX-positive cases without TTs referenced to control PTX-negative cases with inserted TTs. Contrarily, AUROCs increased up to 0.875 (AI_CheXNet) for large PTX-positive cases with inserted TTs referenced to control cases without TTs.
CONCLUSIONS: Our detailed subgroup analysis demonstrated that the performance of established AI algorithms for PTX detection trained on public datasets strongly depends on PTX size and is significantly biased by confounding image features, such as inserted TTS. Our established, clinically relevant and radiologically annotated benchmarking cohort might be of great benefit for ongoing algorithm development.

Entities:  

Mesh:

Year:  2020        PMID: 32694453     DOI: 10.1097/RLI.0000000000000707

Source DB:  PubMed          Journal:  Invest Radiol        ISSN: 0020-9996            Impact factor:   6.016


  4 in total

1.  Hitting the Mark: Reducing Bias in AI Systems.

Authors:  Charles E Kahn
Journal:  Radiol Artif Intell       Date:  2022-08-24

2.  Assessment of the effect of a comprehensive chest radiograph deep learning model on radiologist reports and patient outcomes: a real-world observational study.

Authors:  Catherine M Jones; Luke Danaher; Michael R Milne; Cyril Tang; Jarrel Seah; Luke Oakden-Rayner; Andrew Johnson; Quinlan D Buchlak; Nazanin Esmaili
Journal:  BMJ Open       Date:  2021-12-20       Impact factor: 2.692

3.  Pneumothorax detection in chest radiographs: optimizing artificial intelligence system for accuracy and confounding bias reduction using in-image annotations in algorithm training.

Authors:  Johannes Rueckel; Christian Huemmer; Andreas Fieselmann; Florin-Cristian Ghesu; Awais Mansoor; Balthasar Schachtner; Philipp Wesp; Lena Trappmann; Basel Munawwar; Jens Ricke; Michael Ingrisch; Bastian O Sabel
Journal:  Eur Radiol       Date:  2021-03-27       Impact factor: 5.315

4.  Clinically focused multi-cohort benchmarking as a tool for external validation of artificial intelligence algorithm performance in basic chest radiography analysis.

Authors:  Jan Rudolph; Balthasar Schachtner; Nicola Fink; Vanessa Koliogiannis; Vincent Schwarze; Sophia Goller; Lena Trappmann; Boj F Hoppe; Nabeel Mansour; Maximilian Fischer; Najib Ben Khaled; Maximilian Jörgens; Julien Dinkel; Wolfgang G Kunz; Jens Ricke; Michael Ingrisch; Bastian O Sabel; Johannes Rueckel
Journal:  Sci Rep       Date:  2022-07-27       Impact factor: 4.996

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.