Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Clinical Explainability Failure (CEF) & Explainability Failure Ratio (EFR) - Changing the Way We Validate Classification Algorithms.

Literature DB >> 35249179

Clinical Explainability Failure (CEF) & Explainability Failure Ratio (EFR) - Changing the Way We Validate Classification Algorithms.

Vasantha Kumar Venugopal¹, Rohit Takhar², Salil Gupta², Vidur Mahajan².

Abstract

Adoption of Artificial Intelligence (AI) algorithms into the clinical realm will depend on their inherent trustworthiness, which is built not only by robust validation studies but is also deeply linked to the explainability and interpretability of the algorithms. Most validation studies for medical imaging AI report the performance of algorithms on study-level labels and lay little emphasis on measuring the accuracy of explanations generated by these algorithms in the form of heat maps or bounding boxes, especially in true positive cases. We propose a new metric - Explainability Failure Ratio (EFR) - derived from Clinical Explainability Failure (CEF) to address this gap in AI evaluation. We define an Explainability Failure as a case where the classification generated by an AI algorithm matches with study-level ground truth but the explanation output generated by the algorithm is inadequate to explain the algorithm's output. We measured EFR for two algorithms that automatically detect consolidation on chest X-rays to determine the applicability of the metric and observed a lower EFR for the model that had lower sensitivity for identifying consolidation on chest X-rays, implying that the trustworthiness of a model should be determined not only by routine statistical metrics but also by novel 'clinically-oriented' models.

Entities: Chemical

Keywords: AI (Artificial Intelligence); Deep learning; Explainable AI; Medical imaging

Mesh：

Year: 2022 PMID： 35249179 DOI： 10.1007/s10916-022-01806-2

Source DB: PubMed Journal: J Med Syst ISSN： 0148-5598 Impact factor: 4.460

Keyword Cloud
References

9 in total

1. Tackling the Radiological Society of North America Pneumonia Detection Challenge.

Authors: Ian Pan; Alexandre Cadrin-Chênevert; Phillip M Cheng
Journal: AJR Am J Roentgenol Date: 2019-05-23 Impact factor: 3.959

2. The Algorithmic Audit: Working with Vendors to Validate Radiology-AI Algorithms-How We Do It.

Authors: Vidur Mahajan; Vasantha Kumar Venugopal; Murali Murugavel; Harsh Mahajan
Journal: Acad Radiol Date: 2020-01 Impact factor: 3.173

3. Unboxing AI - Radiological Insights Into a Deep Neural Network for Lung Nodule Characterization.

Authors: Vasantha Kumar Venugopal; Kiran Vaidhya; Murali Murugavel; Abhijith Chunduru; Vidur Mahajan; Suthirth Vaidya; Digvijay Mahra; Akshay Rangasai; Harsh Mahajan
Journal: Acad Radiol Date: 2019-10-14 Impact factor: 3.173

4. A Deep Learning Mammography-based Model for Improved Breast Cancer Risk Prediction.

Authors: Adam Yala; Constance Lehman; Tal Schuster; Tally Portnoi; Regina Barzilay
Journal: Radiology Date: 2019-05-07 Impact factor: 11.105

5. A Deep Learning Model to Predict a Diagnosis of Alzheimer Disease by Using ¹⁸F-FDG PET of the Brain.

Authors: Yiming Ding; Jae Ho Sohn; Michael G Kawczynski; Hari Trivedi; Roy Harnish; Nathaniel W Jenkins; Dmytro Lituiev; Timothy P Copeland; Mariam S Aboian; Carina Mari Aparici; Spencer C Behr; Robert R Flavell; Shih-Ying Huang; Kelly A Zalocusky; Lorenzo Nardo; Youngho Seo; Randall A Hawkins; Miguel Hernandez Pampaloni; Dexter Hadley; Benjamin L Franc
Journal: Radiology Date: 2018-11-06 Impact factor: 29.146

6. Deep learning for chest radiograph diagnosis: A retrospective comparison of the CheXNeXt algorithm to practicing radiologists.

Authors: Pranav Rajpurkar; Jeremy Irvin; Robyn L Ball; Kaylie Zhu; Brandon Yang; Hershel Mehta; Tony Duan; Daisy Ding; Aarti Bagul; Curtis P Langlotz; Bhavik N Patel; Kristen W Yeom; Katie Shpanskaya; Francis G Blankenberg; Jayne Seekins; Timothy J Amrhein; David A Mong; Safwan S Halabi; Evan J Zucker; Andrew Y Ng; Matthew P Lungren
Journal: PLoS Med Date: 2018-11-20 Impact factor: 11.069

7. Deep Learning Localization of Pneumonia: 2019 Coronavirus (COVID-19) Outbreak.

Authors: Brian Hurt; Seth Kligerman; Albert Hsiao
Journal: J Thorac Imaging Date: 2020-05 Impact factor: 3.000

8. The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation.

Authors: Davide Chicco; Giuseppe Jurman
Journal: BMC Genomics Date: 2020-01-02 Impact factor: 3.969

9. Using artificial intelligence to read chest radiographs for tuberculosis detection: A multi-site evaluation of the diagnostic accuracy of three deep learning systems.

Authors: Zhi Zhen Qin; Melissa S Sander; Bishwa Rai; Collins N Titahong; Santat Sudrungrot; Sylvain N Laah; Lal Mani Adhikari; E Jane Carter; Lekha Puri; Andrew J Codlin; Jacob Creswell
Journal: Sci Rep Date: 2019-10-18 Impact factor: 4.379

9 in total