Literature DB >> 35249179

Clinical Explainability Failure (CEF) & Explainability Failure Ratio (EFR) - Changing the Way We Validate Classification Algorithms.

Vasantha Kumar Venugopal1, Rohit Takhar2, Salil Gupta2, Vidur Mahajan2.   

Abstract

Adoption of Artificial Intelligence (AI) algorithms into the clinical realm will depend on their inherent trustworthiness, which is built not only by robust validation studies but is also deeply linked to the explainability and interpretability of the algorithms. Most validation studies for medical imaging AI report the performance of algorithms on study-level labels and lay little emphasis on measuring the accuracy of explanations generated by these algorithms in the form of heat maps or bounding boxes, especially in true positive cases. We propose a new metric - Explainability Failure Ratio (EFR) - derived from Clinical Explainability Failure (CEF) to address this gap in AI evaluation. We define an Explainability Failure as a case where the classification generated by an AI algorithm matches with study-level ground truth but the explanation output generated by the algorithm is inadequate to explain the algorithm's output. We measured EFR for two algorithms that automatically detect consolidation on chest X-rays to determine the applicability of the metric and observed a lower EFR for the model that had lower sensitivity for identifying consolidation on chest X-rays, implying that the trustworthiness of a model should be determined not only by routine statistical metrics but also by novel 'clinically-oriented' models.
© 2022. The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.

Entities:  

Keywords:  AI (Artificial Intelligence); Deep learning; Explainable AI; Medical imaging

Mesh:

Year:  2022        PMID: 35249179     DOI: 10.1007/s10916-022-01806-2

Source DB:  PubMed          Journal:  J Med Syst        ISSN: 0148-5598            Impact factor:   4.460


  9 in total

1.  Tackling the Radiological Society of North America Pneumonia Detection Challenge.

Authors:  Ian Pan; Alexandre Cadrin-Chênevert; Phillip M Cheng
Journal:  AJR Am J Roentgenol       Date:  2019-05-23       Impact factor: 3.959

2.  The Algorithmic Audit: Working with Vendors to Validate Radiology-AI Algorithms-How We Do It.

Authors:  Vidur Mahajan; Vasantha Kumar Venugopal; Murali Murugavel; Harsh Mahajan
Journal:  Acad Radiol       Date:  2020-01       Impact factor: 3.173

3.  Unboxing AI - Radiological Insights Into a Deep Neural Network for Lung Nodule Characterization.

Authors:  Vasantha Kumar Venugopal; Kiran Vaidhya; Murali Murugavel; Abhijith Chunduru; Vidur Mahajan; Suthirth Vaidya; Digvijay Mahra; Akshay Rangasai; Harsh Mahajan
Journal:  Acad Radiol       Date:  2019-10-14       Impact factor: 3.173

4.  A Deep Learning Mammography-based Model for Improved Breast Cancer Risk Prediction.

Authors:  Adam Yala; Constance Lehman; Tal Schuster; Tally Portnoi; Regina Barzilay
Journal:  Radiology       Date:  2019-05-07       Impact factor: 11.105

5.  A Deep Learning Model to Predict a Diagnosis of Alzheimer Disease by Using 18F-FDG PET of the Brain.

Authors:  Yiming Ding; Jae Ho Sohn; Michael G Kawczynski; Hari Trivedi; Roy Harnish; Nathaniel W Jenkins; Dmytro Lituiev; Timothy P Copeland; Mariam S Aboian; Carina Mari Aparici; Spencer C Behr; Robert R Flavell; Shih-Ying Huang; Kelly A Zalocusky; Lorenzo Nardo; Youngho Seo; Randall A Hawkins; Miguel Hernandez Pampaloni; Dexter Hadley; Benjamin L Franc
Journal:  Radiology       Date:  2018-11-06       Impact factor: 29.146

6.  Deep learning for chest radiograph diagnosis: A retrospective comparison of the CheXNeXt algorithm to practicing radiologists.

Authors:  Pranav Rajpurkar; Jeremy Irvin; Robyn L Ball; Kaylie Zhu; Brandon Yang; Hershel Mehta; Tony Duan; Daisy Ding; Aarti Bagul; Curtis P Langlotz; Bhavik N Patel; Kristen W Yeom; Katie Shpanskaya; Francis G Blankenberg; Jayne Seekins; Timothy J Amrhein; David A Mong; Safwan S Halabi; Evan J Zucker; Andrew Y Ng; Matthew P Lungren
Journal:  PLoS Med       Date:  2018-11-20       Impact factor: 11.069

7.  Deep Learning Localization of Pneumonia: 2019 Coronavirus (COVID-19) Outbreak.

Authors:  Brian Hurt; Seth Kligerman; Albert Hsiao
Journal:  J Thorac Imaging       Date:  2020-05       Impact factor: 3.000

8.  The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation.

Authors:  Davide Chicco; Giuseppe Jurman
Journal:  BMC Genomics       Date:  2020-01-02       Impact factor: 3.969

9.  Using artificial intelligence to read chest radiographs for tuberculosis detection: A multi-site evaluation of the diagnostic accuracy of three deep learning systems.

Authors:  Zhi Zhen Qin; Melissa S Sander; Bishwa Rai; Collins N Titahong; Santat Sudrungrot; Sylvain N Laah; Lal Mani Adhikari; E Jane Carter; Lekha Puri; Andrew J Codlin; Jacob Creswell
Journal:  Sci Rep       Date:  2019-10-18       Impact factor: 4.379

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.