Literature DB >> 33479460

Stress testing reveals gaps in clinic readiness of image-based diagnostic artificial intelligence models.

Albert T Young1,2, Kristen Fernandez1,2, Jacob Pfau1,2, Rasika Reddy1,2, Nhat Anh Cao1, Max Y von Franque1, Arjun Johal1,2, Benjamin V Wu1, Rachel R Wu1, Jennifer Y Chen1, Raj P Fadadu1,2, Juan A Vasquez1, Andrew Tam1, Michael J Keiser3, Maria L Wei4,5,6.   

Abstract

Artificial intelligence models match or exceed dermatologists in melanoma image classification. Less is known about their robustness against real-world variations, and clinicians may incorrectly assume that a model with an acceptable area under the receiver operating characteristic curve or related performance metric is ready for clinical use. Here, we systematically assessed the performance of dermatologist-level convolutional neural networks (CNNs) on real-world non-curated images by applying computational "stress tests". Our goal was to create a proxy environment in which to comprehensively test the generalizability of off-the-shelf CNNs developed without training or evaluation protocols specific to individual clinics. We found inconsistent predictions on images captured repeatedly in the same setting or subjected to simple transformations (e.g., rotation). Such transformations resulted in false positive or negative predictions for 6.5-22% of skin lesions across test datasets. Our findings indicate that models meeting conventionally reported metrics need further validation with computational stress tests to assess clinic readiness.

Entities:  

Year:  2021        PMID: 33479460      PMCID: PMC7820258          DOI: 10.1038/s41746-020-00380-6

Source DB:  PubMed          Journal:  NPJ Digit Med        ISSN: 2398-6352


  22 in total

1.  Comparing artificial intelligence algorithms to 157 German dermatologists: the melanoma classification benchmark.

Authors:  Titus J Brinker; Achim Hekler; Axel Hauschild; Carola Berking; Bastian Schilling; Alexander H Enk; Sebastian Haferkamp; Ante Karoglan; Christof von Kalle; Michael Weichenthal; Elke Sattler; Dirk Schadendorf; Maria R Gaiser; Joachim Klode; Jochen S Utikal
Journal:  Eur J Cancer       Date:  2019-02-22       Impact factor: 9.162

2.  Research electronic data capture (REDCap)--a metadata-driven methodology and workflow process for providing translational research informatics support.

Authors:  Paul A Harris; Robert Taylor; Robert Thielke; Jonathon Payne; Nathaniel Gonzalez; Jose G Conde
Journal:  J Biomed Inform       Date:  2008-09-30       Impact factor: 6.317

3.  PH² - a dermoscopic image database for research and benchmarking.

Authors:  Teresa Mendonca; Pedro M Ferreira; Jorge S Marques; Andre R S Marcal; Jorge Rozeira
Journal:  Conf Proc IEEE Eng Med Biol Soc       Date:  2013

4.  Deep learning outperformed 136 of 157 dermatologists in a head-to-head dermoscopic melanoma image classification task.

Authors:  Titus J Brinker; Achim Hekler; Alexander H Enk; Joachim Klode; Axel Hauschild; Carola Berking; Bastian Schilling; Sebastian Haferkamp; Dirk Schadendorf; Tim Holland-Letz; Jochen S Utikal; Christof von Kalle
Journal:  Eur J Cancer       Date:  2019-04-10       Impact factor: 9.162

Review 5.  Artificial Intelligence in Dermatology: A Primer.

Authors:  Albert T Young; Mulin Xiong; Jacob Pfau; Michael J Keiser; Maria L Wei
Journal:  J Invest Dermatol       Date:  2020-03-27       Impact factor: 8.551

6.  Human-computer collaboration for skin cancer recognition.

Authors:  Philipp Tschandl; Christoph Rinner; Zoe Apalla; Giuseppe Argenziano; Noel Codella; Allan Halpern; Monika Janda; Aimilios Lallas; Caterina Longo; Josep Malvehy; John Paoli; Susana Puig; Cliff Rosendahl; H Peter Soyer; Iris Zalaudek; Harald Kittler
Journal:  Nat Med       Date:  2020-06-22       Impact factor: 53.440

7.  Patient Perspectives on the Use of Artificial Intelligence for Skin Cancer Screening: A Qualitative Study.

Authors:  Caroline A Nelson; Lourdes Maria Pérez-Chada; Andrew Creadore; Sara Jiayang Li; Kelly Lo; Priya Manjaly; Ashley Bahareh Pournamdari; Elizabeth Tkachenko; John S Barbieri; Justin M Ko; Alka V Menon; Rebecca Ivy Hartman; Arash Mostaghimi
Journal:  JAMA Dermatol       Date:  2020-05-01       Impact factor: 10.282

8.  Keratinocytic Skin Cancer Detection on the Face Using Region-Based Convolutional Neural Network.

Authors:  Seung Seog Han; Ik Jun Moon; Woohyung Lim; In Suck Suh; Sam Yong Lee; Jung-Im Na; Seong Hwan Kim; Sung Eun Chang
Journal:  JAMA Dermatol       Date:  2020-01-01       Impact factor: 10.282

9.  A deep learning system for differential diagnosis of skin diseases.

Authors:  R Carter Dunn; David Coz; Yuan Liu; Ayush Jain; Clara Eng; David H Way; Kang Lee; Peggy Bui; Kimberly Kanada; Guilherme de Oliveira Marinho; Jessica Gallegos; Sara Gabriele; Vishakha Gupta; Nalini Singh; Vivek Natarajan; Rainer Hofmann-Wellenhof; Greg S Corrado; Lily H Peng; Dale R Webster; Dennis Ai; Susan J Huang; Yun Liu
Journal:  Nat Med       Date:  2020-05-18       Impact factor: 53.440

10.  A comparison of deep learning performance against health-care professionals in detecting diseases from medical imaging: a systematic review and meta-analysis.

Authors:  Xiaoxuan Liu; Livia Faes; Aditya U Kale; Siegfried K Wagner; Dun Jack Fu; Alice Bruynseels; Thushika Mahendiran; Gabriella Moraes; Mohith Shamdas; Christoph Kern; Joseph R Ledsam; Martin K Schmid; Konstantinos Balaskas; Eric J Topol; Lucas M Bachmann; Pearse A Keane; Alastair K Denniston
Journal:  Lancet Digit Health       Date:  2019-09-25
View more
  2 in total

Review 1.  The potential use of digital health technologies in the African context: a systematic review of evidence from Ethiopia.

Authors:  Tsegahun Manyazewal; Yimtubezinash Woldeamanuel; Henry M Blumberg; Abebaw Fekadu; Vincent C Marconi
Journal:  NPJ Digit Med       Date:  2021-08-17

Review 2.  Re-focusing explainability in medicine.

Authors:  Laura Arbelaez Ossa; Georg Starke; Giorgia Lorenzini; Julia E Vogt; David M Shaw; Bernice Simone Elger
Journal:  Digit Health       Date:  2022-02-11
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.