Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Man against machine reloaded: performance of a market-approved convolutional neural network in classifying a broad spectrum of skin lesions in comparison with 96 dermatologists working under less artificial conditions.

Literature DB >> 31912788

Man against machine reloaded: performance of a market-approved convolutional neural network in classifying a broad spectrum of skin lesions in comparison with 96 dermatologists working under less artificial conditions.

H A Haenssle¹, C Fink², F Toberer², J Winkler², W Stolz³, T Deinlein⁴, R Hofmann-Wellenhof⁴, A Lallas⁵, S Emmert⁶, T Buhl⁷, M Zutt⁸, A Blum⁹, M S Abassi¹⁰, L Thomas¹¹, I Tromme¹², P Tschandl¹³, A Enk², A Rosenberger¹⁴.

Abstract

BACKGROUND: Convolutional neural networks (CNNs) efficiently differentiate skin lesions by image analysis. Studies comparing a market-approved CNN in a broad range of diagnoses to dermatologists working under less artificial conditions are lacking.
MATERIALS AND METHODS: One hundred cases of pigmented/non-pigmented skin cancers and benign lesions were used for a two-level reader study in 96 dermatologists (level I: dermoscopy only; level II: clinical close-up images, dermoscopy, and textual information). Additionally, dermoscopic images were classified by a CNN approved for the European market as a medical device (Moleanalyzer Pro, FotoFinder Systems, Bad Birnbach, Germany). Primary endpoints were the sensitivity and specificity of the CNN's dichotomous classification in comparison with the dermatologists' management decisions. Secondary endpoints included the dermatologists' diagnostic decisions, their performance according to their level of experience, and the CNN's area under the curve (AUC) of receiver operating characteristics (ROC).
RESULTS: The CNN revealed a sensitivity, specificity, and ROC AUC with corresponding 95% confidence intervals (CI) of 95.0% (95% CI 83.5% to 98.6%), 76.7% (95% CI 64.6% to 85.6%), and 0.918 (95% CI 0.866-0.970), respectively. In level I, the dermatologists' management decisions showed a mean sensitivity and specificity of 89.0% (95% CI 87.4% to 90.6%) and 80.7% (95% CI 78.8% to 82.6%). With level II information, the sensitivity significantly improved to 94.1% (95% CI 93.1% to 95.1%; P < 0.001), while the specificity remained unchanged at 80.4% (95% CI 78.4% to 82.4%; P = 0.97). When fixing the CNN's specificity at the mean specificity of the dermatologists' management decision in level II (80.4%), the CNN's sensitivity was almost equal to that of human raters, at 95% (95% CI 83.5% to 98.6%) versus 94.1% (95% CI 93.1% to 95.1%); P = 0.1. In contrast, dermatologists were outperformed by the CNN in their level I management decisions and level I and II diagnostic decisions. More experienced dermatologists frequently surpassed the CNN's performance.
CONCLUSIONS: Under less artificial conditions and in a broader spectrum of diagnoses, the CNN and most dermatologists performed on the same level. Dermatologists are trained to integrate information from a range of sources rendering comparative studies that are solely based on one single case image inadequate.

Entities: Disease Species

Keywords: Moleanalyzer Pro; deep learning; dermoscopy; melanoma; neural network; skin cancer

Mesh：

Year: 2020 PMID： 31912788 DOI： 10.1016/j.annonc.2019.10.013

Source DB: PubMed Journal: Ann Oncol ISSN： 0923-7534 Impact factor: 32.976

Keyword Cloud
Cited

21 in total

Review 1. Designing deep learning studies in cancer diagnostics.

Authors: Andreas Kleppe; Ole-Johan Skrede; Sepp De Raedt; Knut Liestøl; David J Kerr; Håvard E Danielsen
Journal: Nat Rev Cancer Date: 2021-01-29 Impact factor: 60.716

Review 2. Lack of Transparency and Potential Bias in Artificial Intelligence Data Sets and Algorithms: A Scoping Review.

Authors: Roxana Daneshjou; Mary P Smith; Mary D Sun; Veronica Rotemberg; James Zou
Journal: JAMA Dermatol Date: 2021-11-01 Impact factor: 11.816

3. Classification of Basal Cell Carcinoma in Ex Vivo Confocal Microscopy Images from Freshly Excised Tissues Using a Deep Learning Algorithm.

Authors: Mercedes Sendín-Martín; Manuel Lara-Caro; Ucalene Harris; Matthew Moronta; Anthony Rossi; Erica Lee; Chih-Shan Jason Chen; Kishwer Nehal; Julián Conejo-Mir Sánchez; José-Juan Pereyra-Rodríguez; Manu Jain
Journal: J Invest Dermatol Date: 2021-10-23 Impact factor: 7.590

Review 4. The Importance of Incorporating Human Factors in the Design and Implementation of Artificial Intelligence for Skin Cancer Diagnosis in the Real World.

Authors: Claire M Felmingham; Nikki R Adler; Zongyuan Ge; Rachael L Morton; Monika Janda; Victoria J Mar
Journal: Am J Clin Dermatol Date: 2021-03 Impact factor: 7.403

5. Validation of artificial intelligence prediction models for skin cancer diagnosis using dermoscopy images: the 2019 International Skin Imaging Collaboration Grand Challenge.

Authors: Marc Combalia; Noel Codella; Veronica Rotemberg; Cristina Carrera; Stephen Dusza; David Gutman; Brian Helba; Harald Kittler; Nicholas R Kurtansky; Konstantinos Liopyris; Michael A Marchetti; Sebastian Podlipnik; Susana Puig; Christoph Rinner; Philipp Tschandl; Jochen Weber; Allan Halpern; Josep Malvehy
Journal: Lancet Digit Health Date: 2022-05

6. Effects of Label Noise on Deep Learning-Based Skin Cancer Classification.

Authors: Achim Hekler; Jakob N Kather; Eva Krieghoff-Henning; Jochen S Utikal; Friedegund Meier; Frank F Gellrich; Julius Upmeier Zu Belzen; Lars French; Justin G Schlager; Kamran Ghoreschi; Tabea Wilhelm; Heinz Kutzner; Carola Berking; Markus V Heppt; Sebastian Haferkamp; Wiebke Sondermann; Dirk Schadendorf; Bastian Schilling; Benjamin Izar; Roman Maron; Max Schmitt; Stefan Fröhling; Daniel B Lipka; Titus J Brinker
Journal: Front Med (Lausanne) Date: 2020-05-06

7. Prospective, comparative evaluation of a deep neural network and dermoscopy in the diagnosis of onychomycosis.

Authors: Young Jae Kim; Seung Seog Han; Hee Joo Yang; Sung Eun Chang
Journal: PLoS One Date: 2020-06-11 Impact factor: 3.240

8. Stress testing reveals gaps in clinic readiness of image-based diagnostic artificial intelligence models.

Authors: Albert T Young; Kristen Fernandez; Jacob Pfau; Rasika Reddy; Nhat Anh Cao; Max Y von Franque; Arjun Johal; Benjamin V Wu; Rachel R Wu; Jennifer Y Chen; Raj P Fadadu; Juan A Vasquez; Andrew Tam; Michael J Keiser; Maria L Wei
Journal: NPJ Digit Med Date: 2021-01-21

9. The Role of DICOM in Artificial Intelligence for Skin Disease.

Authors: Liam J Caffery; Veronica Rotemberg; Jochen Weber; H Peter Soyer; Josep Malvehy; David Clunie
Journal: Front Med (Lausanne) Date: 2021-02-10

Review 10. Integrating Patient Data Into Skin Cancer Classification Using Convolutional Neural Networks: Systematic Review.

Authors: Julia Höhn; Achim Hekler; Eva Krieghoff-Henning; Jakob Nikolas Kather; Jochen Sven Utikal; Friedegund Meier; Frank Friedrich Gellrich; Axel Hauschild; Lars French; Justin Gabriel Schlager; Kamran Ghoreschi; Tabea Wilhelm; Heinz Kutzner; Markus Heppt; Sebastian Haferkamp; Wiebke Sondermann; Dirk Schadendorf; Bastian Schilling; Roman C Maron; Max Schmitt; Tanja Jutzi; Stefan Fröhling; Daniel B Lipka; Titus Josef Brinker
Journal: J Med Internet Res Date: 2021-07-02 Impact factor: 5.428