Literature DB >> 31201137

Comparison of the accuracy of human readers versus machine-learning algorithms for pigmented skin lesion classification: an open, web-based, international, diagnostic study.

Philipp Tschandl1, Noel Codella2, Bengü Nisa Akay3, Giuseppe Argenziano4, Ralph P Braun5, Horacio Cabo6, David Gutman7, Allan Halpern8, Brian Helba9, Rainer Hofmann-Wellenhof10, Aimilios Lallas11, Jan Lapins12, Caterina Longo13, Josep Malvehy14, Michael A Marchetti8, Ashfaq Marghoob15, Scott Menzies16, Amanda Oakley17, John Paoli18, Susana Puig14, Christoph Rinner19, Cliff Rosendahl20, Alon Scope21, Christoph Sinz1, H Peter Soyer22, Luc Thomas23, Iris Zalaudek24, Harald Kittler25.   

Abstract

BACKGROUND: Whether machine-learning algorithms can diagnose all pigmented skin lesions as accurately as human experts is unclear. The aim of this study was to compare the diagnostic accuracy of state-of-the-art machine-learning algorithms with human readers for all clinically relevant types of benign and malignant pigmented skin lesions.
METHODS: For this open, web-based, international, diagnostic study, human readers were asked to diagnose dermatoscopic images selected randomly in 30-image batches from a test set of 1511 images. The diagnoses from human readers were compared with those of 139 algorithms created by 77 machine-learning labs, who participated in the International Skin Imaging Collaboration 2018 challenge and received a training set of 10 015 images in advance. The ground truth of each lesion fell into one of seven predefined disease categories: intraepithelial carcinoma including actinic keratoses and Bowen's disease; basal cell carcinoma; benign keratinocytic lesions including solar lentigo, seborrheic keratosis and lichen planus-like keratosis; dermatofibroma; melanoma; melanocytic nevus; and vascular lesions. The two main outcomes were the differences in the number of correct specific diagnoses per batch between all human readers and the top three algorithms, and between human experts and the top three algorithms.
FINDINGS: Between Aug 4, 2018, and Sept 30, 2018, 511 human readers from 63 countries had at least one attempt in the reader study. 283 (55·4%) of 511 human readers were board-certified dermatologists, 118 (23·1%) were dermatology residents, and 83 (16·2%) were general practitioners. When comparing all human readers with all machine-learning algorithms, the algorithms achieved a mean of 2·01 (95% CI 1·97 to 2·04; p<0·0001) more correct diagnoses (17·91 [SD 3·42] vs 19·92 [4·27]). 27 human experts with more than 10 years of experience achieved a mean of 18·78 (SD 3·15) correct answers, compared with 25·43 (1·95) correct answers for the top three machine algorithms (mean difference 6·65, 95% CI 6·06-7·25; p<0·0001). The difference between human experts and the top three algorithms was significantly lower for images in the test set that were collected from sources not included in the training set (human underperformance of 11·4%, 95% CI 9·9-12·9 vs 3·6%, 0·8-6·3; p<0·0001).
INTERPRETATION: State-of-the-art machine-learning classifiers outperformed human experts in the diagnosis of pigmented skin lesions and should have a more important role in clinical practice. However, a possible limitation of these algorithms is their decreased performance for out-of-distribution images, which should be addressed in future research. FUNDING: None.
Copyright © 2019 Elsevier Ltd. All rights reserved.

Entities:  

Mesh:

Year:  2019        PMID: 31201137      PMCID: PMC8237239          DOI: 10.1016/S1470-2045(19)30333-X

Source DB:  PubMed          Journal:  Lancet Oncol        ISSN: 1470-2045            Impact factor:   41.316


  20 in total

Review 1.  Diagnostic accuracy of dermoscopy.

Authors:  H Kittler; H Pehamberger; K Wolff; M Binder
Journal:  Lancet Oncol       Date:  2002-03       Impact factor: 41.316

2.  Ugly Duckling Sign as a Major Factor of Efficiency in Melanoma Detection.

Authors:  Caroline Gaudy-Marqueste; Yanal Wazaefi; Yvane Bruneu; Raoul Triller; Luc Thomas; Giovanni Pellacani; Josep Malvehy; Marie-Françoise Avril; Sandrine Monestier; Marie-Aleth Richard; Bernard Fertil; Jean-Jacques Grob
Journal:  JAMA Dermatol       Date:  2017-04-01       Impact factor: 10.282

3.  The performance of SolarScan: an automated dermoscopy image analysis instrument for the diagnosis of primary melanoma.

Authors:  Scott W Menzies; Leanne Bischof; Hugues Talbot; Alex Gutenev; Michelle Avramidis; Livian Wong; Sing Kai Lo; Geoffrey Mackellar; Victor Skladnev; William McCarthy; John Kelly; Brad Cranney; Peter Lye; Harold Rabinovitz; Margaret Oliviero; Andreas Blum; Alexandra Varol; Alexandra Virol; Brian De'Ambrosis; Roderick McCleod; Hiroshi Koga; Caron Grin; Ralph Braun; Robert Johr
Journal:  Arch Dermatol       Date:  2005-11

4.  Management Reasoning: Beyond the Diagnosis.

Authors:  David A Cook; Jonathan Sherbino; Steven J Durning
Journal:  JAMA       Date:  2018-06-12       Impact factor: 56.272

5.  Incidence Estimate of Nonmelanoma Skin Cancer (Keratinocyte Carcinomas) in the U.S. Population, 2012.

Authors:  Howard W Rogers; Martin A Weinstock; Steven R Feldman; Brett M Coldiron
Journal:  JAMA Dermatol       Date:  2015-10       Impact factor: 10.282

6.  Dermatologist-level classification of skin cancer with deep neural networks.

Authors:  Andre Esteva; Brett Kuprel; Roberto A Novoa; Justin Ko; Susan M Swetter; Helen M Blau; Sebastian Thrun
Journal:  Nature       Date:  2017-01-25       Impact factor: 49.962

7.  The performance of MelaFind: a prospective multicenter study.

Authors:  Gary Monheit; Armand B Cognetta; Laura Ferris; Harold Rabinovitz; Kenneth Gross; Mary Martini; James M Grichnik; Martin Mihm; Victor G Prieto; Paul Googe; Roy King; Alicia Toledano; Nikolai Kabelev; Maciej Wojton; Dina Gutkowicz-Krusin
Journal:  Arch Dermatol       Date:  2010-10-18

8.  Results of the 2016 International Skin Imaging Collaboration International Symposium on Biomedical Imaging challenge: Comparison of the accuracy of computer algorithms to dermatologists for the diagnosis of melanoma from dermoscopic images.

Authors:  Michael A Marchetti; Noel C F Codella; Stephen W Dusza; David A Gutman; Brian Helba; Aadi Kalloo; Nabin Mishra; Cristina Carrera; M Emre Celebi; Jennifer L DeFazio; Natalia Jaimes; Ashfaq A Marghoob; Elizabeth Quigley; Alon Scope; Oriol Yélamos; Allan C Halpern
Journal:  J Am Acad Dermatol       Date:  2017-09-29       Impact factor: 11.527

9.  pROC: an open-source package for R and S+ to analyze and compare ROC curves.

Authors:  Xavier Robin; Natacha Turck; Alexandre Hainard; Natalia Tiberti; Frédérique Lisacek; Jean-Charles Sanchez; Markus Müller
Journal:  BMC Bioinformatics       Date:  2011-03-17       Impact factor: 3.307

10.  Clinical performance of the Nevisense system in cutaneous melanoma detection: an international, multicentre, prospective and blinded clinical trial on efficacy and safety.

Authors:  J Malvehy; A Hauschild; C Curiel-Lewandrowski; P Mohr; R Hofmann-Wellenhof; R Motley; C Berking; D Grossman; J Paoli; C Loquai; J Olah; U Reinhold; H Wenger; T Dirschka; S Davis; C Henderson; H Rabinovitz; J Welzel; D Schadendorf; U Birgersson
Journal:  Br J Dermatol       Date:  2014-10-19       Impact factor: 9.302

View more
  59 in total

1.  Reliable test of clinicians' mastery in skin cancer diagnostics.

Authors:  Niels Kvorning Ternov; T Vestergaard; L Rosenkrantz Hölmich; K Karmisholt; A L Wagenblast; H Klyver; M Hald; L Schøllhammer; L Konge; A H Chakera
Journal:  Arch Dermatol Res       Date:  2020-06-28       Impact factor: 3.017

2.  Number needed to biopsy ratio and diagnostic accuracy for melanoma detection.

Authors:  Michael A Marchetti; Ashley Yu; Japbani Nanda; Philipp Tschandl; Harald Kittler; Ashfaq A Marghoob; Allan C Halpern; Stephen W Dusza
Journal:  J Am Acad Dermatol       Date:  2020-04-29       Impact factor: 11.527

3.  Automatized Hepatic Tumor Volume Analysis of Neuroendocrine Liver Metastases by Gd-EOB MRI-A Deep-Learning Model to Support Multidisciplinary Cancer Conference Decision-Making.

Authors:  Uli Fehrenbach; Siyi Xin; Alexander Hartenstein; Timo Alexander Auer; Franziska Dräger; Konrad Froböse; Henning Jann; Martina Mogl; Holger Amthauer; Dominik Geisel; Timm Denecke; Bertram Wiedenmann; Tobias Penzkofer
Journal:  Cancers (Basel)       Date:  2021-05-31       Impact factor: 6.639

Review 4.  [New optical examination procedures for the diagnosis of skin diseases].

Authors:  K Sies; J K Winkler; M Zieger; M Kaatz; H A Haenssle
Journal:  Hautarzt       Date:  2020-02       Impact factor: 0.751

5.  Keratinocytic Skin Cancer Detection on the Face Using Region-Based Convolutional Neural Network.

Authors:  Seung Seog Han; Ik Jun Moon; Woohyung Lim; In Suck Suh; Sam Yong Lee; Jung-Im Na; Seong Hwan Kim; Sung Eun Chang
Journal:  JAMA Dermatol       Date:  2020-01-01       Impact factor: 10.282

Review 6.  Artificial intelligence in digital pathology - new tools for diagnosis and precision oncology.

Authors:  Kaustav Bera; Kurt A Schalper; David L Rimm; Vamsidhar Velcheti; Anant Madabhushi
Journal:  Nat Rev Clin Oncol       Date:  2019-08-09       Impact factor: 66.675

7.  Automated NLP Extraction of Clinical Rationale for Treatment Discontinuation in Breast Cancer.

Authors:  Matthew S Alkaitis; Monica N Agrawal; Gregory J Riely; Pedram Razavi; David Sontag
Journal:  JCO Clin Cancer Inform       Date:  2021-05

8.  Toward automated assessment of mole similarity on dermoscopic images.

Authors:  Yao Zhang; Kamil Ali; Jacob A George; Jason S Reichenberg; Matthew C Fox; Adewole S Adamson; James W Tunnell; Mia K Markey
Journal:  J Med Imaging (Bellingham)       Date:  2021-02-10

Review 9.  The Importance of Incorporating Human Factors in the Design and Implementation of Artificial Intelligence for Skin Cancer Diagnosis in the Real World.

Authors:  Claire M Felmingham; Nikki R Adler; Zongyuan Ge; Rachael L Morton; Monika Janda; Victoria J Mar
Journal:  Am J Clin Dermatol       Date:  2021-03       Impact factor: 7.403

10.  An introduction to machine learning for clinicians: How can machine learning augment knowledge in geriatric oncology?

Authors:  Erika Ramsdale; Eric Snyder; Eva Culakova; Huiwen Xu; Adam Dziorny; Shuhan Yang; Martin Zand; Ajay Anand
Journal:  J Geriatr Oncol       Date:  2021-03-29       Impact factor: 3.599

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.