Michael A Marchetti1, Noel C F Codella2, Stephen W Dusza1, David A Gutman3, Brian Helba4, Aadi Kalloo1, Nabin Mishra5, Cristina Carrera6, M Emre Celebi7, Jennifer L DeFazio1, Natalia Jaimes8, Ashfaq A Marghoob1, Elizabeth Quigley1, Alon Scope9, Oriol Yélamos1, Allan C Halpern10. 1. Dermatology Service, Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, New York. 2. IBM Research Division, Thomas J. Watson Research Center, Yorktown Heights, New York. 3. Departments of Neurology, Psychiatry, and Biomedical Informatics, Emory University School of Medicine, Atlanta, Georgia. 4. Kitware Inc, Clifton Park, New York. 5. Stoecker & Associates, Rolla, Missouri. 6. Melanoma Unit, Department of Dermatology, Hospital Clinic, Institut d'Investigacions Biomèdiques August Pi i Sunyer, CIBER de Enfermedades Raras, Instituto de Salud Carlos III, University of Barcelona, Barcelona, Spain. 7. Department of Computer Science, University of Central Arkansas, Conway, Arkansas. 8. Dermatology Service, Aurora Centro Especializado en Cáncer de Piel, Medellín, Colombia; Department of Dermatology and Cutaneous Surgery, University of Miami Miller School of Medicine, Miami, Florida. 9. Dermatology Service, Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, New York; Department of Dermatology, Sheba Medical Center, Sackler School of Medicine, Tel Aviv University, Tel Aviv, Israel. 10. Dermatology Service, Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, New York. Electronic address: halperna@mskcc.org.
Abstract
BACKGROUND: Computer vision may aid in melanoma detection. OBJECTIVE: We sought to compare melanoma diagnostic accuracy of computer algorithms to dermatologists using dermoscopic images. METHODS: We conducted a cross-sectional study using 100 randomly selected dermoscopic images (50 melanomas, 44 nevi, and 6 lentigines) from an international computer vision melanoma challenge dataset (n = 379), along with individual algorithm results from 25 teams. We used 5 methods (nonlearned and machine learning) to combine individual automated predictions into "fusion" algorithms. In a companion study, 8 dermatologists classified the lesions in the 100 images as either benign or malignant. RESULTS: The average sensitivity and specificity of dermatologists in classification was 82% and 59%. At 82% sensitivity, dermatologist specificity was similar to the top challenge algorithm (59% vs. 62%, P = .68) but lower than the best-performing fusion algorithm (59% vs. 76%, P = .02). Receiver operating characteristic area of the top fusion algorithm was greater than the mean receiver operating characteristic area of dermatologists (0.86 vs. 0.71, P = .001). LIMITATIONS: The dataset lacked the full spectrum of skin lesions encountered in clinical practice, particularly banal lesions. Readers and algorithms were not provided clinical data (eg, age or lesion history/symptoms). Results obtained using our study design cannot be extrapolated to clinical practice. CONCLUSION: Deep learning computer vision systems classified melanoma dermoscopy images with accuracy that exceeded some but not all dermatologists.
BACKGROUND: Computer vision may aid in melanoma detection. OBJECTIVE: We sought to compare melanoma diagnostic accuracy of computer algorithms to dermatologists using dermoscopic images. METHODS: We conducted a cross-sectional study using 100 randomly selected dermoscopic images (50 melanomas, 44 nevi, and 6 lentigines) from an international computer vision melanoma challenge dataset (n = 379), along with individual algorithm results from 25 teams. We used 5 methods (nonlearned and machine learning) to combine individual automated predictions into "fusion" algorithms. In a companion study, 8 dermatologists classified the lesions in the 100 images as either benign or malignant. RESULTS: The average sensitivity and specificity of dermatologists in classification was 82% and 59%. At 82% sensitivity, dermatologist specificity was similar to the top challenge algorithm (59% vs. 62%, P = .68) but lower than the best-performing fusion algorithm (59% vs. 76%, P = .02). Receiver operating characteristic area of the top fusion algorithm was greater than the mean receiver operating characteristic area of dermatologists (0.86 vs. 0.71, P = .001). LIMITATIONS: The dataset lacked the full spectrum of skin lesions encountered in clinical practice, particularly banal lesions. Readers and algorithms were not provided clinical data (eg, age or lesion history/symptoms). Results obtained using our study design cannot be extrapolated to clinical practice. CONCLUSION:Deep learning computer vision systems classified melanoma dermoscopy images with accuracy that exceeded some but not all dermatologists.
Authors: M Emre Celebi; Hassan A Kingravi; Bakhtiyar Uddin; Hitoshi Iyatomi; Y Alp Aslandogan; William V Stoecker; Randy H Moss Journal: Comput Med Imaging Graph Date: 2007-03-26 Impact factor: 4.790
Authors: Alon Scope; Michael A Marchetti; Ashfaq A Marghoob; Stephen W Dusza; Alan C Geller; Jaya M Satagopan; Martin A Weinstock; Marianne Berwick; Allan C Halpern Journal: J Am Acad Dermatol Date: 2016-06-17 Impact factor: 11.527
Authors: Andre Esteva; Brett Kuprel; Roberto A Novoa; Justin Ko; Susan M Swetter; Helen M Blau; Sebastian Thrun Journal: Nature Date: 2017-01-25 Impact factor: 49.962
Authors: Gary Monheit; Armand B Cognetta; Laura Ferris; Harold Rabinovitz; Kenneth Gross; Mary Martini; James M Grichnik; Martin Mihm; Victor G Prieto; Paul Googe; Roy King; Alicia Toledano; Nikolai Kabelev; Maciej Wojton; Dina Gutkowicz-Krusin Journal: Arch Dermatol Date: 2010-10-18
Authors: Laura K Ferris; Jan A Harkes; Benjamin Gilbert; Daniel G Winger; Kseniya Golubets; Oleg Akilov; Mahadev Satyanarayanan Journal: J Am Acad Dermatol Date: 2015-09-19 Impact factor: 11.527
Authors: Andreas Kleppe; Ole-Johan Skrede; Sepp De Raedt; Knut Liestøl; David J Kerr; Håvard E Danielsen Journal: Nat Rev Cancer Date: 2021-01-29 Impact factor: 60.716
Authors: Michael A Marchetti; Ashley Yu; Japbani Nanda; Philipp Tschandl; Harald Kittler; Ashfaq A Marghoob; Allan C Halpern; Stephen W Dusza Journal: J Am Acad Dermatol Date: 2020-04-29 Impact factor: 11.527
Authors: Philipp Tschandl; Noel Codella; Bengü Nisa Akay; Giuseppe Argenziano; Ralph P Braun; Horacio Cabo; David Gutman; Allan Halpern; Brian Helba; Rainer Hofmann-Wellenhof; Aimilios Lallas; Jan Lapins; Caterina Longo; Josep Malvehy; Michael A Marchetti; Ashfaq Marghoob; Scott Menzies; Amanda Oakley; John Paoli; Susana Puig; Christoph Rinner; Cliff Rosendahl; Alon Scope; Christoph Sinz; H Peter Soyer; Luc Thomas; Iris Zalaudek; Harald Kittler Journal: Lancet Oncol Date: 2019-06-12 Impact factor: 41.316
Authors: Samantha L Schneider; Indermeet Kohli; Iltefat H Hamzavi; M Laurin Council; Anthony M Rossi; David M Ozog Journal: J Am Acad Dermatol Date: 2018-12-04 Impact factor: 11.527
Authors: Michael A Marchetti; Konstantinos Liopyris; Stephen W Dusza; Noel C F Codella; David A Gutman; Brian Helba; Aadi Kalloo; Allan C Halpern Journal: J Am Acad Dermatol Date: 2019-07-12 Impact factor: 11.527
Authors: Philipp Tschandl; Cliff Rosendahl; Bengu Nisa Akay; Giuseppe Argenziano; Andreas Blum; Ralph P Braun; Horacio Cabo; Jean-Yves Gourhant; Jürgen Kreusch; Aimilios Lallas; Jan Lapins; Ashfaq Marghoob; Scott Menzies; Nina Maria Neuber; John Paoli; Harold S Rabinovitz; Christoph Rinner; Alon Scope; H Peter Soyer; Christoph Sinz; Luc Thomas; Iris Zalaudek; Harald Kittler Journal: JAMA Dermatol Date: 2019-01-01 Impact factor: 10.282
Authors: Cristian Navarrete-Dechent; Stephen W Dusza; Konstantinos Liopyris; Ashfaq A Marghoob; Allan C Halpern; Michael A Marchetti Journal: J Invest Dermatol Date: 2018-06-01 Impact factor: 8.551
Authors: C Muñoz-López; C Ramírez-Cornejo; M A Marchetti; S S Han; P Del Barrio-Díaz; A Jaque; P Uribe; D Majerson; M Curi; C Del Puerto; F Reyes-Baraona; R Meza-Romero; J Parra-Cares; P Araneda-Ortega; M Guzmán; R Millán-Apablaza; M Nuñez-Mora; K Liopyris; C Vera-Kellet; C Navarrete-Dechent Journal: J Eur Acad Dermatol Venereol Date: 2020-11-22 Impact factor: 6.166
Authors: Kivanc Kose; Alican Bozkurt; Christi Alessi-Fox; Melissa Gill; Caterina Longo; Giovanni Pellacani; Jennifer G Dy; Dana H Brooks; Milind Rajadhyaksha Journal: Med Image Anal Date: 2020-10-07 Impact factor: 8.545