Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 A comparison of deep learning performance against health-care professionals in detecting diseases from medical imaging: a systematic review and meta-analysis.

Literature DB >> 33323251

A comparison of deep learning performance against health-care professionals in detecting diseases from medical imaging: a systematic review and meta-analysis.

Xiaoxuan Liu¹, Livia Faes², Aditya U Kale³, Siegfried K Wagner⁴, Dun Jack Fu⁵, Alice Bruynseels³, Thushika Mahendiran³, Gabriella Moraes⁵, Mohith Shamdas⁶, Christoph Kern⁷, Joseph R Ledsam⁸, Martin K Schmid⁹, Konstantinos Balaskas¹⁰, Eric J Topol¹¹, Lucas M Bachmann¹², Pearse A Keane¹³, Alastair K Denniston¹⁴.

Abstract

BACKGROUND: Deep learning offers considerable promise for medical diagnostics. We aimed to evaluate the diagnostic accuracy of deep learning algorithms versus health-care professionals in classifying diseases using medical imaging.
METHODS: In this systematic review and meta-analysis, we searched Ovid-MEDLINE, Embase, Science Citation Index, and Conference Proceedings Citation Index for studies published from Jan 1, 2012, to June 6, 2019. Studies comparing the diagnostic performance of deep learning models and health-care professionals based on medical imaging, for any disease, were included. We excluded studies that used medical waveform data graphics material or investigated the accuracy of image segmentation rather than disease classification. We extracted binary diagnostic accuracy data and constructed contingency tables to derive the outcomes of interest: sensitivity and specificity. Studies undertaking an out-of-sample external validation were included in a meta-analysis, using a unified hierarchical model. This study is registered with PROSPERO, CRD42018091176.
FINDINGS: Our search identified 31 587 studies, of which 82 (describing 147 patient cohorts) were included. 69 studies provided enough data to construct contingency tables, enabling calculation of test accuracy, with sensitivity ranging from 9·7% to 100·0% (mean 79·1%, SD 0·2) and specificity ranging from 38·9% to 100·0% (mean 88·3%, SD 0·1). An out-of-sample external validation was done in 25 studies, of which 14 made the comparison between deep learning models and health-care professionals in the same sample. Comparison of the performance between health-care professionals in these 14 studies, when restricting the analysis to the contingency table for each study reporting the highest accuracy, found a pooled sensitivity of 87·0% (95% CI 83·0-90·2) for deep learning models and 86·4% (79·9-91·0) for health-care professionals, and a pooled specificity of 92·5% (95% CI 85·1-96·4) for deep learning models and 90·5% (80·6-95·7) for health-care professionals.
INTERPRETATION: Our review found the diagnostic performance of deep learning models to be equivalent to that of health-care professionals. However, a major finding of the review is that few studies presented externally validated results or compared the performance of deep learning models and health-care professionals using the same sample. Additionally, poor reporting is prevalent in deep learning studies, which limits reliable interpretation of the reported diagnostic accuracy. New reporting standards that address specific challenges of deep learning could improve future studies, enabling greater confidence in the results of future evaluations of this promising technology. FUNDING: None.

Entities: Chemical

Mesh：

Year: 2019 PMID： 33323251 DOI： 10.1016/S2589-7500(19)30123-2

Source DB: PubMed Journal: Lancet Digit Health ISSN： 2589-7500

Keyword Cloud
Cited

172 in total

Review 1. Designing deep learning studies in cancer diagnostics.

Authors: Andreas Kleppe; Ole-Johan Skrede; Sepp De Raedt; Knut Liestøl; David J Kerr; Håvard E Danielsen
Journal: Nat Rev Cancer Date: 2021-01-29 Impact factor: 60.716

Review 2. Up-to-Date Role of CT/MRI LI-RADS in Hepatocellular Carcinoma.

Authors: Guilherme Moura Cunha; Victoria Chernyak; Kathryn J Fowler; Claude B Sirlin
Journal: J Hepatocell Carcinoma Date: 2021-05-31

Review 3. Artificial intelligence in paediatric radiology: Future opportunities.

Authors: Natasha Davendralingam; Neil J Sebire; Owen J Arthurs; Susan C Shelmerdine
Journal: Br J Radiol Date: 2020-09-17 Impact factor: 3.039

4. Prediction of patient disposition: comparison of computer and human approaches and a proposed synthesis.

Authors: Yuval Barak-Corren; Isha Agarwal; Kenneth A Michelson; Todd W Lyons; Mark I Neuman; Susan C Lipsett; Amir A Kimia; Matthew A Eisenberg; Andrew J Capraro; Jason A Levy; Joel D Hudgins; Ben Y Reis; Andrew M Fine
Journal: J Am Med Inform Assoc Date: 2021-07-30 Impact factor: 4.497

Review 5. Artificial intelligence and machine learning for medical imaging: A technology review.

Authors: Ana Barragán-Montero; Umair Javaid; Gilmer Valdés; Dan Nguyen; Paul Desbordes; Benoit Macq; Siri Willems; Liesbeth Vandewinckele; Mats Holmström; Fredrik Löfman; Steven Michiels; Kevin Souris; Edmond Sterpin; John A Lee
Journal: Phys Med Date: 2021-05-09 Impact factor: 2.685

Review 6. Reporting guidelines for clinical trial reports for interventions involving artificial intelligence: the CONSORT-AI extension.

Authors: Xiaoxuan Liu; Samantha Cruz Rivera; David Moher; Melanie J Calvert; Alastair K Denniston
Journal: Lancet Digit Health Date: 2020-09-09

7. Deep Learning Enables Accurate Diagnosis of Novel Coronavirus (COVID-19) With CT Images.

Authors: Ying Song; Shuangjia Zheng; Liang Li; Xiang Zhang; Xiaodong Zhang; Ziwang Huang; Jianwen Chen; Ruixuan Wang; Huiying Zhao; Yutian Chong; Jun Shen; Yunfei Zha; Yuedong Yang
Journal: IEEE/ACM Trans Comput Biol Bioinform Date: 2021-12-08 Impact factor: 3.710

8. Machine Learning Models for Predicting Neonatal Mortality: A Systematic Review.

Authors: Cheyenne Mangold; Sarah Zoretic; Keerthi Thallapureddy; Axel Moreira; Kevin Chorath; Alvaro Moreira
Journal: Neonatology Date: 2021-07-14 Impact factor: 4.035

9. Selecting the most important self-assessed features for predicting conversion to mild cognitive impairment with random forest and permutation-based methods.

Authors: Jaime Gómez-Ramírez; Marina Ávila-Villanueva; Miguel Ángel Fernández-Blázquez
Journal: Sci Rep Date: 2020-11-26 Impact factor: 4.379

Review 10. Medical imaging and nuclear medicine: a Lancet Oncology Commission.

Authors: Hedvig Hricak; May Abdel-Wahab; Rifat Atun; Miriam Mikhail Lette; Diana Paez; James A Brink; Lluís Donoso-Bach; Guy Frija; Monika Hierath; Ola Holmberg; Pek-Lan Khong; Jason S Lewis; Geraldine McGinty; Wim J G Oyen; Lawrence N Shulman; Zachary J Ward; Andrew M Scott
Journal: Lancet Oncol Date: 2021-03-04 Impact factor: 41.316