Literature DB >> 32740477

Does Artificial Intelligence Outperform Natural Intelligence in Interpreting Musculoskeletal Radiological Studies? A Systematic Review.

Olivier Q Groot1, Michiel E R Bongers1, Paul T Ogink2, Joeky T Senders3, Aditya V Karhade1, Jos A M Bramer4, Jorrit-Jan Verlaan2, Joseph H Schwab1.   

Abstract

BACKGROUND: Machine learning (ML) is a subdomain of artificial intelligence that enables computers to abstract patterns from data without explicit programming. A myriad of impactful ML applications already exists in orthopaedics ranging from predicting infections after surgery to diagnostic imaging. However, no systematic reviews that we know of have compared, in particular, the performance of ML models with that of clinicians in musculoskeletal imaging to provide an up-to-date summary regarding the extent of applying ML to imaging diagnoses. By doing so, this review delves into where current ML developments stand in aiding orthopaedists in assessing musculoskeletal images. QUESTIONS/PURPOSES: This systematic review aimed (1) to compare performance of ML models versus clinicians in detecting, differentiating, or classifying orthopaedic abnormalities on imaging by (A) accuracy, sensitivity, and specificity, (B) input features (for example, plain radiographs, MRI scans, ultrasound), (C) clinician specialties, and (2) to compare the performance of clinician-aided versus unaided ML models.
METHODS: A systematic review was performed in PubMed, Embase, and the Cochrane Library for studies published up to October 1, 2019, using synonyms for machine learning and all potential orthopaedic specialties. We included all studies that compared ML models head-to-head against clinicians in the binary detection of abnormalities in musculoskeletal images. After screening 6531 studies, we ultimately included 12 studies. We conducted quality assessment using the Methodological Index for Non-randomized Studies (MINORS) checklist. All 12 studies were of comparable quality, and they all clearly included six of the eight critical appraisal items (study aim, input feature, ground truth, ML versus human comparison, performance metric, and ML model description). This justified summarizing the findings in a quantitative form by calculating the median absolute improvement of the ML models compared with clinicians for the following metrics of performance: accuracy, sensitivity, and specificity.
RESULTS: ML models provided, in aggregate, only very slight improvements in diagnostic accuracy and sensitivity compared with clinicians working alone and were on par in specificity (3% (interquartile range [IQR] -2.0% to 7.5%), 0.06% (IQR -0.03 to 0.14), and 0.00 (IQR -0.048 to 0.048), respectively). Inputs used by the ML models were plain radiographs (n = 8), MRI scans (n = 3), and ultrasound examinations (n = 1). Overall, ML models outperformed clinicians more when interpreting plain radiographs than when interpreting MRIs (17 of 34 and 3 of 16 performance comparisons, respectively). Orthopaedists and radiologists performed similarly to ML models, while ML models mostly outperformed other clinicians (outperformance in 7 of 19, 7 of 23, and 6 of 10 performance comparisons, respectively). Two studies evaluated the performance of clinicians aided and unaided by ML models; both demonstrated considerable improvements in ML-aided clinician performance by reporting a 47% decrease of misinterpretation rate (95% confidence interval [CI] 37 to 54; p < 0.001) and a mean increase in specificity of 0.048 (95% CI 0.029 to 0.068; p < 0.001) in detecting abnormalities on musculoskeletal images.
CONCLUSIONS: At present, ML models have comparable performance to clinicians in assessing musculoskeletal images. ML models may enhance the performance of clinicians as a technical supplement rather than as a replacement for clinical intelligence. Future ML-related studies should emphasize how ML models can complement clinicians, instead of determining the overall superiority of one versus the other. This can be accomplished by improving transparent reporting, diminishing bias, determining the feasibility of implantation in the clinical setting, and appropriately tempering conclusions. LEVEL OF EVIDENCE: Level III, diagnostic study.

Entities:  

Mesh:

Year:  2020        PMID: 32740477      PMCID: PMC7899420          DOI: 10.1097/CORR.0000000000001360

Source DB:  PubMed          Journal:  Clin Orthop Relat Res        ISSN: 0009-921X            Impact factor:   4.755


  55 in total

1.  Detecting intertrochanteric hip fractures with orthopedist-level accuracy using a deep convolutional neural network.

Authors:  Takaaki Urakawa; Yuki Tanaka; Shinichi Goto; Hitoshi Matsuzawa; Kei Watanabe; Naoto Endo
Journal:  Skeletal Radiol       Date:  2018-06-28       Impact factor: 2.199

Review 2.  Machine learning: Trends, perspectives, and prospects.

Authors:  M I Jordan; T M Mitchell
Journal:  Science       Date:  2015-07-17       Impact factor: 47.728

3.  Development of Machine Learning Algorithms for Prediction of 5-Year Spinal Chordoma Survival.

Authors:  Aditya V Karhade; Quirina Thio; Paul Ogink; Jason Kim; Santiago Lozano-Calderon; Kevin Raskin; Joseph H Schwab
Journal:  World Neurosurg       Date:  2018-08-08       Impact factor: 2.104

4.  A Machine Learning Algorithm to Predict the Probability of (Occult) Posterior Malleolar Fractures Associated With Tibial Shaft Fractures to Guide "Malleolus First" Fixation.

Authors:  Laurent A M Hendrickx; Garret L Sobol; David W G Langerhuizen; Anne Eva J Bulstra; Jeremy Hreha; Sheila Sprague; Michael S Sirkin; David Ring; Gino M M J Kerkhoffs; Ruurd L Jaarsma; Job N Doornberg
Journal:  J Orthop Trauma       Date:  2020-03       Impact factor: 2.512

5.  Explainable machine-learning predictions for the prevention of hypoxaemia during surgery.

Authors:  Scott M Lundberg; Bala Nair; Monica S Vavilala; Mayumi Horibe; Michael J Eisses; Trevor Adams; David E Liston; Daniel King-Wai Low; Shu-Fang Newman; Jerry Kim; Su-In Lee
Journal:  Nat Biomed Eng       Date:  2018-10-10       Impact factor: 25.671

6.  Predicting the Future - Big Data, Machine Learning, and Clinical Medicine.

Authors:  Ziad Obermeyer; Ezekiel J Emanuel
Journal:  N Engl J Med       Date:  2016-09-29       Impact factor: 91.245

7.  Automated detection and classification of the proximal humerus fracture by using deep learning algorithm.

Authors:  Seok Won Chung; Seung Seog Han; Ji Whan Lee; Kyung-Soo Oh; Na Ra Kim; Jong Pil Yoon; Joon Yub Kim; Sung Hoon Moon; Jieun Kwon; Hyo-Jin Lee; Young-Min Noh; Youngjun Kim
Journal:  Acta Orthop       Date:  2018-03-26       Impact factor: 3.717

8.  Prognostic value of serum alkaline phosphatase in spinal metastatic disease.

Authors:  Aditya V Karhade; Quirina C B S Thio; Megna Kuverji; Paul T Ogink; Marco L Ferrone; Joseph H Schwab
Journal:  Br J Cancer       Date:  2019-02-22       Impact factor: 7.640

9.  Diagnostic Efficacy and Therapeutic Decision-making Capacity of an Artificial Intelligence Platform for Childhood Cataracts in Eye Clinics: A Multicentre Randomized Controlled Trial.

Authors:  Haotian Lin; Ruiyang Li; Zhenzhen Liu; Jingjing Chen; Yahan Yang; Hui Chen; Zhuoling Lin; Weiyi Lai; Erping Long; Xiaohang Wu; Duoru Lin; Yi Zhu; Chuan Chen; Dongxuan Wu; Tongyong Yu; Qianzhong Cao; Xiaoyan Li; Jing Li; Wangting Li; Jinghui Wang; Mingmin Yang; Huiling Hu; Li Zhang; Yang Yu; Xuelan Chen; Jianmin Hu; Ke Zhu; Shuhong Jiang; Yalin Huang; Gang Tan; Jialing Huang; Xiaoming Lin; Xinyu Zhang; Lixia Luo; Yuhua Liu; Xialin Liu; Bing Cheng; Danying Zheng; Mingxing Wu; Weirong Chen; Yizhi Liu
Journal:  EClinicalMedicine       Date:  2019-03-17

10.  Near real-time intraoperative brain tumor diagnosis using stimulated Raman histology and deep neural networks.

Authors:  Todd C Hollon; Balaji Pandian; Arjun R Adapa; Esteban Urias; Akshay V Save; Siri Sahib S Khalsa; Daniel G Eichberg; Randy S D'Amico; Zia U Farooq; Spencer Lewis; Petros D Petridis; Tamara Marie; Ashish H Shah; Hugh J L Garton; Cormac O Maher; Jason A Heth; Erin L McKean; Stephen E Sullivan; Shawn L Hervey-Jumper; Parag G Patil; B Gregory Thompson; Oren Sagher; Guy M McKhann; Ricardo J Komotar; Michael E Ivan; Matija Snuderl; Marc L Otten; Timothy D Johnson; Michael B Sisti; Jeffrey N Bruce; Karin M Muraszko; Jay Trautman; Christian W Freudiger; Peter Canoll; Honglak Lee; Sandra Camelo-Piragua; Daniel A Orringer
Journal:  Nat Med       Date:  2020-01-06       Impact factor: 53.440

View more
  5 in total

1.  CORR Insights®: Does Artificial Intelligence Outperform Natural Intelligence in Interpretation of Musculoskeletal Radiological Studies? A Systematic Review.

Authors:  Raphaël Porcher
Journal:  Clin Orthop Relat Res       Date:  2020-12       Impact factor: 4.755

2.  Quality assessment standards in artificial intelligence diagnostic accuracy systematic reviews: a meta-research study.

Authors:  Shruti Jayakumar; Viknesh Sounderajah; Pasha Normahani; Leanne Harling; Sheraz R Markar; Hutan Ashrafian; Ara Darzi
Journal:  NPJ Digit Med       Date:  2022-01-27

3.  Surveillance of atypical femoral fractures in a nationwide fracture register.

Authors:  Hans Peter Bögl; Georg Zdolsek; Lukas Barnisin; Michael Möller; Jörg Schilcher
Journal:  Acta Orthop       Date:  2022-01-11       Impact factor: 3.717

Review 4.  Artificial intelligence and its impact on the domains of universal health coverage, health emergencies and health promotion: An overview of systematic reviews.

Authors:  Antonio Martinez-Millana; Aida Saez-Saez; Roberto Tornero-Costa; Natasha Azzopardi-Muscat; Vicente Traver; David Novillo-Ortiz
Journal:  Int J Med Inform       Date:  2022-08-17       Impact factor: 4.730

5.  Machine learning prediction models in orthopedic surgery: A systematic review in transparent reporting.

Authors:  Olivier Q Groot; Paul T Ogink; Amanda Lans; Peter K Twining; Neal D Kapoor; William DiGiovanni; Bas J J Bindels; Michiel E R Bongers; Jacobien H F Oosterhoff; Aditya V Karhade; F C Oner; Jorrit-Jan Verlaan; Joseph H Schwab
Journal:  J Orthop Res       Date:  2021-03-29       Impact factor: 3.102

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.