Literature DB >> 33651768

Can a Deep-learning Model for the Automated Detection of Vertebral Fractures Approach the Performance Level of Human Subspecialists?

Yi-Chu Li1, Hung-Hsun Chen2, Henry Horng-Shing Lu3, Hung-Ta Hondar Wu4,5, Ming-Chau Chang4,6, Po-Hsin Chou4,6.   

Abstract

BACKGROUND: Vertebral fractures are the most common osteoporotic fractures in older individuals. Recent studies suggest that the performance of artificial intelligence is equal to humans in detecting osteoporotic fractures, such as fractures of the hip, distal radius, and proximal humerus. However, whether artificial intelligence performs as well in the detection of vertebral fractures on plain lateral spine radiographs has not yet been reported. QUESTIONS/PURPOSES: (1) What is the accuracy, sensitivity, specificity, and interobserver reliability (kappa value) of an artificial intelligence model in detecting vertebral fractures, based on Genant fracture grades, using plain lateral spine radiographs compared with values obtained by human observers? (2) Do patients' clinical data, including the anatomic location of the fracture (thoracic or lumbar spine), T-score on dual-energy x-ray absorptiometry, or fracture grade severity, affect the performance of an artificial intelligence model? (3) How does the artificial intelligence model perform on external validation?
METHODS: Between 2016 and 2018, 1019 patients older than 60 years were treated for vertebral fractures in our institution. Seventy-eight patients were excluded because of missing CT or MRI scans (24% [19]), poor image quality in plain lateral radiographs of spines (54% [42]), multiple myeloma (5% [4]), and prior spine instrumentation (17% [13]). The plain lateral radiographs of 941 patients (one radiograph per person), with a mean age of 76 ± 12 years, and 1101 vertebral fractures between T7 and L5 were retrospectively evaluated for training (n = 565), validating (n = 188), and testing (n = 188) of an artificial intelligence deep-learning model. The gold standard for diagnosis (ground truth) of a vertebral fracture is the interpretation of the CT or MRI reports by a spine surgeon and a radiologist independently. If there were any disagreements between human observers, the corresponding CT or MRI images would be rechecked by them together to reach a consensus. For the Genant classification, the injured vertebral body height was measured in the anterior, middle, and posterior third. Fractures were classified as Grade 1 (< 25%), Grade 2 (26% to 40%), or Grade 3 (> 40%). The framework of the artificial intelligence deep-learning model included object detection, data preprocessing of radiographs, and classification to detect vertebral fractures. Approximately 90 seconds was needed to complete the procedure and obtain the artificial intelligence model results when applied clinically. The accuracy, sensitivity, specificity, interobserver reliability (kappa value), receiver operating characteristic curve, and area under the curve (AUC) were analyzed. The bootstrapping method was applied to our testing dataset and external validation dataset. The accuracy, sensitivity, and specificity were used to investigate whether fracture anatomic location or T-score in dual-energy x-ray absorptiometry report affected the performance of the artificial intelligence model. The receiver operating characteristic curve and AUC were used to investigate the relationship between the performance of the artificial intelligence model and fracture grade. External validation with a similar age population and plain lateral radiographs from another medical institute was also performed to investigate the performance of the artificial intelligence model.
RESULTS: The artificial intelligence model with ensemble method demonstrated excellent accuracy (93% [773 of 830] of vertebrae), sensitivity (91% [129 of 141]), and specificity (93% [644 of 689]) for detecting vertebral fractures of the lumbar spine. The interobserver reliability (kappa value) of the artificial intelligence performance and human observers for thoracic and lumbar vertebrae were 0.72 (95% CI 0.65 to 0.80; p < 0.001) and 0.77 (95% CI 0.72 to 0.83; p < 0.001), respectively. The AUCs for Grades 1, 2, and 3 vertebral fractures were 0.919, 0.989, and 0.990, respectively. The artificial intelligence model with ensemble method demonstrated poorer performance for discriminating normal osteoporotic lumbar vertebrae, with a specificity of 91% (260 of 285) compared with nonosteoporotic lumbar vertebrae, with a specificity of 95% (222 of 234). There was a higher sensitivity 97% (60 of 62) for detecting osteoporotic (dual-energy x-ray absorptiometry T-score ≤ -2.5) lumbar vertebral fractures, implying easier detection, than for nonosteoporotic vertebral fractures (83% [39 of 47]). The artificial intelligence model also demonstrated better detection of lumbar vertebral fractures compared with detection of thoracic vertebral fractures based on the external dataset using various radiographic techniques. Based on the dataset for external validation, the overall accuracy, sensitivity, and specificity on bootstrapping method were 89%, 83%, and 95%, respectively.
CONCLUSION: The artificial intelligence model detected vertebral fractures on plain lateral radiographs with high accuracy, sensitivity, and specificity, especially for osteoporotic lumbar vertebral fractures (Genant Grades 2 and 3). The rapid reporting of results using this artificial intelligence model may improve the efficiency of diagnosing vertebral fractures. The testing model is available at http://140.113.114.104/vght_demo/corr/. One or multiple plain lateral radiographs of the spine in the Digital Imaging and Communications in Medicine format can be uploaded to see the performance of the artificial intelligence model. LEVEL OF EVIDENCE: Level II, diagnostic study.
Copyright © 2021 by the Association of Bone and Joint Surgeons.

Entities:  

Mesh:

Year:  2021        PMID: 33651768      PMCID: PMC8208416          DOI: 10.1097/CORR.0000000000001685

Source DB:  PubMed          Journal:  Clin Orthop Relat Res        ISSN: 0009-921X            Impact factor:   4.755


  29 in total

Review 1.  Diagnosis of osteoporotic vertebral fractures: importance of recognition and description by radiologists.

Authors:  Leon Lenchik; Lee F Rogers; Pierre D Delmas; Harry K Genant
Journal:  AJR Am J Roentgenol       Date:  2004-10       Impact factor: 3.959

Review 2.  Schmorl's nodes.

Authors:  Kwaku A Kyere; Khoi D Than; Anthony C Wang; Shayan U Rahman; Juan M Valdivia-Valdivia; Frank La Marca; Paul Park
Journal:  Eur Spine J       Date:  2012-04-28       Impact factor: 3.134

3.  Underdiagnosis of vertebral fractures is a worldwide problem: the IMPACT study.

Authors:  Pierre D Delmas; Lex van de Langerijt; Nelson B Watts; Richard Eastell; Harry Genant; Andreas Grauer; David L Cahall
Journal:  J Bone Miner Res       Date:  2004-12-06       Impact factor: 6.741

4.  An estimate of the worldwide prevalence and disability associated with osteoporotic fractures.

Authors:  O Johnell; J A Kanis
Journal:  Osteoporos Int       Date:  2006-09-16       Impact factor: 4.507

5.  Systematic Radiation Dose Reduction in Cervical Spine CT of Human Cadaveric Specimens: How Low Can We Go?

Authors:  M Tozakidou; C Reisinger; D Harder; J Lieb; Z Szucs-Farkas; M Müller-Gerbl; U Studler; S Schindera; A Hirschmann
Journal:  AJNR Am J Neuroradiol       Date:  2017-12-21       Impact factor: 3.825

6.  Vertebral Body Compression Fractures and Bone Density: Automated Detection and Classification on CT Images.

Authors:  Joseph E Burns; Jianhua Yao; Ronald M Summers
Journal:  Radiology       Date:  2017-03-16       Impact factor: 11.105

7.  Vertebral fracture assessment using a semiquantitative technique.

Authors:  H K Genant; C Y Wu; C van Kuijk; M C Nevitt
Journal:  J Bone Miner Res       Date:  1993-09       Impact factor: 6.741

Review 8.  Back pain in osteoporotic vertebral fractures.

Authors:  R M Francis; T J Aspray; G Hide; A M Sutcliffe; P Wilkinson
Journal:  Osteoporos Int       Date:  2007-12-11       Impact factor: 4.507

9.  Assessment of osteoporotic vertebral fractures using specialized workflow software for 6-point morphometry.

Authors:  Giuseppe Guglielmi; Francesco Palmieri; Maria Grazia Placentino; Francesco D'Errico; Luca Pio Stoppino
Journal:  Eur J Radiol       Date:  2008-02-01       Impact factor: 3.528

Review 10.  Friend or foe: high bone mineral density on routine bone density scanning, a review of causes and management.

Authors:  Celia L Gregson; Sarah A Hardcastle; Cyrus Cooper; Jonathan H Tobias
Journal:  Rheumatology (Oxford)       Date:  2013-02-27       Impact factor: 7.580

View more
  5 in total

Review 1.  Real-world analysis of artificial intelligence in musculoskeletal trauma.

Authors:  Pranav Ajmera; Amit Kharat; Rajesh Botchu; Harun Gupta; Viraj Kulkarni
Journal:  J Clin Orthop Trauma       Date:  2021-08-27

Review 2.  Current understanding on artificial intelligence and machine learning in orthopaedics - A scoping review.

Authors:  Vishal Kumar; Sandeep Patel; Vishnu Baburaj; Aditya Vardhan; Prasoon Kumar Singh; Raju Vaishya
Journal:  J Orthop       Date:  2022-08-26

Review 3.  Current development and prospects of deep learning in spine image analysis: a literature review.

Authors:  Biao Qu; Jianpeng Cao; Chen Qian; Jinyu Wu; Jianzhong Lin; Liansheng Wang; Lin Ou-Yang; Yongfa Chen; Liyue Yan; Qing Hong; Gaofeng Zheng; Xiaobo Qu
Journal:  Quant Imaging Med Surg       Date:  2022-06

4.  A software program for automated compressive vertebral fracture detection on elderly women's lateral chest radiograph: Ofeye 1.0.

Authors:  Ben-Heng Xiao; Michael S Y Zhu; Er-Zhu Du; Wei-Hong Liu; Jian-Bing Ma; Hua Huang; Jing-Shan Gong; Davide Diacinti; Kun Zhang; Bo Gao; Heng Liu; Ri-Feng Jiang; Zhong-You Ji; Xiao-Bao Xiong; Lai-Chang He; Lei Wu; Chuan-Jun Xu; Mei-Mei Du; Xiao-Rong Wang; Li-Mei Chen; Kong-Yang Wu; Liu Yang; Mao-Sheng Xu; Daniele Diacinti; Qi Dou; Timothy Y C Kwok; Yì Xiáng J Wáng
Journal:  Quant Imaging Med Surg       Date:  2022-08

5.  A Classification Method for Thoracolumbar Vertebral Fractures due to Basketball Sports Injury Based on Deep Learning.

Authors:  XiaoGan Chen; Yu Liu
Journal:  Comput Math Methods Med       Date:  2022-10-05       Impact factor: 2.809

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.