| Literature DB >> 31667364 |
Junlin Yang1, Kai Zhang2,3, Hengwei Fan1, Zifang Huang4, Yifan Xiang2, Jingfan Yang1, Lin He3, Lei Zhang3, Yahan Yang2, Ruiyang Li2, Yi Zhu2,5, Chuan Chen2,5, Fan Liu3, Haoqing Yang3, Yaolong Deng1, Weiqing Tan6, Nali Deng6, Xuexiang Yu7, Xiaoling Xuan8, Xiaofeng Xie8, Xiyang Liu3, Haotian Lin2,9.
Abstract
Adolescent idiopathic scoliosis is the most common spinal disorder in adolescents with a prevalence of 0.5-5.2% worldwide. The traditional methods for scoliosis screening are easily accessible but require unnecessary referrals and radiography exposure due to their low positive predictive values. The application of deep learning algorithms has the potential to reduce unnecessary referrals and costs in scoliosis screening. Here, we developed and validated deep learning algorithms for automated scoliosis screening using unclothed back images. The accuracies of the algorithms were superior to those of human specialists in detecting scoliosis, detecting cases with a curve ≥20°, and severity grading for both binary classifications and the four-class classification. Our approach can be potentially applied in routine scoliosis screening and periodic follow-ups of pretreatment cases without radiation exposure.Entities:
Keywords: Bone; Translational research
Mesh:
Year: 2019 PMID: 31667364 PMCID: PMC6814825 DOI: 10.1038/s42003-019-0635-8
Source DB: PubMed Journal: Commun Biol ISSN: 2399-3642
Demographic information of the training, internal validation and external validation datasets
| Severity | Total number of subjects | Male | Female | Average severity | Average age (years old) |
|---|---|---|---|---|---|
| Training and internal validation datasets | |||||
| <10° | 745 | 306 | 439 | 6.3° | 15.2 |
| 10–19° | 938 | 312 | 626 | 15.2° | 13.9 |
| 20–44° | 780 | 197 | 583 | 26.9° | 15.4 |
| ≥45° | 777 | 214 | 563 | 56.2° | 16.1 |
| External validation dataset | |||||
| <10° | 100 | 43 | 57 | 6.3° | 14.6 |
| 10–19° | 100 | 32 | 68 | 16.1° | 14.1 |
| 20–44° | 100 | 41 | 59 | 28.4° | 15.3 |
| ≥45° | 100 | 45 | 55 | 51.3° | 15.7 |
Fig. 1Details of the methods. a The entire DLA workflow; b The architecture and workflow of Faster-RCNN; c The architecture of Resnet, where the yellow squares in Restnet are the convolutional kernels in the convolutional layers and the purple squares in Resnet are pooling operations in the pooling layers
Fig. 2ROC curves and the accuracy, specificity, and sensitivity of the binary classification for the internal validation dataset. a The ROC curve and AUC of the DLAs for discerning whether the severity is ≥10°; b The ROC curve and AUC of the DLAs for discerning whether the severity is ≥20°; c The confusion matrix of the DLAs for the four classes of classifications (0–9°, 10–19°, 20–44°, ≥45°). The rows and columns represent the ground-truth label (<10°, 10–19°, 20–44°, ≥45° from top to bottom) and the predicted label (<10°, 10–19°, 20–44°, ≥45° from left to right)
Fig. 3ROC curves and the accuracy, specificity, sensitivity, NPV, and PPV of the binary classification for the external validation dataset. a The ROC curve of the DLAs for discerning whether the severity is ≥10° and the performance of the human experts; b The accuracy, specificity, sensitivity, NPV, and PPV of the DLAs and human experts for discerning whether the severity is ≥10°; c The ROC curve of the DLAs for discerning whether the severity is ≥20° and the performance of the human experts; d The accuracy, specificity, sensitivity, NPV, and PPV of the DLAs and human experts for discerning whether the severity is ≥20°
Fig. 4The confusion matrix of the four classes of classification and their accuracy in the external validation dataset. The rows and columns represent the ground-truth label (<10°, 10–19°, 20–44°, ≥45° from top to bottom) and the predicted label (<10°, 10–19°, 20–44°, ≥45° from left to right). a The confusion matrix of the DLAs for classifying the severity (four classes of classifications: <10°, 10–19°, 20–44°, ≥45°); b–e The confusion matrix of the human experts for classifying the severity (four classes of classifications: <10°, 10–19°, 20–44°, ≥45°); f The accuracy of the DLAs and human experts in classifying the severity (four classes of classifications: <10°, 10–19°, 20–44°, ≥45°)
Fig. 5Heat maps illustrating which parts of the body contributed to the classification results. Heat maps suggested that the features contributing to intelligent discrimination by the DLAs were primarily in the scapular and lumbar regions. The level of trunk asymmetry revealed in the heat maps was associated with the spinal curves of the patients