| Literature DB >> 34674509 |
Sung Hye Kong1,2, Chan Soo Shin1,3.
Abstract
In this unprecedented era of the overwhelming volume of medical data, machine learning can be a promising tool that may shed light on an individualized approach and a better understanding of the disease in the field of osteoporosis research, similar to that in other research fields. This review aimed to provide an overview of the latest studies using machine learning to address issues, mainly focusing on osteoporosis and fractures. Machine learning models for diagnosing and classifying osteoporosis and detecting fractures from images have shown promising performance. Fracture risk prediction is another promising field of research, and studies are being conducted using various data sources. However, these approaches may be biased due to the nature of the techniques or the quality of the data. Therefore, more studies based on the proposed guidelines are needed to improve the technical feasibility and generalizability of artificial intelligence algorithms.Entities:
Keywords: Data science; Medical informatics; Osteoporosis
Mesh:
Substances:
Year: 2021 PMID: 34674509 PMCID: PMC8566132 DOI: 10.3803/EnM.2021.1111
Source DB: PubMed Journal: Endocrinol Metab (Seoul) ISSN: 2093-596X
Fig. 1The trend in the number and categories of machine learning-related publications per year in the field of bone and mineral research. The included publications were from PubMed until the search date (May 30th, 2021). Search strategies were (“Osteoporo-sis”[Mesh] OR “Osteoporotic Fractures”[Mesh] OR “Hip Frac-tures”[Mesh] OR “Spinal Fractures”[Mesh] OR “Humeral Frac-tures”[Mesh] OR “Bone Density”[Mesh] OR Osteoporos*[tiab] OR “fragility fractur*”[tiab] OR (Fractur*[tiab] AND (spin*[tiab] OR vertebra*[tiab] OR hip[tiab] OR humer*[tiab])) OR “bone mineral densit*”[tiab]) AND (“Artificial Intelligence”[Mesh:noexp] OR “machine learning”[Mesh] OR “Neural Networks, Computer” [Mesh] OR “artificial Intelligence”[tiab] OR “machine learning” [tiab] OR “deep learning”[tiab] OR “neural network*”[tiab]) AND English[la]).
Fig. 2Cross table of the relationship between the results of the algorithm and reference standard. AI, artificial intelligence; TP, true positive; FP, false positive; FN, false negative; TN, true negative.
Characteristics and Results of Key Studies Using Machine Learning
| Study | Tasks | Data type | Input data amount | Trained algorithm | Train/validation/test set | Main results | Clinical significance |
|---|---|---|---|---|---|---|---|
| Shim et al. [ | Screening osteoporosis | DB | 1,792 (34% OP) | ANN, RF, LR, SVM, KNN, DT, GBM | 76%/5-fold CV/24% | AUROC | Demonstrated performances of 7 ML models to accurately classify osteoporosis, and found ANN as most accurate methods |
| Yamamoto et al. [ | Screening osteoporosis | X-ray | 1,131 (53% OP) | ResNet-18, resNet-34, GoogleNet, EfficientNet b3, EfficientNet b4 | 80%/10%/10% | EfficientNet b3, accuracy 0.885, recall 0.887, NPV 0.865, F1 score 0.894, AUROC 0.937 | Addition of clinical covariates increased almost all performance metrics in CNN networks over the analysis of hip radiographs alone |
| Yasaka et al. [ | Screening osteoporosis | CT | 2,045 (% not reported) | CNN (4-layer) | 81%/9%/10% (external validation) | AUROC 0.97 | By applying a deep learning technique, the BMD of lumbar vertebrae can be estimated from noncontrast abdominal CT |
| Chung et al. [ | Fracture detection (humerus) | X-ray | 1,891 (69% fracture) | Resnet-152 | 90%/-/10% | AUROC 1.00, sensitivity 0.99, specificity 0.97 | CNN showed superior performance to that of physicians and orthopedists |
| Tomita et al. [ | Fracture detection (vertebra) | CT | 1,432 (50% fracture) | Resnet-LSTM | 80%/10%/10% | Accuracy 0.892, F1 score 0.908 | Accuracy and F1 score of CNN were similar to the radiologists’ performance in detecting fracture |
| Mutasa et al. [ | Fracture detection (hip) | X-ray | 1,063 (69% fracture) | CNN (21- layer) | 72%/18%/10% | AUROC 0.920, accuracy 0.923, sensitivity 0.910, specificity 0.930, PPV 0.960, NPV 0.860 | Data augmentation techniques of generative adversarial networks and digitally reconstructed radiographs showed better performances than those without augmentation |
| Su et al. [ | Fracture prediction (hip) | DB | 5,977 (3% fracture) | CART | 10-fold CV | AUROC 0.73 | Classification of a high-risk group for hip fractures using a classic ML method of CARTs showed a discrimination power similar to that of FRAX ≥3% |
| Almog et al. [ | Fracture prediction (osteoporotic, hip, vertebra) | DB | 630,445 (7% fracture) | Word2Vec, Doc2Vec, LSTM, XGBoost, ensemble | 70%/3-fold CV/30% | AUROC 0.82 | Development of a short-term incident fracture prediction model based on natural language processing methods |
| Muehlematter et al. [ | Fracture prediction (vertebra) | CT | 120 (50% fracture) | ANN, RF, SVM | 67%/10-fold CV/33% | AUROC 0.97 | Bone texture analysis combined with ML allows to identify patients at risk for vertebral fractures on CT scans with high accuracy |
DB, database; OP, osteoporosis; ANN, artificial neural network; RF, random forest; LR, logistic regression; SVM, support vector machine; KNN, k-nearest neighbors; DT, decision tree; GBM, gradient boosting machine; CV, cross validation; AUROC, area under the receiver operating characteristic curve; ML, machine learning; NPV, negative predictive value; CNN, convolutional neural network; CT, computed tomography; BMD, bone mineral density; DXA, dual X-ray absorptiometry; LSTM, long short-term memory; PPV, positive predictive value; CART, classification and regression tree; FRAX, Fracture Risk Assessment Tool.