| Literature DB >> 36229793 |
Cheng-Bin Huang1,2, Jia-Sen Hu1, Kai Tan1,2, Wei Zhang1, Tian-Hao Xu1,2, Lei Yang3,4.
Abstract
BACKGROUND: With rapid economic development, the world's average life expectancy is increasing, leading to the increasing prevalence of osteoporosis worldwide. However, due to the complexity and high cost of dual-energy x-ray absorptiometry (DXA) examination, DXA has not been widely used to diagnose osteoporosis. In addition, studies have shown that the psoas index measured at the third lumbar spine (L3) level is closely related to bone mineral density (BMD) and has an excellent predictive effect on osteoporosis. Therefore, this study developed a variety of machine learning (ML) models based on psoas muscle tissue at the L3 level of unenhanced abdominal computed tomography (CT) to predict osteoporosis.Entities:
Keywords: Computed tomography; Machine learning; Middle-aged and aged people; Osteoporosis; Psoas; Radiomics
Mesh:
Year: 2022 PMID: 36229793 PMCID: PMC9563158 DOI: 10.1186/s12877-022-03502-9
Source DB: PubMed Journal: BMC Geriatr ISSN: 1471-2318 Impact factor: 4.070
Fig. 1Flow chart showing analyses and model making process for the study. Abbreviations: ML, machine learning; CT, computed tomography; LASSO, least absolute shrinkage and selection operator; LR, Logistic regression; GBM, Gradient boosting machine; RF, Random forest; GNB, Gaussian naïve Bayes; XGBoost, Extreme gradient boosting; SVM, Support vector machines
Fig. 2The psoas muscle at the third lumbar level was segmented in A) transverse plane and B) coronal plane
Comparison of clinical characteristics between two groups
| Age (years) | 62 ± 9 | 63 ± 13 | 0.448 |
| BMI (kg/m2) | 24.07 ± 3.02 | 23.31 ± 3.63 | 0.136 |
| Gender | 0.432 | ||
| Female, n(%) | 59(65.6) | 49(59.8) | |
| Male, n(%) | 31(34.4) | 33(40.2) | |
| Hypertension, n(%) | 57(63.3) | 42(51.2) | 0.108 |
| Diabetes, n(%) | 65(72.2) | 64(78.0) | 0.378 |
| Current drinking, n(%) | 10(11.1) | 11(13.4) | 0.645 |
| Current smoking, n(%) | 12(13.3) | 12(14.6) | 0.806 |
Abbreviations: BMI body mass index
Fig. 3The least absolute shrinkage and selection operator algorithm was applied to select features
Fig. 4ROC curve analysis of machine learning algorithms for prediction of osteoporosis patients in the validation set. Abbreviations: LR, Logistic regression; GBM, Gradient boosting machine; RF, Random forest; GNB, Gaussian naïve Bayes; XGBoost, Extreme gradient boosting; SVM, support vector machines; ROC, receiver operating characteristic; AUC, area under the curve
Predictive performance comparison of the five types of machine learning algorithms in the validation sets
| Model | AUROC | Sensitivity | Specificity | Accuracy |
|---|---|---|---|---|
| LR | 0.85 | 0.73 | 0.86 | 0.80 |
| XGBoost | 0.82 | 0.70 | 0.75 | 0.72 |
| GNB | 0.80 | 0.73 | 0.86 | 0.80 |
| GBM | 0.86 | 0.70 | 0.92 | 0.81 |
| RF | 0.87 | 0.73 | 0.86 | 0.80 |
| SVM | 0.81 | 0.86 | 0.55 | 0.71 |
Abbreviations: LR Logistic regression, GBM Gradient boosting machine, RF Random forest, GNB Gaussian naïve Bayes, XGBoost Extreme gradient boosting, SVM Support vector machines, AUROC area under the receiver operating characteristic
parameters of all machine learning models in this study
| Model | parameters |
|---|---|
| LR | penalty = 'l2', dual = False, tol = 0.0001, C = 1.0, fit_intercept = True, intercept_scaling = 1, class_weight = None, random_state = None, solver = 'lbfgs', max_iter = 100, multi_class = 'auto', verbose = 0, warm_start = False, n_jobs = None, l1_ratio = None |
| XGBoost | n_estimators = 500, learning_rate = 0.5, objective = 'binary:logistic', use_label_encoder = True |
| GNB | priors = None, var_smoothing = 1e-09 |
| GBM | loss = 'deviance', learning_rate = 0.5, n_estimators = 500, subsample = 1.0, criterion = 'friedman_mse', min_samples_split = 2, min_samples_leaf = 1, min_weight_fraction_leaf = 0.0, max_depth = 3, min_impurity_decrease = 0.0, min_impurity_split = None, init = None, random_state = None, max_features = None, verbose = 0, max_leaf_nodes = None, warm_start = False, validation_fraction = 0.1, n_iter_no_change = None, tol = 0.0001, ccp_alpha = 0.0 |
| RF | n_estimators = 100, criterion = 'gini', max_depth = None, min_samples_split = 2, min_samples_leaf = 1, min_weight_fraction_leaf = 0.0, max_features = 'auto', max_leaf_nodes = None, min_impurity_decrease = 0.0, min_impurity_split = None, bootstrap = True, oob_score = False, n_jobs = None, random_state = None, verbose = 0, warm_start = False, class_weight = None, ccp_alpha = 0.0, max_samples = None |
| SVM | C = 2.33, kernel = 'rbf', degree = 3, gamma = 2.15e-04, coef0 = 0.0, shrinking = True, probability = False, tol = 0.001, cache_size = 200, class_weight = None, verbose = False, max_iter = -1, decision_function_shape = 'ovr', break_ties = False, random_state = None |
Abbreviations: LR Logistic regression, GBM Gradient boosting machine, RF Random forest, GNB Gaussian naïve Bayes, XGBoost Extreme gradient boosting, SVM Support vector machines