| Literature DB >> 35144529 |
Liyu Liu1, Meng Si2, Hecheng Ma2, Menglin Cong2, Quanzheng Xu1, Qinghua Sun1, Weiming Wu1, Cong Wang1, Michael J Fagan3, Luis A J Mur4, Qing Yang5, Bing Ji6.
Abstract
BACKGROUND: Osteoporosis is a common metabolic skeletal disease and usually lacks obvious symptoms. Many individuals are not diagnosed until osteoporotic fractures occur. Bone mineral density (BMD) measured by dual-energy X-ray absorptiometry (DXA) is the gold standard for osteoporosis detection. However, only a limited percentage of people with osteoporosis risks undergo the DXA test. As a result, it is vital to develop methods to identify individuals at-risk based on methods other than DXA.Entities:
Keywords: CT; Clinical data; Machine learning; Opportunistic screening; Osteoporosis
Mesh:
Year: 2022 PMID: 35144529 PMCID: PMC8829991 DOI: 10.1186/s12859-022-04596-z
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Schematic elaboration of the study design
Fig. 2Flow chart of participants selection
Fig. 3Sagittal CT images of participants with osteoporosis and normal BMD
Demographic characteristics of each subgroup
| Group | Condition | Number | Age | ||
|---|---|---|---|---|---|
| Clinical data group | Osteoporosis | 943 | 2188 | 65.67 | 0.000 |
| Normal | 1245 | 55.77 | |||
| Image group | Osteoporosis | 126 | 268 | 65.66 | 0.000 |
| Normal | 142 | 58.61 | |||
| Combination group | Osteoporosis | 117 | 246 | 65.68 | 0.000 |
| Normal | 129 | 58.50 | |||
Age was expressed as mean standard deviation and P-value was used to compare the difference between individuals with osteoporosis and normal BMD in age
Classification performance of each classifier on clinical data and CT images
| Features | LR | SVM | ANN | RF | XGBoost | Stacking | N | |
|---|---|---|---|---|---|---|---|---|
| DC | Training | 0.798 | 0.800 | 0.795 | 0.836 | 0.828 | 0.819 | 2188 |
| Test | 0.805 | 0.806 | 0.798 | 0.809 | 0.808 | |||
| RLT | Training | 0.696 | 0.725 | 0.702 | 0.856 | 0.828 | 0.819 | |
| Test | 0.677 | 0.680 | 0.687 | 0.687 | ||||
| CD | Training | 0.815 | 0.837 | 0.818 | 0.898 | 0.893 | 0.872 | |
| Test | 0.813 | 0.824 | 0.815 | 0.820 | 0.820 | |||
| TFs | Training | 0.970 | 0.971 | 0.942 | 0.989 | 0.976 | 0.976 | 268 |
| Test | 0.949 | 0.951 | 0.929 | 0.947 | 0.933 | |||
| SFs | Training | 0.869 | 0.882 | 0.829 | 0.945 | 0.905 | 0.892 | |
| Test | 0.855 | 0.875 | 0.850 | 0.867 | 0.853 | |||
| IFs* | Training | 0.977 | 0.979 | 0.932 | 0.978 | 0.973 | 0.982 | |
| Test | 0.950 | 0.957 | 0.931 | 0.938 | 0.959 |
DC Demographic Characteristics; RLT Routine Laboratory Tests; CD Clinical Data; TFs Texture Features; SFs Shape Features; IFs Image Features
The performances of each classifier were evaluated by the mean of five repeated experiments
The highest values among the six classifiers for each feature set in test set were highlighted in bold
*Image features included texture and shape features
Performance of proposed model based on each classifier
| Layers | LR | SVM | ANN | RF | XGBoost | Stacking | |
|---|---|---|---|---|---|---|---|
| 1st Layer* | Training | 0.848 | 0.849 | 0.823 | 0.899 | 0.897 | 0.883 |
| Test | 0.810 | 0.812 | 0.807 | 0.804 | 0.808 | ||
| 2nd Layer* | Training | 0.878 | 0.887 | 0.827 | 0.944 | 0.958 | 0.927 |
| Test | 0.838 | 0.816 | 0.849 | 0.837 | 0.846 | ||
| 3rd Layer* | Training | 0.983 | 0.980 | 0.909 | 0.995 | 0.993 | 0.989 |
| Test | 0.960 | 0.917 | 0.950 | 0.947 | 0.960 |
The highest values among the six classifiers for each feature set in test set were highlighted in bold
*The first and second layer utilized demographic characteristics and clinical data respectively, while the third layer utilized clinical data and texture features
Fig. 4ROC curves of the proposed three-layer model based on a LR, b SVM, c ANN, d RF, e XGBoost and f stacking respectively on combination group
Fig. 5Feature importance for a LR, b RF and c XGBoost. These features were represented briefly by the combination of category and number. The details are listed in Additional file 1: Table S6
Fig. 6Decision boundary of LR based on Texture 5 and 2 (shown in a) as well as Texture 8 and 3 (shown in b). The gray and light blue area represent osteoporosis and normal individuals, respectively. Red and blue dots represent the samples labelled by osteoporosis and non-osteoporosis, respectively. Samples marked with yellow stars (such as dots marked with 1 and 2) represent the ones that are incorrectly classified by LR in five repeated experiments. All incorrectly classified samples are marked with yellow stars in (a), while only two are marked in (b)