| Literature DB >> 35629187 |
Jong-Ho Kim1,2, So-Eun Lee3, Hee-Sun Jung3, Bo-Seok Shim3, Jong-Uk Hou3, Young-Suk Kwon1,2.
Abstract
Lumbar herniated nucleus pulposus (HNP) is difficult to diagnose using lumbar radiography. HNP is typically diagnosed using magnetic resonance imaging (MRI). This study developed and validated an artificial intelligence model that predicts lumbar HNP using lumbar radiography. A total of 180,271 lumbar radiographs were obtained from 34,661 patients in the form of lumbar X-ray and MRI images, which were matched together and labeled accordingly. The data were divided into a training set (31,149 patients and 162,257 images) and a test set (3512 patients and 18,014 images). Training data were used for learning using the EfficientNet-B5 model and four-fold cross-validation. The area under the curve (AUC) of the receiver operating characteristic (ROC) for the prediction of lumbar HNP was 0.73. The AUC of the ROC for predicting lumbar HNP in L (lumbar) 1-2, L2-3, L3-4, L4-5, and L5-S (sacrum)1 levels were 0.68, 0.68, 0.63, 0.67, and 0.72, respectively. Finally, an HNP prediction model was developed, although it requires further improvements.Entities:
Keywords: deep learning; herniated nucleus pulposus; lumbar X-ray; prediction
Year: 2022 PMID: 35629187 PMCID: PMC9145973 DOI: 10.3390/jpm12050767
Source DB: PubMed Journal: J Pers Med ISSN: 2075-4426
Each lumbar level distribution of lumbar herniated nucleus pulposus according to each fold in the training set.
| L1-2 | L2-3 | L3-4 | L4-5 | L5-S1 | ||
|---|---|---|---|---|---|---|
| Fold 0 | No-HNP | 39,331 | 38,166 | 36,058 | 28,765 | 30,559 |
| HNP | 1360 | 2525 | 4633 | 11,926 | 10,132 | |
| Sum | 40,691 | 40,691 | 40,691 | 40,691 | 40,691 | |
| Fold 1 | No-HNP | 39,306 | 38,174 | 35,929 | 28,981 | 30,532 |
| HNP | 1282 | 2414 | 4659 | 11,607 | 10,056 | |
| Sum | 40,558 | 40,558 | 40,558 | 40,558 | 40,558 | |
| Fold 2 | No-HNP | 39,108 | 37,779 | 35,902 | 28,619 | 30,393 |
| HNP | 1263 | 2592 | 4469 | 11,752 | 9978 | |
| Sum | 40,371 | 40,371 | 40,371 | 40,371 | 40,371 | |
| Fold 3 | No-HNP | 39,471 | 38,244 | 36,201 | 28,578 | 30,411 |
| HNP | 1135 | 2362 | 4405 | 12,028 | 10,195 | |
| Sum | 40,606 | 40,606 | 40,606 | 40,606 | 40,606 |
L, lumbar; S, sacrum; HNP, herniated nucleus pulposus.
Precision, recall, f-1 score, accuracy, and ROC AUC for HNP presence prediction.
| Fold 0 | Fold 1 | Fold 2 | Fold 3 | ||
|---|---|---|---|---|---|
| Precision | 0.67 | 0.68 | 0.66 | 0.65 | |
| No-HNP | Recall | 0.66 | 0.69 | 0.67 | 0.7 |
| F1-score | 0.67 | 0.68 | 0.66 | 0.67 | |
| Precision | 0.67 | 0.68 | 0.68 | 0.68 | |
| HNP | Recall | 0.68 | 0.67 | 0.66 | 0.63 |
| F1-score | 0.67 | 0.67 | 0.67 | 0.66 | |
| Accuracy | 62.3% | 66.1% | 58.8% | 65.2% | |
| AUC | 0.73 | 0.74 | 0.71 | 0.73 |
HNP, herniated nucleus pulposus; AUC, area under curve.
Results for predicting the presence of HNP in each level of lumbar through four-fold cross-validation.
| Fold 0 | Fold 1 | Fold 2 | Fold 3 | ||
|---|---|---|---|---|---|
| L1-2 | Precision | 0.06 | 0.07 | 0.06 | 0.05 |
| Recall | 0.7 | 0.64 | 0.72 | 0.73 | |
| F1-score | 0.11 | 0.12 | 0.12 | 0.1 | |
| AUC | 0.69 | 0.73 | 0.74 | 0.74 | |
| L2-3 | Precision | 0.11 | 0.12 | 0.13 | 0.12 |
| Recall | 0.62 | 0.64 | 0.58 | 0.65 | |
| F1-score | 0.19 | 0.2 | 0.12 | 0.2 | |
| AUC | 0.7 | 0.71 | 0.69 | 0.73 | |
| L3-4 | Precision | 0.2 | 0.2 | 0.19 | 0.19 |
| Recall | 0.56 | 0.57 | 0.62 | 0.57 | |
| F1-score | 0.3 | 0.29 | 0.29 | 0.29 | |
| AUC | 0.68 | 0.68 | 0.69 | 0.68 | |
| L4-5 | precision | 0.43 | 0.41 | 0.41 | 0.44 |
| Recall | 0.68 | 0.71 | 0.67 | 0.66 | |
| F1-score | 0.52 | 0.52 | 0.51 | 0.53 | |
| AUC | 0.7 | 0.7 | 0.68 | 0.71 | |
| L5-S1 | Precision | 0.4 | 0.39 | 0.39 | 0.41 |
| Recall | 0.68 | 0.73 | 0.64 | 0.64 | |
| F1-score | 0.51 | 0.5 | 0.48 | 0.5 | |
| AUC | 0.73 | 0.72 | 0.7 | 0.72 | |
| Accuracy | 65.5% | 67.2% | 66.7% | 67.0% |
L, lumbar; S, sacrum; AUC, area under curve.
Multitask results for predicting the presence of HNP in each level of lumbar through four-fold cross-validation.
| Fold 0 | Fold 1 | Fold 2 | Fold 3 | ||
|---|---|---|---|---|---|
| L1-2 | Precision | 0.05 | 0.05 | 0.06 | 0.05 |
| Recall | 0.66 | 0.66 | 0.64 | 0.71 | |
| F1-score | 0.09 | 0.1 | 0.1 | 0.1 | |
| AUC | 0.64 | 0.67 | 0.68 | 0.69 | |
| L2-3 | Precision | 0.09 | 0.09 | 0.11 | 0.08 |
| Recall | 0.74 | 0.74 | 0.54 | 0.77 | |
| F1-score | 0.16 | 0.15 | 0.19 | 0.15 | |
| AUC | 0.66 | 0.65 | 0.66 | 0.66 | |
| L3-4 | Precision | 0.18 | 0.16 | 0.16 | 0.17 |
| Recall | 0.55 | 0.59 | 0.62 | 0.51 | |
| F1-score | 0.28 | 0.25 | 0.26 | 0.26 | |
| AUC | 0.66 | 0.63 | 0.65 | 0.64 | |
| L4-5 | Presicion | 0.42 | 0.4 | 0.42 | 0.4 |
| Recall | 0.68 | 0.71 | 0.66 | 0.74 | |
| F1-score | 0.52 | 0.51 | 0.51 | 0.52 | |
| AUC | 0.7 | 0.69 | 0.69 | 0.69 | |
| L5-S1 | Precision | 0.4 | 0.39 | 0.39 | 0.37 |
| Recall | 0.63 | 0.73 | 0.67 | 0.65 | |
| F1-score | 0.49 | 0.5 | 0.49 | 0.47 | |
| AUC | 0.71 | 0.72 | 0.7 | 0.69 | |
| Accuracy | 61.5% | 60.1% | 65.0% | 60.5% |
L, lumbar; S, sacrum; AUC, area under curve.
Evaluation results of the final deep learning model using the test set for predicting HNP presence in each lumbar level.
| Precision | Recall | F1-Score | ROC AUC | Accuracy | |
|---|---|---|---|---|---|
| L1-2 | 0.05 | 0.6 | 0.09 | 0.68 | 69.1% |
| L2-3 | 0.09 | 0.65 | 0.15 | 0.68 | |
| L3-4 | 0.18 | 0.38 | 0.24 | 0.63 | |
| L4-5 | 0.41 | 0.47 | 0.43 | 0.67 | |
| L5-S1 | 0.36 | 0.55 | 0.43 | 0.72 |
HNP, herniated nucleus pulposus; ROC, receiver operating characteristic; AUC, area under the curve.
Figure 1Gradient-weighted class activation mapping for the prediction model of lumbar HNP. HNP, herniated nucleus pulposus; L, lumbar; S, sacrum. (a) Heatmap in lateral view of the disk and vertebrae area (HNP in L4-5 and L5-S1). (b) Heatmap in anteroposterior view of the disk and vertebrae area (HNP in L5-S1). (c) Heat map along the posterior vertebral line (HNP in L4-5 and L5-S1). (d) Heap map in the area unrelated with HNP (HNP in L4-5 and L5-S1).