| Literature DB >> 35136170 |
Koji Tamai1, Hidetomi Terai2, Masatoshi Hoshino2, Akito Yabu2, Hitoshi Tabuchi3,4, Ryo Sasaki2, Hiroaki Nakamura2.
Abstract
The cervical ossification of the posterior longitudinal ligament (cOPLL) is sometimes misdiagnosed or overlooked on radiography. Thus, this study aimed to validate the diagnostic yield of our deep learning algorithm which diagnose the presence/absence of cOPLL on cervical radiography and highlighted areas of ossification in positive cases and compare its diagnostic accuracy with that of experienced spine physicians. Firstly, the radiographic data of 486 patients (243 patients with cOPLL and 243 age and sex matched controls) who received cervical radiography and a computer tomography were used to create the deep learning algorithm. The diagnostic accuracy of our algorithm was 0.88 (area under curve, 0.94). Secondly, the numbers of correct diagnoses were compared between the algorithm and consensus of four spine physicians using 50 independent samples. The algorithm had significantly more correct diagnoses than spine physicians (47/50 versus 39/50, respectively; p = 0.041). In conclusion, the accuracy of our deep learning algorithm for cOPLL diagnosis was significantly higher than that of experienced spine physicians. We believe our algorithm, which uses different diagnostic criteria than humans, can significantly improve the diagnostic accuracy of cOPLL when radiography is used.Entities:
Mesh:
Year: 2022 PMID: 35136170 PMCID: PMC8826389 DOI: 10.1038/s41598-022-06140-8
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Demographic data.
| OPLL group | Control group | ||
|---|---|---|---|
| Number of patients | 243 | 243 | |
| Average age | 63.5 ± 10.1 | 64.9 ± 11.2 | 0.891# |
| 0.850* | |||
| Female | 86 | 89 | |
| Male | 157 | 154 | |
| 1.000* | |||
| Institution A | 112 | 112 | |
| Institution B | 83 | 83 | |
| Institution C | 48 | 48 | |
| Continuous | 22 | – | |
| Segmental | 67 | – | |
| Mixed | 110 | – | |
| Localized | 44 | – | |
| Upper-to-middle | 65 | – | |
| Middle | 67 | – | |
| Middle-to-lower | 86 | – | |
| Whole cervical | 25 | – | |
#Mann–Whitney U test, *Chi-squared test.
OPLL, ossification of the posterior longitudinal ligament.
Diagnostic results of the deep learning algorithm (n = 486).
| TP | FP | FN | TN | Accuracy | Precision | Recall | |
|---|---|---|---|---|---|---|---|
| Overall | 219 | 34 | 24 | 209 | 0.88 | 0.86 | 0.90 |
| Institution A | 99 | 22 | 15 | 92 | 0.85 | 0.82 | 0.87 |
| Institution B | 79 | 7 | 4 | 76 | 0.93 | 0.92 | 0.95 |
| Institution C | 41 | 5 | 5 | 41 | 0.90 | 0.89 | 0.89 |
| Continuous | 20 | – | 2 | – | – | – | 0.91 |
| Segmental | 57 | – | 10 | – | – | – | 0.85 |
| Mixed | 106 | – | 4 | – | – | – | 0.96 |
| Localized | 36 | – | 8 | – | – | – | 0.82 |
| Upper to middle | 60 | – | 5 | – | – | – | 0.92 |
| Middle | 58 | – | 9 | – | – | – | 0.87 |
| Middle to lower | 76 | – | 10 | – | – | – | 0.88 |
| Whole cervical | 25 | – | 0 | – | – | – | 1.00 |
TP, true positive; FP, false positive; FN, false negative; TN, true negative; OPLL, ossification of the posterior longitudinal ligament.
Figure 1The ROC curve of the diagnostic accuracy of the deep learning algorithm is shown. ROC, receiver operating characteristic.
Figure 2Representative images used and created by the deep learning algorithm are shown. The left image shows the cervical plain radiograph used in the deep learning algorithm. Images created by our algorithm are shown on the center. The right image shows a sagittal slice of the computed tomography image used as the ground truth, but not used in the algorithm. The algorithm was designed to highlight areas of suspected ossification of the posterior longitudinal ligament (OPLL) when OPLL was identified in an image. (A) An image from a 47-year-old women with a continuous-type OPLL from C2–C4 is shown. (B) An image from a 56-year-old man with a small segmental OPLL at C5 and C6 is shown. (C) An image from a 63-year-old man without cervical OPLL is shown.
Diagnostic accuracy of the deep learning algorithm and four spine surgeons (n = 50).
| TP | FP | FN | TN | Accuracy | |
|---|---|---|---|---|---|
| Deep learning algorithm | 24 | 1 | 2 | 23 | 0.92 |
| Surgeon 1 (> 25 y exp.) | 22 | 3 | 7 | 18 | 0.80 |
| Surgeon 2 (> 20 y exp.) | 20 | 5 | 8 | 17 | 0.74 |
| Surgeon 3 (> 10 y exp.) | 21 | 4 | 8 | 17 | 0.76 |
| Surgeon 4 (> 5 y exp.) | 23 | 2 | 9 | 16 | 0.78 |
TP, true positive; FP, false positive; FN, false negative; TN, true negative; y, years; exp, experience.
Figure 3Images in which only the algorithm could identify an ossification of the posterior longitudinal ligament (OPLL) are shown. (A) An image from a 56-year-old woman with a small segmental OPLL at C5 is shown. (B) An image from a 72-year-old man with an OPLL at C5–C6 is shown.
Comparison of the diagnostic accuracy between the deep learning algorithm and the consensus of four spine physicians (n = 50).
| TP + TN | FP + FN | Accuracy | ||
|---|---|---|---|---|
| Deep learning algorithm | 47 | 3 | 0.041* | 0.92 |
| Surgeons’ consensus | 39 | 11 | 0.78 |
*Chi-square test.
TP, true positive; FP, false positive; FN, false negative; TN, true negative.
Figure 4Illustration of study process. Lateral cervical plain radiographies of all patients were extracted as jpeg files from the DICOM database. As annotation phase, an independent researcher manually painted the ossification area in the cases with OPLL on jpeg images of radiography with the reference of CT images. Subsequently, the painted image was divided into mask images for ground truth and original image, and both were used to construct the CNN. In the cases without OPLL on referenced CT image, all-black mask images were created as ground truth for CNN.