| Literature DB >> 35476649 |
Hee-Dong Chae1,2, Sung Hwan Hong1,2,3, Hyun Jung Yeoh1,2, Yeo Ryang Kang1,2, Su Min Lee1,2, Minyoung Kim1,2, Seok Young Koh1,2, Yongeun Lee4, Moo Sung Park4, Ja-Young Choi1,2,3, Hye Jin Yoo1,2.
Abstract
BACKGROUND: A high false-negative rate has been reported for the diagnosis of ossification of the posterior longitudinal ligament (OPLL) using plain radiography. We investigated whether deep learning (DL) can improve the diagnostic performance of radiologists for cervical OPLL using plain radiographs.Entities:
Mesh:
Year: 2022 PMID: 35476649 PMCID: PMC9045646 DOI: 10.1371/journal.pone.0267643
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
Fig 1Flow diagram of data inclusion and allocation.
Patient characteristics.
| Training-validation set | Test set | P value | ||
|---|---|---|---|---|
| with OPLL | without OPLL | |||
|
| 207 | 100 | 100 | |
|
| 915 | 100 | 100 | |
|
| 59.7 ± 9.9 | 59.4 ± 11.3 | 51.1 ± 16.6 | < 0.001 |
|
| 157 (75.8) | 68 (68) | 54 (54) | < 0.001 |
|
| 0.667 | |||
| Continuous | 38 | 16 | ||
| Segmental | 122 | 62 | ||
| Mixed | 30 | 17 | ||
| Circumscribed | 17 | 5 | ||
OPLL, ossification of the posterior longitudinal ligament.
Fig 2The architecture of the deep convolutional neural network used for OPLL segmentation.
OPLL, ossification of the posterior longitudinal ligament.
Results of the observer performance test, including subgroup analysis, according to the morphologic subtype of OPLL.
| Per-vertebra analysis | Patient level | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Total | Continuous | Segmental | Mixed | Circumscribed | Total | |||||||
| AUC (95% CI) | AUC (95% CI) | AUC (95% CI) | AUC (95% CI) | AUC (95% CI) | AUC (95% CI) | |||||||
| DL model | 0.854 (0.828–0.880) | - | 0.897 (0.839–0.956) | - | 0.825 (0.786–0.864) | - | 0.881 (0.845–0.916) | - | 0.819 (0.637–1.001) | - | 0.851 (0.799–0.903) | - |
| 1st session (DL model vs. Observer) | ||||||||||||
| Staff 1 | 0.880 (0.846–0.914) | 0.163 | 0.980 (0.961–0.999) | 0.006 | 0.822 (0.772–0.872) | 0.928 | 0.941 (0.888–0.995) | 0.025 | 0.836 (0.751–0.920) | 0.812 | 0.876 (0.832–0.920) | 0.425 |
| Staff 2 | 0.847 (0.812–0.881) | 0.683 | 0.949 (0.905–0.993) | 0.125 | 0.788 (0.744–0.833) | 0.143 | 0.926 (0.882–0.970) | 0.065 | 0.637 (0.367–0.907) | 0.286 | 0.837 (0.788–0.886) | 0.683 |
| Fellow 1 | 0.868 (0.831–0.904) | 0.481 | 0.975 (0.946–1.004) | 0.009 | 0.807 (0.755–0.859) | 0.552 | 0.926 (0.863–0.990) | 0.145 | 0.763 (0.522–1.005) | 0.605 | 0.878 (0.831–0.925) | 0.407 |
| Fellow 2 | 0.788 (0.745–0.832) | 0.002 | 0.944 (0.898–0.990) | 0.150 | 0.707 (0.655–0.758) | < 0.001 | 0.852 (0.764–0.940) | 0.497 | 0.621 (0.374–0.867) | 0.101 | 0.847 (0.795–0.898) | 0.891 |
| Resident 1 | 0.821 (0.782–0.859) | 0.089 | 0.940 (0.913–0.966) | 0.174 | 0.763 (0.712–0.814) | 0.027 | 0.898 (0.853–0.943) | 0.442 | 0.664 (0.328–1.001) | 0.483 | 0.805 (0.745–0.864) | 0.182 |
| Resident 2 | 0.754 (0.707–0.801) | < 0.001 | 0.928 (0.871–0.986) | 0.417 | 0.633 (0.581–0.685) | < 0.001 | 0.879 (0.817–0.942) | 0.972 | 0.584 (0.349–0.819) | 0.102 | 0.804 (0.744–0.863) | 0.214 |
| Average Observers | 0.826 (0.772–0.881) | 0.292 | 0.953 (0.921–0.985) | 0.069 | 0.753 (0.677–0.830) | 0.073 | 0.904 (0.851–0.957) | 0.369 | 0.684 (0.507–0.861) | 0.291 | 0.841 (0.797–0.885) | 0.739 |
| 2nd session (without DL model vs. with DL model) | ||||||||||||
| Staff 1 | 0.903 (0.872–0.933) | < 0.001 | 0.990 (0.981–1.000) | 0.039 | 0.853 (0.808–0.899) | < 0.001 | 0.958 (0.909–1.006) | 0.013 | 0.862 (0.787–0.938) | < 0.001 | 0.921 (0.887–0.955) | < 0.001 |
| Staff 2 | 0.920 (0.896–0.944) | < 0.001 | 0.977 (0.955–0.998) | 0.029 | 0.887 (0.854–0.920) | < 0.001 | 0.961 (0.917–1.004) | 0.101 | 0.805 (0.607–1.004) | 0.002 | 0.936 (0.906–0.965) | < 0.001 |
| Fellow 1 | 0.912 (0.886–0.938) | 0.002 | 0.979 (0.954–1.005) | 0.253 | 0.861 (0.821–0.902) | 0.012 | 0.977 (0.956–0.999) | 0.091 | 0.789 (0.569–1.010) | 0.082 | 0.936 (0.901–0.970) | 0.005 |
| Fellow 2 | 0.876 (0.843–0.910) | < 0.001 | 0.978 (0.951–1.005) | 0.072 | 0.810 (0.759–0.860) | < 0.001 | 0.957 (0.916–0.997) | 0.004 | 0.636 (0.372–0.901) | 0.127 | 0.910 (0.873–0.948) | 0.001 |
| Resident 1 | 0.883 (0.854–0.913 | < 0.001 | 0.975 (0.959–0.990) | < 0.001 | 0.843 (0.803–0.883) | < 0.001 | 0.945 (0.913–0.978) | < 0.001 | 0.715 (0.326–1.104) | 0.485 | 0.877 (0.829–0.925) | 0.002 |
| Resident 2 | 0.861 (0.827–0.895) | < 0.001 | 0.975 (0.949–1.001) | 0.028 | 0.786 (0.737–0.834) | < 0.001 | 0.932 (0.882–0.982) | < 0.001 | 0.761 (0.536–0.986) | 0.025 | 0.884 (0.837–0.930) | 0.006 |
| Average Observers | 0.893 (0.862–0.924) | 0.001 | 0.979 (0.962–0.997) | 0.012 | 0.840 (0.795–0.885) | 0.002 | 0.955 (0.920–0.990) | 0.007 | 0.762 (0.582–0.941) | 0.061 | 0.911 (0.876–0.945) | < 0.001 |
* p values and 95% CI are from the fixed-reader, random-case analysis comparing AUC between the DL model and individual observers.
† p values and 95% CI are from the random-reader, random-case analysis comparing AUC between the DL model and average observers.
‡ p values and 95% CI are from the fixed-reader, random-case analysis comparing the AUC of individual radiologists between the 1st session (without DL model) and the 2nd session (with DL model).
§ p values and 95% CI are from the random-reader, random-case analysis comparing the AUC of individual radiologists between the 1st session (without DL model) and the 2nd session (with DL model).
DL, deep learning; AUC, area under the curve.
Fig 3A 65-year-old male patient with mixed-type OPLL.
(A) A lateral plain radiograph shows mixed-type OPLL along the posterior side of the vertebra. The lesion at the C7 level is obscured by the shoulder shadow. (b) In the sagittal image of cervical spine CT, ossifications ranging from the C2 to T1 level are clearly demonstrated (window width = 2000 HU, window level = 500 HU). (c) OPLL lesions annotated by a radiologist on the plain radiograph. (d) In the resulting image inferred by the deep-learning model, OPLL lesions at the C2-6 levels are well predicted, but the lesion located at the C7 level was not detected by the model. OPLL, ossification of the posterior longitudinal ligament.
Fig 4A 51-year-old female patient with segmental-type OPLL.
(a) A lateral plain radiograph shows segmental-type OPLL (arrow) at the C5-6 level. (b) A sagittal image of cervical spine CT also demonstrates the segmental ossifications (arrow) at C5-6 level (window width = 2000 HU, window level = 500 HU). (c) OPLL lesions annotated by a radiologist on the plain radiograph. (d) The deep-learning model correctly predicted the segmental-type OPLL, which was overlooked by two observers. OPLL, ossification of the posterior longitudinal ligament.
Fig 5Comparison of observer performances with and without the DL model (a) ROC curve of the DL model and average observers in per-patient analysis.
The AUC of the DL model alone was 0.851 (95% CI, 0.799–0.903), and that for average observers was 0.841 (95% CI, 0.781–0.901). The AUC of average observers improved to 0.911 (95% CI, 0.876–0.945) when referring to the results of the DL model. (b) Improved diagnostic performance of individual observers in per-patient analysis with the assistance of the DL model. ROC, receiver operating characteristics; DL, deep learning; AUC, area under the curve.
Subgroup analysis according to the vertebral level.
| Vertebral level | DL model | Average Observers | |
|---|---|---|---|
| C2 | 0.932 (0.862–1.001) | 0.927 (0.902–0.951) | 0.685 |
| C3 | 0.904 (0.840–0.968) | 0.904 (0.844–0.964) | 0.996 |
| C4 | 0.906 (0.862–0.949) | 0.846 (0.782–0.911) | 0.071 |
| C5 | 0.865 (0.812–0.918) | 0.829 (0.755–0.902) | 0.319 |
| C6 | 0.829 (0.765–0.893) | 0.773 (0.667–0.878) | 0.277 |
| C7 | 0.582 (0.487–0.677) | 0.793 (0.675–0.911) | < 0.001 |
Data are AUC and number in parentheses are 95% CI.
* p values and 95% CI are from the random-reader, random-case analysis comparing AUC between DL model and average observers.
DL, deep learning; AUC, area under the curve.