| Literature DB >> 27786306 |
Ferran Prados1,2, M Jorge Cardoso1,3, Marios C Yiannakas2, Luke R Hoy2, Elisa Tebaldi2, Hugh Kearney2, Martina D Liechti2, David H Miller2, Olga Ciccarelli2, Claudia A M Gandini Wheeler-Kingshott2,4, Sebastien Ourselin1,3.
Abstract
Axonal loss in the spinal cord is one of the main contributing factors to irreversible clinical disability in multiple sclerosis (MS). In vivo axonal loss can be assessed indirectly by estimating a reduction in the cervical cross-sectional area (CSA) of the spinal cord over time, which is indicative of spinal cord atrophy, and such a measure may be obtained by means of image segmentation using magnetic resonance imaging (MRI). In this work, we propose a new fully automated spinal cord segmentation technique that incorporates two different multi-atlas segmentation propagation and fusion techniques: The Optimized PatchMatch Label fusion (OPAL) algorithm for localising and approximately segmenting the spinal cord, and the Similarity and Truth Estimation for Propagated Segmentations (STEPS) algorithm for segmenting white and grey matter simultaneously. In a retrospective analysis of MRI data, the proposed method facilitated CSA measurements with accuracy equivalent to the inter-rater variability, with a Dice score (DSC) of 0.967 at C2/C3 level. The segmentation performance for grey matter at C2/C3 level was close to inter-rater variability, reaching an accuracy (DSC) of 0.826 for healthy subjects and 0.835 people with clinically isolated syndrome MS.Entities:
Mesh:
Year: 2016 PMID: 27786306 PMCID: PMC5082365 DOI: 10.1038/srep36151
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Schematic representation of the proposed pipeline.
The spinal cord images were cropped only for visualisation purposes.
Figure 2Example of GM and CSA consensus masks obtained from the three raters masks intersection.
Consensus mask for a healthy control (rows 1 and 2) and a RRMS patient (rows 3 and 4). DSC respects the consensus mask is overlayed in each segmentation.
Demographic data for each group.
| Control (N = 25) | CIS (N = 19) | PPMS (N = 16) | SPMS (N = 15) | RRMS (N = 20) | |
|---|---|---|---|---|---|
| 7F:18M | 7F:12M | 9F:7M | 5F:10M | 8F:12M | |
| 40.4(10.1) | 36.0(4.6) | 52.5(11.0) | 55.2(7.4) | 41.8(9.1) | |
| — | 0.4(0.3) | 11.5(8.7) | 22.1(14.1) | 7.8(6.1) | |
| — | 1(0–3.5) | 6(2–7) | 6.5(4.5–7.5) | 2.5(0–6.5) |
First row: gender - female(F):male(M); second row: mean age in years; third row: mean disease duration in years; and last row: median EDSS score. Std: standard deviation.
Evaluation results for the CSA segmentation, with the mean (std) Dice score coefficient (DSC), mean surface distance (MSD), Hausdorff distance (HD) (all with respect to the consensus).
| Rater 1 | Rater 2 | Rater 3 | Proposed | |||
|---|---|---|---|---|---|---|
| 0.980 (0.009)* | 0.997 (0.004)* | 0.996 (0.003)* | 0.931 (0.021)* | 0.968 (0.008) | ||
| 0.130 (0.056)* | 0.096 (0.056)* | 0.101 (0.052)* | 0.265 (0.094)* | 0.153 (0.042) | ||
| 1.785 (0.544) | 1.494 (0.302)* | 1.448 (0.150)* | 1.809 (0.429) | 1.649 (0.349) | ||
| 0.983 (0.010)* | 0.996 (0.004)* | 0.995 (0.004)* | 0.917 (0.025)* | 0.971 (0.005) | ||
| 0.107 (0.051)* | 0.071 (0.043)* | 0.072 (0.036)* | 0.295 (0.088)* | 0.132 (0.037) | ||
| 1.598 (0.334) | 1.401 (0.182) | 1.393 (0.131)* | 1.673 (0.408) | 1.533 (0.282) | ||
| 0.979 (0.008)* | 0.997 (0.002)* | 0.990 (0.005)* | 0.932 (0.024)* | 0.965 (0.005) | ||
| 0.114 (0.046) | 0.066 (0.034)* | 0.080 (0.040)* | 0.230 (0.078)* | 0.133 (0.037) | ||
| 1.600 (0.344) | 1.383 (0.226) | 1.345 (0.143)* | 1.698 (0.405)* | 1.468 (0.188) | ||
| 0.981 (0.008)* | 0.996 (0.003)* | 0.992 (0.006)* | 0.943 (0.022)* | 0.963 (0.005) | ||
| 0.121 (0.036)* | 0.086 (0.037)* | 0.089 (0.039)* | 0.202 (0.074)* | 0.148 (0.026) | ||
| 1.585 (0.213) | 1.413 (0.129)* | 1.428 (0.115) | 1.677 (0.476) | 1.540 (0.236) | ||
| 0.978 (0.008)* | 0.998 (0.002)* | 0.993 (0.004)* | 0.921 (0.036)* | 0.967 (0.006) | ||
| 0.130 (0.034)* | 0.083 (0.033)* | 0.093 (0.030)* | 0.269 (0.093)* | 0.154 (0.037) | ||
| 1.936 (0.610)* | 1.429 (0.115)* | 1.420 (0.127)* | 1.981 (0.773) | 1.588 (0.305) | ||
| 0.980 (0.009)* | 0.997 (0.003)* | 0.994 (0.005)* | 0.928 (0.027)* | 0.967 (0.006) | ||
| 0.121 (0.047)* | 0.082 (0.044)* | 0.088 (0.041)* | 0.256 (0.091)* | 0.145 (0.037) | ||
| 1.717 (0.466)* | 1.430 (0.211)* | 1.411 (0.137)* | 1.778 (0.523)* | 1.565 (0.288) |
The script * represents significant differences (paired t-test with p < 0.05) between a rater/iNLS versus the proposed method.
Evaluation results for the GM segmentation, with the mean (std) Dice score coefficient (DSC), mean surface distance (MSD), Hausdorff distance (HD) (all with respect to the consensus).
| Rater 1 | Rater 2 | Rater 3 | Proposed | |||
|---|---|---|---|---|---|---|
| 0.894 (0.039)* | 0.927 (0.022)* | 0.931 (0.020)* | 0.776 (0.049)* | 0.826 (0.046) | ||
| 0.100 (0.035) | 0.063 (0.031)* | 0.067 (0.028)* | 0.106 (0.030)* | 0.089 (0.030) | ||
| 1.637 (0.274)* | 1.285 (0.128)* | 1.315 (0.135)* | 1.723 (0.420)* | 1.444 (0.204) | ||
| 0.908 (0.026)* | 0.930 (0.040)* | 0.933 (0.021)* | 0.789 (0.046)* | 0.835 (0.047) | ||
| 0.104 (0.044) | 0.065 (0.035)* | 0.064 (0.029)* | 0.104 (0.029)* | 0.083 (0.033) | ||
| 1.654 (0.285)* | 1.360 (0.251) | 1.307 (0.208) | 1.766 (0.475)* | 1.393 (0.207) | ||
| 0.907 (0.030)* | 0.905 (0.038)* | 0.911 (0.033)* | 0.735 (0.073)* | 0.780 (0.056) | ||
| 0.057 (0.027)* | 0.061 (0.027)* | 0.061 (0.023)* | 0.118 (0.043)* | 0.091 (0.039) | ||
| 1.320 (0.200) | 1.294 (0.134)* | 1.239 (0.088)* | 1.920 (0.536)* | 1.611 (0.517) | ||
| 0.914 (0.032)* | 0.894 (0.041)* | 0.903 (0.035)* | 0.745 (0.100) | 0.735 (0.069) | ||
| 0.054 (0.019)* | 0.055 (0.030)* | 0.052 (0.024)* | 0.091 (0.039) | 0.107 (0.056) | ||
| 1.339 (0.234) | 0.930 (0.117) | 1.198 (0.353)* | 1.517 (0.388) | 1.444 (0.503) | ||
| 0.923 (0.017)* | 0.925 (0.027)* | 0.906 (0.026)* | 0.754 (0.049)* | 0.793 (0.069) | ||
| 0.065 (0.024) | 0.063 (0.022)* | 0.063 (0.029)* | 0.088 (0.032) | 0.078 (0.031) | ||
| 1.327 (0.136) | 1.322 (0.186) | 1.213 (0.309)* | 1.821 (0.578) | 1.441 (0.340) | ||
| 0.908 (0.031)* | 0.918 (0.035)* | 0.919 (0.029)* | 0.762 (0.065)* | 0.799 (0.066) | ||
| 0.079 (0.038) | 0.062 (0.029)* | 0.062 (0.027)* | 0.101 (0.035)* | 0.089 (0.038) | ||
| 1.475 (0.280) | 1.306 (0.171)* | 1.261 (0.233)* | 1.753 (0.489)* | 1.461 (0.357) |
The script * represents significant differences (paired t-test with p < 0.05) between a rater/iNLS versus the proposed method.
Figure 3Quantitative analysis per patient group on GM and CSA segmentation of the spinal cord.
Figure 4GM segmentation results showing the worst (1–3) and the best (4–6) result for each patient group.
Rows 1 and 4 show one slice of the input image, rows 2 and 5 correspond to the consensus segmentation, and rows 3 and 6 are the obtained segmentation by the proposed method with DSC overlay.
Results from splitting the patients’ data into two groups depending on the presence of visible lesions at the C2/C3 level, with the mean (std) Dice score coefficient (DSC), mean surface distance (MSD), Hausdorff distance (HD) (all with respect to the consensus).
| With visible lesions (N = 45) | Without visible lesions (N = 25) | ||
|---|---|---|---|
| 0.965 (0.143)* | 0.970 (0.005)* | ||
| 0.143 (0.036) | 0.139 (0.035) | ||
| 1.534 (0.291) | 1.538 (0.196) | ||
| 0.762 (0.067)* | 0.838 (0.039)* | ||
| 0.096 (0.045)* | 0.076 (0.025)* | ||
| 1.525 (0.472) | 1.364 (0.176) |
The script * represents significant differences (paired t-test with p < 0.05) between groups.
Test-retest measurements obtained from C2-3 spinal cord level.
| Volume ( | Area ( | COV (%) | DSC | ||
|---|---|---|---|---|---|
| Manual | 1323.6 (92.6) | 88.2 (6.2) | 0.8 (0.5) | 0.965 (0.005) | |
| Proposed | 1321.4 (89.9) | 88.1 (6.0) | 1.2 (0.9) | ||
| Manual | 209.8 (20.8) | 14.0 (1.4) | 7.2 (1.9) | 0.791 (0.041) | |
| Proposed | 186.6 (21.2) | 12.4 (1.4) | 7.4 (4.9) |
Mean (std) for volume, area, coefficient of variation (COV) and Dice score coefficient (DSC).