| Literature DB >> 29527496 |
R Guerrero1, C Qin2, O Oktay2, C Bowles2, L Chen2, R Joules3, R Wolz4, M C Valdés-Hernández5, D A Dickie5, J Wardlaw5, D Rueckert2.
Abstract
White matter hyperintensities (WMH) are a feature of sporadic small vessel disease also frequently observed in magnetic resonance images (MRI) of healthy elderly subjects. The accurate assessment of WMH burden is of crucial importance for epidemiological studies to determine association between WMHs, cognitive and clinical data; their causes, and the effects of new treatments in randomized trials. The manual delineation of WMHs is a very tedious, costly and time consuming process, that needs to be carried out by an expert annotator (e.g. a trained image analyst or radiologist). The problem of WMH delineation is further complicated by the fact that other pathological features (i.e. stroke lesions) often also appear as hyperintense regions. Recently, several automated methods aiming to tackle the challenges of WMH segmentation have been proposed. Most of these methods have been specifically developed to segment WMH in MRI but cannot differentiate between WMHs and strokes. Other methods, capable of distinguishing between different pathologies in brain MRI, are not designed with simultaneous WMH and stroke segmentation in mind. Therefore, a task specific, reliable, fully automated method that can segment and differentiate between these two pathological manifestations on MRI has not yet been fully identified. In this work we propose to use a convolutional neural network (CNN) that is able to segment hyperintensities and differentiate between WMHs and stroke lesions. Specifically, we aim to distinguish between WMH pathologies from those caused by stroke lesions due to either cortical, large or small subcortical infarcts. The proposed fully convolutional CNN architecture, called uResNet, that comprised an analysis path, that gradually learns low and high level features, followed by a synthesis path, that gradually combines and up-samples the low and high level features into a class likelihood semantic segmentation. Quantitatively, the proposed CNN architecture is shown to outperform other well established and state-of-the-art algorithms in terms of overlap with manual expert annotations. Clinically, the extracted WMH volumes were found to correlate better with the Fazekas visual rating score than competing methods or the expert-annotated volumes. Additionally, a comparison of the associations found between clinical risk-factors and the WMH volumes generated by the proposed method, was found to be in line with the associations found with the expert-annotated volumes.Entities:
Keywords: CNN; Segmentation; Stroke; White matter hyperintensity
Mesh:
Year: 2017 PMID: 29527496 PMCID: PMC5842732 DOI: 10.1016/j.nicl.2017.12.022
Source DB: PubMed Journal: Neuroimage Clin ISSN: 2213-1582 Impact factor: 4.881
Fig. 1Proposed u-shaped residual network (uResNet) architecture for WMH segmentation and differentiation.
Fig. 2Training patch sampling.
MR imaging sequence details for the three acquisition protocols used.
| Protocols | Protocol 1 | Protocol 2 | Protocol 3 |
|---|---|---|---|
| TR/TE/TI (ms) T1 | 9/440 | 9.7/3.984/500 | |
| TR/TE/TI (ms) FLAIR | 9002/147/2200 | 9000/140/2200 | |
| Pixel bandwidth (KHz) | 125 (T1) | 15.63 (T1) | |
| 122.07(FLAIR) | 15.63 (FLAIR) | ||
| Matrix | 256 ×192 | 256 ×216 (T1) | 192 ×192 (T1) |
| 384 ×224 (FLAIR) | 256*256(FLAIR) | ||
| No. slices | 20 | 256 (T1) | 160 (T1) |
| 28 (FLAIR) | 40 (FLAIR) | ||
| Slice thickness (mm) | 5 | 1.02 (T1) | 1.3 (T1) |
| 5 (FLAIR) | 4 (FLAIR) | ||
| Inter-slice gap (mm) | 1.5 | 1 | 0 |
| Voxel size (mm3) | 0.94 ×0.94x6.5 | 1.02 ×0.9x1.02 (T1) | 1.3 ×1.3x1(T1) |
| 0.47 ×0.47x6 (FLAIR) | 1 ×1x4 (FLAIR) | ||
Mean Dice scores of WMH and stroke (standard deviation in parenthesis), correlation analysis between expert and automatic volumes (R2 and trend), and correlation with clinical variables.
| uResNet | DeepMedic | LPA | LGA | Expert | |
|---|---|---|---|---|---|
| WMH Dice (std) | 66.6(16.7) | 64.7(19.0) | 41.0(22.9) | − | |
| Stroke Dice (std) | 31.3(29.2) | − | − | − | |
| WMH | 0.943 | 0.855 | 0.687 | − | |
| Stroke | 0.688 | − | − | − | |
| WMH Trend | 0.89x+0.07 | 0.83x+0.28 | 0.51x+0.16 | − | |
| Stroke Trend | 0.52x- | − | − | − | |
| CC D-Fazekas | 0.769 | 0.746 | 0.630 | 0.774 | |
| CC PV-Fazekas | 0.778 | 0.777 | 0.718 | 0.765 | |
| CC Fazekas | 0.811 | 0.734 | 0.819 | ||
| CC MMSE | 0.364 | 0.369 | 0.389 | 0.372 |
Fig. 3Effect of different loss functions on uResNet trained using FLAIR images.
Fig. 4Different input channel exploration. F: FLAIR image, CS: cerebro-spinal track atlas, WM: white matter probability map, T1: T1 weighted image.
Fig. 5Automated versus expertly generated WMH volumes (as ICV %). The solid line indicates the linear trend f(x) of the comparison, while the dotted line indicates the ideal trend f(x) = 1.0x + 0.0.
Fig. 6Bland-Altman plots comparing expert annotations with all other methods in WMH segmentation.
Fig. 7Automated versus expertly generated stroke volumes. LPA and LGA are unable to distinguish between WMH and stroke, hence cannot generate these results. The solid line indicates the linear trend f(x) of the comparison, while the dotted indicates the ideal trend f(x) = 1.0x + 0.0.
Fig. 8Visual comparisons of all competing methods. Yellow lines delineate WMH, green lines stroke and white arrows point to interesting result areas. Best seen in color.
P-values of linear regression associations between volumes calculated with different methods and risk factors. Bold numbers indicate statistical significance above 0.05.
| uResNet | DeepMedic | LPA | LGA | Expert | |
|---|---|---|---|---|---|
| Age | 0.491 | 0.533 | < | 0.723 | 0.313 |
| Diabetes | 0.082 | 0.072 | 0.070 | 0.066 | |
| Hyperlipidaemia | 0.645 | 0.547 | 0.551 | 0.687 | 0.728 |
| Hypertension | 0.820 | 0.781 | 0.504 | 0.358 | 0.562 |
| Smoking | 0.497 | 0.560 | 0.216 | 0.719 | 0.767 |
| totalChl | 0.235 | 0.281 | 0.161 | 0.328 | 0.371 |
| BGPVS | < | < | < | < | < |
| deepAtrophyVol | < | < | 0.117 |
P-values of linear regression associations between volumes calculated with different methods and risk factors after residual outliers were removed. Bold numbers indicate statistical significance above 0.05.
| uResNet | DeepMedic | LPA | LGA | Expert | |
|---|---|---|---|---|---|
| Age | 0.905 | 0.993 | < | 0.685 | 0.407 |
| Diabetes | < | 0.177 | |||
| Hyperlipidaemia | 0.346 | 0.425 | 0.464 | 0.550 | 0.186 |
| Hypertension | 0.639 | 0.502 | 0.190 | 0.128 | 0.350 |
| Smoking | 0.069 | 0.084 | 0.107 | 0.673 | 0.343 |
| totalChl | 0.294 | 0.212 | 0.222 | 0.043 | 0.868 |
| BGPVS | < | < | < | < | < |
| deepAtrophyVol | < | < |
Fig. 9General linear model normal probability plots of residuals for all methods, with and without outliers.
Mean Dice scores of WMH and stroke (standard deviation in parenthesis), correlation analysis between expert and automatic volumes (R2 and trend), and correlation with clinical variables. No statistical significance between uResNet and uResNet2 was observed (p > 0.05), while there was a statistically significant difference (p < 0.001) between patch off-center sampling (uResNet) and regular no off-center sampling (uResNet_NoC).
| uResNet | uResNet2 | uResNet_NoC | Expert | |
|---|---|---|---|---|
| WMH Dice (std) | 69.5 | 66.9(18.1) | − | |
| Stroke Dice (std) | 40.0(25.2) | 28.9 | − | |
| WMH | 0.948 | − | ||
| Stroke | 0.761 | 0.710 | − | |
| WMH Trend | − | |||
| Stroke Trend | 0.55x- | 0.52x+0.07 | − | |
| CC D-Fazekas | 0.770 | 0.771 | 0.774 | |
| CC PV-Fazekas | 0.778 | 0.777 | 0.765 | |
| CC Fazekas | 0.824 | 0.823 | 0.819 | |
| CC MMSE | 0.364 | 0.373 | 0.366 | 0.372 |
Mean Dice scores of WMH and stroke, for different inputs with uResNet and DeepMedic. Difference in Dice score between the two methods is given in italics. F: FLAIR image, CS: cerebro-spinal track atlas, WM: white matter probability map, T1: T1 weighted image.
| Input | WMH | Stroke | ||||
|---|---|---|---|---|---|---|
| channels | uResNet | DeepMedic | Diff | uResNet | DeepMedic | Diff |
| F | 66.3 | 31.1 | ||||
| F-T1 | 67.6 | 34.3 | ||||
| F-CS | 66.6 | 35.1 | ||||
| F-WM | 68.2 | 33.0 | − | |||
| F-CS-WM | 68.0 | 37.8 | ||||
| F-T1-CS-WM | 68.4 | 36.0 | ||||