| Literature DB >> 32621103 |
Alexandra de Sitter1, Tom Verhoeven2, Jessica Burggraaff3, Yaou Liu2, Jorge Simoes2, Serena Ruggieri4,5, Miklos Palotai6, Iman Brouwer2, Adriaan Versteeg2, Viktor Wottschel2, Stefan Ropele7, Mara A Rocca8,9, Claudio Gasperini5, Antonio Gallo10, Marios C Yiannakas11, Alex Rovira12, Christian Enzinger13, Massimo Filippi8,9,14,15, Nicola De Stefano16, Ludwig Kappos17, Jette L Frederiksen18, Bernard M J Uitdehaag3, Frederik Barkhof2,19, Charles R G Guttmann6, Hugo Vrenken2.
Abstract
BACKGROUND: Deep grey matter (DGM) atrophy in multiple sclerosis (MS) and its relation to cognitive and clinical decline requires accurate measurements. MS pathology may deteriorate the performance of automated segmentation methods. Accuracy of DGM segmentation methods is compared between MS and controls, and the relation of performance with lesions and atrophy is studied.Entities:
Keywords: Atrophy; Automated segmentation methods; Deep grey matter; Multiple sclerosis
Mesh:
Year: 2020 PMID: 32621103 PMCID: PMC7674567 DOI: 10.1007/s00415-020-10023-1
Source DB: PubMed Journal: J Neurol ISSN: 0340-5354 Impact factor: 4.849
Demographics of the subjects
| Disease status | Number of cases (male/female) | Average age in years ± std | Median EDSS score (range) | Average DD year ± std |
|---|---|---|---|---|
| HC | 11 (3/8) | 37.6 ± 8.2 | n.a | n.a |
| MS | 21 (9/12) | 43.2 ± 10.1 | 3.5 (6.0) | 9.5 ± 6.9 |
| RRMS | 10 (4/6 | 39.8 ± 8.3 | 2.3 (2.5) | 8.0 ± 9.8 |
| SPMS | 5 (3/2) | 41.3 ± 8.9 | 4.0 (6.0) | 11.0 ± 5.3 |
| PPMS | 6 (2/4) | 49.4 ± 10.8 | 3.5 (4.5) | 14.0 ± 4.0 |
EDSS expended disability status scale, DD disease duration, std Standard deviation, HC healthy control, RR relapsing remitting, SP secondary progressive, PP primary progressive
Fig. 1Method for lesion load calculation within a set border. a a distance field is created around DGM structure, in this case the caudate nucleus. b Distance is set around the DGM structure, seen in grey. c An overlay is created between the DGM border and the lesion mask in T1 space. In grey the lesion border is shown, in red the lesions without overlap with the border and in yellow the lesions with an overlap in the border
For all structures and hemispheres, first the mean volume ± standard deviation (std) in millimeter of reference and the four automated segmentation software
| Method | Left Caudate | Right Caudate | ||||
|---|---|---|---|---|---|---|
| Volume | ICC | DSC | Volume | ICC | DSC | |
| Reference | 3.99 ± 0.64 | 3.98 ± 0.60 | ||||
| FSL-FIRST | 3.46 ± 0.45 | 0.69 | 0.84 ± 0.04 | 3.53 ± 0.49 | 0.81 | 0.84 ± 0.04 |
| FreeSurfer | 3.55 ± 0.52 | 0.74 | 0.76 ± 0.09 | 3.75 ± 0.58 | 0.68 | 0.77 ± 0.09 |
| GIF | 3.52 ± 0.42 | 0.50 | 0.83 ± 0.04 | 3.73 ± 0.47 | 0.60 | 0.83 ± 0.05 |
| volBrain | 3.55 ± 0.55 | 0.85 | 0.83 ± 0.07 | 3.57 ± 0.54 | 0.86 | 0.83 ± 0.07 |
Second, the intra-class correlation coefficient (ICC) and mean dice similarity coefficient (DSC) ± std between the reference and the segmentation of the automated segmentation software. N = amount of subjects
For all structures and hemispheres the spatial overlap of intra rater agreement. Spatial overlap is shown with the mean dice similarity coefficient ± standard deviation and is calculated over four subjects
| Rater | Left caudate | Right caudate | Left putamen | Right putamen | Left thalamus | Right thalamus |
|---|---|---|---|---|---|---|
| Expert 1 | 0.87 ± 0.031 | 0.87 ± 0.037 | 0.89 ± 0.047 | 0.91 ± 0.026 | 0.87 ± 0.035 | 0.88 ± 0.007 |
| Expert 2 | 0.87 ± 0.051 | 0.88 ± 0.032 | 0.85 ± 0.039 | 0.88 ± 0.018 | 0.89 ± 0.031 | 0.88 ± 0.022 |
| Expert 3 | 0.92 ± 0.004 | 0.92 ± 0.008 | 0.91 ± 0.022 | 0.92 ± 0.016 | 0.89 ± 0.022 | 0.91 ± 0.008 |
The average (± standard deviation) amount of voxels that were selected by one rater for both healthy control groups (HC) as patients (MS) group
| Structure | HC ( | MS ( |
|---|---|---|
| Caudate | 1356 ± 220 | 1413 ± 211 |
| Putamen | 1460 ± 290 | 1536 ± 436 |
| Thalamus | 2126 ± 443 | 2083 ± 526 |
Volumes differed between the Reference and all four automated methods for all structures (all p < 0.01), but there were no differences between the volumes for pairs of automated methods
Fig. 2T1 weighted images and segmentation of majority voting, FSL-FIRST, Freesurfer, GIF and volBrain. Segmentations of both left and right hemisphere and for all three structures; caudate, putamen and thalamus
Fig. 3Majority voting segmentation volume and volume by automatic segmentation are given for each deep gray matter structure and segmentation method. Volumes are given in milliliters
Fig. 4Dice similarity coefficients between segmentations from majority voting and each automated method per DGM structure for both healthy controls (HC) and patients (green)
For all structures and hemispheres the spatial overlap between the “Gold standard” and the automated segmentation methods for both control and patients group
| Method | Caudate nucleus | |||
|---|---|---|---|---|
| Controls ( | Patients ( | |||
| Left | Right | Left | Right | |
| FSL-FIRST | 0.84 ± 0.44 | 0.87 ± 0.03 | 0.86 ± 0.24 | 0.86 ± 0.04 |
| FreeSurfer | 0.85 ± 0.02 | 0.85 ± 0.02 | 0.82 ± 0.06 | |
| GIF | 0.85 ± 0.02 | 0.86 ± 0.01 | 0.83 ± 0.05 | |
| volBrain | 0.88 ± 0.02 | 0.88 ± 0.02 | 0.87 ± 0.03 | 0.87 ± 0.02 |
The spatial overlap is given as the mean ± standard deviation of the Dice Similarity Coefficient. Values of patients are bold if they are significantly diferent from those of controls (p-value < 0.05). N = amount of subjects
Spearman correlation between the dice similarity index and lesion load (LL), regional lesion load (RLL), normalized brain volume (NBV) and volume of region of interest (ROIV)
| Method | Left caudate | Right caudate | ||||||
|---|---|---|---|---|---|---|---|---|
| N = 21 | LL | RLL | NBV | ROIV | LL | RLL | NBV | ROIV |
| FSL-FIRST | − 0.31 | − 0.52* | − 0.88 | 0.45* | − 0.33 | − 0.41 | 0.36 | 0.53** |
| FreeSurfer | − 0.60** | − 0.49* | 0.20 | 0.47** | − 0.57** | − 0.62* | 0.25 | 0.37* |
| GIF | − 0.68** | − 0.57** | 0.25 | 0.17 | − 0.57** | − 0.63* | 0.34 | 0.38* |
| volBrain | − 0.34 | − 0.43 | 0.17 | 0.61** | − 0.57** | − 0.58** | 0.18 | 0.82** |
Correlation is measured for all structures, hemispheres and automated segmentation software. With α for * < 0.05 and ** < 0.01 for significant spearman correlation. N = amount of subjects
Fig. 5Dice similarity coefficients versus lesion load, represented per DGM structure and segmentation method and left (blue) and right (green) hemisphere
For all structures, hemispheres and automated segmentation method the p-value of students t test between dice similarity index of non-filled and filled T1 images before applying automated segmentation method
| Method | Left Caudate | Right Caudate | ||
|---|---|---|---|---|
| LEAP | LST-filling | LEAP | LST-filling | |
| FSL-FIRST | 0.70 | 0.90 | 0.79 | 0.73 |
| FreeSurfer | 0.73 | 0.75 | 0.16 | 0.56 |
| GIF | 0.84 | 0.76 | 0.78 | 0.54 |
| volBrain | 0.75 | 0.84 | 0.84 | 0.79 |
Filling is done with LEAP and LST
N = amount of subjects