| Literature DB >> 27881948 |
Dana L Tudorascu1, Helmet T Karim2, Jacob M Maronge3, Lea Alhilali4, Saeed Fakhran5, Howard J Aizenstein6, John Muschelli7, Ciprian M Crainiceanu7.
Abstract
We evaluated and compared the performance of two popular neuroimaging processing platforms: Statistical Parametric Mapping (SPM) and FMRIB Software Library (FSL). We focused on comparing brain segmentations using Kirby21, a magnetic resonance imaging (MRI) replication study with 21 subjects and two scans per subject conducted only a few hours apart. We tested within- and between-platform segmentation reliability both at the whole brain and in 10 regions of interest (ROIs). For a range of fixed probability thresholds we found no differences between-scans within-platform, but large differences between-platforms. We have also found very large differences between- and within-platforms when probability thresholds were changed. A randomized blinded reader study indicated that: (1) SPM and FSL performed well in terms of gray matter segmentation; (2) SPM and FSL performed poorly in terms of white matter segmentation; and (3) FSL slightly outperformed SPM in terms of CSF segmentation. We also found that tissue class probability thresholds can have profound effects on segmentation results. We conclude that the reproducibility of neuroimaging studies depends on the neuroimaging software-processing platform and tissue probability thresholds. Our results suggest that probability thresholds may not be comparable across platforms and consistency of results may be improved by estimating a probability threshold correspondence function between SPM and FSL.Entities:
Keywords: MRI reproducibility; healthy brain segmentation; segmentation bias
Year: 2016 PMID: 27881948 PMCID: PMC5101202 DOI: 10.3389/fnins.2016.00503
Source DB: PubMed Journal: Front Neurosci ISSN: 1662-453X Impact factor: 4.677
Figure 1Inferior parietal ROI is shown on a single brain (native space) for the SPM (top row) and the FSL (bottom row).
Descriptive statistics for threshold volumes for each tissue type.
| 983.59 (100.05) | 984.14 (96.86) | 829.25 (72.81) | 826.21 (72.71) | ||
| 595.54 (62.17) | 595.81 (58.37) | 575.63 (59.10) | 575.77 (58.68) | ||
| 769.16 (146.98) | 775.11 (132.12) | 462.19 (35.79) | 465.09 (37.19) | ||
| 711.57 (66.21) | 711.35 (66.42) | 566.94 (53.67) | 561.90 (54.64) | ||
| 456.28 (49.72) | 454.90 (47.28) | 507.95 (55.74) | 507.84 (56.06) | ||
| 274.60 (73.02) | 276.50 (74.33) | 270.41 (23.73) | 274.76 (27.71) | ||
| 626 (55.82) | 624.55 (58.16) | 327.50 (34.42) | 323.32 (35.05) | ||
| 424.56 (47.42) | 423.19 (45.01) | 384.54 (46.90) | 384.19 (47.24) | ||
| 190.26 (60.58) | 190.65 (61.06) | 174.55 (17.03) | 176.77 (22.12) | ||
| 507.99 (45.13) | 504.54 (49.10) | 312.36 (32.23) | 308.38 (33.12) | ||
| 384.89 (44.51) | 383.54 (42.23) | 358.95 (42.93) | 358.88 (43.03) | ||
| 116.50 (43.70) | 115.72 (43.24) | 161.87 (17.17) | 164.24 (22.63) | ||
Repeated measures analysis results for gray matter by threshold.
| Intercept | 828.36 (18.71) | 44.28 (21.6) | <0.0001 | (789.92, 867.19) | |
| Method (FSL = ref) | 156.13 (5.17) | 30.20 (61) | <0.0001 | (145.80, 166.47) | |
| Scan (Scan1 = ref) | −1.24 (5.17) | −0.24 (61) | 0.81 | (−11.58, 9.09) | |
| Intercept | 565.73 (13.10) | 43.19 (21.4) | <0.0001 | (538.53, 592.93) | |
| Method (FSL = ref) | 147.04 (3.42) | 42.89 (61) | <0.0001 | (140.19, 153.90) | |
| Scan (Scan1 = ref) | −2.62 (3.42) | −0.77 (61) | 0.44 | (−9.47, 4.23) | |
| Intercept | 326.82 (10.09) | 32.40 (23.6) | <0.0001 | (305.98, 347.66) | |
| Method (FSL = ref) | 299.86 (4.07) | 73.66 (61) | <0.0001 | (291.72, 308) | |
| Scan (Scan1 = ref) | −2.81 (4.07) | −0.69 (61) | 0.49 | (−10.96, 5.32) | |
| Intercept | 312.23 (8.63) | 36.20 (24.7) | <0.0001 | (294.45, 330.00) | |
| Method (FSL = ref) | 195.89 (3.89) | 50.28 (61) | <0.0001 | (188.1, 203.68) | |
| Scan (Scan1 = ref) | −3.71 (3.89) | −0.95 (61) | 0.34 | (−11.50, 4.07) | |
β coefficient for the method represents the difference in mean estimates between SPM and FSL when scan is fixed; β.
Repeated measures analysis results for white matter by threshold.
| Intercept | 575.60 (12.89) | 44.65 (21.4) | <0.0001 | (548.81, 602.38) | |
| Method (FSL = ref) | 19.97 (3.34) | 5.98 (61) | <0.0001 | (13.30, 26.65) | |
| Scan (Scan1 = ref) | 0.21 (3.33) | 0.06 (61) | 0.95 | (−6.47, 6.88) | |
| Intercept | 508.27 (11.36) | 44.71 (20.7) | <0.0001 | (484.61, 531.93) | |
| Method (FSL = ref) | −52.31 (2.14) | −24.44 (61) | <0.0001 | (−56.58, −48.03) | |
| Scan (Scan1 = ref) | −0.74 (2.14) | −0.35 (61) | 0.73 | (-5.02, 3.54) | |
| Intercept | 384.80 (10.13) | 37.99 (20.8) | <0.0001 | (363.73, 405.88) | |
| Method (FSL = ref) | 39.50 (1.99) | 19.77 (61) | <0.0001 | (35.51,43.50) | |
| Scan (Scan1 = ref) | −0.87 (1.99) | −0.43 (61) | 0.67 | (−4.86, 3.13) | |
| Intercept | 359.27 (9.38) | 38.30 (20.7) | <0.0001 | (339.74, 378.79) | |
| Method (FSL = ref) | 25.30 (1.75) | 14.43 (61) | <0.0001 | (21.80, 28.81) | |
| Scan (Scan1 = ref) | −0.71 (1.75) | −0.40 (61) | 0.69 | (−4.21, 2.80) | |
β coefficient for the method represents the difference in mean estimates between SPM and FSL when scan is fixed.; β.
Figure 2GM ROI's mean volumes and 95% CI are plotted. The ROI's are: anterior cingulate cortex, middle frontal gyrus, superior frontal gyrus, paracentral lobule, parietal inferior, parietal superior, postcentral, precentral, superior motor, temporal superior, respectively.
Figure 3Good (bottom row) vs. poor (top row) GM segmentation. Red arrows indicate regions that are problematic/incorrectly classified as gray matter.
Figure 4White matter segmentation with the areas poorly rated (top row), rated as good (middle row), and MPRAGE of the same subject (bottom row) showing the same slice white matter without any potential problems. Red arrows point to areas of potential issues.