| Literature DB >> 35083815 |
Irene Brusini1,2, Michael Platten1,3,4, Russell Ouellette3,4, Fredrik Piehl4,5,6, Chunliang Wang1, Tobias Granberg3,4.
Abstract
BACKGROUND ANDEntities:
Keywords: atrophy; convolutional neural networks; corpus callosum; magnetic resonance imaging; multiple sclerosis; neurodegeneration
Mesh:
Year: 2022 PMID: 35083815 PMCID: PMC9304261 DOI: 10.1111/jon.12972
Source DB: PubMed Journal: J Neuroimaging ISSN: 1051-2284 Impact factor: 2.324
Cohort demographics of the training and testing dataset
| Training data ( | Testing data ( | |||
|---|---|---|---|---|
| Scanner | Aera ( | Avanto ( | Trio ( | Aera/Avanto/Trio, % 30/47/23 |
| Age in years, mean ± SD | 39 ± 11 | 39 ± 12 | 36 ± 12 | 43 ± 11 |
| Disease duration in years, mean ± SD | 6.7 ± 8.0 | 6.0 ± 5.7 | 4.8 ± 6.6 | 4.4 ± 5.4 |
| Sex, % F/M | 77/23 | 74/26 | 57/43 | 72/28 |
| Subtype, % RRMS/SPMS/PPMS/NA | 69/22/2.9/4.4 | 75/16/4.4/4.4 | 79/5.9/2.9/12 | 70/23/1.8/5.3 |
| Median EDSS within 6 months, IQR | 2.0 (1.0‐3.0) | 1.5 (0.0‐2.8) | 2.0 (1.5‐3.0) | 2.5 (1.5‐3.5) |
| ( | ( | ( | ( | |
| Median EDSS future score, IQR | 2.5 (1.5‐4.0) | 2.5 (1.5‐4.0) | 3.0 (1.0‐3.5) | 2.5 (1.5‐4.0) |
| ( | ( | ( | ( | |
| Median SDMT within 6 months, IQR | –0.59 (−1.6 to 0.12) | –0.70 (−1.41 to −0.05) | –0.93 (−2.0 to −0.17) | –0.69 (0.065 to −1.4) |
| ( | ( | ( | ( | |
| Median SDMT future score, IQR | –1.1 (−2.3 to −0.49) | –1.4 (−2.21 to −0.54) | –1.6 (−2.3 to −0.65) | –0.97 (−0.18 to −1.9) |
| ( | ( | ( | ( | |
Note: n signifies the number of patients.
Abbreviations: EDSS, Expanded Disability Status Scale; F, Female; IQR, interquartile range; M, Male; NA, not available; PPMS, primary‐progressing MS; RRMS, relapsing‐remitting MS; SD, standard deviation; SDMT, Symbol Digit Modalities Test; SPMS, secondary‐progressive MS.
Average number of years between scan and EDSS was 6.7 ± 2.6 years.
Average number of years between scan and SDMT was 5.8 ± 2.8 years.
MRI scanner settings
| T1‐weighted MPRAGE | T2‐weighted SPACE FLAIR | |||||
|---|---|---|---|---|---|---|
| Scanner model | Aera | Avanto | Trio | Aera | Avanto | Trio |
| Field strength | 1.5 | 1.5 | 3.0 | 1.5 | 1.5 | 3.0 |
| Voxel size | 1.0 × 1.0 × 1.5 | 1.0 × 1.0 × 1.5 | 1.0 × 1.0 × 1.5 | 1.0 × 1.0 × 1.0 | 1.0 × 1.0 × 1.0 | 1.0 × 1.0 × 1.0 |
| Echo time | 3.02 | 3.55 | 3.39 | 333 | 333 | 388 |
| Repetition time | 1900 | 1900 | 1900 | 5000 | 6000 | 6000 |
| Inversion time | 1100 | 1100 | 900 | 1800 | 2200 | 2100 |
| Flip angle, ° | 15 | 15 | 9 | 120 | 120 | 120 |
Note: All times are given as milliseconds. The main magnetic field strength is given as Tesla.
FLAIR, fluid‐attenuated inversion recovery; MPRAGE, magnetization‐prepared rapid gradient echo; SPACE, sampling perfection with application optimized contrasts using different flip angle evolution.
FIGURE 1Manual corpus callosum and intracranial segmentations of six MS patients. Both T1‐weighted and T2‐weighted FLAIR scans were segmented using ITK snap (v3.6.0, www.itksnap.org)
FIGURE 2Midslice selection pipeline. The input consists of individual slices from the MRI scan, with dimensions 256 × 256 × 1. The input subsequently goes through four convolutional layers, using a 3 × 3 kernel, ReLU activation, and 2 × 2 max pooling. At the end, there are two dense layers (128 and 1, respectively), resulting in a high‐probability output of the midsagittal slice. The number under each block signifies the number of filters applied in that convolutional layer
FIGURE 3Full midslice and segmentation pipeline. Initially the midslice selection algorithm picks the slice with the highest probability of being middle. This is then fed into the segmentation pipelines that segment both the intracranial and corpus callosum area separately. This U‐net is based on the model by Ronneberger et al., where each convolutional layer in the downsampling path applies twice as many filters as the previous layer. The numbers underneath each box represent the number of filters present. Concatenation is applied in order to retain spatial information in the upsampling path. A kernel size of 3 × 3, along with a max pooling of 2 × 2, was applied. CC, corpus callosum: IC, intracranial
Performance of the midslice selection algorithms using scanner‐wise cross‐validation
| Overall | Aera | Avanto | Trio | |||
|---|---|---|---|---|---|---|
| ( | ( | ( | ( | |||
| midCNNT1 | MAE | 1.16 mm | 1.11 mm | 1.11 mm | 1.28 | |
| Mean NAE | 0.32% | 0.31% | 0.29% | 0.35% | ||
| Max AE | 6 mm | 3 mm | 3 mm | 6 mm | ||
| N. AE ≤ 3 mm | 101 | 34 | 34 | 33 | ||
| midCNNFLAIR | MAE | 0.68 mm | 0.62 mm | 0.59 mm | 0.85 mm | |
| Mean NAE | 0.40% | 0.36% | 0.34% | 0.50% | ||
| Max AE | 4 mm | 2 mm | 3 mm | 4 mm | ||
| N. AE ≤ 3 mm | 101 | 34 | 34 | 33 | ||
| midCNNT1/FLAIR | On T1w scans | MAE | 1.14 mm | 1.07 mm | 0.98 mm | 1.41 mm |
| Mean NAE | 0.31% | 0.29% | 0.26% | 0.39% | ||
| Max AE | 6 mm | 3 mm | 3 mm | 6 mm | ||
| N. AE ≤ 3 mm | 99 | 34 | 34 | 31 | ||
| On FLAIR scans | MAE | 0.64 mm | 0.76 mm | 0.44 mm | 0.71 mm | |
| Mean NAE | 0.37% | 0.44% | 0.25% | 0.42% | ||
| Max AE | 2 mm | 2 mm | 1 mm | 2 mm | ||
| N. AE ≤ 3 mm | 102 | 34 | 34 | 34 |
Note: For this scanner‐wise cross‐validation, at each fold the data from one scanner (Aera, Avanto or Trio) were used as validation set, whereas those from the remaining scanners were used as training set. n signifies the number of patients. For midCNNT1/+FLAIR, the AEs are analyzed separately for each MRI sequence (T1‐weighted [T1w] and FLAIR) as this is an important metric for a multicontrast algorithm. No significant difference was found across scanners as tested by analysis of variance (ANOVA).
Abbreviations: CNN, convolutional neural network; MAE, mean absolute error (between ground‐truth and predicted midslice); Mean NAE, average normalized absolute error (obtained by dividing each AE by the total image size along the sagittal view); Max AE, maximum absolute error; N. AE ≤ 3 mm, number of cases that reported an error that was less or equal to 3 mm.
FIGURE 4A clustered boxplot showing how the segmentation accuracy significantly decreases with higher atrophy. All patients in the training cohort were split into one of three atrophy levels, based on whether their normalized CC area was in the top, middle, or bottom third of the cohort. *P‐value < .05; **P‐value < .01; CC, corpus callosum
FIGURE 5Segmentation output for each sequence and atrophy level. nCCA, normalized corpus callosum area
Performance of the IC and CC segmentation networks using scanner‐wise cross‐validation
| Overall | Aera | Avanto | Trio | ||
|---|---|---|---|---|---|
| ( | ( | ( | ( | ||
| T1 only | IC‐NetT1 | .974 ± .019 | .973 ± .011 | .976 ± .012 | .971 ± .029 |
| CC‐NetT1 | .902 ± .065 | .908 ± .049 | .902 ± .050 | .896 ± .088 | |
| FLAIR only | IC‐NetFLAIR | .965 ± .028 | .970 ± .014 | .957 ± .043 | .968 ± .013 |
| CC‐NetFLAIR | .828 ± .110 | .812 ± .144 | .846 ± .098 | .827 ± .094 | |
| T1 using T1/ FLAIR networks | IC‐NetT1/+FLAIR | .974 ± .013 | .975 ± .006 | .973 ± .011 | .973 ± .019 |
| CC‐NetT1/+FLAIR | .894 ± .062 | .911 ± .028 | .888 ± .067 | .884 ± .078 | |
| FLAIR using T1/FLAIR networks | IC‐NetT1/FLAIR | .967 ± .019 | .961 ± .029 | .970 ± .010 | .971 ± .010 |
| CC‐NetT1/FLAIR | .808 ± .149 | .742 ± .196 | .830 ± .128 | .852 ± .079 |
Note: For this scanner‐wise cross‐validation, at each fold the data from one scanner (Aera, Avanto or Trio) were used as validation set, whereas those from the remaining scanners were used as training set.
Abbreviations: CC, corpus callosum; FLAIR, fluid‐attenuated inversion recovery; IC, intracranial; n, number of patients; T1, T1‐weighted scan.
**P < .01, only the CC segmentation of FLAIR scans using the CC‐NetT1/+FLAIR algorithm showed a significant difference between scanners, as tested by analysis of variance (ANOVA). All other segmentations did not vary significantly across scanners.
Intraclass correlation coefficients of automatic and semi‐automatic pipelines
| Automatic midslice selection | Manual midslice selection | ||
|---|---|---|---|
| IC‐NetT1 + CC‐NetT1 | ICC = .942 | ICC = .883 | |
| 95% CI: .602‐.988 | 95% CI: .679‐.97 | ||
| IC‐NetFLAIR + CC‐NetFLAIR | ICC = .739 | ICC = .828 | |
| 95% CI: .421‐.925 | 95% CI: .501‐.956 | ||
| IC‐NetT1/FLAIR + CC‐NetT1/FLAIR | T1 data | ICC = .908 | ICC = .910 |
| 95% CI: .617‐.979 | 95% CI: .734‐.977 | ||
| FLAIR data | ICC = .753 | ICC = .633 | |
| 95% CI: .421‐.931 | 95% CI: .235‐.889 |
Note: Results presented do not significantly differ between algorithms, as tested by independent samples t‐tests.
Abbreviations: CC, corpus callosum; CI, confidence interval; FLAIR, fluid‐attenuated inversion recovery; IC, intracranial; ICC, intraclass correlation coefficient; T1, T1‐weighted scan.
Segmentation output correlation with FreeSurfer and clinical disability
| Pipeline | FreeSurfer nCCV ( | EDSS ± 6 months ( | EDSS future | SDMT ± 6 months ( | SDMT future | |
|---|---|---|---|---|---|---|
| FreeSurfer nCCV | N/A |
|
|
|
| |
| FLAIR‐specific |
|
|
|
|
| |
| T1w‐specific |
|
|
|
|
| |
| Combined T1w and FLAIR | FLAIR |
|
|
|
|
|
| T1w |
|
|
|
|
|
Abbreviations: nCCV, normalized corpus callosum volume; EDSS, Expanded Disability Status Scale; n, number of patients; r, Pearson's correlation coefficient; SDMT, Symbol Digit Modalities Test; T1w, T1‐weighted scan; ρ, Spearman's rank correlation coefficient.
*P < .05; **P < .01.
EDSS follow‐up time was 6.7 ± 2.6 years.
SDMT follow‐up time was 5.8 ± 2.8 years.