| Literature DB >> 35373881 |
Erica Balboni1,2, Luca Nocetti1, Chiara Carbone2,3, Nicola Dinsdale4,5, Maurilio Genovese6, Gabriele Guidi1, Marcella Malagoli6, Annalisa Chiari6, Ana I L Namburete5, Mark Jenkinson4,7,8, Giovanna Zamboni2,3,4.
Abstract
Research on segmentation of the hippocampus in magnetic resonance images through deep learning convolutional neural networks (CNNs) shows promising results, suggesting that these methods can identify small structural abnormalities of the hippocampus, which are among the earliest and most frequent brain changes associated with Alzheimer disease (AD). However, CNNs typically achieve the highest accuracy on datasets acquired from the same domain as the training dataset. Transfer learning allows domain adaptation through further training on a limited dataset. In this study, we applied transfer learning on a network called spatial warping network segmentation (SWANS), developed and trained in a previous study. We used MR images of patients with clinical diagnoses of mild cognitive impairment (MCI) and AD, segmented by two different raters. By using transfer learning techniques, we developed four new models, using different training methods. Testing was performed using 26% of the original dataset, which was excluded from training as a hold-out test set. In addition, 10% of the overall training dataset was used as a hold-out validation set. Results showed that all the new models achieved better hippocampal segmentation quality than the baseline SWANS model (ps < .001), with high similarity to the manual segmentations (mean dice [best model] = 0.878 ± 0.003). The best model was chosen based on visual assessment and volume percentage error (VPE). The increased precision in estimating hippocampal volumes allows the detection of small hippocampal abnormalities already present in the MCI phase (SD = [3.9 ± 0.6]%), which may be crucial for early diagnosis.Entities:
Keywords: Alzheimer disease; deep learning; hippocampus; magnetic resonance imaging; mild cognitive impairment; neural networks; transfer learning
Mesh:
Year: 2022 PMID: 35373881 PMCID: PMC9248306 DOI: 10.1002/hbm.25858
Source DB: PubMed Journal: Hum Brain Mapp ISSN: 1065-9471 Impact factor: 5.399
FIGURE 1Structure of the u‐net: black layers were kept fixed for the transfer learning, while red layers were trained. Each layer uses a ReLu activation function (Nicola K. Dinsdale et al., 2019). The final section of the network, the spatial transformer, is not shown but only has fixed parameters
Characteristics of the acquisition sequences
| DS | Scanner | Field | Voxel size | Field of view | Sequence | TE (ms) | TR (ms) | TI (ms) | FA | Ch. | Acquisition time (min) | Acceleration factor | SMS factor |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | General Electric Signa Architect | 3T | 0.5 mmisotropic | 328 X 512 X 340 | MP‐rage | 3.1 | 2,150 | 900 | 8° | 48 | 5.5 | 2 | 2 |
| 2 | Philips Achieva | 3T | 1 mm isotropic | 160 X 205 X 140 | MP‐rage | 4.6 | 9,360 | 900 | 15° | 8 | 4.7 | 2 | 1 |
| 3 | Siemens Magnetom | 3T | 1 mm isotropic | 192 X 174 X 192 | MP‐rage | 4.7 | 2040 | 900 | 8° | 32 | 5.56 | 1 | 1 |
Abbreviations: Ch., number of channels in the head coil; DS, dataset; FA, flip angle; SMS, simultaneous multislice imaging; TE, echo time; TR, repetition time.
FIGURE 2Boxplots of Dice coefficients for the SWANS network with the “Baseline” model (using separately labels from rater 1 and 2 as ground truth) on the left and for the manual raters (comparing the raters with each other) on the right, using the whole dataset of 146 hippocampi
Statistical values of the Dice coefficients for SWANS with the “Baseline” model (comparing SWANS separately to rater 1 and rater 2) and for the two raters
| SWANS “Baseline” | Raters | |
|---|---|---|
| Dice coefficient: | ||
| Median | 0.847 | 0.880 |
| Interquartile range | 0.826–0.862 | 0.866–0.896 |
| Mean | 0.838 | 0.880 |
|
| 0.04 | 0.03 |
FIGURE 3Bland Altman plot comparing hippocampal volumes from the SWANS network with the “Baseline” model to the gold standard (the average of the two manual labels). Positive values indicate that hippocampal volumes from SWANS were larger than gold standard and vice versa
FIGURE 4Boxplot of Dice coefficients for the SWANS network with different transfer learning models (considering separately labels from rater 1 and 2 as ground truth), using a hold‐out test dataset of 38 hippocampi
Statistical values of VPEs and Dice coefficients for each SWANS model
| Baseline | Transfer DS2 | Transfer DS1 | Transfer all | Training all | |
|---|---|---|---|---|---|
| Dice coefficient | |||||
| Median | 0.852 | 0.881 | 0.884 | 0.884 | 0.885 |
| 25th to 75th percentiles | 0.835 to 0.867 | 0.858 to 0.895 | 0.859 to 0.898 | 0.864 to 0.898 | 0.858 to 0.898 |
| Mean | 0.848 | 0.874 | 0.876 | 0.878 | 0.878 |
|
| 0.03 | 0.03 | 0.03 | 0.03 | 0.03 |
| VPE | |||||
| Median | −6.4% | 3.1% | 1.2% | 5.0% | 2.4% |
| 25th to 75th percentiles | −9.7% to −1.4% | 0.46% to 5.4% | −1.9% to 4.5% | 2.1% to 7.1% | 0.48% to 4.8% |
| Mean | −5.7% | 2.9% | 1.1% | 4.1% | 2.1% |
|
| 5.8% | 4.9% | 4.2% | 4.5% | 3.9% |
Note: The mean and median values of VPE highlight systematic difference with the gold standard, while SDs and quartile ranges (between the 25th and 75th percentiles) show the precision of the algorithm in estimating hippocampal volume.
FIGURE 5Boxplots of VPE values for the SWANS network with different transfer learning models, using a hold‐out test dataset of 38 hippocampi. The median values highlight a systematic difference with the gold standard, while the interquartile ranges show the precision of the algorithm in estimating hippocampal volumes
FIGURE 6Bland Altman plot comparing hippocampal volumes from the SWANS network, for the different transfer learning models, to the gold standard. Positive values indicate that hippocampal volumes from SWANS were larger than gold standard and vice versa
FIGURE 7Hippocampal segmentation maps produced by different models, compared to ones from the two raters. In the first row the red arrows highlight the presence of isolated voxel labels, in the second row they show incomplete portions of the hippocampal head, in third row they indicate incomplete portions in the lateral side, and in the fourth row they highlight an additional tract in the caudal region. Dice coefficient and VPE scores of each segmentation are showed below the corresponding image. Subjects in rows 1 and 2 come from dataset 2, subject in row 3 from dataset 1 and subject in row 4 from dataset 3; all views are sagittal and all views except row 4 are from the right hippocampus; rows 1 and 2 show medial slices and rows 3 and 4 show lateral slices