| Literature DB >> 35633447 |
Yun Wang1,2, Fateme Sadat Haghpanah3, Xuzhe Zhang4, Katie Santamaria2, Gabriela Koch da Costa Aguiar Alves2, Elizabeth Bruno2, Natalie Aw2, Alexis Maddocks5, Cristiane S Duarte2, Catherine Monk2,6, Andrew Laine4, Jonathan Posner7,8.
Abstract
Infant brain magnetic resonance imaging (MRI) is a promising approach for studying early neurodevelopment. However, segmenting small regions such as limbic structures is challenging due to their low inter-regional contrast and high curvature. MRI studies of the adult brain have successfully applied deep learning techniques to segment limbic structures, and similar deep learning models are being leveraged for infant studies. However, these deep learning-based infant MRI segmentation models have generally been derived from small datasets, and may suffer from generalization problems. Moreover, the accuracy of segmentations derived from these deep learning models relative to more standard Expectation-Maximization approaches has not been characterized. To address these challenges, we leveraged a large, public infant MRI dataset (n = 473) and the transfer-learning technique to first pre-train a deep convolutional neural network model on two limbic structures: amygdala and hippocampus. Then we used a leave-one-out cross-validation strategy to fine-tune the pre-trained model and evaluated it separately on two independent datasets with manual labels. We term this new approach the Infant Deep learning SEGmentation Framework (ID-Seg). ID-Seg performed well on both datasets with a mean dice similarity score (DSC) of 0.87, a mean intra-class correlation (ICC) of 0.93, and a mean average surface distance (ASD) of 0.31 mm. Compared to the Developmental Human Connectome pipeline (dHCP) pipeline, ID-Seg significantly improved segmentation accuracy. In a third infant MRI dataset (n = 50), we used ID-Seg and dHCP separately to estimate amygdala and hippocampus volumes and shapes. The estimates derived from ID-seg, relative to those from the dHCP, showed stronger associations with behavioral problems assessed in these infants at age 2. In sum, ID-Seg consistently performed well on two different datasets with an 0.87 DSC, however, multi-site testing and extension for brain regions beyond the amygdala and hippocampus are still needed.Entities:
Keywords: Amygdala; Behavioral problems; Convolutional neural networks; Deep learning; Hippocampus; Infant neuroimaging; Segmentation
Year: 2022 PMID: 35633447 PMCID: PMC9148335 DOI: 10.1186/s40708-022-00161-9
Source DB: PubMed Journal: Brain Inform ISSN: 2198-4026
Demographics and MRI sequence information
| Training DHCP* ( | ECHO-Dataset1ƒ ( | M-CRIB† ( | ECHO-Dataset2‡ ( | |
|---|---|---|---|---|
| PMA at scan, weeks | 40.65 ± 2.19 | 46.90 ± 4.14 | 39.78 ± 1.31 | 48.06 ± 4.69 |
| Sex | ||||
| Female, N(%) | 266 (43.8%) | 13 (65.0%) | 4 (40.0%) | 25 (50%) |
| Male, N (%) | 207 (56.2%) | 7 (35.0%) | 6 (60.0%) | 25 (50%) |
| MRI scanners | 3 Philips | 3T GE | 3T Siemens | 3T GE |
| MRI resolution (mm3) | [0.5, 0.5, 0.5] | [0.9, 0.9, 0.9] | [0.63, 0.63, 0.63] | [0.9, 0.9, 0.9] |
| MRI dimensions | [290,290,203] | [130,256,256] | [304,304,157] | [130, 256, 256] |
For quantitative variables, data are presented as mean ± standard deviation unless otherwise noted. PMA: postmenstrual age. *The large training DHCP dataset with corresponding dHCP labels was used to pre-train the model with sufficient data. ƒThe ECHO-Dataset1 dataset was used to test the model’s performance as an internal source. †The M-CRIB dataset was used as an external test dataset to further test the reliability of our proposed deep learning framework. ‡The proof-of-concept ECHO-Dataset2 was used to test the association between brain morphometric measures at birth and corresponding CBCL measures at age 2-year-old.
Fig. 1Overview of this study. a Using 3 independent infant MRI datasets through a transfer-learning approach, we trained, fine-tuned, and cross-validated a deep-learning segmentation framework (ID-Seg) for hippocampus and amygdala, both with internal and external datasets; b we further explored the prospective associations between morphometric measures (left and right hippocampus and amygdala) in infants and behavior problems at age 2. *Cyan color represents segmented hippocampus, and red represents segmented amygdala. LOOCV leave-one-out cross-validation
Segmentation evaluations and comparisons
| Dataset | Metric | Region | Segmentation method | F | p.adj | ||
|---|---|---|---|---|---|---|---|
| dHCP | ID-Seg without pre-training | ID-Seg with pre-training | |||||
| Internal dataset: ECHO-Dataset1 | DSC | L Amyg | 0.79 (0.12) | 0.76(0.07) | 0.86(0.03) | 8.81 | 0.001 |
| (n = 20) | L Hippo | 0.76 (0.09) | 0.75 (0.08) | 0.87 (0.03) | 14.10 | < 10−4 | |
| R Amy | 0.77 (0.14) | 0.76 (0.08) | 0.86 (0.03) | 6.31 | 0.003 | ||
| R Hippo | 0.74 (0.15) | 0.76 (0.08) | 0.87 (0.03) | 10.67 | < 10−4 | ||
| ICC | L Amyg | 0.87 (0.09) | 0.86 (0.05) | 0.92 (0.02) | 3.41 | 0.05 | |
| L Hippo | 0.86 (0.08) | 0.85 (0.06) | 0.93 (0.02) | 4.75 | 0.024 | ||
| R Amyg | 0.86 (0.13) | 0.86 (0.05) | 0.92 (0.02) | 3.79 | 0.05 | ||
| R Hippo | 0.84 (0.13) | 0.86 (0.05) | 0.93 (0.01) | 5.89 | 0.02 | ||
| ASD | L Amyg | 0.49 (0.34) | 0.41 (0.09) | 0.32 (0.11) | 5.89 | 0.007 | |
| L Hippo | 0.60 (0.37) | 0.61 (0.59) | 0.26 (0.11) | 10.38 | 0.001 | ||
| R Amyg | 0.53 (0.47) | 0.79 (1.3) | 0.31 (0.09) | 3.97 | 0.024 | ||
| R Hippo | 0.65 (0.43) | 0.66 (0.57) | 0.26 (0.11) | 6.43 | 0.006 | ||
| External dataset: M-CRIB | DSC | L Amyg | 0.73 (0.02) | 0.81 (0.03) | 0.88 (0.02) | 184.47 | < 10−4 |
| (n = 10) | L Hippo | 0.67 (0.04) | 0.81 (0.03) | 0.88 (0.03) | 156.2 | < 10−4 | |
| R Amyg | 0.67 (0.03) | 0.83 (0.04) | 0.87 (0.02) | 68.78 | < 10−4 | ||
| R Hippo | 0.60 (0.04) | 0.83 (0.03) | 0.87 (0.03) | 109.61 | < 10−4 | ||
| ICC | L Amyg | 0.86 (0.05) | 0.90 (0.01) | 0.93(0.01) | 195.48 | < 10−4 | |
| L Hippo | 0.83 (0.06) | 0.91 (0.02) | 0.94(0.01) | 393.73 | < 10−4 | ||
| R Amyg | 0.83 (0.06) | 0.90 (0.02) | 0.93(0.01) | 88.13 | < 10−4 | ||
| R Hippo | 0.78 (0.09) | 0.91 (0.01) | 0.93(0.02) | 373.74 | < 10−4 | ||
| ASD | L Amyg | 0.94 (0.13) | 0.55 (0.06) | 0.36 (0.11) | 190.95 | < 10−4 | |
| L Hippo | 2.5 (0.29) | 0.37 (0.17) | 0.25 (0.06) | 158.59 | < 10−4 | ||
| R Amyg | 1.1 (0.11) | 0.45 (0.09) | 0.41 (0.09) | 70.13 | < 10–4 | ||
| R Hippo | 3.3 (0.44) | 0.41 (0.18) | 0.28 (0.06) | 112.71 | < 10–4 | ||
DSC Dice similarity coefficients, ICC intra-class correlation, ASD average surface distance, measured in mm. Segmentation metrics for each method are shown in mean (standard deviation). Higher DSC and ICC, and lower ASD indicate better segmentation accuracy. Amyg Amygdala, Hippo hippocampus; L left; R right
Fig. 2Visual comparisons between the “ground-truth” manual, dHCP, and ID-Seg segmentations for the left hippocampus’s 3D shape using our internal (ECHO-Dataset1) and external (M-CRIB) datasets, respectively. Red arrow highlights the areas with notable differences
Fig. 3Brain–behavior relationships (in black rectangles) for a ID-Seg, b dHCP. X indicates that the p value of spearman correlation is not significant at the threshold of p = 0.05. t_total CBCL total problems T score, t_inter CBCL internalizing problems T score, t_exter CBCL externalizing problems T score, amyg amygdala, hippo hippocampus