| Literature DB >> 34566556 |
Liying Peng1,2, Lanfen Lin1, Yusen Lin3, Yen-Wei Chen4, Zhanhao Mo5, Roza M Vlasova2, Sun Hyung Kim2, Alan C Evans6, Stephen R Dager7, Annette M Estes8, Robert C McKinstry9, Kelly N Botteron9,10, Guido Gerig11, Robert T Schultz12, Heather C Hazlett2,13, Joseph Piven2,13, Catherine A Burrows14, Rebecca L Grzadzinski2,13, Jessica B Girault2,13, Mark D Shen2,13,15, Martin A Styner2,16.
Abstract
The infant brain undergoes a remarkable period of neural development that is crucial for the development of cognitive and behavioral capacities (Hasegawa et al., 2018). Longitudinal magnetic resonance imaging (MRI) is able to characterize the developmental trajectories and is critical in neuroimaging studies of early brain development. However, missing data at different time points is an unavoidable occurrence in longitudinal studies owing to participant attrition and scan failure. Compared to dropping incomplete data, data imputation is considered a better solution to address such missing data in order to preserve all available samples. In this paper, we adapt generative adversarial networks (GAN) to a new application: longitudinal image prediction of structural MRI in the first year of life. In contrast to existing medical image-to-image translation applications of GANs, where inputs and outputs share a very close anatomical structure, our task is more challenging as brain size, shape and tissue contrast vary significantly between the input data and the predicted data. Several improvements over existing GAN approaches are proposed to address these challenges in our task. To enhance the realism, crispness, and accuracy of the predicted images, we incorporate both a traditional voxel-wise reconstruction loss as well as a perceptual loss term into the adversarial learning scheme. As the differing contrast changes in T1w and T2w MR images in the first year of life, we incorporate multi-contrast images leading to our proposed 3D multi-contrast perceptual adversarial network (MPGAN). Extensive evaluations are performed to assess the qualityand fidelity of the predicted images, including qualitative and quantitative assessments of the image appearance, as well as quantitative assessment on two segmentation tasks. Our experimental results show that our MPGAN is an effective solution for longitudinal MR image data imputation in the infant brain. We further apply our predicted/imputed images to two practical tasks, a regression task and a classification task, in order to highlight the enhanced task-related performance following image imputation. The results show that the model performance in both tasks is improved by including the additional imputed data, demonstrating the usability of the predicted images generated from our approach.Entities:
Keywords: MRI; autism; generative adversarial networks; imputation; infant; longitudinal prediction; machine learning; postnatal brain development
Year: 2021 PMID: 34566556 PMCID: PMC8458966 DOI: 10.3389/fnins.2021.653213
Source DB: PubMed Journal: Front Neurosci ISSN: 1662-453X Impact factor: 5.152
Figure 1Overview of the proposed methods. Panels (A,C) are the architectures of PGAN and MPGAN, respectively. Panel (B) shows the discriminator and the feature extractor setup for the perceptual loss computation.
The network architecture of perceptual adversarial network.
|
| |
|---|---|
|
|
|
| (3 × 3 × 3) 20 Conv, IN, RL | (2 × 2 × 2)↑, (3 × 3 × 3) 160 Conv, IN, RL |
| (3 × 3 × 3) 40 Conv, IN, RL, (2 × 2 × 2)↓ | (3 × 3 × 3) 160 Conv, IN, RL |
| (3 × 3 × 3) 40 Conv, IN, RL | (2 × 2 × 2)↑, (3 × 3 × 3) 80 Conv, IN, RL |
| (3 × 3 × 3) 80 Conv, IN, RL, (2 × 2 × 2)↓ | (3 × 3 × 3) 80 Conv, IN, RL |
| (3 × 3 × 3) 80 Conv, IN, RL | (2 × 2 × 2)↑, (3 × 3 × 3) 40 Conv, IN, RL |
| (3 × 3 × 3) 160 Conv, IN, RL, (2 × 2 × 2)↓ | (3 × 3 × 3) 40 Conv, IN, RL |
| (3 × 3 × 3) 160 Conv, IN, RL | (1 × 1 × 1) 1 Conv, tanh |
| (3 × 3 × 3) 320 Conv, IN, RL | |
|
| |
| (4 × 4 × 4) 64 stride 2 Conv, LR, (4 × 4 × 4) 128 stride 2 Conv, IN, LR | |
| (4 × 4 × 4) 256 stride 2 Conv, IN, LR, (4 × 4 × 4) 512 stride 2 Conv, IN, LR | |
| (4 × 4 × 4) 1 stride 1 Conv, sigmoid | |
|
| |
| (3 × 3 × 3) 32 Conv, IN, RL, (3 × 3 × 3) 64 Conv, IN, RL, (2 × 2 × 2)↓ | |
| (3 × 3 × 3) 64 Conv, IN, RL, (3 × 3 × 3) 128 Conv, IN, RL, (2 × 2 × 2)↓ | |
| (3 × 3 × 3) 128 Conv, IN, RL, (3 × 3 × 3) 256 Conv, IN, RL, (2 × 2 × 2)↓ | |
| (3 × 3 × 3) 256 Conv, IN, RL, (3 × 3 × 3) 512 Conv, IN, RL | |
Conv, convolution; IN, Instance Normlization; RL, ReLu; LR, Leaky ReLU; ↓ and ↑, represent down- and upsampling, respectively; sigmoid, sigmoid activation function; tanh, tanh activation function.
Figure 2Examples of predicted MR images at 12 months (from 6 months MRI) compared across seven methods and the corresponding ground truth. (A) Axial view. (B) Coronal view.
Figure 3Examples of predicted MR images at 6 months (from 12 months MRI) compared across seven methods and the corresponding ground truth. (A) Axial view. (B) Coronal view.
Figure 4Quantitative comparison results of different methods based on Learned Perceptual Image Patch Similarity (LPIPS). A lower LPIPS value reflects a higher similarity between ground-truth and predicted images.
The average score results of human assessments based on the appearance of images predicted by different methods.
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| 6-month T1w | 1.48 ± 0.64 | 2.18 ± 0.68 | 2.58 ± 1.09 | 3.44 ± 1.11 | 5.24 ± 0.91 | 6.13 ± 0.84 |
|
| 6-month T2w | 1.87 ± 0.80 | 2.14 ± 0.78 | 3.29 ± 1.54 | 3.32 ± 0.84 | 5.34 ± 0.82 | 5.96 ± 0.76 |
|
| 12-month T1w | 2.83 ± 1.27 | 2.15 ± 1.46 | 3.68 ± 1.75 | 4.20 ± 1.36 | 3.92 ± 1.64 | 6.22 ± 0.71 |
|
| 12-month T2w | 2.13 ± 1.07 | 1.73 ± 0.88 | 2.63 ± 1.29 | 3.29 ± 1.06 | 4.01 ± 1.40 | 5.89 ± 0.64 |
|
The scores range from 1 to 7, with higher scores indicating a higher degree of realism and closer appearance to the ground truth. Methods with best performance are bolded for each setting (significantly better than other measurements, p < 0.5).
Figure 5Examples of the tissue segmentation results. The first four columns are the automatic segmentations of the predicted and the ground-truth images (SegGT). The last column is the reference manual segmentation (Ref).
Figure 6Examples of the subcortical segmentation results. The first four columns are the automatic segmentations of the predicted and the ground-truth images (SegGT). The last column is the reference manual segmentation (Ref).
Segmentation consistency across different approaches on subcortical segmentation task.
|
|
|
|
| |
|---|---|---|---|---|
| SegPredictGAN vs. SegGT | ||||
| SegPredictUnet( | ||||
| SegPredictGAN+ | ||||
| SegPredictCycleGAN vs. SegGT | ||||
| SegPredictUnet( | 0.555 ± 0.021 | 0.820 ± 0.044 | ||
| SegPredictPGAN vs. SegGT | 0.555 ± 0.030 | 0.820 ± 0.047 | ||
| SegPredictMPGAN vs. SegGT |
| 0.556 ± 0.025 | 0.820 ± 0.043 |
|
SegPredict.
Significantly different compared to MPGAN (p < 0.05).
Significantly different compared to PGAN (p < 0.05).
↓, lower is better. ↑, higher is better. The methods are sorted by the fused score. Methods with best performance are bolded for each metric (significantly better than other measurements, p < 0.5).
Segmentation consistency across different approaches on tissue segmentation task.
|
|
|
|
| |
|---|---|---|---|---|
| SegPredictGAN vs. SegGT | ||||
| SegPredictCycleGAN vs. SegGT | ||||
| SegPredictUnet( | ||||
| SegPredictGAN+ | ||||
| SegPredictUnet( | ||||
| SegPredictPGAN vs. SegGT | 2.958 ± 1.566 | |||
| SegPredictMPGAN vs. SegGT | 2.933 ± 1.404 |
|
|
|
SegPredict.
Significantly different compared to MPGAN (p < 0.05).
Significantly different compared to PGAN (p < 0.05).
↓, lower is better. ↑, higher is better. The methods are sorted by the fused score. Methods with best performance are bolded for each metric (significantly better than other measurements, p < 0.5).
Segmentation accuracy analysis on predicted images from different methods on subcortical segmentation task.
|
|
|
|
| |
|---|---|---|---|---|
| SegPredictUnet( | ||||
| SegPredictGAN vs. Ref | ||||
| SegPredictGAN+ | ||||
| SegPredictCycleGAN vs. Ref | ||||
| SegPredictUnet( | ||||
| SegPredictPGAN vs. Ref | 0.720 ± 0.110 | 0.771 ± 0.056 | ||
| SegPredictMPGAN vs. Ref |
| 0.721 ± 0.109 | 0.771 ± 0.051 |
|
| SegGT vs. Ref | 6.000 ± 4.396 | 0.666 ± 0.119 | 0.788 ± 0.055 | 1.902 ± 0.390 |
SegPredict.
Significantly different compared to MPGAN (p < 0.05).
Significantly different compared to PGAN (p < 0.05).
The methods are sorted by the fused core. Methods with best performance are bolded for each metric (significantly better than other measurements, p < 0.5).
Segmentation accuracy analysis on predicted images from different methods on tissue segmentation task.
|
|
|
|
| |
|---|---|---|---|---|
| SegPredictGAN vs. Ref | 4.090 ± 2.507 | |||
| SegPredictCycleGAN vs. Ref | 5.493 ± 2.908 | |||
| SegPredictUnet( | ||||
| SegPredictGAN+ | ||||
| SegPredictUnet( | 3.610 ± 2.109 | |||
| SegPredictPGAN vs. Ref | 3.817 ± 1.506 | |||
| SegPredictMPGAN vs. Ref | 3.948 ± 1.285 |
|
|
|
| SegGT vs. Ref | 4.158 ± 2.604 | 0.376 ± 0.075 | 0.871 ± 0.058 | 1.185 ± 0.270 |
SegPredict.
Significantly different compared to MPGAN (p < 0.05).
Significantly different compared to PGAN (p < 0.05).
The methods are sorted by the fused score. Methods with best performance are bolded for each metric (significantly better than other measurements, p < 0.5).
Effects of imputed longitudinal data on the ADOS-SA-CSS group classification task.
|
|
|
|
|
|---|---|---|---|
| Non-imputed (77 subjects) | 0.590 | 0.655 | 0.594 |
| Imputed (77+103 subjects) | 0.694 | 0.671 | 0.699 |
↑, higher is better.
Figure 7Confusion matrices of ADOS-SA-CSS group classification task. (A) “Non-imputed” setting. (B) “Imputed” setting.
Effects of imputed longitudinal data on the gestational age (in weeks) regression task.
|
|
|
|
|---|---|---|
| Non-imputed (134 subjects) | 2.559 | 0.971 |
| Imputed (134 + 76 subjects) | 2.327 | 0.886 |
↓, lower is better.