| Literature DB >> 32494091 |
Guang Yang1,2, Jun Chen3, Zhifan Gao4, Shuo Li4, Hao Ni5,6, Elsa Angelini7, Tom Wong1,2, Raad Mohiaddin1,2, Eva Nyktari1, Ricardo Wage1, Lei Xu8, Yanping Zhang9, Xiuquan Du9, Heye Zhang3, David Firmin1,2, Jennifer Keegan1,2.
Abstract
Three-dimensional late gadolinium enhanced (LGE) cardiac MR (CMR) of left atrial scar in patients with atrial fibrillation (AF) has recently emerged as a promising technique to stratify patients, to guide ablation therapy and to predict treatment success. This requires a segmentation of the high intensity scar tissue and also a segmentation of the left atrium (LA) anatomy, the latter usually being derived from a separate bright-blood acquisition. Performing both segmentations automatically from a single 3D LGE CMR acquisition would eliminate the need for an additional acquisition and avoid subsequent registration issues. In this paper, we propose a joint segmentation method based on multiview two-task (MVTT) recursive attention model working directly on 3D LGE CMR images to segment the LA (and proximal pulmonary veins) and to delineate the scar on the same dataset. Using our MVTT recursive attention model, both the LA anatomy and scar can be segmented accurately (mean Dice score of 93% for the LA anatomy and 87% for the scar segmentations) and efficiently ( ∼ 0.27 s to simultaneously segment the LA anatomy and scars directly from the 3D LGE CMR dataset with 60-68 2D slices). Compared to conventional unsupervised learning and other state-of-the-art deep learning based methods, the proposed MVTT model achieved excellent results, leading to an automatic generation of a patient-specific anatomical model combined with scar segmentation for patients in AF.Entities:
Keywords: Atrial fibrillation; Attention model; Deep learning; Deep learning interpretation of biomedical data; Late gadolinium enhancement; Medical image segmentation
Year: 2020 PMID: 32494091 PMCID: PMC7134530 DOI: 10.1016/j.future.2020.02.005
Source DB: PubMed Journal: Future Gener Comput Syst ISSN: 0167-739X Impact factor: 7.187
Fig. 1Overall workflow of our proposed MVTT recursive attention model that consists of three major subnetworks.
Fig. 2Architecture of the proposed sequential learning network with corresponding kernel size (k), number of feature maps (n) and stride (s) indicated for each convolutional layer.
Fig. 3Architecture of the proposed dilated residual network with corresponding kernel size (k), number of feature maps (n), stride (s) and dilation rate (d) indicated for each convolutional layer.
Fig. 4Architecture of the proposed dilated attention network with corresponding kernel size (k), number of feature maps (n), stride (s) and dilation rate (d) indicated for each convolutional layer.
Quantitative results (mean standard deviation) of the cross-validated LA and PV segmentation, compared to the performance using the WHS, 2D U-Net, 3D U-Net, 2D V-Net and 3D V-Net. AC: Accuracy, SE: Sensitivity, SP: Specificity and DI: Dice score.
| Methods | AC (%) | SE (%) | SP (%) | DI (%) |
|---|---|---|---|---|
| WHS | 99.62 ± 0.21 | 80.86 ± 18.07 | 99.88 ± 0.14 | 84.54 ± 15.11 |
| 2D U-Net | 98.60 ± 0.42 | 93.50 ± 3.73 | 99.11 ± 0.43 | 91.97 ± 2.42 |
| 3D U-Net | 98.48 ± 0.05 | 93.02 ± 3.35 | 99.04 ± 0.44 | 90.58 ± 2.64 |
| 2D V-Net | 98.36 ± 0.58 | 92.20 ± 4.91 | 98.98 ± 0.50 | 90.66 ± 3.15 |
| 3D V-Net | 98.47 ± 0.46 | 94.43 ± 3.33 | 98.89 ± 0.44 | 91.37 ± 2.48 |
| Vesal et al. | 98.30 ± 0.71 | 94.97 ± 3.02 | 98.65 ± 0.73 | 90.58 ± 3.40 |
| SV+CLSTM | 98.49 ± 0.40 | 92.41 ± 4.59 | 99.17 ± 0.45 | 91.67 ± 3.12 |
| MV | 98.04 ± 0.89 | 90.95 ± 4.69 | 98.76 ± 0.59 | 89.03 ± 4.14 |
| S-LA/PV | 98.55 ± 0.51 | 95.32 ± 3.08 | 98.88 ± 0.50 | 91.87 ± 2.68 |
| MVTT | 98.62 ± 0.46 | 92.92 ± 4.47 | 99.20 ± 0.38 | 92.11 ± 2.39 |
As Table 1, but using the independent testing dataset.
| Methods | AC (%) | SE (%) | SP (%) | DI (%) |
|---|---|---|---|---|
| WHS | 99.53 ± 0.21 | 80.31 ± 17.66 | 99.83 ± 0.14 | 82.94 ± 14.39 |
| 2D U-Net | 98.60 ± 0.36 | 90.86 ± 2.18 | 99.50 ± 0.19 | 93.08 ± 1.58 |
| 3D U-Net | 98.49 ± 0.26 | 93.22 ± 1.75 | 99.11 ± 0.19 | 92.74 ± 1.22 |
| 2D V-Net | 98.33 ± 0.51 | 89.94 ± 2.90 | 99.32 ± 0.24 | 91.84 ± 2.25 |
| 3D V-Net | 98.44 ± 0.29 | 89.33 ± 2.18 | 99.50 ± 0.14 | 92.21 ± 1.36 |
| Vesal et al. | 98.54 ± 0.25 | 91.54 ± 1.76 | 99.36 ± 0.19 | 92.81 ± 1.37 |
| SV+CLSTM | 98.49 ± 0.40 | 90.07 ± 2.51 | 99.48 ± 0.27 | 92.54 ± 1.69 |
| MV | 97.83 ± 0.64 | 89.19 ± 3.02 | 98.84 ± 0.44 | 89.48 ± 4.14 |
| S-LA/PV | 98.56 ± 0.44 | 90.81 ± 3.17 | 99.48 ± 0.20 | 90.28 ± 26.83 |
| MVTT | 98.59 ± 0.40 | 91.96 ± 2.11 | 99.36 ± 0.28 | 93.11 ± 1.86 |
Fig. 5Qualitative visualisation of the LA anatomy segmentations (via independent testing) in multiple slices from an example pre-ablation (a–g) and an example post-ablation (h–n) study. Red contour: manual delineated ground truth. Green contour: segmentation using MVTT.(For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Quantitative results (mean standard deviation) of the cross-validated LA scars delineation. For the LA scars delineation, we compared with SD based thresholding (2-SD), k-means, Fuzzy c-means, 2D U-Net, 3D U-Net, 2D V-Net and 3D V-Net. AC: Accuracy, SE: Sensitivity, SP: Specificity and DI: Dice score.
| Methods | AC (%) | SE (%) | SP (%) | DI (%) |
|---|---|---|---|---|
| 2-SD | 99.89 ± 0.05 | 76.37 ± 19.43 | 99.92 ± 0.06 | 57.84 ± 18.07 |
| K-means | 99.85 ± 0.04 | 74.78 ± 15.17 | 99.89 ± 0.05 | 49.80 ± 16.27 |
| Fuzzy c-means | 99.85 ± 0.04 | 78.67 ± 14.24 | 99.88 ± 0.06 | 49.95 ± 17.45 |
| 2D U-Net | 99.92 ± 0.04 | 87.94 ± 8.90 | 99.94 ± 0.03 | 77.86 ± 9.03 |
| 3D U-Net | 99.92 ± 0.04 | 77.43 ± 14.26 | 99.96 ± 0.03 | 74.39 ± 12.06 |
| 2D V-Net | 99.92 ± 0.04 | 84.51 ± 11.19 | 99.94 ± 0.03 | 75.01 ± 11.80 |
| 3D V-Net | 99.92 ± 0.04 | 83.97 ± 8.90 | 99.95 ± 0.03 | 74.28 ± 13.80 |
| Vesal et al. | 99.94 ± 0.03 | 77.18 ± 14.5 | 99.97 ± 0.02 | 75.25 ± 12.3 |
| MV+AT | 99.93 ± 0.04 | 82.21 ± 10.35 | 99.96 ± 0.03 | 79.38 ± 10.62 |
| MV+CLSTM | 99.40 ± 0.03 | 81.09 ± 12.58 | 99.97 ± 0.02 | 81.12 ± 9.79 |
| SV+CLSTM+AT | 99.93 ± 0.04 | 82.47 ± 10.42 | 99.96 ± 0.02 | 78.68 ± 9.26 |
| S-Scar | 99.95 ± 0.03 | 80.65 ± 9.77 | 99.97 ± 0.01 | 82.32 ± 8.36 |
| MVTT | 99.94 ± 0.03 | 85.88 ± 9.84 | 99.97 ± 0.02 | 82.58 ± 8.72 |
As Table 3, but using the independent testing dataset.
| Methods | AC (%) | SE (%) | SP (%) | DI (%) |
|---|---|---|---|---|
| 2-SD | 99.84 ± 0.06 | 81.38 ± 19.51 | 99.87 ± 0.07 | 51.66 ± 19.50 |
| K-means | 99.82 ± 0.06 | 75.92 ± 13.76 | 99.86 ± 0.06 | 47.78 ± 15.24 |
| Fuzzy c-means | 99.82 ± 0.04 | 77.78 ± 13.76 | 99.86 ± 0.05 | 48.09 ± 16.26 |
| 2D U-Net | 99.93 ± 0.03 | 89.14 ± 4.66 | 99.95 ± 0.01 | 80.21 ± 9.61 |
| 3D U-Net | 99.93 ± 0.04 | 75.22 ± 11.31 | 99.98 ± 0.01 | 79.41 ± 8.02 |
| 2D V-Net | 99.92 ± 0.03 | 86.39 ± 6.30 | 99.95 ± 0.02 | 79.45 ± 8.60 |
| 3D V-Net | 99.92 ± 0.04 | 77.07 ± 9.72 | 99.98 ± 0.01 | 77.05 ± 9.96 |
| Vesal et al. | 99.92 ± 0.03 | 83.71 ± 10.4 | 99.96 ± 0.03 | 76.13 ± 10.9 |
| MV+AT | 99.93 ± 0.04 | 70.83 ± 9.44 | 99.99 ± 0.01 | 79.05 ± 8.66 |
| MV+CLSTM | 99.30 ± 0.03 | 65.36 ± 11.17 | 99.99 ± 0.01 | 77.10 ± 9.26 |
| SV+CLSTM+AT | 99.94 ± 0.03 | 81.27 ± 7.88 | 99.97 ± 0.01 | 82.36 ± 6.51 |
| S-Scar | 99.94 ± 0.03 | 72.39 ± 8.88 | 99.99 ± 0.01 | 80.22 ± 0.01 |
| MVTT | 99.95 ± 0.02 | 86.77 ± 4.64 | 99.98 ± 0.01 | 86.59 ± 5.60 |
Fig. 6Qualitative visualisation of LA scars delineation (independent testing results) in an example pre-ablation (a–i) and post-ablation (j–r) study using different methods. Red manually segmentation (ground truth), green algorithm segmentation.(For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 73D visualisation for LA anatomy and LA scars of the independent testing results (DI_L represents the DI value for predicted LA anatomy. DI_S represents the DI value for predicted LA scars). (a–c) Ground truth and (d–f) Segmentation results of using our MVTT method.
Comparison of different parameter settings for the LA segmentation.
| Type | Method | AC (%) | SE (%) | SP (%) | DI (%) |
|---|---|---|---|---|---|
| 10-fold | AFT | ||||
| K5 | |||||
| NDC | |||||
| MVTT | |||||
| Independent testing | AFT | ||||
| K5 | |||||
| NDC | |||||
| MVTT | |||||
Comparison of different parameter settings for the scar segmentation.
| Type | Method | AC (%) | SE (%) | SP (%) | DI (%) |
|---|---|---|---|---|---|
| 10-fold | AFT | ||||
| K5 | |||||
| NDC | |||||
| MVTT | |||||
| Independent testing | ATF | ||||
| K5 | |||||
| NDC | |||||
| MVTT | |||||
Fig. 10Boxplot of the Dice scores for comparison studies on LA scars segmentation. Training/cross-validation on the pre-ablation (a) and post-ablation (b) cases. Independent testing on the pre-ablation (c) and post-ablation (d) cases.
Fig. 8Correlation between the estimated LA scars percentage (ESP) of our MVTT method and the LA scars percentage from the manual delineation (MSP) (diagonal lines represent lines of identity). (a) and (b) show the correlations for pre and post ablation studies in the training/cross-validation datasets, and (c) and (d) show the correlations for pre and post ablation studies in the independent testing datasets.
Fig. 9Bland–Altman plots for the calculated LA scars percentage of our MVTT method and the LA scars percentage of the manual delineation. (a) and (b) were calculated on the 170 LGE CMR images using training/cross-validation results. (c) and (d) were calculated on the 20 LGE CMR images using independent testing results. Horizontal lines show the mean difference and the 95% CI of limits of agreement (confidence limits of the bias), which are defined as the mean difference plus/minus 1.96 times the standard deviation of the differences. The mean differences are near the 0-line (bias [95% CI to 4%] and bias [95% CI to 5%] for the pre-ablation and post-ablation cases respectively via training/cross-validation and bias [95% CI to 1.7%] and bias [95% CI −2.3% to 2.6%] for the pre-ablation and post-ablation cases respectively via independent testing. In summary, no significant systematic differences between the two methods can be discerned. MSP: Manual Segmented Atrial Scar Percentage; ESP: Estimated Atrial Scar Percentage.