| Literature DB >> 34413789 |
Yinzhe Wu1,2, Zeyu Tang1,2, Binghuan Li2, David Firmin1,3, Guang Yang1,3.
Abstract
Segmentation of cardiac fibrosis and scars is essential for clinical diagnosis and can provide invaluable guidance for the treatment of cardiac diseases. Late Gadolinium enhancement (LGE) cardiovascular magnetic resonance (CMR) has been successful in guiding the clinical diagnosis and treatment reliably. For LGE CMR, many methods have demonstrated success in accurately segmenting scarring regions. Co-registration with other non-contrast-agent (non-CA) modalities [e.g., balanced steady-state free precession (bSSFP) cine magnetic resonance imaging (MRI)] can further enhance the efficacy of automated segmentation of cardiac anatomies. Many conventional methods have been proposed to provide automated or semi-automated segmentation of scars. With the development of deep learning in recent years, we can also see more advanced methods that are more efficient in providing more accurate segmentations. This paper conducts a state-of-the-art review of conventional and current state-of-the-art approaches utilizing different modalities for accurate cardiac fibrosis and scar segmentation.Entities:
Keywords: atrial fibrillation; cardiac magnetic resonance; deep learning; late gadolinium enhancement; myocardial infarction; scar segmentation
Year: 2021 PMID: 34413789 PMCID: PMC8369509 DOI: 10.3389/fphys.2021.709230
Source DB: PubMed Journal: Front Physiol ISSN: 1664-042X Impact factor: 4.566
List of challenges in segmentation of LV and LA anatomy and scar in LGE CMR.
| Year | Challenge/Dataset | Conference (MICCAI/IBSI etc.) | Modality (data size n) | Target | Pathology |
| 2012 | LV scar segmentation challenge ( | MICCAI | LGE MRI (30) | LV scar | MI |
| 2013 | LA scar segmentation challenge ( | ISBI | LGE MRI (30) | LA scar | AF |
| 2018 | LA segmentation challenge ( | MICCAI | LGE MRI (150) | LA cavity | AF |
| 2019 | Multi-sequence Cardiac MR Segmentation Challenge (MS-CMR) ( | MICCAI | LGE MRI, T2 MRI, bSSFP MRI (45, coregistered) | LV blood pool, RV blood pool, LV myocardium | MI |
| 2020 | Myocardial pathology segmentation combining multi-sequence CMR (MyoPS) ( | MICCAI | LGE MRI, T2 MRI, bSSFP MRI (45, coregistered) | LV blood pool, RV blood pool, LV normal myocardium, LV myocardial oedema, LV myocardial scar | MI |
FIGURE 1Flowchart to demonstrate the search criterion.
FIGURE 2Examples of LGE CMR images acquired at (A) LA and (B) LV, with the fibrosis/infarction regions highlighted in green. By comparing (A2) and (B2), we can see the fibrosis region in LA is rather more discrete and thinner compared to LV infarction, making LA fibrosis regions more difficult to be accurately fully localized and quantified. Image source: (A) was extracted from pre-ablation CMR images in ISBI 2013 cDERMIS dataset (http://www.cardiacatlas.org/challenges/left-atrium-fibrosis-and-scar-segmentation-challenge/). (B) was extracted from MICCAI 2012 Ventricular Infarct Segmentation challenge dataset (http://www.cardiacatlas.org/challenges/ventricular-infarct -segmentation/).
FIGURE 3Example images using different CMR sequences acquired by (A) LGE CMR (B) T2 CMR (C) bSSFP CMR. As denoted by the green arrows, we can see (A) LGE CMR accentuates the scar tissue by high intensities on the images; (B) T2 CMR accentuates myocardial oedema by high intensities on the image; and (C) bSSFP CMR shows the distinct endo- and epi-cardial boundary of the myocardium clearly on the image. Image source: (A–C) extracted from the MS-CMR open challenge dataset (MS-CMR Challenge, 2019).
Summary of representative conventional methodologies for segmentation of the myocardium on LGE-MRI.
| Reference | Modalities | Methodology description | Pros | Cons | Quantitative result (myocardium) | Dataset |
|
| LGE MRI, cine MRI | (1) Define LV border – non-rigid registration of cine and LGE MRI (2) LV pixel classification – SVM | Automatic segmentation of LGE-MRI with CINE-MRI information | No longitudinal axis (LAX) consideration, resulting in inter-slice misalignment; Need to register with other modality (CINE MRI) | Average contour pixel location error = 1.54 pixel | Private (LV LGE + cine MRI, |
|
| LGE MRI, cine MRI | 2D segmentation with a geometrical template (LGE only) and 3D mesh alignment (LGE + CINE) | Overcome non-homogeneous intensity of the myocardium in LGE infarcted regions | Meshes focus only on features in the SAX slices, no inter-slice consideration and thus inter-slice misalignment; Need to register with other modality (CINE MRI) | ASD = 2.2 mm (endocardial), 2.0 mm (epicardial) | Private (LV LGE + cine MRI, |
|
| LGE MRI, cine MRI | (1) Affine transformation estimation (2) non-rigid registration of LGE and cine MRI (3) myocardial contour generation by simplex mesh geometry | Utilize information better in connecting cine and LGE MRI | No LAX consideration, resulting in inter-slice misalignment; Need to register with other modality (CINE MRI) | Mean Dice = 0.8249; ASD = 0.97 pixel (endocardial), 0.93 pixel (epicardial) | Private (LV LGE + cine MRI, |
|
| LGE MRI, cine MRI | Translational registration of LGE and cine MRI data; 3D non-rigid deformation of the myocardial meshes by both short axis (SAX) and longitudinal axis (LAX) data | Consistent and robust segmentation; Consider both SAX and LAX data to reduce interslice misalignment | Need to register with other modality (CINE MRI) | Mean Dice = 0.9409; ASD = 0.67 mm (endocardial), 0.69 mm (epicardial) | Private (LV LGE + cine MRI, |
|
| LGE MRI | Slice-by-slice graph cuts (GC) with interslice and shape constraints | Impose morphological constraints that are common across MRI sequences – no need for subject-specific tuning or for user initialization and generalizable for other sequences (CINE-MRI); Achieve robustness to variations in grey-level appearance and to image inhomogeneities – more robust to the presence of abnormalities; Consider interslice interactions; No need to register with other modality (e.g., bSSFP cine MRI) | Give poorer result when generalized to CINE-MRI (due to many artefacts in the dataset tested) | Mean Dice = 0.81; ASD = 1.83 mm (endocardial), 2.38 mm (epicardial) | Private (LV LGE MRI, |
|
| LGE MRI | (1) LV localization – image registration (2) short axis estimation – principal component analysis (PCA) (3) endocardial refinement – a minimal cost path search (MCP) in polar space using the edge and scar information (4) epicardial refinement - by shape and inter-slice smoothness constraints (5) surface extraction – 3D mesh generation by marching cube algorithm ( | Fast speed and low computational workload by using simple texture features; Consider image data along the longitudinal axis in addition to the short axis, improving inter-slice smoothness and avoid inter-slice shift; No need to register with other modality (e.g., bSSFP cine MRI) | Poor performance in apex and LV outflow tract, poor accuracy in basal regions; Since this method is texture based, the distribution of scar and the small size of the atrium adversely affect its performance | Mean Dice = 0.92; ASD = 1.35 mm | Private (LV LGE MRI, |
|
| LGE MRI | (1) LV detection – circular Hough transforms (2) LV blood pool detection – morphological active contours approach without edges (MACWE) (3) endocardial boundary extraction – a minimal cost path search (MCP) in polar space using the edge and scar information (4) epicardial boundary extraction – by edge information while considering endocardial contour extracted | Fast speed and low computational workload by using simple texture features; No need to register with other modality (e.g., bSSFP cine MRI) | Poor performance in apex and LV outflow tract, poor accuracy in basal regions; Since this method is texture based, distribution of scar adversely affect its performance | Mean Dice = 0.85 (endocardial), 0.84 (epicardial); ASD = 2.54 mm (endocardial), 3.32 mm (epicardial) | Private (LV LGE MRI, |
|
| LGE MRI | (1) LV detection – circular Hough transforms, Otsu thresholding and circularity measures (2) ROI detection – morphological active contours approach without edges (MACWE) (3) endocardial boundary extraction – random forest classifier (4) epicardial boundary extraction – minimal cost path search to the boundary cost array in polar space | Fast speed and low computational workload by using simple texture features; No need to register with other modality (e.g., bSSFP cine MRI) | Poor performance in apex and LV outflow tract, resulting in poor accuracy in basal regions and poor ASD result | Mean Dice = 0.83 (endocardial), 0.83 (epicardial); ASD = 3.55 mm (endocardial), 4.12 mm (epicardial) | Private (LV LGE MRI, |
Summary of representative conventional methodologies for segmentation of cardiac scar and fibrosis regions on LGE-MRI.
| Type of method | Reference | Method Description | Pros | Cons | Quantitative result (scar/fibrosis) | Dataset |
| (A) Thresholding |
| Histogram analysis with constrained watershed segmentation | Automatic threshold determination; No training (supervision) needed; | Based on fixed models – mismatches occur for some cases | * | Private (LGE MRI, |
|
| Otsu thresholding ( | Automatic threshold determination; No training (supervision) needed; No specific density model assumed – no overfitting; Region growing technique can be useful for small MI | Connectivity filtering and region growing may not be suitable for discrete LA fibrosis regions | Mean Dice = 0.83 | Private (LV LGE MRI, | |
| Cates et al. (2013) (part of | Histogram analysis and simple thresholding | Simple and accurate processing | Time consuming (require manual work); Manual variance may be significant for the thin LA wall | Median Dice = 0.42 (pre-ablation); Median Dice = 0.78 (post-ablation) | ISBI cDERMIS 2013 ( | |
| Bai et al. (2013) (part of | Hysteresis thresholding ( | Coherent segmentation (adjacent faint scar sections can still be segmented) | Fixed parameterized model relying on empirical data | Median Dice = 0.37 (pre-ablation); Median Dice = 0.76 (post-ablation) | ISBI cDERMIS 2013 ( | |
| (B) Classification | Perry et al. (2013) (part of | K-means clustering | Relatively higher performance in pre-ablation fibrosis segmentation result benchmarking; No training (supervision) needed | Cluster number to be determined beforehand; Variance in LA scar segmented | Median Dice = 0.45 (pre-ablation); Median Dice = 0.72 (post-ablation) | ISBI cDERMIS 2013 ( |
| Markov random fields (MRF) model with graph-cuts | Relatively higher performance in pre-ablation fibrosis result benchmarking; | Require necessary post-processing steps to refine clustering | Median Dice = 0.30 (pre-ablation); Median Dice = 0.78 (post-ablation) | ISBI cDERMIS 2013 ( | ||
| Gao et al. (2013) (part of | Active contour with expectation-maximization (EM)-fitting | Counteract region leaking problem in region growing techniques | Fixed number of Gaussian mixtures in model | Median Dice = 0.42 (pre-ablation); Median Dice = 0.78 (post-ablation) | ISBI cDERMIS 2013 ( | |
|
| Graph cuts | Does not requires manual outlining of base-line healthy myocardium | Require additional modality (bSSFP) | * | Private (LA LGE + bSSFP MRI, | |
|
| Simple linear iterative Clustering (SLIC) + support vector machine | Fully automatic scar segmentation; Able to complement minor flaws in manual annotation | Require collection of b-SSFP modality; Supervised learning – need paired manual labels for training | Mean Dice = 0.79 | Private [LA LGE + bSSFP MRI, | |
|
| Fractal Analysis and Random Forest Classification | Utilize texture information in addition to clustering | Require accurate segmentation of the myocardium | Mean Dice = 0.66 | Private (LV LGE MRI, |
Summary of representative deep learning based methodologies for segmentation of the myocardium on LGE-MRI.
| Reference | Model backbone | Method description | Pros/cons | Quantitative result (myocardium) | Dataset | ||
|
| U-Net | Standard U-Net | Fast processing; deep latent network | Mean Dice = 0.8661 | Private (LV LGE MRI, | ||
|
| U-Net | U-Net with bidirectional convolutional LSTM | Process spatial sequential information | Mean Dice = 0.906 | LASC’18 ( | ||
|
| U-Net | U-Net with multiview sequential learning via convolutional LSTM and dilated residual learning | Process spatial sequential information on all 3 spatial axes | Mean Dice = 0.897 | Private (LA LGE MRI, | ||
|
| FCNN | Dual-path FCNN concerning both local and global view | Mitigate class imbalance; Less input image size – save GPU memory | Dice = 0.942 | Benchmarking (Dice) | Private [LA LGE MRI, | |
| U-Net ( | 0.642 | ||||||
| Dilated U-Net ( | 0.687 | ||||||
| VGGNet ( | 0.684 | ||||||
| Inception ( | 0.792 | ||||||
| ResNet ( | 0.804 | ||||||
| DCN-8 ( | 0.558 | ||||||
| DeconvNet ( | 0.500 | ||||||
| SegNet ( | 0.656 | ||||||
| V-Net ( | 0.696 | ||||||
| DeepOrgan ( | 0.632 | ||||||
|
| 0.821 | ||||||
|
| FCNN | 3D FCNN with 3D view fusion | Process spatial information on all 3 spatial axes volumetrically; Greater amount of GPU memory occupied | Dice = 0.912 | LASC’18 ( | ||
|
| Double-sided FCNN | Semi-supervised learning – discriminative feature learning via double-sided domain adaptation | Achieve a fusion of the feature spaces of labeled data and unlabeled data to achieve semi-supervision | Mean Dice = 0.9078 | Private (LA LGE MRI, two-center, n1 = 175, n2 = 94) | ||
Summary of representative deep learning based methodologies for segmentation of cardiac scar and fibrosis regions on LGE-MRI.
| LA/LV | Reference | Model backbone | Model description | Pros/Cons | Quantitative results (scar/fibrosis) | Dataset |
| (A) LA |
| Auto Encoder | Stacked Sparse Auto-Encoders | Significantly higher accuracy; Misenhancement in valves, etc. can cause false positive; Hyper-parameter sensitive | Mean Dice = 0.82 | Private [LA LGE MRI, |
|
| CNN | Graph-cuts framework based on multi-scale CNN | Multi-scale consideration enables both local and global feature extraction; Surface projection mitigate difficulty in accurate LA wall delineation; Require collection of b-SSFP | Mean Dice = 0.898 | Private [LA + bSSFP, LGE MRI, | |
| (B) LV |
| E-Net | E-Net on manually segmented myocardium region only | Significantly higher accuracy; Require manual intervention in myocardium segmentation | Dice = 0.86 | Private (LV LGE MRI, |
|
| FCNN | FCNN on manually segmented myocardium region only | Significantly higher accuracy; Require manual intervention in myocardium segmentation | Median Dice = 0.7125 | Private (LV LGE MRI, | |
|
| U-Net | Cascaded multi-view U-Net via majority vote multi-view fusion | Consider sequential spatial information on all three axes | Median Dice = 0.8861 | Private (LV LGE MRI, |
Summary of representative end-to-end deep learning based methodologies for segmentation of cardiac scar and fibrosis regions on LGE-MRI.
| LA/LV | Reference | Model backbone | Model description | Pros/Cons | Quantitative results (scar/fibrosis) | Dataset |
| (A) LA | ResNet | Multi-view based dilated attention and residual network with sequential learning via convolutional LSTM | Spatial sequential information processing; Attention network to tackle class imbalance | Mean Dice = 0.8258 | Private [LGE MRI, | |
|
| GAN | Adaptive attention cascade network for simultaneous estimation of unbalanced targets + joint discriminative network for adversarial regularization | Inter-cascade adversarial learning paradigm to tackle class imbalance and regularize the output | Mean Dice = 0.946 | Private [LGE MRI, | |
| (B) LV |
| E-Net | E-Net | Relatively low accuracy; Unable to tackle class imbalance well | Dice = 0.55 | Private (LV LGE MRI, |
|
| FCNN | FCNN | Relatively low accuracy; Unable to tackle class imbalance well | Median Dice = 0.5400 | Private (LV LGE MRI, | |
|
| CNN | Volume patch based 3D CNN | utilize small volume patches for accurate local view inspection | Mean Dice = 0.9363 | Private (LV LGE MRI, | |
|
| U-Net | U-Net based 3D CNN | Sub-volume design utilizes small volume patches for accurate local view inspection | Mean Dice = 0.54 | Private (LV LGE MRI, multi-vendor |
Result of a private benchmarking (Chen et al., 2021) of different algorithms on the LASC’18 dataset, reported in their mean ± SD.
| LA and PVs | LA scar | |||
| Dice Scores | ASD (mm) | Dice Scores | ASD (mm) | |
| 2D U-Net | 0.898 ± 0.034 | 3.38 ± 4.53 | 0.526 ± 0.118 | 1.83 ± 0.891 |
| 3D U-Net | 0.895 ± 0.032 | 3.81 ± 3.89 | 0.508 ± 0.106 | 1.90 ± 0.837 |
| MVTT ( | 0.902 ± 0.037 | 2.25 ± 1.39 | 0.613 ± 0.131 | 1.39 ± 1.03 |
| JAS-GAN ( | 0.913 ± 0.027 | 2.24 ± 2.73 | 0.621 ± 0.110 | 1.24 ± 1.04 |
Summary of representative machine learning/deep learning based scar segmentation in cine MRI for segmentation of cardiac scar regions on cine bSSFP MRI.
| Reference | Method description | Pros/Cons | Private Benchmarking Accuracy (%) ( | Dataset |
|
| (1) priori coarse tissue mask generation GAN, (2) condition LGE-equivalent image synthesis GAN, (3) fine segmentation GAN | Segment more than just LV scar – LV blood pool, myocardium and scar regions; Further improve temporal-spatial learning by a two-stream structure that includes a spatial perceptual pathway, a temporal perceptual pathway, and a multi-attention weighing unit. | 97.13 | Private [SAX cine bSSFP MRI, |
|
| (1) LV localization – ROI detection by CNN (2) Motion feature extraction (2.1) global motion feature – dense motion flow estimation (2.2) local motion feature – LSTM-RNN (3) infarction discrimination – FCNN | Combine both LSTM-RNN based local motion analysis and dense motion flow estimation based global motion analysis | 95.03 | |
|
| GAN (A) Generator: (A1) LV morphology and kinematic abnormalities – spatio-temporal feature extraction network through 3D successive convolution (A2) complementarity between segmentation and quantification - joint feature learning network for multitask learning; (B) Discriminator: (B1) intrinsic pattern between tasks – uses task relatedness network for adversarial learning | Introduce adversarial learning and task relatedness to reduce divergence | 96.77 | |
|
| (1) Heart localization – FAST R-CNN ( | Combine both ROI based local motion analysis and deep optical flow based global motion analysis | 94.93 | |
|
| Simple Linear Iterative Clustering (SLIC) based supervoxels ( | Only radial strain analyzed, excluding longitudinal and circumferential strains; K-means clustering used requires an empirical definition of the number of clusters | 86.47 | |
|
| Neighborhood approximation forests | Consider myocardial thickness and its temporal variations | 84.39 |