| Literature DB >> 35877637 |
Adi Wibowo1, Pandji Triadyaksa2, Aris Sugiharto1, Eko Adi Sarwoko1, Fajar Agung Nugroho1, Hideo Arai3, Masateru Kawakubo4.
Abstract
Cardiac cine magnetic resonance imaging (MRI) is a widely used technique for the noninvasive assessment of cardiac functions. Deep neural networks have achieved considerable progress in overcoming various challenges in cine MRI analysis. However, deep learning models cannot be used for classification because limited cine MRI data are available. To overcome this problem, features from cine image settings are derived by handcrafting and addition of other clinical features to the classical machine learning approach for ensuring the model fits the MRI device settings and image parameters required in the analysis. In this study, a novel method was proposed for classifying heart disease (cardiomyopathy patient groups) using only segmented output maps. In the encoder-decoder network, the fully convolutional EfficientNetB5-UNet was modified to perform the semantic segmentation of the MRI image slice. A two-dimensional thickness algorithm was used to combine the segmentation outputs for the 2D representation of images of the end-diastole (ED) and end-systole (ES) cardiac volumes. The thickness images were subsequently used for classification by using a few-shot model with an adaptive subspace classifier. Model performance was verified by applying the model to the 2017 MICCAI Medical Image Computing and Computer-Assisted Intervention dataset. High segmentation performance was achieved as follows: the average Dice coefficients of segmentation were 96.24% (ED) and 89.92% (ES) for the left ventricle (LV); the values for the right ventricle (RV) were 92.90% (ED) and 86.92% (ES). The values for myocardium were 88.90% (ED) and 90.48% (ES). An accuracy score of 92% was achieved in the classification of various cardiomyopathy groups without clinical features. A novel rapid analysis approach was proposed for heart disease diagnosis, especially for cardiomyopathy conditions using cine MRI based on segmented output maps.Entities:
Keywords: adaptive subspace classification; cardiac MRI; deep learning; few-shot learning; segmentation
Year: 2022 PMID: 35877637 PMCID: PMC9318676 DOI: 10.3390/jimaging8070194
Source DB: PubMed Journal: J Imaging ISSN: 2313-433X
Figure 1Proposed architecture consisting of 2D segmentation (top) and few-shot classification (bottom).
Figure 2Encoder–decoder UNet suitable for any encoder.
Figure 3Architecture detail on MobileNetV3 and EfficientNet B0-B5 as the encoder.
Segmentation on the validation set with the Dice coefficient.
| Segmentation Model | Fold 1 | Fold 2 | Fold 3 | Fold 4 | Fold 5 | Average | Standard |
|---|---|---|---|---|---|---|---|
| MobileNetV3-UNet | 0.891 | 0.880 | 0.889 | 0.896 | 0.890 | 0.889 | 0.0052 |
| EfficientNetB0-UNet | 0.884 | 0.879 | 0.885 | 0.888 | 0.894 | 0.886 | 0.0049 |
| EfficientNetB1-UNet | 0.897 | 0.881 | 0.888 | 0.890 | 0.893 | 0.890 | 0.0053 |
| EfficientNetB2-UNet | 0.890 | 0.888 | 0.883 | 0.887 | 0.890 | 0.888 | 0.0026 |
| EfficientNetB3-UNet | 0.898 | 0.888 | 0.894 | 0.891 | 0.893 | 0.893 | 0.0033 |
| EfficientNetB4-UNet | 0.889 | 0.889 | 0.888 | 0.893 | 0.890 | 0.890 | 0.0015 |
| EfficientNetB5-UNet | 0.896 | 0.891 | 0.893 | 0.894 | 0.889 | 0.893 | 0.0024 |
Segmentation on test leaderboard with the Dice coefficient.
| State-of-the-Art Methods | LV | RV | MYO | |||
|---|---|---|---|---|---|---|
| ED | ES | ED | ES | ED | ES | |
| Fabian Isensee [ | 0.967 | 0.935 | 0.951 | 0.904 | 0.904 | 0.923 |
| Fumin Guo [ | 0.968 | 0.935 | 0.955 | 0.894 | 0.906 | 0.923 |
| Georgios Simantris [ | 0.967 | 0.928 | 0.936 | 0.889 | 0.891 | 0.904 |
| Mahendra Khened [ | 0.964 | 0.917 | 0.935 | 0.879 | 0.889 | 0.898 |
| Ensemble B0-B5 (proposed) | 0.963 | 0.907 | 0.919 | 0.865 | 0.889 | 0.905 |
| Ensemble V3B5 (proposed) | 0.962 | 0.899 | 0.929 | 0.869 | 0.889 | 0.905 |
| EfficientNetB5-UNet (proposed) | 0.963 | 0.904 | 0.929 | 0.856 | 0.887 | 0.903 |
| MobileNetV3-UNet (proposed) | 0.960 | 0.887 | 0.885 | 0.858 | 0.885 | 0.898 |
Figure 4Segmentation output maps (a–f) of short-axis slices 1–6 of each ED (top) and ES (bottom) frames processed by the 2D thickness algorithm to produce 2D images (g) for few-shot model input.
Accuracy results of application of various encoder few-shot models to the validation set.
| Encoder | Experiment 1 | Experiment 2 | Experiment 3 | Experiment 4 | Experiment 5 | Average | Standard |
|---|---|---|---|---|---|---|---|
| Conv4 | 0.6912 | 0.6692 | 0.6916 | 0.6568 | 0.6620 | 0.6742 | 0.0146 |
| MobileNetV3 without pre-trained | 0.4064 | 0.4092 | 0.4944 | 0.4988 | 0.5568 | 0.4731 | 0.0577 |
| MobileNetV3 pre-trained | 0.6304 | 0.5951 | 0.6140 | 0.6375 | 0.7156 | 0.6385 | 0.0412 |
| EfficientNetB1 without pre-trained | 0.5036 | 0.3620 | 0.5532 | 0.4474 | 0.6304 | 0.4993 | 0.0913 |
| EfficientNetB1 pre-trained | 0.6375 | 0.6916 | 0.6916 | 0.7220 | 0.7320 | 0.6949 | 0.0329 |
Accuracy results of applying dropout and augmentation on the validation set.
| Encoder (Pre-Trained) | Experiment 1 | Experiment 2 | Experiment 3 | Experiment 4 | Experiment 5 | Average | Standard |
|---|---|---|---|---|---|---|---|
| EfficientNetB1 | 0.6375 | 0.6916 | 0.6916 | 0.7220 | 0.7320 | 0.6949 | 0.0329 |
| EfficientNetB1—dropout | 0.5836 | 0.6952 | 0.6468 | 0.5848 | 0.6712 | 0.6363 | 0.0452 |
| EfficientNetB1—augment | 0.7368 | 0.7735 | 0.6808 | 0.7423 | 0.6948 | 0.7256 | 0.0363 |
| EfficientNetB1—dropout-augment | 0.8083 | 0.7622 | 0.7710 | 0.7900 | 0.7801 | 0.7823 | 0.0159 |
Accuracy results of EfficientNetB1 (pre-trained, dropout, augment) with various shots on the validation set.
| Number of Shots | Experiment 1 | Experiment 2 | Experiment 3 | Experiment 4 | Experiment 5 | Average | Standard |
|---|---|---|---|---|---|---|---|
| 1-shot | 0.6360 | 0.5460 | 0.5620 | 0.5639 | 0.5460 | 0.5708 | 0.0348 |
| 2-shot | 0.7230 | 0.6780 | 0.7250 | 0.7200 | 0.6919 | 0.7076 | 0.0191 |
| 3-shot | 0.7007 | 0.7507 | 0.7120 | 0.7220 | 0.7080 | 0.7187 | 0.0174 |
| 4-shot | 0.7880 | 0.7150 | 0.7180 | 0.7410 | 0.7490 | 0.7422 | 0.0263 |
| 5-shot | 0.8083 | 0.7622 | 0.7710 | 0.7900 | 0.7801 | 0.7823 | 0.0159 |
Figure 5Examples of 2D thickness images for (a) 1-slice, (b) 2-slice, (c) 3-slice, (d) 4-slice, (e) 5-slice, and (f) 6-slice.
Accuracy results of EfficientNetB1 (pre-trained, dropout, augmented) with various short-axis slices on the validation set.
| Number of Slice(s) | Accuracy Score |
|---|---|
| 1 | 0.6928 |
| 2 | 0.7184 |
| 3 | 0.7272 |
| 4 | 0.7548 |
| 5 | 0.7632 |
| 6 | 0.8083 |
Accuracy results of five ensemble few-shot classifications on the test leaderboard.
| Method | Score on Leaderboard |
|---|---|
| Proposed Experiment 1 | 78 |
| Proposed Experiment 2 | 80 |
| Proposed Experiment 3 | 84 |
| Proposed Experiment 4 | 86 |
| Proposed Experiment 5 | 86 |
| Proposed Ensemble | 92 |
| [ | 100 |
| [ | 92 |
| [ | 92 |
| [ | 86 |
Strengths and weaknesses of the proposed method compared to previous methods.
| Method | Stage | Strengths | Weaknesses |
|---|---|---|---|
| Proposed | Segmentation | Segmentation becomes lighter with UNet-EfficientNetB5 for data per slice. | The segmentation performance has not outperformed the previous method. |
| Classification | Only considers slices and does not depend on the parameter settings of the tool and the number of slices obtained. Can be compared between the number of slices. | This approach is only suitable for morphological problems. Uncertainty is high because depending on the training in each episode, this is handled by the ensemble. | |
| Mahendra Khened [ | Segmentation | Segmentation using DenseNet which is suitable for limited data. | Segmentation using 2D UNet with dense block doesn’t outperform the ensemble of 2D and 3D DMR-UNet. |
| Classification | Classification becomes faster with Random Forest. | The classification most exclusively focus on end-diastole and end-systole features. | |
| Fabian Isensee [ | Segmentation | Segmentation by combining 2D and 3D UNets slightly improved. | The 3D UNet has large slice gap on the input images, it causes pooling and upscaling operations are carried out only in the short-axis plane. Moreover, the 3D network involves a smaller number of feature maps. |
| Classification | Perform ensemble classification by combining MLP and Random Forest. | The ensemble method does not outperform single Random Forest. | |
| Irem Cetin [ | Segmentation | The training data was manually segmented to produce accurate results. | They computed large number of computations manually. This method tends to overfitting. To prevent from overfitting, they selected the most discriminative features and used SVM for classification. |
| Classification | Classification using Support Vector Machines suitable for limited data. | The classification method does not outperform. | |
| Jelmer M Wolterink [ | Segmentation | The network was designed to contain a number of convolutional layers with increasing levels of dilatation to produce high resolution feature maps. | Convolutional neural network does not exhibit an encoder–decoder architecture. |
| Classification | Classification becomes faster with Random Forest. | Classification methods most exclusively focus on end-diastole and end-systole features. It does not outperform other Random Forest. | |
| Fumin Guo [ | Segmentation | Segmentation by combining UNet and Continuos Max-Flow. | Only for left ventricle. Other methods function for right ventricle and myocardium. |
| Classification | N/A | N/A | |
| Georgios Simantris [ | Segmentation | Networks trains quickly and efficiently without overfitting. | Does not outperform to the state of art featured in the ACDC. |
| Classification | N/A | N/A |
Figure 6Confusion matrix of ensemble few-shot model on test leaderboard.
Figure 7Comparison of results for all cardiac conditions (a) DCM, (b) HCM, (c) MINF, (d) normal, and (e) ARV, with H: height, W: weight, S: slices.