| Literature DB >> 36091529 |
Muhammad Ali Shoaib1,2, Khin Wee Lai3, Joon Huang Chuah1, Yan Chai Hum4, Raza Ali1,2, Samiappan Dhanalakshmi5, Huanhuan Wang6, Xiang Wu6.
Abstract
One of the primary factors contributing to death across all age groups is cardiovascular disease. In the analysis of heart function, analyzing the left ventricle (LV) from 2D echocardiographic images is a common medical procedure for heart patients. Consistent and accurate segmentation of the LV exerts significant impact on the understanding of the normal anatomy of the heart, as well as the ability to distinguish the aberrant or diseased structure of the heart. Therefore, LV segmentation is an important and critical task in medical practice, and automated LV segmentation is a pressing need. The deep learning models have been utilized in research for automatic LV segmentation. In this work, three cutting-edge convolutional neural network architectures (SegNet, Fully Convolutional Network, and Mask R-CNN) are designed and implemented to segment the LV. In addition, an echocardiography image dataset is generated, and the amount of training data is gradually increased to measure segmentation performance using evaluation metrics. The pixel's accuracy, precision, recall, specificity, Jaccard index, and dice similarity coefficients are applied to evaluate the three models. The Mask R-CNN model outperformed the other two models in these evaluation metrics. As a result, the Mask R-CNN model is used in this study to examine the effect of training data. For 4,000 images, the network achieved 92.21% DSC value, 85.55% Jaccard index, 98.76% mean accuracy, 96.81% recall, 93.15% precision, and 96.58% specificity value. Relatively, the Mask R-CNN outperformed other architectures, and the performance achieves stability when the model is trained using more than 4,000 training images.Entities:
Keywords: Convolutional Neural Network (CNN); deep learning; echocardiography; image processing; left ventricle (LV); segmentation
Mesh:
Year: 2022 PMID: 36091529 PMCID: PMC9453312 DOI: 10.3389/fpubh.2022.981019
Source DB: PubMed Journal: Front Public Health ISSN: 2296-2565
Figure 1FCN architecture.
Figure 2SegNet architecture.
Figure 3Mask R-CNN architecture.
Minimum, maximum, and mean values with a standard deviation of evaluation metrics for three models.
|
|
|
|
|
|
|
| ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
| SegNet | 0.6492 | 0.8884 | 0.7651 ± 0.0401 | 0.4806 | 0.7992 | 0.6195 ± 0.0453 | 0.7524 | 0.9286 | 0.8455 ± 0.025 | 0.3261 | 0.7981 | 0.7486 ± 0.301 | 0.4813 | 0.8351 | 0.6519 ± 0.055 | 0.5824 | 0.8621 | 0.6891 ± 0.045 |
| FCN | 0.7106 | 0.9279 | 0.8386 ± 0.0342 | 0.5511 | 0.8654 | 0.7221 ± 0.0412 | 0.8627 | 0.9705 | 0.9193 ± 0.019 | 0.4568 | 0.9872 | 0.9649 ± 0.174 | 0.6013 | 0.8041 | 0.7238 ± 0.043 | 0.6871 | 0.8877 | 0.7891 ± 0.044 |
| Mask | 0.7567 | 0.9958 | 0.8831 ± 0.0356 | 0.6086 | 0.9916 | 0.7907 ± 0.0401 | 0.8903 | 0.9981 | 0.9457 ± 0.018 | 0.509 | 0.9910 | 0.9681 ± 0.216 | 0.6321 | 0.8182 | 0.7937 ± 0.031 | 0.7056 | 0.9021 | 0.8157 ± 0.033 |
Figure 4The top row represents the original Ultrasound image and ground truth binary mask. The bottom row shows the segmented results of the three architectures.
Figure 5Ground truth binary mask, segmented LV, and corresponding segmented binary masks (Using 2,000 images).
Figure 6Ground truth binary mask, segmented LV, and corresponding segmented binary masks (Using 3,000 images).
Figure 7Ground truth binary mask, segmented LV, and corresponding segmented binary masks (using 4,000 images).
Mean with standard deviation values of evaluation metrics using different training data size.
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
| 1,000 | 0.8831 ± 0.0356 | 0.7907 ± 0.0401 | 0.9457 ± 0.018 | 0.9681 ± 0.216 | 0.7937 ± 0.055 | 0.8057 ± 0.033 |
| 2,000 | 0.8945 ± 0.0365 | 0.8091 ± 0.0372 | 0.9581 ± 0.018 | 0.9712 ± 0.201 | 0.8398 ± 0.052 | 0.8462 ± 0.057 |
| 3,000 | 0.9071 ± 0.0281 | 0.8299 ± 0.0313 | 0.9703 ± 0.016 | 0.9809 ± 0.170 | 0.8730 ± 0.056 | 0.899 ± 0.041 |
| 4,000 | 0.9221 ± 0.0237 | 0.8555 ± 0.0294 | 0.9876 ± 0.015 | 0.9902 ± 0.165 | 0.9315 ± 0.049 | 0.9658 ± 0.040 |
| 5,000 | 0.9228 ± 0.0233 | 0.8566 ± 0.2899 | 0.9881 ± 0.015 | 0.9903 ± 0.163 | 0.9317 ± 0.050 | 0.9660 ± 0.042 |
Figure 8Average values of evaluation metrics for different data size used for training the Mask R-CNN model.