| Literature DB >> 34943615 |
Yin Dai1,2, Yumeng Song1,2, Weibin Liu1,2, Wenhe Bai1,2, Yifan Gao1,2,3, Xinyang Dong4, Wenbo Lv1,2.
Abstract
Parkinson's disease (PD) is a common neurodegenerative disease that has a significant impact on people's lives. Early diagnosis is imperative since proper treatment stops the disease's progression. With the rapid development of CAD techniques, there have been numerous applications of computer-aided diagnostic (CAD) techniques in the diagnosis of PD. In recent years, image fusion has been applied in various fields and is valuable in medical diagnosis. This paper mainly adopts a multi-focus image fusion method primarily based on deep convolutional neural networks to fuse magnetic resonance images (MRI) and positron emission tomography (PET) neural photographs into multi-modal images. Additionally, the study selected Alexnet, Densenet, ResNeSt, and Efficientnet neural networks to classify the single-modal MRI dataset and the multi-modal dataset. The test accuracy rates of the single-modal MRI dataset are 83.31%, 87.76%, 86.37%, and 86.44% on the Alexnet, Densenet, ResNeSt, and Efficientnet, respectively. Moreover, the test accuracy rates of the multi-modal fusion dataset on the Alexnet, Densenet, ResNeSt, and Efficientnet are 90.52%, 97.19%, 94.15%, and 93.39%. As per all four networks discussed above, it can be concluded that the test results for the multi-modal dataset are better than those for the single-modal MRI dataset. The experimental results showed that the multi-focus image fusion method according to deep learning can enhance the accuracy of PD image classification.Entities:
Keywords: Parkinson’s disease (PD); deep learning; multi-focus image fusion
Year: 2021 PMID: 34943615 PMCID: PMC8700359 DOI: 10.3390/diagnostics11122379
Source DB: PubMed Journal: Diagnostics (Basel) ISSN: 2075-4418
Figure 1Experimental process.
Figure 2(a) Magnetic resonance image (MRI) of patients. (b) MRI of normal people. (c) Positron emission tomography (PET) image of patients. (d) PET image of normal people.
Figure 3(a) Before image enhancement. (b) After image enhancement. (c) Region of interest.
Figure 4Convolutional neural network (CNN) model [27].
Figure 5Process of image fusion.
Figure 6Network structure of Densenet [35].
Figure 7Network structure of Efficientnet [37].
Figure 8Network structure of ResNeSt [38].
Data classification.
| True Patient | True Normal | |
|---|---|---|
| Predict Patient | TP | FP |
| Predict Normal | FN | TN |
Confusion matrix.
| True Patient | True Normal | |
|---|---|---|
| Predict Patient |
|
|
| Predict Normal |
|
|
Figure 9(a) Original MRI image. (b) Original PET image.
Figure 10(a) Focus detection. (b) Initial segmentation. (c) Initial decision map. (d) Final decision map.
Figure 11Image fusion. (a) The product of MRI and final decision map. (b) The product of PET and complementary set of final decision map. (c) Final fusion image.
Figure 12(a) Fusion results of Laplacian pyramid (LP). (b) Fusion results of ratio of low-pass pyramid (RP). (c) Fusion results of curvelet transform (CVT). (d) Fusion results of nonsubsampled contourlet transform (NSCT).
The results of image fusion.
| SSIM | SF | MI | STD | CC | |
|---|---|---|---|---|---|
| LP | 0.8176 | 4.14 | 5.466 | 38.81 | 0.5825 |
| RP | 0.7883 | 4.15 | 5.431 | 37.90 | 0.6039 |
| CVT | 0.7860 | 3.96 | 5.458 | 35.64 | 0.6411 |
| NSCT | 0.7939 | 4.04 | 5.468 | 35.94 | 0.6360 |
| This paper | 0.8189 | 4.26 | 6.338 | 63.27 | 0.6350 |
Experimental results.
| CNN | Dataset | Accuracy | Recall | Precision | Specificity | |
|---|---|---|---|---|---|---|
| Alexnet | Single-modal | 83.31% | 81.87% | 81.36% | 84.95% | 81.58% |
| Multi-modal | 90.52% | 83.74% | 94.79% | 87.65% | 88.90% | |
| Efficientnet | Single-modal | 86.44% | 84.36% | 85.46% | 87.19% | 84.88% |
| Multi-modal | 93.39% | 94.43% | 91.36% | 95.45% | 92.79% | |
| ResNest | Single-modal | 86.37% | 84.60% | 85.26% | 87.29% | 84.83% |
| Multi-modal | 94.15% | 94.36% | 93.07% | 95.25% | 93.63% | |
| Densenet | Single-modal | 87.76% | 86.45% | 86.49% | 88.86% | 86.78% |
| Multi-modal | 97.19% | 97.09% | 96.79% | 97.59% | 96.91% |
Figure 13The receiver operating characteristic curve (ROC) curve of the single-modal data set. (a) Alexnet (b) Densenet (c) ResNeSt (d) Efficientnet.
Figure 14The confusion matrix of the single-mode MRI data set. (a) Alexnet (b) Densenet (c) ResNeSt (d) Efficientnet.
Figure 15The ROC curve of the multi-modal data set. (a) Alexnet (b) Densenet (c) ResNeSt (d) Efficientnet.
Figure 16The confusion matrix of the multi-modal data set. (a) Alexnet (b) Densenet (c) ResNeSt (d) Efficientnet.