| Literature DB >> 36033610 |
Lu Tang1, Yu Hui1, Hang Yang1, Yinghong Zhao1, Chuangeng Tian2.
Abstract
Multimodal medical image fusion (MMIF) has been proven to effectively improve the efficiency of disease diagnosis and treatment. However, few works have explored dedicated evaluation methods for MMIF. This paper proposes a novel quality assessment method for MMIF based on the conditional generative adversarial networks. First, with the mean opinion scores (MOS) as the guiding condition, the feature information of the two source images is extracted separately through the dual channel encoder-decoder. The features of different levels in the encoder-decoder are hierarchically input into the self-attention feature block, which is a fusion strategy for self-identifying favorable features. Then, the discriminator is used to improve the fusion objective of the generator. Finally, we calculate the structural similarity index between the fake image and the true image, and the MOS corresponding to the maximum result will be used as the final assessment result of the fused image quality. Based on the established MMIF database, the proposed method achieves the state-of-the-art performance among the comparison methods, with excellent agreement with subjective evaluations, indicating that the method is effective in the quality assessment of medical fusion images.Entities:
Keywords: attention mechanism; conditional; generative adversarial networks; image quality assessment; medical image fusion
Year: 2022 PMID: 36033610 PMCID: PMC9400712 DOI: 10.3389/fnins.2022.986153
Source DB: PubMed Journal: Front Neurosci ISSN: 1662-453X Impact factor: 5.152
FIGURE 1The overall architecture of our proposed method.
FIGURE 2The diagrammatic sketch of SA-FB.
FIGURE 3An example of fused images generated by ten different MMIF algorithms. Algorithms include (A) discrete Tchebichef moments and pulse coupled neural network (DTM-PCNN) (Min et al., 2019), (B) convolutional sparse representation (CSR) (Liu et al., 2016), (C) pulse-coupled neural network with modified spatial frequency based on non-subsampled contourlet transform (PCNN-NSCT-SF) (Das and Kundu, 2012), (D) guided filtering (GFF) (Li et al., 2013), (E) cross-scale coefficient selection (CSCS) (Shen et al., 2013), (F) union Laplacian pyramid with multiple features (LAP-MF) (Du et al., 2016b), (G) Laplacian pyramid and sparse representation (LP-SR) (Liu et al., 2015), (H) parameter-adaptive pulse-coupled neural network (PA-PCNN) (Yin et al., 2019), (I) pulse coupled neural network using the multi-swarm fruit fly optimization algorithm (PCNN-MFOA) (Tang et al., 2019), and (J) reduced pulse-coupled neural network (RPCNN) (Das and Kundu, 2013).
Comparison of quality assessment performance of different models.
| Methods | SRCC | KRCC | PLCC | RMSE |
|
| 0.2545 | 0.3604 | 0.2772 | 0.3804 |
|
| 0.2647 | 0.3608 | 0.2920 | 0.4093 |
|
| 0.2488 | 0.3322 | 0.2444 | 0.2791 |
|
| 0.1801 | 0.2076 | 0.3126 | 0.2872 |
|
| 0.1884 | 0.2400 | 0.2503 | 0.4002 |
|
| 0.0960 | 0.1275 | 0.2235 | 0.2970 |
|
| 0.1093 | 0.1216 | 0.0803 | 0.3007 |
| Proposed |
|
|
|
|
The bold values are the results of our proposed method, which achieves the best performance.
Comparative results of ablation experiments.
| Methods | SRCC | KRCC | PLCC | RMSE |
| Early-FM | 0.7077 | 0.6208 | 0.6779 | 0.2425 |
| Late-FM | 0.7288 | 0.6427 | 0.6833 | 0.2417 |
| Proposed w/o SA | 0.7825 | 0.7113 | 0.7867 | 0.2020 |
| Proposed w SA |
|
|
|
|
The bold values are the results of our proposed method, which achieves the best performance.