| Literature DB >> 26309389 |
Yu Cao1, Shawn Steffey1, Jianbiao He2, Degui Xiao3, Cui Tao4, Ping Chen5, Henning Müller6.
Abstract
Medical imaging is becoming a vital component of war on cancer. Tremendous amounts of medical image data are captured and recorded in a digital format during cancer care and cancer research. Facing such an unprecedented volume of image data with heterogeneous image modalities, it is necessary to develop effective and efficient content-based medical image retrieval systems for cancer clinical practice and research. While substantial progress has been made in different areas of content-based image retrieval (CBIR) research, direct applications of existing CBIR techniques to the medical images produced unsatisfactory results, because of the unique characteristics of medical images. In this paper, we develop a new multimodal medical image retrieval approach based on the recent advances in the statistical graphic model and deep learning. Specifically, we first investigate a new extended probabilistic Latent Semantic Analysis model to integrate the visual and textual information from medical images to bridge the semantic gap. We then develop a new deep Boltzmann machine-based multimodal learning model to learn the joint density model from multimodal information in order to derive the missing modality. Experimental results with large volume of real-world medical images have shown that our new approach is a promising solution for the next-generation medical imaging indexing and retrieval system.Entities:
Keywords: content-based image retrieval; deep boltzmann machine; deep learning; extended probabilistic latent semantic analysis; multi-modal and content-based medical image retrieval
Year: 2015 PMID: 26309389 PMCID: PMC4533857 DOI: 10.4137/CIN.S14053
Source DB: PubMed Journal: Cancer Inform ISSN: 1176-9351
Figure 1Overview architecture of the proposed model in Step 1.
Figure 2Overview architecture of the proposed model based on DBM.
Figure 3Sample queries (textual; keywords and the query images) from data set 1 (ImageCLEF 2009 medical retrieval challenge).
Figure 4Sample queries (textual; keywords and the query images) from data set 2 (ImageCLEF 2013 medical retrieval challenge).
Results of the proposed approach for multimodal retrieval using the two data sets.
| rel_ret | map | gm_map | Rprec | Bpref | recip_rank |
|---|---|---|---|---|---|
| 1902 | 0.2909 | 0.2019 | 0.3101 | 0.3206 | 0.6421 |
| 0.5620 | 0.5510 | 0.5309 | 0.5270 | 0.4647 | 0.3281 |
Performance comparisons between the proposed approach and the image retrieval techniques with single modality.
| Techniques | proposed multimodal approach | Algorithm A (only using visual modality) | Algorithm B (only using textual modality) |
|---|---|---|---|
| 0.2909 | 0.0101 | 0.2013 |
Performance comparisons between the proposed approach and the state-of-the-art image retrieval techniques.
| Techniques | Proposed multimodal approach | Algorithm C (LSA-based technique | Algorithm D (multilayer pLSA-based technique | Algorithm E (techniques from the best performer at imageCLEF 2013 |
|---|---|---|---|---|
| MAP | 0.2909 | 0.0912 | 0.2825 | 0.3010 |
Results of the proposed approach when certain percentage of textual modality is missing.
| Percentage of missing textual modality | 10% | 15% | 20% | 25% | 30% |
|---|---|---|---|---|---|
| MAP | 0.2709 | 0.2601 | 0.2505 | 0.2459 | 0.2319 |