| Literature DB >> 24860817 |
Gang Zhang1, Jian Yin2, Xiangyang Su3, Yongjing Huang4, Yingrong Lao4, Zhaohui Liang4, Shanxing Ou5, Honglai Zhang4.
Abstract
Skin biopsy images can reveal causes and severity of many skin diseases, which is a significant complement for skin surface inspection. Automatic annotation of skin biopsy image is an important problem for increasing efficiency and reducing the subjectiveness in diagnosis. However it is challenging particularly when there exists indirect relationship between annotation terms and local regions of a biopsy image, as well as local structures with different textures. In this paper, a novel method based on a recent proposed machine learning model, named multi-instance multilabel (MIML), is proposed to model the potential knowledge and experience of doctors on skin biopsy image annotation. We first show that the problem of skin biopsy image annotation can naturally be expressed as a MIML problem and then propose an image representation method that can capture both region structure and texture features, and a sparse Bayesian MIML algorithm which can produce probabilities indicating the confidence of annotation. The proposed algorithm framework is evaluated on a real clinical dataset containing 12,700 skin biopsy images. The results show that it is effective and prominent.Entities:
Mesh:
Year: 2014 PMID: 24860817 PMCID: PMC3997873 DOI: 10.1155/2014/305629
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1Example of skin biopsy images and their corresponding description in plain text.
Figure 2Dermatological terms and their corresponding regions.
15 considered annotation terms and their occurence frequency.
| Number | Name | Rate |
|---|---|---|
|
| Retraction space | 28.65% |
|
| Papillomatosis | 22.71% |
|
| Follicular plug | 1.8% |
|
| Hypergranulosis | 32.15% |
|
| Horn cyst | 4.14% |
|
| Basal cell liquefaction degeneration | 6.48% |
|
| Thin prickle cell layer | 2.61% |
|
| Infiltration of lymphocytes | 9.12% |
|
| Hyperpigmentation of Basal cell layer | 36.99% |
|
| Nevocytic nests | 18.56% |
|
| Munro microabscess | 7.72% |
|
| Acanthosis | 19.05% |
|
| Absent granular cell layer | 23.24% |
|
| Parakeratosis | 6.81% |
|
| Hyperkeratosis | 11.30% |
Figure 3Normalized cut with k = 11.
Figure 4Feature extraction for local regions.
Figure 5Padding pixels.
Figure 6Main steps of the proposed algorithm.
Data representation and their consistent methods.
| Method | Reference | Dataset |
|---|---|---|
|
| This work |
|
|
|
Zhang et al. [ |
|
|
| Caicedo et al. [ |
|
|
| Li et al. [ |
|
|
|
Chen and Wang [ |
|
Evaluation criteria for multilabel learning.
| Name | Equation |
|---|---|
|
| Evaluate the number of misclassified label pairs |
|
| |
|
| Evaluate the portion that a label of highest probability is not a correct label |
|
| |
|
| Evaluate the average distance to go down to find the proper label for a given image |
|
| |
|
| Evaluate the average fraction of label pair that are misordered in the ranking list |
Annotation result evaluated by accuracy.
| Term |
|
|
|
|
|
|---|---|---|---|---|---|
|
|
| 76.1% | 70.6% | 75.9% | 68.3% |
|
|
| 75.9% | 76.1% | 74.5% | 73.8% |
|
| 77.7% |
| 77.8% | 76.2% | 68.5% |
|
| 81.3% | 81.2% | 80.5% |
| 81.2% |
|
| 69.3% | 66.5% | 67.9% |
| 67.4% |
|
|
| 75.0% | 71.7% | 74.2% | 72.3% |
|
|
| 77.4% | 76.5% | 75.8% | 75.9% |
|
| 85.1% |
| 84.6% | 83.8% | 80.9% |
|
|
| 86.8% | 81.4% | 83.0% | 78.2% |
|
|
| 75.4% | 74.5% | 73.8% | 72.0% |
|
| 69.9% |
| 68.9% | 70.7% | 69.6% |
|
|
| 76.1% | 73.2% | 75.8% | 73.2% |
|
| 79.2% |
| 77.2% | 78.8% | 72.5% |
|
| 80.6% | 81.2 | 77.2% |
| 73.5% |
|
|
| 86.4% | 82.6% | 83.1% | 80.2% |
Figure 7Sample outputs of methods M1 and M5.
Figure 8Evaluation result of four criteria.
Figure 9Sparsity and number of basic learners.
Figure 10Accuracy and the size of training set.
Figure 11The result of normalized cut with different settings of k.