| Literature DB >> 24707215 |
Zheng Xu1, Xiangfeng Luo2, Yunhuai Liu1, Lin Mei1, Chuanping Hu1.
Abstract
Relatedness measurement between multimedia such as images and videos plays an important role in computer vision, which is a base for many multimedia related applications including clustering, searching, recommendation, and annotation. Recently, with the explosion of social media, users can upload media data and annotate content with descriptive tags. In this paper, we aim at measuring the semantic relatedness of Flickr images. Firstly, four information theory based functions are used to measure the semantic relatedness of tags. Secondly, the integration of tags pair based on bipartite graph is proposed to remove the noise and redundancy. Thirdly, the order information of tags is added to measure the semantic relatedness, which emphasizes the tags with high positions. The data sets including 1000 images from Flickr are used to evaluate the proposed method. Two data mining tasks including clustering and searching are performed by the proposed method, which shows the effectiveness and robustness of the proposed method. Moreover, some applications such as searching and faceted exploration are introduced using the proposed method, which shows that the proposed method has broad prospects on web based tasks.Entities:
Mesh:
Year: 2014 PMID: 24707215 PMCID: PMC3953465 DOI: 10.1155/2014/758089
Source DB: PubMed Journal: ScientificWorldJournal ISSN: 1537-744X
Figure 1The illustration of four kinds of correlations between concepts.
Figure 2The illustration of a pair of images from Flickr.
The variables and parameters used in the proposed computation model.
| Name | Description |
|---|---|
|
| An image |
|
| A tag |
|
| Tags set of an image |
|
| Semantic relatedness of two tags |
|
| Semantic relatedness of two images |
|
| Page counts of a tag |
|
| Set of page counts of an image |
| pos( | Position information of a tag |
Figure 3The illustration of the proposed method.
Figure 4Graphical representation of the assignment in bipartite graphs problem.
Algorithm 1MaxRel.
The detailed information of the data set.
| Group 1 | Average tags per image | Group 2 | Average tags per image |
|---|---|---|---|
| Car | 4.4 | Louis Vuitton | 3.1 |
| Phone | 3.5 | Dior | 3.2 |
| Flower | 2.2 | Gucci | 2.9 |
| Dog | 5.6 | Cartier | 2.8 |
| Boat | 3.1 | Chanel | 2.6 |
The selected tags of group 2 from Flickr.
| Group 2 | Tags | Tags | Tags | Tags | Tags |
|---|---|---|---|---|---|
| Louis Vuitton | “Louis Vuitton” | “Louis Vuitton” | “Louis Vuitton” | “Louis Vuitton” | “LV” |
| Dior | “DIOR” “lipstick” | “Dior” | “Dior” | “Dior” | “Dior” |
| Gucci | “Gucci” | “Gucci” | “Gucci” | “Replica” | “Gucci” |
| Cartier | “Cartier” | “CARTIER” | “Cartier” | “Calibre” | “Cartier Watch” |
| Chanel | “Chanel” | “Chanel” | “Coco Mademoiselle” | “Chanel” | “Chance” |
Figure 5The correlation of four selected functions.
Figure 6The clustering results of group 1 data sets.
Figure 7The clustering results of group 2 data sets.
The comparison of the cut-off point precision between the proposed method and Flickr.
| Cut-off point | Louis Vuitton | Gucci | Dior | Chanel | Cartier |
|---|---|---|---|---|---|
|
| 100% | 100% | 100% | 100% | 100% |
|
| 100% | 100% | 0 | 100% | 100% |
|
| 100% | 100% | 100% | 100% | 100% |
|
| 80% | 60% | 60% | 60% | 80% |
|
| 100% | 100% | 100% | 100% | 100% |
|
| 90% | 70% | 70% | 80% | 80% |