| Literature DB >> 23471473 |
Markus Strohmaier1, Christian Körner, Roman Kern.
Abstract
While recent progress has been achieved in understanding the structure and dynamics of social tagging systems, we know little about the underlying user motivations for tagging, and how they influence resulting folksonomies and tags. This paper addresses three issues related to this question. (1) What distinctions of user motivations are identified by previous research, and in what ways are the motivations of users amenable to quantitative analysis? (2) To what extent does tagging motivation vary across different social tagging systems? (3) How does variability in user motivation influence resulting tags and folksonomies? In this paper, we present measures to detect whether a tagger is primarily motivated by categorizing or describing resources, and apply these measures to datasets from seven different tagging systems. Our results show that (a) users' motivation for tagging varies not only across, but also within tagging systems, and that (b) tag agreement among users who are motivated by categorizing resources is significantly lower than among users who are motivated by describing resources. Our findings are relevant for (1) the development of tag-based user interfaces, (2) the analysis of tag semantics and (3) the design of search algorithms for social tagging systems.Entities:
Keywords: Social tagging systems; Tagging motivation; User goals; User motivation
Year: 2012 PMID: 23471473 PMCID: PMC3587461 DOI: 10.1016/j.websem.2012.09.003
Source DB: PubMed Journal: Web Semant ISSN: 1570-8268 Impact factor: 1.897
Overview of research on users’ motivation for tagging in social tagging systems.
| Authors | Categories of tagging motivation | Detection | Evidence | Reasoning | Systems investigated | # of users | Resources per user |
|---|---|---|---|---|---|---|---|
| Coates | Categorization, description | Expert judgment | Anecdotal | Inductive | Weblog | 1 | N/A |
| Hammond et al. | Self/self, self/others, others/self, others/others | Expert judgment | Observation | Inductive | 9 different tagging systems | N/A | N/A |
| Golder et al. | What it is about, what it is, who owns it, refining categories, identifying qualities, self-reference, task organizing | Expert judgment | Dataset | Inductive | Delicious | 229 | 300 (average) |
| Marlow et al. | Organizational, social, (and refinements) | Expert judgment | N/A | Deductive | Flickr | 10 (25,000) | 100 (minimum) |
| Xu et al. | Content-based, context-based, attribute-based, subjective, organizational | Expert judgment | N/A | Deductive | N/A | N/A | N/A |
| Sen et al. | Self-expression, organizing, learning, finding, decision support | Expert judgment | Prior experience | Deductive | Movielens | 635 (3366) | N/A |
| Wash and Rader | Later retrieval, sharing, social recognition, (and others) | Expert judgment | Interviews (semistruct.) | Inductive | Delicious | 12 | 950 (average) |
| Ames and Naaman | Self/organization, self/communication, social/organization, social/communication | Expert judgment | Interviews (in-depth) | Inductive | Flickr, ZoneTag | 13 | N/A |
| Heckner et al. | Personal information management, resource sharing | Expert judgment | Survey (M. Turk) | Deductive | Flickr, YouTube, Delicious, Connotea | 142 | 20 and 5 (minimum) |
| Nov et al. | Enjoyment, commitment, self-development, reputation | Expert judgment | Survey (e-mail) | Deductive | Flickr (PRO users only) | 422 | 2848.5 (average) |
Fig. 1Examples of tag clouds produced by different users: categorizer (left) vs. describer (right).
Differences between categorizers and describers.
| Categorizer (C) | Describer (D) | |
|---|---|---|
| Goal | Later browsing | Later retrieval |
| Change of vocab. | Costly | Cheap |
| Size of vocab. | Limited | Open |
| Tags | Subjective | Objective |
| Tag reuse | Frequent | Rare |
| Tag purpose | Mimic taxonomy | Descriptive labels |
Overview and statistics of social tagging datasets.
| Dataset | |||||
|---|---|---|---|---|---|
| ESP game | 290 | 29,834 | 99,942 | 1000 | 0.2985 |
| Flickr sets | 1419 | 49,298 | 1,966,269 | 500 | 0.0250 |
| Delicious | 896 | 184,746 | 1,089,653 | 1000 | 0.1695 |
| Flickr tags | 456 | 216,936 | 965,419 | 1000 | 0.2247 |
| Bibsonomy bookmarks | 84 | 29176 | 93,309 | 500 | 0.3127 |
| Bibsonomy publications | 26 | 11,006 | 23,696 | 500 | 0.4645 |
| CiteULike | 581 | 148,396 | 545,535 | 500 | 0.2720 |
| Diigo tags | 135 | 68,428 | 161,475 | 500 | 0.4238 |
| Movielens | 99 | 9983 | 7078 | 500 | 1.4104 |
Indicate synthetic personomies of extreme categorization/description behavior.
Fig. 2Overview of the introduced measures (from left to right: and ) over time for the two synthetic datasets (top and bottom row) and the Delicious dataset (middle row). The synthetic datasets form approximate upper and lower bounds for “real” tagging datasets.
Fig. 3and at for 9 datasets, including Pearson correlation and the mean value for . The top-left and the lower-right figure show the reference datasets for describers and categorizers respectively (designated with an asterisk).
Fig. 4at for 9 different datasets, binned in the interval . The synthetic Flickr sets dataset (back row) indicates extreme categorizer behavior, while the synthetic ESP game dataset (second back row) indicates extreme describer behavior (designated by an asterisk).
Fig. 5(a), (b) and (c) in the interval for 100 random users obtained from the Delicious dataset. The 100 users are split into two equal halves at according to the corresponding measure, with the upper half colored red (50 describers) and the lower half colored blue (50 categorizers). Due to our particular approach, it is obvious that all measures exhibit perfect separation at , but (right) appears to exhibit faster convergence and better separability in early phases of a user’s tagging history, especially for small (such as ). (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Classification results for <500 resources according to the measure on the Delicious dataset. The identified user behavior at 500 resources was used as a ground truth for the previous steps.
| 0 | 100 | 200 | 300 | 400 | |
|---|---|---|---|---|---|
| Correctly identified user behavior | 50 | 77 | 84 | 90 | 91 |
| Incorrectly identified user behavior | 50 | 23 | 16 | 10 | 9 |
Tag agreement among Delicious describers and categorizers for 500 most popular resources. For all different , describers produce more agreed tags than categorizers. Results for are dominated by ties.
| 10 | 20 | 30 | 40 | 50 | 60 | 70 | 80 | 90 | |
|---|---|---|---|---|---|---|---|---|---|
| Desc. wins | 172 | 74 | 23 | ||||||
| Cat. wins | 56 | 12 | 5 | 7 | 5 | 3 | 4 | 4 | 0 |
| Ties | 66 | 25 | 24 | 41 | 115 | 211 |