| Literature DB >> 22112583 |
Tanja Bekhuis1, Marcos Kreinacke, Heiko Spallek, Mei Song, Jean A O'Donnell.
Abstract
BACKGROUND: An Internet mailing list may be characterized as a virtual community of practice that serves as an information hub with easy access to expert advice and opportunities for social networking. We are interested in mining messages posted to a list for dental practitioners to identify clinical topics. Once we understand the topical domain, we can study dentists' real information needs and the nature of their shared expertise, and can avoid delivering useless content at the point of care in future informatics applications. However, a necessary first step involves developing procedures to identify messages that are worth studying given our resources for planned, labor-intensive research.Entities:
Mesh:
Year: 2011 PMID: 22112583 PMCID: PMC3236668 DOI: 10.2196/jmir.1799
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 5.428
Figure 1Workflow for finding clinically relevant messages posted to an Internet mailing list.
Figure 2Cumulative distribution of messages posted by dental practitioners to an online discussion list.
Distribution of collocated phrases and keywords by category
| Category | n of collocated phrasesa (% of phrases)b | n of keywordsc (% of keywords)d |
| Systemic disease | 49 (15) | 21 (15) |
| Endodontics | 18 (6) | 9 (6) |
| Orthodontics | 8 (3) | 3 (2) |
| Periodontics | 12 (4) | 6 (4) |
| Restorative dentistry | 66 (20) | 20 (14) |
| Oral and maxillofacial surgery | 26 (8) | 18 (13) |
| Other oral diseases | 7 (2) | 4 (3) |
| Radiology | 7 (2) | 4 (3) |
| Causative agent | 20 (6) | 9 (6) |
| Medication | 36 (11) | 19 (13) |
| Materials | 44 (14) | 17 (12) |
| Basic sciences | 13 ( 4) | 6 (4) |
| Research | 19 (6) | 7 (5) |
| Total | 325 | 143 |
a Collocated phrases are bigrams and trigrams; selection based on pointwise mutual information score and clinical relevance.
b Percentage of phrases computed relative to the total number of phrases and rounded.
c Some keywords occur in more than one category. Thus, the total number of instances is greater than the number of unique keywords.
d Percentage of keywords computed relative to the total number of instances of keywords and rounded.
Number of messages with phrases or keywords retrieved for content analyses by selected category
| Selected categorya | n of messagesb (n of phrases)c | n of messages (n of keywords) | n of messages (n of phrases or keywords) |
| Systemic disease | 119 (164) | 284 (384) | 299 (548) |
| Periodontics | 14 (27) | 51 (51) | 54 (78) |
| Oral and maxillofacial surgery | 36 (40) | 106 (113) | 106 (153) |
| Other oral diseases | 17 (24) | 44 (56) | 48 (80) |
| Radiology | 1 (1) | 12 (12) | 12 (13) |
| Causative agent | 55 (78) | 79 (95) | 102 (173) |
| Medication | 70 (110) | 343 (377) | 363 (487) |
| Materials | 4 (4) | 44 (50) | 44 (54) |
| Basic sciences | 8 (12) | 157 (164) | 160 (176) |
| Research | 40 (60) | 89 (109) | 100 (169) |
| Total | 305 (520) | 948 (1411) | 996 (1931) |
a Categories selected from the full set by qualitative researchers.
b Number of messages after deduplication.
c Collocated phrases are bigrams and trigrams; selection based on pointwise mutual information score and clinical relevance.