| Literature DB >> 25954356 |
Noémie Elhadad1, Shaodian Zhang1, Patricia Driscoll1, Samuel Brody2.
Abstract
Online health communities play an increasingly prevalent role for patients and are the source of a growing body of research. A lexicon that represents the sublanguage of an online community is an important resource to enable analysis and tool development over this data source. This paper investigates a method to generate a lexicon representative of the language of members in a given community with respect to specific semantic types. We experiment with a breast cancer community and detect terms that belong to three semantic types: medications, symptoms and side effects, and emotions. We assess the ability of our automatically generated lexicons to detect new terms, and show that a data-driven approach captures the sublanguage of members in these communities, all the while increasing coverage of general-purpose terminologies. The code and the generated lexicons are made available to the research community.Entities:
Mesh:
Year: 2014 PMID: 25954356 PMCID: PMC4419934
Source DB: PubMed Journal: AMIA Annu Symp Proc ISSN: 1559-4076