| Literature DB >> 24294131 |
Xin Wang1, Wanli Zuo, Ying Wang.
Abstract
Word sense disambiguation (WSD) is a fundamental problem in nature language processing, the objective of which is to identify the most proper sense for an ambiguous word in a given context. Although WSD has been researched over the years, the performance of existing algorithms in terms of accuracy and recall is still unsatisfactory. In this paper, we propose a novel approach to word sense disambiguation based on topical and semantic association. For a given document, supposing that its topic category is accurately discriminated, the correct sense of the ambiguous term is identified through the corresponding topic and semantic contexts. We firstly extract topic discriminative terms from document and construct topical graph based on topic span intervals to implement topic identification. We then exploit syntactic features, topic span features, and semantic features to disambiguate nouns and verbs in the context of ambiguous word. Finally, we conduct experiments on the standard data set SemCor to evaluate the performance of the proposed method, and the results indicate that our approach achieves relatively better performance than existing approaches.Entities:
Mesh:
Year: 2013 PMID: 24294131 PMCID: PMC3833093 DOI: 10.1155/2013/586327
Source DB: PubMed Journal: ScientificWorldJournal ISSN: 1537-744X
Algorithm 1Leveraging Topic Discriminative Term for Topic Identification.
Figure 1The topical-semantic association graph.
The performance of the topic identification based on extracting topic discriminative terms.
| Measure feature | Mono. | Poly. | All |
|---|---|---|---|
| TF | 78.3 | 76.1 | 78.0 |
| Pos + TF | 82.1 | 80.8 | 81.6 |
| SN |
| 65.5 | 74.0 |
| TS | 88.6 | 83.1 | 86.2 |
| All |
|
|
|
The performance of disambiguating through TSA versus other state-of-the art algorithms.
| Algo. | Nouns only | Verbs only | All words | ||||||
|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
| |
| TSA | 82.5 | 68.9 | 75.1 | 69.3 | 57.7 | 63.0 | 76.8 | 60.2 | 67.4 |
| ExtLesk | 80.5 | 62.1 | 70.1 | 60.7 | 47.9 | 53.5 | 71.5 | 50.9 | 59.5 |
| SSI | 81.6 | 65.2 | 72.5 | 63.5 | 54.6 | 58.7 | 74.0 | 58.7 | 65.5 |
| MFS | 74.7 | 74.7 | 74.7 | 59.1 | 59.1 | 59.1 | 68.4 | 68.4 | 68.4 |