| Literature DB >> 26000635 |
Yanyan Zhao1, Bing Qin2, Ting Liu2, Wei Yang3.
Abstract
Target extraction is an important task in opinion mining. In this task, a complete target consists of an aspect and its corresponding object. However, previous work has always simply regarded the aspect as the target itself and has ignored the important "object" element. Thus, these studies have addressed incomplete targets, which are of limited use for practical applications. This paper proposes a novel and important sentiment analysis task, termed aspect-object alignment, to solve the "object neglect" problem. The objective of this task is to obtain the correct corresponding object for each aspect. We design a two-step framework for this task. We first provide an aspect-object alignment classifier that incorporates three sets of features, namely, the basic, relational, and special target features. However, the objects that are assigned to aspects in a sentence often contradict each other and possess many complicated features that are difficult to incorporate into a classifier. To resolve these conflicts, we impose two types of constraints in the second step: intra-sentence constraints and inter-sentence constraints. These constraints are encoded as linear formulations, and Integer Linear Programming (ILP) is used as an inference procedure to obtain a final global decision that is consistent with the constraints. Experiments on a corpus in the camera domain demonstrate that the three feature sets used in the aspect-object alignment classifier are effective in improving its performance. Moreover, the classifier with ILP inference performs better than the classifier without it, thereby illustrating that the two types of constraints that we impose are beneficial.Entities:
Mesh:
Year: 2015 PMID: 26000635 PMCID: PMC4441432 DOI: 10.1371/journal.pone.0125084
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Example of the three types of sentences.
Statistics of the corpus.
| No. | Types | Digital camera |
|---|---|---|
| 1 | # of reviews | 200 |
| 2 | # of sentences | 8,042 |
| 3 | # of aspects | 2,017 |
| 4 | Average # of objects per review | 2.82 |
| 5 | # of pairwise ⟨ | 9,161 |
Fig 2Algorithm for the cascading rule-based approach (Baseline1).
Comparison of the results of our method and two baselines.
| Method |
|
|---|---|
| Baseline 1: cascading rule-based | 78.04 |
| Baseline 2: aspect-object alignment classifier | 81.80 |
| Our ILP inference method |
|
Results of using different feature-set combinations in the aspect-object alignment classifier.
| Feature set | Positive instances | Negative instances | ||||
|---|---|---|---|---|---|---|
| P | R | F | P | R | F | |
| No Basic features. | 69.24 | 60.04 | 64.15 | 89.17 | 92.52 | 90.80 |
| No Relational features. | 71.03 | 58.40 | 63.88 | 88.84 | 93.19 | 90.95 |
| No Special target features. | 81.47 | 77.92 | 79.50 | 93.82 | 94.97 | 94.38 |
| All features | 82.88 | 77.89 |
| 93.86 | 95.44 |
|
1 “Positive instances” is defined in Section 4.3.
2 Similarly, “negative instances” is also defined in Section 4.3.
Results of aspect-object alignment using different ILP constraints.
| Constraints | ILP constraints |
|
|---|---|---|
| Intra-sentence constraints | ILP-c1 | 81.80 |
| ILP-c2 | 81.85 | |
| ILP-c3 | 81.90 | |
| ILP-c4 | 82.65 | |
| Inter-sentence constraints | ILP-c5 | 82.45 |
| ILP-c6 | 82.05 | |
| All constraints | ILP-c1-6 |
|