| Literature DB >> 25054188 |
Atika Qazi1, Ram Gopal Raj1, Muhammad Tahir2, Erik Cambria3, Karim Bux Shah Syed4.
Abstract
Appropriate identification and classification of online reviews to satisfy the needs of current and potential users pose a critical challenge for the business environment. This paper focuses on a specific kind of reviews: the suggestive type. Suggestions have a significant influence on both consumers' choices and designers' understanding and, hence, they are key for tasks such as brand positioning and social media marketing. The proposed approach consists of three main steps: (1) classify comparative and suggestive sentences; (2) categorize suggestive sentences into different types, either explicit or implicit locutions; (3) perform sentiment analysis on the classified reviews. A range of supervised machine learning approaches and feature sets are evaluated to tackle the problem of suggestive opinion mining. Experimental results for all three tasks are obtained on a dataset of mobile phone reviews and demonstrate that extending a bag-of-words representation with suggestive and comparative patterns is ideal for distinguishing suggestive sentences. In particular, it is observed that classifying suggestive sentences into implicit and explicit locutions works best when using a mixed sequential rule feature representation. Sentiment analysis achieves maximum performance when employing additional preprocessing in the form of negation handling and target masking, combined with sentiment lexicons.Entities:
Mesh:
Year: 2014 PMID: 25054188 PMCID: PMC4099162 DOI: 10.1155/2014/879323
Source DB: PubMed Journal: ScientificWorldJournal ISSN: 1537-744X
Figure 1Suggestive and comparative detection workflow schematic.
Figure 2Sentiment analysis workflow schematic.
Figure 3F 1 performance on suggestive classification.
Performance numbers on LRChi2 over all feature construction approaches for detecting suggestive reviews.
| Feature construction | Precision | Recall |
|
|---|---|---|---|
| Bow | 0.8806 | 0.7603 | 0.8129 |
| Bow + Cmp | 0.8797 | 0.7539 | 0.8093 |
| Bow + Sug | 0.8715 | 0.7668 | 0.8124 |
| Bow + Cmp + Sug | 0.8796 | 0.7733 | 0.8193 |
| Bow + Cmp+ Sug + Surf | 0.859 | 0.7572 | 0.8008 |
| Sug | 0.899 | 0.6535 | 0.7534 |
| Mix | 0.8388 | 0.7671 | 0.7975 |
| Sug + Cmp | 0.9126 | 0.6537 | 0.7568 |
| Sug + Cmp + Surf | 0.8937 | 0.6699 | 0.7586 |
| Mix + Sug + Cmp | 0.8271 | 0.78 | 0.8005 |
| Mix + Surf + Sug + Cmp | 0.8309 | 0.7703 | 0.7972 |
Figure 4F 1 performance on comparative classification.
Performance numbers on LR over all feature construction approaches for detecting comparative reviews.
| Feature construction | Precision | Recall |
|
|---|---|---|---|
| Bow | 0.7297 | 0.6658 | 0.6907 |
| Bow + Cmp | 0.8298 | 0.7013 | 0.7523 |
| Bow + Sug | 0.7983 | 0.65 | 0.7124 |
| Bow + Cmp + Sug | 0.7821 | 0.6961 | 0.731 |
| Sug + Per | 0 | 0 | 0 |
| Mix | 0.7611 | 0.7132 | 0.7302 |
| Bow + Per | 0.8175 | 0.6842 | 0.7409 |
| Mix + Per | 0.7861 | 0.6789 | 0.724 |
| Mix + Per + Sug | 0.7347 | 0.6947 | 0.7092 |
| Mix + Sug + Cmp | 0.7377 | 0.6816 | 0.7047 |
| Mix + Sug + Cmp + Per | 0.7551 | 0.6618 | 0.7003 |
Figure 5Performance numbers on sentiment analysis on suggestive sentences.
Performance of LRChi2 on classification of explicit and implicit locution suggestions.
| Feature construction | Precision | Recall |
|
|---|---|---|---|
| Bow | 0.878 | 0.876 | 0.875 |
| Bow + Cmp | 0.881 | 0.904 | 0.890 |
| Bow + Sug | 0.862 | 0.918 | 0.887 |
| Bow + Cmp + Sug | 0.843 | 0.896 | 0.867 |
| Sug + Per | 0.822 | 0.938 | 0.872 |
| Mix | 0.909 | 0.925 | 0.915 |
| Bow + Per | 0.901 | 0.897 | 0.897 |
| Mix + Per | 0.902 | 0.914 | 0.906 |
| Mix + Per + Sug | 0.913 | 0.903 | 0.906 |
| Mix + Sug + Cmp | 0.909 | 0.931 | 0.918 |
| Mix + Sug + Cmp + Per | 0.906 | 0.921 | 0.911 |
Figure 6Performance numbers on various algorithms on separation of explicit and implicit locutions suggestions.
Performance numbers on LR over all feature construction approaches for sentiment analysis of suggestive reviews.
| Feature construction | Precision | Recall |
|
|---|---|---|---|
| Bow | 0.636 | 0.679 | 0.645 |
| Bow + Neg | 0.632 | 0.657 | 0.641 |
| Bow + Neg + Msk | 0.628 | 0.648 | 0.634 |
| Bow + Neg + Msk + Lex | 0.655 | 0.693 | 0.662 |
| Bow + Lex | 0.628 | 0.658 | 0.639 |
| Lex | 0.581 | 0.665 | 0.568 |
| Mix | 0.629 | 0.678 | 0.628 |
| Mix + Lex | 0.632 | 0.669 | 0.638 |
| Mix + Msk | 0.616 | 0.662 | 0.629 |
| Mix + Neg | 0.608 | 0.652 | 0.621 |
| Mix + Neg + Msk | 0.634 | 0.666 | 0.641 |
| Mix + Neg + Msk + Lex | 0.626 | 0.672 | 0.635 |