| Literature DB >> 35271045 |
Meikang Chen1, Kurban Ubul1,2, Xuebin Xu1, Alimjan Aysa2, Mahpirat Muhammat3.
Abstract
As a research hotspot in the field of natural language processing (NLP), sentiment analysis can be roughly divided into explicit sentiment analysis and implicit sentiment analysis. However, due to the lack of obvious emotion words in the implicit sentiment analysis task and because the sentiment polarity contained in implicit sentiment words is not easily accurately identified by existing text-processing methods, the implicit sentiment analysis task is one of the most difficult tasks in sentiment analysis. This paper proposes a new preprocessing method for implicit sentiment text classification; this method is named Text To Picture (TTP) in this paper. TTP highlights the sentiment differences between different sentiment polarities in Chinese implicit sentiment text with the help of deep learning by converting original text data into word frequency maps. The differences between sentiment polarities are used as sentiment clues to improve the performance of the Chinese implicit sentiment text classification task. It does this by transforming the original text data into a word frequency map in order to highlight the differences between the sentiment polarities expressed in the implicit sentiment text. We conducted experimental tests on two common datasets (SMP2019, EWECT), and the results show that the accuracy of our method is significantly improved compared with that of the competitor's. On the SMP2019 dataset, the accuracy-improvement range was 4.55-7.06%. On the EWECT dataset, the accuracy was improved by 1.81-3.95%. In conclusion, the new preprocessing method for implicit sentiment text classification proposed in this paper can achieve better classification results.Entities:
Keywords: data preprocessing; image classification; implicit sentiment analysis; natural language processing; text classification
Mesh:
Year: 2022 PMID: 35271045 PMCID: PMC8915041 DOI: 10.3390/s22051899
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1TTP flow diagram.
Figure 2Effect diagram of sample after data enhancement. (Example in the figure: Today, two people around Jia Yueting successively confirmed to Sina technology that Jia Yueting is still in California because there are still some things to deal with, but it will not be long before he returns home.).
Figure 3Schematic diagram of numerical matrix.
Figure 4Final effect diagram after processing input samples.
The proportions of the implicit experimental dataset for SMP2019.
| Dataset | Subset | Positive | Negative |
|---|---|---|---|
| SMP2019 | Training | 3749 | 7425 |
| Development | 469 | 922 | |
| Testing | 463 | 494 |
The proportions of the implicit experimental dataset for EWECT.
| Dataset | Subset | Positive | Negative |
|---|---|---|---|
| EWECT | Training | 4568 | 12,767 |
| Development | 391 | 1017 | |
| Testing | 810 | 854 |
Implicit sentiment analysis on SMP2019 dataset. (Two classifications: positive, negative).
| Dataset | Model | Accuracy (%) |
|---|---|---|
| SMP2019 | TextCNN | 80.67 |
| TextRNN | 79.00 | |
| TextRNN + Attention | 78.58 | |
| TextRCNN | 81.09 | |
| FastText | 79.10 | |
| DPCNN | 80.36 | |
| Transformer | 79.21 | |
| TTP + FER-Net |
|
Implicit sentiment analysis on EWECT dataset. (Two classifications: positive, negative).
| Dataset | Model | Accuracy (%) |
|---|---|---|
| EWECT | TextCNN | 77.36 |
| TextRNN | 77.72 | |
| TextRNN + Attention | 76.28 | |
| TextRCNN | 78.00 | |
| FastText | 75.98 | |
| DPCNN | 78.12 | |
| Transformer | 77.06 | |
| TTP + FER-Net |
|
Results of ablation experiments on SMP2019 dataset. (Two classifications: positive, negative).
| Dataset | Model | Accuracy (%) |
|---|---|---|
| SMP2019 | FER-Net | 50.57 |
| TTP + FER-Net |
|
Results of ablation experiments on EWECT dataset. (Two classifications: positive, negative).
| Dataset | Model | Accuracy (%) |
|---|---|---|
| EWECT | FER-Net | 51.32 |
| TTP + FER-Net |
|