| Literature DB >> 34066133 |
Kia Dashtipour1, Mandar Gogate2, Ahsan Adeel3, Hadi Larijani4, Amir Hussain2.
Abstract
Sentiment analysis aims to automatically classify the subject's sentiment (e.g., positive, negative, or neutral) towards a particular aspect such as a topic, product, movie, news, etc. Deep learning has recently emerged as a powerful machine learning technique to tackle the growing demand for accurate sentiment analysis. However, the majority of research efforts are devoted to English-language only, while information of great importance is also available in other languages. This paper presents a novel, context-aware, deep-learning-driven, Persian sentiment analysis approach. Specifically, the proposed deep-learning-driven automated feature-engineering approach classifies Persian movie reviews as having positive or negative sentiments. Two deep learning algorithms, convolutional neural networks (CNN) and long-short-term memory (LSTM), are applied and compared with our previously proposed manual-feature-engineering-driven, SVM-based approach. Simulation results demonstrate that LSTM obtained a better performance as compared to multilayer perceptron (MLP), autoencoder, support vector machine (SVM), logistic regression and CNN algorithms.Entities:
Keywords: CNN; LSTM; classification; deep learning; sentiment analysis
Year: 2021 PMID: 34066133 PMCID: PMC8151596 DOI: 10.3390/e23050596
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.524
Figure 1Proposed Framework.
Figure 2Persian Sentences Examples.
Figure 3CNN Classifier.
Figure 4Long Term Short Memory.
Deep Learning Classifiers Results on Persian Movie reviews.
| Classifier | Precision | Recall | F-Measure | Accuracy (%) |
|---|---|---|---|---|
| Ghasemi et al. [ | 0.63 | 0.62 | 0.63 | 63.94 |
| Asli et al. [ | 0.68 | 0.67 | 0.68 | 68.23 |
| Amiri et al. [ | 0.69 | 0.68 | 0.69 | 70.01 |
| Basiri et al. [ | 0.72 | 0.71 | 0.72 | 72.81 |
| MLP | 0.78 | 0.78 | 0.78 | 78.49 |
| MLP-Autoencoder | 0.8 | 0.8 | 0.8 | 80.08 |
| 2D-CNN | 0.82 | 0.82 | 0.82 | 82.47 |
| 1D-CNN | 0.84 | 0.83 | 0.83 | 82.86 |
| Minimal-RNN | 0.86 | 0.86 | 0.86 | 85.64 |
| Stacked-LSTM | 0.94 | 0.94 | 0.94 | 93.65 |
|
| 0.96 | 0.96 | 0.96 | 95.61 |
SVM vs. Logistic Regression (LR) on Persian Movie reviews.
| Feature | Precision | Recall | F-Measure | Accuracy (%) | ||||
|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
| |
| Bigram | 0.65 | 0.66 | 0.61 | 0.67 | 0.53 | 0.65 | 61.27 | 66.59 |
| Trigram | 0.81 | 0.88 | 0.75 | 0.88 | 0.71 | 0.88 | 74.53 | 87.96 |
| Adjective | 0.72 | 0.87 | 0.85 | 0.85 | 0.78 | 0.79 | 84.57 | 85.07 |
| Adverb | 0.76 | 0.81 | 0.87 | 0.86 | 0.81 | 0.82 | 86.95 | 85.88 |
|
| 0.79 | 0.91 | 0.89 | 0.89 | 0.84 | 0.85 | 88.93 | 89.37 |
| Verb | 0.78 | 0.78 | 0.88 | 0.88 | 0.83 | 0.83 | 88.4 | 88.4 |
SVM vs. LR on Hotel reviews dataset.
| Feature | Precision | Recall | F-Measure | Accuracy (%) | ||||
|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
| |
| Bigram | 0.71 | 0.72 | 0.71 | 0.72 | 0.71 | 0.72 | 71.25 | 72 |
| Trigram | 0.73 | 0.74 | 0.73 | 0.74 | 0.73 | 0.74 | 73.5 | 74 |
|
| 0.75 | 0.76 | 0.78 | 0.79 | 0.76 | 0.77 | 76.24 | 77.06 |
| Adverb | 0.63 | 0.64 | 0.62 | 0.63 | 0.62 | 0.62 | 62.72 | 62.78 |
| Noun | 0.67 | 0.69 | 0.68 | 0.70 | 0.67 | 0.69 | 68.21 | 69.08 |
| Verb | 0.68 | 0.69 | 0.69 | 0.71 | 0.68 | 0.69 | 68.54 | 69.97 |
Deep Learning Classifiers Results on Hotel reviews dataset.
| Classifier | Precision | Recall | F-Measure | Accuracy (%) |
|---|---|---|---|---|
| Stacked-BiLSTM | 0.53 | 0.91 | 0.67 | 74.49 |
| 1D-CNN | 0.73 | 0.80 | 0.76 | 78.02 |
| Stacked-LSTM | 0.79 | 0.81 | 0.80 | 81.04 |
|
| 0.89 | 0.89 | 0.89 | 89.76 |
Comparison of 1D-CNN Layers on Persian Movie reviews.
| Layer | Precision | Recall | F-Measure | Accuracy (%) | Time |
|---|---|---|---|---|---|
| 2 | 0.72 | 0.73 | 0.72 | 73.55 | 2 m 31 s |
| 3 | 0.74 | 0.74 | 0.74 | 74.26 | 3 m 12 s |
| 4 | 0.81 | 0.79 | 0.79 | 79.01 | 3 m 44 s |
|
| 0.84 | 0.83 | 0.83 | 82.86 | 4 m 22 s |
| 6 | 0.78 | 0.76 | 0.76 | 78.42 | 5 m 24 s |
Comparison of 2D-CNN Layers on Persian Movie reviews.
| Layer | Precision | Recall | F-Measure | Accuracy (%) | Time |
|---|---|---|---|---|---|
| 2 | 0.72 | 0.70 | 0.70 | 74.05 | 2 m 21 s |
| 3 | 073 | 0.71 | 0.71 | 74.22 | 3 m 23 s |
| 4 | 078 | 0.78 | 0.78 | 78.26 | 5 m 5 s |
|
| 0.82 | 0.82 | 0.82 | 82.47 | 6 m 33 s |
| 6 | 0.75 | 0.76 | 0.75 | 76.51 | 8 m 25 s |
Comparison of LSTM Layers on Persian Movie reviews.
| Layer | Precision | Recall | F-Measure | Accuracy (%) | Time |
|---|---|---|---|---|---|
|
| 0.94 | 0.94 | 0.94 | 93.65 | 6 m 49 s |
| 3 | 0.84 | 0.83 | 0.83 | 84.26 | 7 m 28 s |
Comparison of BiLSTM Layers on Persian Movie reviews.
| Layer | Precision | Recall | F-Measure | Accuracy (%) | Time |
|---|---|---|---|---|---|
|
| 0.96 | 0.96 | 0.96 | 95.61 | 7 m 23 s |
| 3 | 0.84 | 0.86 | 0.85 | 85.05 | 9 m 24 s |
Figure 5Deep Learning Classifiers’ Accuracy on Movie reviews.
Figure 6SVM vs. LR Classifier Accuracy on Movie reviews.