| Literature DB >> 35890903 |
Yen-Hao Hsieh1, Xin-Ping Zeng2.
Abstract
Sentiment analysis is one of the fields of affective computing, which detects and evaluates people's psychological states and sentiments through text analysis. It is an important application of text mining technology and is widely used to analyze comments. Bullet screen videos have become a popular way for people to interact and communicate while watching online videos. Existing studies have focused on the form, content, and function of bullet screen comments, but few have examined bullet screen comments using natural language processing. Bullet screen comments are short text messages of different lengths and ambiguous emotional information, which makes it extremely challenging in natural language processing. Hence, it is important to understand how we can use the characteristics of bullet screen comments and sentiment analysis to understand the sentiments expressed and trends in bullet screen comments. This study poses the following research question: how can one analyze the sentiments ex-pressed in bullet screen comments accurately and effectively? This study mainly proposes an ERNIE-BiLSTM approach for sentiment analysis on bullet screen comments, which provides effective and innovative thinking for the sentiment analysis of bullet screen comments. The experimental results show that the ERNIE-BiLSTM approach has a higher accuracy rate, precision rate, recall rate, and F1-score than other methods.Entities:
Keywords: BiLSTM; ERNIE; bullet screen comments; sentiment analysis
Mesh:
Year: 2022 PMID: 35890903 PMCID: PMC9318645 DOI: 10.3390/s22145223
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.847
Figure 1The structure of BERT [23].
Figure 2Long Short—Term Memory Neural Network [26].
Parameters Setting.
| Parameter | Values |
|---|---|
| Learning Rate | 2 × 10−5 |
| Epoch | 10 |
| Optimizer | Adam |
| Numbers of Neurons | 384 |
| Batch Size | 32 |
Experiment results of BERT-BiLSTM approach.
| Number of Layer | Dropout Rate | Accuracy | Precision | Recall | F1-Score |
|---|---|---|---|---|---|
| 1 | 0 | 0.832 | 0.811 | 0.795 | 0.789 |
| 1 | 0.2 | 0.852 | 0.843 | 0.824 | 0.817 |
| 1 | 0.4 | 0.861 | 0.851 | 0.844 | 0.831 |
| 1 | 0.6 | 0.872 | 0.862 | 0.847 | 0.841 |
| 1 | 0.8 | 0.838 | 0.813 | 0.801 | 0.791 |
| 1 | 1 | 0.513 | 0.454 | 0.419 | 0.407 |
| 2 | 0 | 0.841 | 0.819 | 0.804 | 0.797 |
| 2 | 0.2 | 0.855 | 0.833 | 0.812 | 0.809 |
| 2 | 0.4 | 0.860 | 0.851 | 0.842 | 0.825 |
| 2 | 0.6 | 0.875 | 0.853 | 0.847 | 0.833 |
| 2 | 0.8 | 0.836 | 0.819 | 0.808 | 0.801 |
| 2 | 1 | 0.483 | 0.471 | 0.433 | 0.420 |
Experiment results of ERNIE-BiLSTM approach.
| Number of Layer | Dropout Rate | Accuracy | Precision | Recall | F1-Score |
|---|---|---|---|---|---|
| 1 | 0 | 0.827 | 0.801 | 0.779 | 0.771 |
| 1 | 0.2 | 0.847 | 0.835 | 0.815 | 0.796 |
| 1 | 0.4 | 0.863 | 0.851 | 0.844 | 0.841 |
| 1 | 0.6 | 0.859 | 0.849 | 0.838 | 0.829 |
| 1 | 0.8 | 0.827 | 0.806 | 0.793 | 0.781 |
| 1 | 1 | 0.493 | 0.478 | 0.436 | 0.428 |
| 2 | 0 | 0.839 | 0.837 | 0.821 | 0.814 |
| 2 | 0.2 | 0.868 | 0.867 | 0.843 | 0.835 |
| 2 | 0.4 | 0.873 | 0.868 | 0.859 | 0.853 |
| 2 | 0.6 | 0.889 | 0.871 | 0.853 | 0.848 |
| 2 | 0.8 | 0.860 | 0.859 | 0.839 | 0.822 |
| 2 | 1 | 0.513 | 0.487 | 0.445 | 0.427 |
Parameters Setting of Word2Vec.
| Parameter | Values |
|---|---|
| Learning Rate | 0.002 |
| Epoch | 10 |
| Optimizer | Adagrad |
| Numbers of Neurons | 384 |
| Batch Size | 32 |
Performance Comparison of Different Word Vector Models.
| Accuracy | Precision | Recall | F1 | |
|---|---|---|---|---|
| Word2vec + BiLSTM | 0.687 | 0.699 | 0.644 | 0.642 |
| Bert + BiLSTM | 0.875 | 0.853 | 0.847 | 0.833 |
| ERNIE + BiLSTM | 0.889 | 0.871 | 0.853 | 0.848 |
Performance Comparison of Different Feature Extraction Models.
| Accuracy | Precision | Recall | F1 | |
|---|---|---|---|---|
| Bert | 0.821 | 0.815 | 0.801 | 0.789 |
| ERNIE | 0.838 | 0.824 | 0.810 | 0.803 |
| Bert + BiLSTM | 0.875 | 0.853 | 0.847 | 0.833 |
| ERNIE + BiLSTM | 0.889 | 0.871 | 0.853 | 0.848 |
Definitions of SIR.
| Role | Definition |
|---|---|
| Susceptible | A user indicated first post belonging to positive sentiment |
| Infectious | A user indicated first post belonging to negative sentiment |
| Potential | A user indicated negative sentiment posts with lots of followers and responses |
| Recovered | A user indicated positive sentiment posts to respond to Potential |
Figure 3The variations of SIR.
Figure 4The trends of SIR.