| Literature DB >> 35497382 |
Alberto Fernández-Isabel1, Javier Cabezas1, Daniela Moctezuma2, Isaac Martín de Diego1.
Abstract
Intelligent systems have been developed for years to solve specific tasks automatically. An important issue emerges when the information used by these systems exhibits a dynamic nature and evolves. This fact adds a level of complexity that makes these systems prone to a noticeable worsening of their performance. Thus, their capabilities have to be upgraded to address these new requirements. Furthermore, this problem is even more challenging when the information comes from human individuals and their interactions through language. This issue happens more easily and forcefully in the specific domain of Sentiment Analysis, where feelings and opinions of humans are in constant evolution. In this context, systems are trained with an enormous corpus of textual content, or they include an extensive set of words and their related sentiment values. These solutions are usually static and generic, making their manual upgrading almost unworkable. In this paper, an automatic and interactive coaching architecture is proposed. It includes a ML framework and a dictionary-based system both trained for a specific domain. These systems converse about the outcomes obtained during their respective learning stages by simulating human interactive coaching sessions. This leads to an Active Learning process where the dictionary-based system acquires new information and improves its performance. More than 800, 000 tweets have been gathered and processed for experiments. Outstanding results were obtained when the proposed architecture was used. Also, the lexicon was updated with the prior and new words related to the corpus used which is important to reach a better sentiment analysis classification.Entities:
Keywords: Active learning; Automatic coaching; Combination of information; Continuous dynamical system; Sentiment analysis
Year: 2022 PMID: 35497382 PMCID: PMC9043891 DOI: 10.1007/s12559-022-10018-2
Source DB: PubMed Journal: Cognit Comput ISSN: 1866-9956 Impact factor: 4.890
Fig. 1Overview of the EmoWeb 2.0 framework architecture
Fig. 2Excerpt of the Data Processing Module architecture
Fig. 3Excerpt of the Sentiment Evaluation Module architecture
Example of how texts are transformed by µTC.
| maintain hashtags | The #covid19 makes meee crazy!!! :(, I read this http://siteurl |
| remove urls | The #covid19 makes meee crazy!!! :(, I read this |
| lowercase | the #covid19 makes meee crazy!!! :(, i read this |
| remove punctuation | the #covid19 makes meee crazy :( i read this |
| remove duplicates | the |
| maintain emojis | the #covid19 makes me crazy :( i read this |
Fig. 4Proposed architecture and major coaching activities
Fig. 5Coaching session and internal process followed
Fig. 6Complete set of experiments performed with EmoWeb 2.0
µTC results (train and test)
| Pos. | Neu. | Neg. | Pos. | Neu. | Neg. | Pos. | Neu. | Neg. | ||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Train | 95.46 | 95.84 | 95.96 | 93.07 | 94.97 | 96.39 | 93.41 | 96.72 | 95.53 | 92.72 | ||
| Test | 84.07 | 85.82 | 85.67 | 74.15 | 86.37 | 83.88 | 79.26 | 85.27 | 87.54 | 69.67 | ||
Parameters used for each of the experiments performed
| Upper_th | Lower_th | Days | CProb_th | CHigh_th | CNeu_th | CLow_th | CEmo_th | CN | ||
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 0.4 | 0.3 | 0.2 | 2 | - | - | - | - | - | - |
| 2 | 0.4 | 0.3 | 0.2 | 2 | - | - | - | - | - | - |
| 5 | 0.4 | 0.3 | 0.2 | 2 | - | - | - | - | - | - |
| 6 | 0.4 | 0.3 | 0.2 | 2 | - | - | - | - | - | - |
| 3 | 0.7 | 0.3 | 0.2 | 2 | 0.45 | 0.3 | 0.025 | 0.3 | 0.4 | 30 |
| 4.1 | 0.7 | 0.3 | 0.2 | 2 | 0.45 | 0.3 | 0.025 | 0.3 | 0.4 | 30 |
| 4.2 | 0.1 | 0.3 | 0.2 | 2 | 0.45 | 0.2 | 0.025 | 0.2 | 0.6 | 30 |
| 4.3 | 0.1 | 0.3 | 0.2 | 2 | 0.1 | 0.05 | 0.025 | 0.05 | 0.9 | 30 |
| 7.1 | 0.7 | 0.3 | 0.2 | 2 | 0.45 | 0.3 | 0.025 | 0.3 | 0.4 | 30 |
| 7.2 | 0.1 | 0.3 | 0.2 | 2 | 0.45 | 0.2 | 0.025 | 0.2 | 0.6 | 30 |
| 7.3 | 0.1 | 0.3 | 0.2 | 2 | 0.1 | 0.05 | 0.025 | 0.05 | 0.9 | 30 |
Results for experiments 2 and 6 (train and test)
| Experiment 2 | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Pos. | Neu. | Neg. | Pos. | Neu. | Neg. | Pos. | Neu. | Neg. | ||
| Train | 41.51 | 49.98 | 37.29 | 27.19 | 38.62 | 55.94 | 27.41 | 70.82 | 28.11 | 26.98 |
| Test | 42.34 | 51.12 | 37.91 | 27.45 | 40.19 | 55.92 | 19.17 | 70.54 | 28.78 | 27.63 |
Results for experiments 4.1 to 4.3 and 7.1 to 7.3 (training, coaching and testing)
| Experiment 4.1 | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Pos. | Neu. | Neg. | Pos. | Neu. | Neg. | Pos. | Neu. | Neg. | ||
| Train | 41.51 | 49.98 | 37.29 | 27.19 | 38.62 | 55.94 | 27.41 | 70.82 | 28.11 | 26.98 |
| Coaching | 97.37 | 96.85 | 97.77 | 97.15 | 94.86 | 99.25 | 96.74 | 98.94 | 96.33 | 97.56 |
| Test | 59.31 | 41.56 | 70.84 | 9.03 | 75.47 | 57.06 | 35.13 | 29.36 | 64.98 | 5.28 |