| Literature DB >> 34938715 |
Senthil Kumar Narayanasamy1, Kathiravan Srinivasan2, Saeed Mian Qaisar3, Chuan-Yu Chang4,5.
Abstract
The exponential growth of social media users has changed the dynamics of retrieving the potential information from user-generated content and transformed the paradigm of information-retrieval mechanism with the novel developments on the concept of "web of data". In this regard, our proposed Ontology-Based Sentiment Analysis provides two novel approaches: First, the emotion extraction on tweets related to COVID-19 is carried out by a well-formed taxonomy that comprises possible emotional concepts with fine-grained properties and polarized values. Second, the potential entities present in the tweet can be analyzed for semantic associativity. The extraction of emotions can be performed in two cases: (i) words directly associated with the emotional concepts present in the taxonomy and (ii) words indirectly present in the emotional concepts. Though the latter case is very challenging in processing the tweets to find the hidden patterns and extract the meaningful facts associated with it, our proposed work is able to extract and detect almost 81% of true positives and considerably able to detect the false negatives. Finally, the proposed approach's superior performance is witnessed from its comparison with other peer-level approaches.Entities:
Keywords: SPARQL; emotion ontology; latent Dirichlet allocation; natural language processing; sentiment analysis; twitter streams
Mesh:
Year: 2021 PMID: 34938715 PMCID: PMC8685242 DOI: 10.3389/fpubh.2021.798905
Source DB: PubMed Journal: Front Public Health ISSN: 2296-2565
Highlights of semantic similarity measures and dataset assignments.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| 1 | Dridi and Recupero ( | (2019) | Semantic similarity Association between the pairs of nouns | 40 | ⋎ = 0.784 |
| 2 | Mozafari and Tahayori ( | (2019) | Semantic relatedness exists among the pairs of nouns. | Two sets | ρ = 0.661 |
| 3 | Dragoni et al. ( | (2018) | Semantic proximity of word pairs on the basis of synonymy questions | 50 | NA |
| 4 | Chen et al. ( | (2017) | Semantic similarity evaluation of words based on analogy questions | 390 | NA |
| 5 | Li et al. ( | (2017) | Semantic Associativity between 25 medical words | 34 | ⋎ = 0.51 |
| 6 | Zhu and Iglesias ( | (2016) | Semantic similarity and proximity score of pairs of UMLS concepts (domain: medical) | 571 | |
| 7 | Bruni et al. ( | (2014) | Semantic relatedness of pairs of words | 2500 | ρ = 0.74 |
| 8 | Hill et al. ( | Semantic similarity of pairs of words | 999 | ρ = 0.7 |
Figure 1Proposed architecture for emotion extraction from Tweets.
Semantic tools and ontology design for CLI and LIB.
|
|
|
|
|
|
|---|---|---|---|---|
| SML | OWL,RDF, and OBO | CLI, LIB | P, G | Java |
| FastSemSim | OBO and others | CLI, LIB | P | Python |
| OntoSim | OWL, RDF | LIB | P | Java |
| YtexSemanticSimilarity | SEE DOC | LIB | P, G | Java |
| SimilarityLibrary | Wordnet, MeSH, GO | CLI, LIB | P, G | Java / Python |
| OWLSim | OWL, RDF, OBO | LIB | P | Java / Python |
Figure 2Fundamental emotions list for ontology population.
Figure 3Emotion ontology generation.
Tweet segmentation and classification for ontology matching.
|
|
|
|
|
|---|---|---|---|
| Tweet Seg + Ontology | 0.892 | 0.801 | 0.841 |
| Tweet Seg + Inferences | 0.761 | 0.684 | 0.692 |
| POS + First Order Logics | 0.561 | 0.504 | 0.539 |
| Tweet Seg + POS | 0.710 | 0.693 | 0.703 |
Total number of occurrences of the emotions tested for the training dataset.
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| 187 | 562 | 491 | 688 | 490 | 294 | 579 | 463 |
Calculation of the percentage of TN, FP, FN, and TP from the confusion matrix.
|
|
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|---|---|
| TN | 31.27 | 56.81 | 48.24 | 71.88 | 49.11 | 32.45 | 50.19 | 48.82 |
| TP | 22.10 | 19.21 | 24.11 | 8.99 | 1.90 | 6.33 | 14.28 | 2.19 |
| FN | 6.21 | 1.01 | 5.35 | 4.22 | 2.34 | 3.11 | 5.36 | 1.21 |
| FP | 4.10 | 0.19 | 2.98 | 2.81 | 1.16 | 1.69 | 2.55 | 0.61 |
Figure 4Percentage of TP, TN, FP, and FN.
Emotion classifier for each emotion extracted from COVID-19 datasets.
|
|
|
|
|
|
|---|---|---|---|---|
| Joy | 273 | 0.781 | 0.689 | 0.735 |
| Anger | 849 | 0.674 | 0.621 | 0.647 |
| Disgust | 783 | 0.738 | 0.685 | 0.711 |
| Fear | 959 | 0.811 | 0.783 | 0.797 |
| Surprise | 630 | 0.893 | 0.812 | 0.852 |
| Love | 384 | 0.874 | 0.823 | 0.848 |
| Sadness | 791 | 0.891 | 0.841 | 0.866 |
| Confusion | 768 | 0.783 | 0.738 | 0.760 |
| Macro-Average | 817 | 0.805 | 0.749 | 0.777 |
Figure 5ROC curve—emotions classifier for the COVID-19 dataset.
Comparative analysis with existing models and their limitations.
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
| Our Proposed Model | – | Ontological Framework | COVID-19 Pandemic related Tweets | Right utilization of Ontological Framework coupled with ML algorithms such as SVM and LDA | Can work only with English Text. | 0.831 |
| Matla and Badugu | 2020 | Machine Learning | Tweets | Deployed the Naive Bayes algorithm for classifying the Twitter messages into four emotional categories. | Absences of contextual words in the text. | 0.725 |
| Huang et al. ( | 2019 | Visual and semantic attention mechanism | Tweets | Deep Multimodal Attentive Fusion for multimodal sentiment analysis of 10 million Tweets. | The fine-granularity relation between image and text pairs is not yet explored. | 0.769 |
| Ragheb et al. ( | 2019 | Machine Learning | CLEF eRisk-2019 (signs of anorexia) T1 dataset | Categorized the emotions based on Attention based model. | Cannot detect the happy emotions from text. | 0.710 |
| Hasan et al. ( | 2019 | Hybrid | Tweets | Deployed a supervised learning system for automatically classifying emotion in text stream messages using ANEW lexicon. | Works well only for text messages. | 0.778 |
| Almanie et al. ( | 2018 | Rule Based | Tweets | Able to extract the prominent 5 emotions in a real time situation. | Low semantic orientation of textual context. | 0.719 |
| Badugu and Suhasini. ( | 2017 | Rule Based | Tweets | Detected only 4 primary emotions using the Rule Based approach. | This system focused only on English sentences. | 0.720 |
Algorithm: 1 Emotional Ontology-Based Entity Extraction and Weight Calculation.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| SELECT ?group ?emotion |
| WHERE |
| { |
| ?a rdf:type x:EmotionAboutEntityItem; |
| x:hasEmotion ?emotion; |
| x:hasEntityItem ?entityItem. |
| ?group x:contains ?entityItem. |
| } |
| ORDER BY ASC(?group) |