| Literature DB >> 31341466 |
Dan Gan1, Jiang Shen1, Man Xu2.
Abstract
The medical knowledge sharing community provides users with an open platform for accessing medical resources and sharing medical knowledge, treatment experience, and emotions. Compared with the recipients of general commodities, the recipients in the medical knowledge sharing community pay more attention to the intensity or overall evaluation of emotional vocabularies in the comments, such as treatment effects, prices, service attitudes, and other aspects. Therefore, the overall evaluation is not a key factor in medical service comments, but the semantics of the emotional polarity is the key to affect recipients of the medical information. In this paper, we propose an adaptive learning emotion identification method (ALEIM) based on mutual information feature weight, which captures the correlation and redundancy of features. In order to evaluate the proposed method's effectiveness, we use four basic corpus libraries crawled from the Haodf's online platform and employ Taiwan University NTUSD Simplified Chinese Emotion Dictionary for emotion classification. The experimental results show that our proposed ALEIM method has a better performance for the identification of the low-frequency words' redundant features in comments of the online medical knowledge sharing community.Entities:
Year: 2019 PMID: 31341466 PMCID: PMC6614987 DOI: 10.1155/2019/1604392
Source DB: PubMed Journal: Comput Intell Neurosci
Summary of typical previous studies for the emotion analysis challenges.
| Author | Year | Domain oriented | Challenge type | Review structure |
|---|---|---|---|---|
| Jia et al. [ | 2009 | Health/medical domain | Theoretical | Semi-structured |
| Hogenboom et al. [ | 2011 | Movie reviews | Theoretical | Unstructured |
| Alexandra and Ralf [ | 2009 | Online news reviews | Theoretical | Semistructured/unstructured |
| Mukherjee and Bhattacharyya [ | 2012 | Products | Technical | Semistructured |
| Chetan and Atul [ | 2014 | Tweets | Technical | Unstructured |
| Doaa and Osama [ | 2015 | Scientific papers | Theoretical + technical | Structured |
Figure 1The preparation process of experimental datasets in this paper.
The test data for emotion classification under different algorithms.
| The number of data | Positive | Negative | The number of feature | Used for |
|---|---|---|---|---|
| 100 | 70 | 30 | 37 | Training corpus |
| 150 | 100 | 50 | 39 | Training corpus |
| 200 | 120 | 80 | 41 | Training corpus |
| 300 | 180 | 120 | 41 | Training corpus |
| 400 | 200 | 200 | 42 | Test corpus |
Figure 2The overall flowchart of experiments in this paper.
Figure 3The accuracy of the four methods used in this paper.
The detailed significant test results of accuracy between MI and other methods.
| Datasets | Metrics | Methods | ||
|---|---|---|---|---|
| MI and emotion lexicon | MI and TI-IDF | MI and SVM (RBF) | ||
| 100 data |
| 0.0906 | 0.0063 | 0.1304 |
| 150 data | 0.0487 | 0.0197 | 0.0043 | |
| 200 data | 0.0435 | 0.0437 | 0.0226 | |
| 300 data | 0.0255 | 0.0432 | 0.0021 | |
Figure 4The precision of the four methods used in this paper.
The detailed significant test results of precision between MI and other methods.
| Datasets | Metrics | Methods | ||
|---|---|---|---|---|
| MI and emotion lexicon | MI and TI-IDF | MI and SVM (RBF) | ||
| 100 data |
| 0.0413 | 0.0043 | 0.0343 |
| 150 data | 0.0387 | 0.0667 | 0.0342 | |
| 200 data | 0.0234 | 0.0731 | 0.0106 | |
| 300 data | 0.0055 | 0.0902 | 0.0049 | |
Figure 5The recall of the four methods used in this paper.
The detailed significant test results of recall between MI and other methods.
| Datasets | Metrics | Methods | ||
|---|---|---|---|---|
| MI and emotion lexicon | MI and TI-IDF | MI and SVM (RBF) | ||
| 100 data |
| 0.0313 | 0.1025 | 0.0034 |
| 150 data | 0.0478 | 0.0706 | 0.0147 | |
| 200 data | 0.0443 | 0.0831 | 0.0321 | |
| 300 data | 0.0142 | 0.0502 | 0.0079 | |
Figure 6The difference between mutual information algorithm and TI-IDF algorithm.