| Literature DB >> 22879757 |
Hui Yang1, Alistair Willis, Anne de Roeck, Bashar Nuseibeh.
Abstract
We describe the Open University team's submission to the 2011 i2b2/VA/Cincinnati Medical Natural Language Processing Challenge, Track 2 Shared Task for sentiment analysis in suicide notes. This Shared Task focused on the development of automatic systems that identify, at the sentence level, affective text of 15 specific emotions from suicide notes. We propose a hybrid model that incorporates a number of natural language processing techniques, including lexicon-based keyword spotting, CRF-based emotion cue identification, and machine learning-based emotion classification. The results generated by different techniques are integrated using different vote-based merging strategies. The automated system performed well against the manually-annotated gold standard, and achieved encouraging results with a micro-averaged F-measure score of 61.39% in textual emotion recognition, which was ranked 1st place out of 24 participant teams in this challenge. The results demonstrate that effective emotion recognition by an automated system is possible when a large annotated corpus is available.Entities:
Keywords: emotion recognition; hybrid model; keyword-based model; machine-learning-based model; result integration
Year: 2012 PMID: 22879757 PMCID: PMC3409477 DOI: 10.4137/BII.S8948
Source DB: PubMed Journal: Biomed Inform Insights ISSN: 1178-2226
The statistics information of the training and test data.
| #Document | 600 | 300 |
| #Document with annotated sentences | 595 | 299 |
| #Sentence | 4633 | 2086 |
| #Annotated sentences | 2173 (46.9%) | 1098 (52.6%) |
The distribution of emotion instances in different sentiment polarities.
| Positive | 483 (19.2%) | 317 (24.9%) |
| Negative | 924 (36.6%) | 469 (36.9%) |
| Neutral | 1115 (44.2%) | 486 (38.2%) |
| Total | 2522 | 1272 |
The distribution of emotion instances in individual emotions.
| Forgiveness | 6 | 8 |
| Pride | 15 | 9 |
| Happiness_peacefulness | 25 | 16 |
| Hopefulness | 47 | 38 |
| Thankfulness | 94 | 45 |
| Love | 296 | 201 |
| Abuse | 9 | 5 |
| Fear | 25 | 13 |
| Sorrow | 51 | 34 |
| Anger | 69 | 26 |
| Blame | 107 | 45 |
| Guilt | 208 | 117 |
| Hopelessness | 455 | 229 |
| Information | 295 | 104 |
| Instructions | 820 | 382 |
The distribution of sentiment polarity changes in the multi-emotion sentences.
| Positive | Positive | 14 | 16 |
| Positive | Negative | 89 | 66 |
| Positive | Neutral | 63 | 32 |
| Negative | Negative | 82 | 38 |
| Negative | Neutral | 94 | 34 |
| Neutral | Neutral | 67 | 15 |
Part of the frequent co-occurred emotion pairs in the multi-emotion sentences.
| Positive/Positive | Love | Thankfulness | 7 | 6 |
| Love | Hopefulness | 3 | 2 | |
| Positive/Negative | Love | Hopelessness | 31 | 16 |
| Love | Guilt | 18 | 20 | |
| Positive/Neutral | Love | Instructions | 40 | 23 |
| Thankfulness | Instructions | 7 | 4 | |
| Negative/Negative | Hopelessness | Guilt | 34 | 20 |
| Anger | Blame | 10 | 4 | |
| Negative/Neutral | Hopelessness | Instructions | 38 | 9 |
| Guilt | Instructions | 16 | 15 | |
| Neutral/Neutral | Information | Instructions | 67 | 15 |
Figure 1.The system architecture diagram.
Figure 2.The cue annotation rates for individual emotions.
The performance of the four ML-based models on the test dataset.
| Thankfulness | 0.6056 | 0.5856 | 0.5954 | 0.5881 | 0.6033 | 0.5956 | 0.5778 | 0.6500 | 0.6188 | 0.5263 | 0.7067 | 0.6033 |
| Love | 0.7620 | 0.5477 | 0.6373 | 0.6850 | 0.7016 | 0.6932 | 0.7762 | 0.5822 | 0.6653 | 0.7687 | 0.5124 | 0.6149 |
| Guilt | 0.5806 | 0.2538 | 0.3532 | 0.4684 | 0.3562 | 0.4046 | 0.3525 | 0.2282 | 0.2770 | 0.5833 | 0.2795 | 0.3779 |
| Hopelessness | 0.7353 | 0.5541 | 0.6319 | 0.6326 | 0.6503 | 0.6413 | 0.6867 | 0.5317 | 0.5993 | 0.7451 | 0.4619 | 0.5702 |
| Information | 0.5314 | 0.3515 | 0.4231 | 0.4096 | 0.5700 | 0.4766 | 0.4811 | 0.4008 | 0.4372 | 0.7619 | 0.3038 | 0.4343 |
| Instruction | 0.7634 | 0.5774 | 0.6574 | 0.6359 | 0.6789 | 0.6567 | 0.7091 | 0.5737 | 0.6342 | 0.7592 | 0.4796 | 0.5878 |
| Micro-average (6 emotions) | 0.7150 | 0.5093 | 0.5949 | 0.5998 | 0.6266 | 0.6129 | 0.6528 | 0.5130 | 0.5745 | 0.7228 | 0.4515 | 0.5558 |
The merged results after three different integration strategies.
| Thankfulness | 0.5257 | 0.8156 | 0.6393 | 0.6055 | 0.6567 | 0.6300 | 0.5500 | 0.7711 | 0.6420 |
| Love | 0.6867 | 0.7563 | 0.7198 | 0.7793 | 0.6399 | 0.7027 | 0.7458 | 0.6767 | 0.7095 |
| Guilt | 0.4655 | 0.4615 | 0.4634 | 0.5000 | 0.2137 | 0.2994 | 0.5052 | 0.4188 | 0.4579 |
| Hopelessness | 0.6410 | 0.7081 | 0.6728 | 0.8028 | 0.5522 | 0.6543 | 0.6627 | 0.6838 | 0.6730 |
| Information | 0.4090 | 0.6485 | 0.5016 | 0.4993 | 0.4181 | 0.4551 | 0.4524 | 0.5073 | 0.4782 |
| Instruction | 0.6849 | 0.7608 | 0.7208 | 0.8079 | 0.5488 | 0.6536 | 0.7032 | 0.7373 | 0.7198 |
| Micro-average (6 emotions) | 0.6106 | 0.7067 | 0.6551 | 0.7294 | 0.5195 | 0.6065 | 0.6489 | 0.6597 | 0.6540 |
The summary of the evaluation of the three submission runs (Expected—the gold-standard results; Predicted—the results that the system predicted).
| Run 1 | 1272 | 1403 | 810 | 0.5780 | 0.6375 | 0.6063 |
| Run 2 | 1272 | 1560 | 865 | 0.5545 | 0.6803 | 0.6108 |
| Run 3 | 1272 | 1419 | 826 | 0.5821 | 0.6493 | 0.6139 |
Emotion-based performance of the best submission (Run 3) (Expected—the gold-standard results; Predicted—the results that the system predicted).
| forgiveness | 8 | 4 | 2 | 0.5000 | 0.2500 | 0.3333 |
| pride | 9 | 10 | 2 | 0.2000 | 0.2222 | 0.2105 |
| happiness_peacefulness | 16 | 14 | 8 | 0.5714 | 0.5000 | 0.5333 |
| hopefulness | 38 | 34 | 10 | 0.2941 | 0.2632 | 0.2778 |
| thankfulness | 45 | 79 | 39 | 0.4937 | 0.8667 | 0.6290 |
| love | 201 | 205 | 147 | 0.7171 | 0.7313 | 0.7241 |
| abuse | 5 | 5 | 1 | 0.2000 | 0.2000 | 0.2000 |
| fear | 13 | 16 | 3 | 0.1875 | 0.2308 | 0.2069 |
| sorrow | 34 | 27 | 7 | 0.2593 | 0.2059 | 0.2295 |
| anger | 26 | 48 | 10 | 0.2083 | 0.3846 | 0.2703 |
| blame | 45 | 58 | 27 | 0.4655 | 0.6000 | 0.5243 |
| guilt | 117 | 123 | 55 | 0.4472 | 0.4701 | 0.4583 |
| hopelessness | 229 | 259 | 164 | 0.6332 | 0.7162 | 0.6721 |
| information | 104 | 150 | 66 | 0.4400 | 0.6346 | 0.5197 |
| instructions | 382 | 390 | 285 | 0.7308 | 0.7461 | 0.7383 |
Figure 3.Contribution of different models to the overall performance.