| Literature DB >> 22879768 |
Bart Desmet1, Véronique Hoste.
Abstract
This paper describes a system for automatic emotion classification, developed for the 2011 i2b2 Natural Language Processing Challenge, Track 2. The objective of the shared task was to label suicide notes with 15 relevant emotions on the sentence level. Our system uses 15 SVM models (one for each emotion) using the combination of features that was found to perform best on a given emotion. Features included lemmas and trigram bag of words, and information from semantic resources such as WordNet, SentiWordNet and subjectivity clues. The best-performing system labeled 7 of the 15 emotions and achieved an F-score of 53.31% on the test data.Entities:
Keywords: emotion classification; machine learning; suicide; suicide notes; topic classification
Year: 2012 PMID: 22879768 PMCID: PMC3409478 DOI: 10.4137/BII.S8960
Source DB: PubMed Journal: Biomed Inform Insights ISSN: 1178-2226
Distribution of emotions in training and test set: average number of annotations per 1000 sentences.
| Instructions | 177.0 | 183.1 |
| Hopelessness | 98.2 | 109.8 |
| Love | 63.9 | 96.4 |
| Information | 63.7 | 49.9 |
| Guilt | 44.9 | 56.1 |
| Blame | 23.1 | 21.6 |
| Thankfulness | 20.3 | 21.6 |
| Anger | 14.9 | 12.5 |
| Sorrow | 11.0 | 16.3 |
| Hopefulness | 10.1 | 18.2 |
| Happiness | 5.4 | 7.7 |
| Fear | 5.4 | 6.2 |
| Pride | 3.2 | 4.3 |
| Abuse | 1.9 | 2.4 |
| Forgiveness | 1.3 | 3.8 |
Note: Sorted by frequency in the training set.
Micro-averaged F-scores on the training and test set for all emotions, and the 7 best-performing emotions (pruned).
| All emotions | 49.11 | 51.19 |
| Pruned emotions | 51.04 | 53.31 |