| Literature DB >> 22879762 |
Ning Yu1, Sandra Kübler, Joshua Herring, Yu-Yin Hsu, Ross Israel, Charese Smiley.
Abstract
DUE TO THE COMPLEXITY OF EMOTIONS IN SUICIDE NOTES AND THE SUBTLE NATURE OF SENTIMENTS, THIS STUDY PROPOSES A FUSION APPROACH TO TACKLE THE CHALLENGE OF SENTIMENT CLASSIFICATION IN SUICIDE NOTES: leveraging WordNet-based lexicons, manually created rules, character-based n-grams, and other linguistic features. Although our results are not satisfying, some valuable lessons are learned and promising future directions are identified.Entities:
Keywords: character n-grams; dependency parsing; fusion
Year: 2012 PMID: 22879762 PMCID: PMC3409490 DOI: 10.4137/BII.S8949
Source DB: PubMed Journal: Biomed Inform Insights ISSN: 1178-2226
Training data distribution over classes (number of sentences).
| Abuse | 9 | 14 |
| Fear | 25 | 25 |
| Sorrow | 51 | 60 |
| Anger | 69 | 84 |
| Blame | 107 | 117 |
| Guilt | 206 | 223 |
| Hopelessness | 455 | 478 |
| Forgiveness | 6 | 6 |
| Pride | 15 | 18 |
| Happiness-peacefulness | 25 | 28 |
| Hopefulness | 47 | 48 |
| Thankfulness | 91 | 105 |
| Love | 290 | 311 |
| Information | 294 | 312 |
| Instructions | 813 | 863 |
| Other | 2460 | 2460 |
Performance of different classifiers trained on re-segmented training data.
| Ad-hoc (1) | 0.31 | 0.26 | 0.37 |
| Word trigrams (2) | 0.38 | 0.49 | 0.31 |
| Word trigrams w/placeholder (3) | 0.38 | 0.50 | 0.31 |
| POS tag trigrams (4) | 0.32 | 0.40 | 0.26 |
| Dependency relations (5) | 0.40 | 0.53 | 0.33 |
| Character 8-grams (6) | 0.41 | 0.32 | |
| Character 8-grams w/placeholder (7) | 0.41 | 0.32 | |
| Fusion: 1 + 5 | 0.38 | 0.32 | 0.46 |
| Fusion: 1 + 7 | 0.40 | 0.29 | |
| Fusion: 5 + 7 | 0.45 | 0.40 | |
| Fusion: 1 + 5 + 7 | 0.40 | 0.32 | 0.53 |
Performance of 8-gram character-based classifiers trained on original and re-segmented training data.
| Re-segmented | 0.41 | 0.57 | 0.32 |
| Original | 0.42 | 0.55 | 0.34 |
Performance of 8-gram character-based classifiers with and without the OTHER class.
| Without OTHER | 0.44 | 0.40 | 0.48 |
| With OTHER | 0.42 | 0.55 | 0.34 |