| Literature DB >> 22879776 |
Kirk Roberts1, Sanda M Harabagiu.
Abstract
In this paper we report on the approaches that we developed for the 2011 i2b2 Shared Task on Sentiment Analysis of Suicide Notes. We have cast the problem of detecting emotions in suicide notes as a supervised multi-label classification problem. Our classifiers use a variety of features based on (a) lexical indicators, (b) topic scores, and (c) similarity measures. Our best submission has a precision of 0.551, a recall of 0.485, and a F-measure of 0.516.Entities:
Keywords: sentiment classification; similarity method; statistical method; suicide notes
Year: 2012 PMID: 22879776 PMCID: PMC3409476 DOI: 10.4137/BII.S8958
Source DB: PubMed Journal: Biomed Inform Insights ISSN: 1178-2226
Representative phrases chosen from the i2b2 training data by statistical phrase discovery.
| I | notify | my funeral | I would like |
| H | helpless | any more | can’t go on |
| L | Bless | love Jane | you have been |
| I | debts | life insurance | in my purse |
| G | fault | Please forgive | God have mercy |
| B | wreck | trouble with | you have done |
Top words for each topic determined by LDA from the i2b2 training data.
| i | My | i | johnson | my | i | you | they | i | who |
| me | you | am | john | i | my | i | food | you | your |
| we | find | so | cincinnati | johnson | have | my | who | have | father |
| up | box | way | burnet | john | been | love | their | do | god |
| not | one | not | jane | take | has | john | where | know | pray |
| out | two | do | my | bill | life | jane | too | what | peter |
| my | car | sorry | smith | money | no | me | down | me | world |
| when | which | ca | ohio | pay | death | good | mountain | would | let |
| go | also | no | please | want | which | your | destroyed | not | lord |
| would | book | just | hospital | give | own | please | then | can | people |
| here | check | out | bill | have | one | have | just | could | days |
| over | dollars | have | children | funeral | time | forgive | many | only | where |
| after | get | more | mary | insurance | years | dear | business | want | man |
| should | other | me | ave | body | made | god | carry | one | other |
| time | purse | my | signed | they | last | so | family | did | soul |
| last | here | going | call | jane | long | much | up | than | these |
| years | back | want | january | please | since | very | warm | your | death |
| again | letter | take | give | not | being | mother | wear | like | light |
| night | bank | go | oh | can | things | mary | along | say | no |
| one | keys | know | phone | no | who | always | between | about | before |
Figure 1.System architecture.
Features selected through automatic feature selection.
| SentenceUnigrams |
| StatisticalLabeledPhrases(2, 2.0) |
| StatisticalLabeledPhrases(3, 3.0) |
| StatisticalUnlabeledWordSum( |
| StatisticalUnlabeledWordStrongest( |
| TopicScore(8) |
| Mostcommonemotion( |
| StrongestEmotionScore(LOVE, |
| StrongestEmotionScore(SORROW, |
| StrongestEmotionScore(SORROW, |
| StrongestEmotionScore(ANGER, |
| StrongestEmotionScore(NONE, |
Official results for our three submissions as well as the mean, median, and bag-of-words result.
| SVMBinary | 1120 | 0.55089 | 0.48506 | 0.51589 |
| SVMBinaryTop5 | 1048 | 0.55725 | 0.45912 | 0.50345 |
| SVMMulti | 1020 | 0.54020 | 0.43318 | 0.48080 |
| Mean submission | 0.4875 | |||
| Median submission | 0.5027 | |||
| Bag-of-words | 2108 | 0.34108 | 0.56525 | 0.42544 |
Detail per-emotion results for SVMBinary submission.
| A | 0 | 0 | 5 | 0 | 0 | 0 |
| A | 1 | 2 | 25 | 0.33 | 0.04 | 0.07 |
| B | 3 | 4 | 42 | 0.43 | 0.07 | 0.12 |
| F | 0 | 0 | 13 | 0 | 0 | 0 |
| F | 0 | 1 | 8 | 0 | 0 | 0 |
| G | 50 | 64 | 67 | 0.44 | 0.43 | 0.43 |
| H | 1 | 1 | 15 | 0.50 | 0.06 | 0.11 |
| H | 1 | 6 | 37 | 0.14 | 0.03 | 0.04 |
| H | 122 | 97 | 107 | 0.56 | 0.53 | 0.54 |
| I | 40 | 83 | 64 | 0.33 | 0.38 | 0.35 |
| I | 241 | 168 | 141 | 0.59 | 0.63 | 0.61 |
| L | 136 | 65 | 65 | 0.68 | 0.68 | 0.68 |
| P | 0 | 0 | 9 | 0 | 0 | 0 |
| S | 0 | 4 | 34 | 0 | 0 | 0 |
| T | 26 | 14 | 19 | 0.65 | 0.58 | 0.61 |