| Literature DB >> 22879769 |
Fabon Dzogang1, Marie-Jeanne Lesot, Maria Rifqi, Bernadette Bouchon-Meunier.
Abstract
WE STUDY THE DISCRIMINATION OF EMOTIONS ANNOTATED IN FREE TEXTS AT THE SENTENCE LEVEL: a sentence can either be associated with no emotion (neutral) or multiple labels of emotion. The proposed system relies on three characteristics. We implement an early fusion of grams of increasing orders transposing an approach successfully employed in the related task of opinion mining. We apply a filtering process that consists in extracting frequent n-grams and making use of the Shannon's entropy measure to respectively maintain dictionaries at balanced sizes and keep emotion specific features. Finally the overall system is implemented as a 2-step decision process: a first classifier discriminates between neutral and emotion bearing sentences, then one classifier per emotion is applied on emotion bearing sentences. The final decision is given by the classifier holding the maximum confidence. Results obtained on the testing set are promising.Entities:
Keywords: emotion mining; entropy; fusion; n-grams; text analysis
Year: 2012 PMID: 22879769 PMCID: PMC3409481 DOI: 10.4137/BII.S8973
Source DB: PubMed Journal: Biomed Inform Insights ISSN: 1178-2226
Number of occurences of each emotion in the training set in decreasing order.
| No emotion | 2460 |
| Instruction | 800 |
| Hopelessness | 455 |
| Love | 296 |
| Information | 295 |
| Guilt | 208 |
| Blame | 107 |
| Thankfulness | 94 |
| Anger | 69 |
| Sorrow | 51 |
| Hopefulness | 47 |
| Happiness/Peacefulness | 25 |
| Fear | 25 |
| Pride | 15 |
| Abuse | 9 |
| Forgiveness | 6 |
Figure 1.Architecture of the proposed 2-step system: pre-processed sentences are ran through a neutral vs. emotion classifier. Then, emotion bearing sentences are ran through M further classifiers, the one holding the most confidence wins over the others.
Unigrams: averaged F1 score, precision and recall along with standard deviations (sorted by emotions’ frequencies).
| No emotion | 0.68 ± 0.02 | 0.71 ± 0.02 | 0.66 ± 0.03 |
| Instruction | 0.85 ± 0.02 | 0.86 ± 0.03 | 0.84 ± 0.03 |
| Hopelessness | 0.69 ± 0.04 | 0.63 ± 0.06 | 0.76 ± 0.05 |
| Love | 0.76 ± 0.04 | 0.73 ± 0.05 | 0.8 ± 0.07 |
| Information | 0.54 ± 0.04 | 0.45 ± 0.04 | 0.68 ± 0.07 |
| Guilt | 0.52 ± 0.09 | 0.42 ± 0.08 | 0.66 ± 0.1 |
| Blame | 0.23 ± 0.09 | 0.19 ± 0.07 | 0.31 ± 0.15 |
| Thankfulness | 0.98 ± 0 | 0.99 ± 0.01 | 0.98 ± 0.01 |
| Anger | 0.17 ± 0.06 | 0.12 ± 0.04 | 0.29 ± 0.11 |
| Sorrow | 0.17 ± 0 | 0.14 ± 0.01 | 0.22 ± 0.03 |
| Hopefulness | 0.24 ± 0.11 | 0.18 ± 0.08 | 0.38 ± 0.16 |
| Happiness/Peacefulness | 0.19 ± 0.13 | 0.19 ± 0.11 | 0.2 ± 0.15 |
| Fear | 0.19 ± 0.08 | 0.19 ± 0.04 | 0.19 ± 0.12 |
| Pride | 0.11 ± 0.04 | 0.06 ± 0.02 | 0.4 ± 0.2 |
| Abuse | 0.02 ± 0 | 0.01 ± 0 | 0.44 ± 0.19 |
| Forgiveness | 0.26 ± 0.1 | 0.16 ± 0.06 | 0.83 ± 0.29 |
Trigrams: averaged F1 score, precision and recall along with standard deviations (sorted by emotions’ frequencies).
| No emotion | 0.6 ± 0.02 | 0.85 ± 0.02 | 0.47 ± 0.02 |
| Instruction | 0.53 ± 0.07 | 0.61 ± 0.09 | 0.47 ± 0.08 |
| Hopelessness | 0.8 ± 0.02 | 0.73 ± 0.03 | 0.88 ± 0.02 |
| Love | 0.22 ± 0.06 | 0.34 ± 0.15 | 0.17 ± 0.03 |
| Information | 0.53 ± 0.04 | 0.63 ± 0.09 | 0.47 ± 0.04 |
| Guilt | 0.37 ± 0.06 | 0.4 ± 0.04 | 0.35 ± 0.08 |
| Blame | 0.1 ± 0.01 | 0.05 ± 0 | 0.77 ± 0.02 |
| Thankfulness | 0.03 ± 0.01 | 0.02 ± 0.01 | 0.51 ± 0.15 |
| Anger | 0.4 ± 0.08 | 0.51 ± 0.07 | 0.33 ± 0.08 |
| Sorrow | 0.98 ± 0.01 | 0.97 ± 0 | 0.99 ± 0.01 |
| Hopefulness | N/A | 0 ± 0 | 0 ± 0 |
| Happiness/Peacefulness | 0.04 ± 0.01 | 0.02 ± 0 | 0.49 ± 0.08 |
| Fear | N/A | 0 ± 0 | 0 ± 0 |
| Pride | N/A | 0 ± 0 | 0 ± 0 |
| Abuse | N/A | 0 ± 0 | 0 ± 0 |
| Forgiveness | N/A | 0 ± 0 | 0 ± 0 |
Fusion of unigrams and bigrams: averaged F1 score, precision and recall along with standard deviations (sorted by emotion labels’ frequencies).
| No emotion | 0.73 ± 0.04 | 0.8 ± 0.03 | 0.68 ± 0.05 |
| Instruction | 0.85 ± 0.02 | 0.85 ± 0.03 | 0.86 ± 0.03 |
| Hopelessness | 0.68 ± 0.04 | 0.68 ± 0.04 | 0.69 ± 0.06 |
| Love | 0.78 ± 0.06 | 0.78 ± 0.07 | 0.78 ± 0.08 |
| Information | 0.54 ± 0.08 | 0.52 ± 0.06 | 0.58 ± 0.12 |
| Guilt | 0.53 ± 0.07 | 0.51 ± 0.08 | 0.55 ± 0.08 |
| Blame | 0.32 ± 0.09 | 0.34 ± 0.11 | 0.31 ± 0.08 |
| Thankfulness | 0.99 ± 0 | 0.98 ± 0.01 | 0.99 ± 0.01 |
| Anger | 0.2 ± 0.1 | 0.18 ± 0.08 | 0.23 ± 0.14 |
| Sorrow | 0.16 ± 0.05 | 0.1 ± 0.03 | 0.39 ± 0.12 |
| Hopefulness | 0.23 ± 0.07 | 0.16 ± 0.05 | 0.38 ± 0.1 |
| Happiness/Peacefulness | 0.16 ± 0.12 | 0.12 ± 0.07 | 0.29 ± 0.29 |
| Fear | 0.21 ± 0.05 | 0.23 ± 0.06 | 0.2 ± 0.07 |
| Pride | 0.08 ± 0.02 | 0.05 ± 0.01 | 0.27 ± 0.12 |
| Abuse | 0.02 ± 0 | 0.01 ± 0 | 0.44 ± 0.19 |
| Forgiveness | 0.24 ± 0.1 | 0.14 ± 0.07 | 0.83 ± 0.29 |
Best ranked SVM features (top 7 unigrams and top 7 bigrams) for final classifiers with F1 scores higher than 0.3. Sorted by classifiers’ performance in decreasing order.
| Thankfulness | thank/appreciate/than/nice/effort/kindness/under be swell/than you/you dear/appreciate it/too ./have be/for your |
| Instruction | cremate/call/please/sell/funeral/teach/notify to be/forget me/be good/to have/bury me/dispose of/care of |
| Love | love/wonderful/bless/watch/beloved/most/loving you ./do ./be wonderful/love you/god bless/your john/me on |
| Hopelessness | cancer/am/suffer/die/struggle/everybody/tired without you/go on/dear jane/can not/. my/be ./of all |
| Information | bldg/insurance/key/paper/owe/ticket/in of cincinnati/be pay/ohio ./in this/no ./and my/the key |
| Guilt | sorry/forgive/excuse/fail/hurt/could/burden have be/forgive me/please forgive/have do/understand ./not to/to help |
| Blame | sorry/thank/love/please/give/wish/go to be/cause you/of it/you ./you to/in the/to go |
Results of the final system on the testing set made of 300 notes.
| 0.47 | 0.46 | 0.49 |
Bigrams: averaged F1 score, precision and recall along with standard deviations (sorted by emotions’ frequencies).
| No emotion | 0.72 ± 0.03 | 0.84 ± 0.03 | 0.63 ± 0.04 |
| Instruction | 0.82 ± 0.01 | 0.8 ± 0.02 | 0.84 ± 0.03 |
| Hopelessness | 0.64 ± 0.05 | 0.66 ± 0.04 | 0.62 ± 0.08 |
| Love | 0.74 ± 0.07 | 0.76 ± 0.08 | 0.72 ± 0.08 |
| Information | 0.47 ± 0.1 | 0.43 ± 0.09 | 0.53 ± 0.14 |
| Guilt | 0.5 ± 0.08 | 0.5 ± 0.08 | 0.5 ± 0.09 |
| Blame | 0.28 ± 0.1 | 0.27 ± 0.08 | 0.32 ± 0.14 |
| Thankfulness | 0.98 ± 0.01 | 0.98 ± 0.01 | 0.99 ± 0.01 |
| Anger | 0.14 ± 0.01 | 0.11 ± 0.01 | 0.2 ± 0.02 |
| Sorrow | 0.05 ± 0.02 | 0.03 ± 0.01 | 0.16 ± 0.09 |
| Hopefulness | 0.2 ± 0.1 | 0.2 ± 0.06 | 0.21 ± 0.13 |
| Happiness/Peacefulness | 0.15 ± 0.04 | 0.26 ± 0.21 | 0.12 ± 0.01 |
| Fear | 0.13 ± 0.06 | 0.11 ± 0.04 | 0.16 ± 0.08 |
| Pride | N/A | 0 ± 0 | 0 ± 0 |
| Abuse | N/A | 0 ± 0 | 0 ± 0 |
| Forgiveness | N/A | 0 ± 0 | 0 ± 0 |