| Literature DB >> 22879760 |
Richard Wicentowski1, Matthew R Sydes.
Abstract
An ensemble of supervised maximum entropy classifiers can accurately detect and identify sentiments expressed in suicide notes. Using lexical and syntactic features extracted from a training set of externally annotated suicide notes, we trained separate classifiers for each of fifteen pre-specified emotions. This formed part of the 2011 i2b2 NLP Shared Task, Track 2. The precision and recall of these classifiers related strongly with the number of occurrences of each emotion in the training data. Evaluating on previously unseen test data, our best system achieved an F(1) score of 0.534.Entities:
Keywords: emotion classification; natural language processing; suicide notes; text analysis
Year: 2012 PMID: 22879760 PMCID: PMC3409489 DOI: 10.4137/BII.S8972
Source DB: PubMed Journal: Biomed Inform Insights ISSN: 1178-2226
List of the annotated emotions in the dataset, as well as the frequency of occurrence of each of the emotions in the training set and the test set and the annotation guidelines. The log of the test/train ratio illustrates whether the training set represented the test set. Positive numbers indicate an overrepresentation of the emotion in the test set, zero indicates equal representation and negative indicates underrepresentation.
| Instructions | 820 | 17.7% | 382 | 18.3% | 0.05 | Giving directions on what to do next ... |
| Hopelessness | 455 | 9.8% | 229 | 12.2% | 0.16 | Feels hopeless ... |
| Love | 296 | 6.4% | 201 | 9.6% | 0.59 | Feels love for someone ... |
| Information | 295 | 6.4% | 104 | 5.0% | −0.35 | Giving practical information where things stand ... |
| Guilt | 208 | 4.5% | 117 | 5.6% | 0.32 | Feels guilt ... |
| Blame | 107 | 2.3% | 45 | 2.2% | −0.10 | Is blaming someone ... |
| Thankfulness | 94 | 2.0% | 45 | 2.2% | 0.09 | Is thanking someone ... |
| Anger | 69 | 1.5% | 26 | 1.3% | −0.26 | Is angry with someone ... |
| Sorrow | 51 | 1.1% | 34 | 1.6% | 0.57 | Feels sorrow ... |
| Hopefulness | 47 | 1.0% | 38 | 1.8% | 0.84 | Has hope for future ... |
| Happiness | 25 | 0.5% | 16 | 0.8% | 0.51 | Is feeling happy or peaceful ... |
| Fear | 25 | 0.5% | 13 | 0.6% | 0.21 | Is afraid of something ... |
| Pride | 15 | 0.3% | 9 | 0.4% | 0.41 | Feels pride ... |
| Abuse | 9 | 0.2% | 5 | 0.2% | 0.30 | Was abused verbally, physically, mentally ... |
| Forgiveness | 6 | 0.1% | 8 | 0.4% | 1.57 | Is forgiving someone ... |
|
| ||||||
| Total sentences | 4633 | – | 2086 | – | – | |
| Labeled sentences | 2173 | 46.9% | 1098 | 52.6% | 0.17 | |
| Labels assigned | 2522 | 3.6% | 1272 | 4.0% | 0.16 | |
| Unannotated notes | 5 | 0.8% | 1 | 0.3% | −1.32 | |
Key to the features used in the classifiers. For example, the notation “W1–1/C0–0/L-/M-/D-/O-” indicates a feature set using only unigrams as features and “W1–2/C0–0/L+/M+/Dd/Oe” indicates a feature set using word unigrams and bigrams, spelling correction, stemming, dependency relations and a split point optimized for each emotion. See the Methods section for a complete description of these features.
| W | Word |
| C | Prefix/suffix character |
| L+/L− | Uses spelling correction and contraction normalization (L+) or not (L−) |
| M+/M− | Uses stemming (M+) or does not (M−) |
| Dd/Dv/D− | Uses dependency relations only (Dd), with variable dependency relations (Dv) or neither (D−) |
| Oa/Oe/O− | Optimizes a split point for all classifiers (Oa), for each emotion (Oe) or not at all (O−) |
For every feature set, fine tuning a single split point for all 15 emotion classifiers yielded a decrease in precision, an increase in recall and a marked increase in F1 in the cross-validated training set. Two representative feature sets are illustrated above without optimization (split point = 0.500) and with optimization.
|
| 0.500 | 935 | 731 | 1587 | 56.12 | 37.07 | 44.65 |
| 0.342 | 1113 | 1229 | 1409 | 47.52 | 44.13 | 45.76 | |
|
| 0.500 | 892 | 408 | 1630 | 68.62 | 35.37 | 46.68 |
| 0.170 | 1228 | 980 | 1294 | 55.62 | 48.69 | 51.92 |
Relative performance difference in the cross-validated training set of including a single new feature relative to the baseline system that used only unigrams, evaluated using the “Oa” optimization, sorted by F1. See Table 2 for an explanation of the notation used to describe the features.
| D- to Dv |
| 1175 | 1590 | 1347 | 42.50 | 46.59 | 44.45 |
| L− to L+ |
| 1054 | 1059 | 1468 | 49.88 | 41.79 | 45.48 |
| Baseline |
| 1113 | 1229 | 1409 | 47.52 | 44.13 | 45.76 |
| C0–0 to C2–6 |
| 1176 | 1378 | 1346 | 46.05 | 46.63 | 46.34 |
| D- to Dd |
| 1170 | 1264 | 1352 | 48.07 | 46.39 | 47.22 |
| M− to M+ |
| 1126 | 1121 | 1396 | 50.11 | 44.65 | 47.22 |
| W1–1 to W1–2 |
| 1168 | 1175 | 1354 | 49.85 | 46.31 | 48.02 |
Performance of the best single classifier, W1–2/C0–0/L+/M+/Dd, using 5-fold cross-validation on the training data, with a single optimized split point = 0.198 and sigma = 0.5, sorted by F1 compared with the performance of the same classifier when not providing labels for emotions with small amounts of training data. The latter is equivalent to system S1.
| Love | 203 | 125 | 93 | 61.89 | 68.58 | 65.06 | 203 | 125 | 93 | 61.89 | 68.58 | 65.06 |
| Instructions | 571 | 384 | 249 | 59.79 | 69.63 | 64.34 | 571 | 384 | 249 | 59.79 | 69.63 | 64.34 |
| Thankfulness | 44 | 16 | 50 | 73.33 | 46.81 | 57.14 | 44 | 16 | 50 | 73.33 | 46.81 | 57.14 |
| Hopelessness | 244 | 212 | 211 | 53.51 | 53.63 | 53.57 | 244 | 212 | 211 | 53.51 | 53.63 | 53.57 |
| Guilt | 87 | 98 | 121 | 47.03 | 41.83 | 44.27 | 87 | 98 | 121 | 47.03 | 41.83 | 44.27 |
| Information | 103 | 134 | 192 | 43.46 | 34.92 | 38.72 | 103 | 134 | 192 | 43.46 | 34.92 | 38.72 |
| Blame | 5 | 18 | 102 | 21.74 | 4.67 | 7.69 | 0 | 0 | 107 | – | 0.00 | – |
| Hopefulness | 1 | 6 | 46 | 14.29 | 2.13 | 3.70 | 0 | 0 | 47 | – | 0.00 | – |
| Abuse | 0 | 1 | 9 | 0.00 | 0.00 | 0.00 | 0 | 0 | 9 | – | 0.00 | – |
| Fear | 0 | 0 | 25 | – | 0.00 | – | 0 | 0 | 25 | – | 0.00 | – |
| Forgiveness | 0 | 0 | 6 | – | 0.00 | – | 0 | 0 | 6 | – | 0.00 | – |
| Anger | 0 | 4 | 69 | 0.00 | 0.00 | 0.00 | 0 | 0 | 69 | – | 0.00 | – |
| Pride | 0 | 0 | 15 | – | 0.00 | – | 0 | 0 | 15 | – | 0.00 | – |
| Happiness | 0 | 2 | 25 | 0.00 | 0.00 | 0.00 | 0 | 0 | 25 | 0.00 | 0.00 | 0.00 |
| Sorrow | 0 | 6 | 51 | 0.00 | 0.00 | 0.00 | 0 | 0 | 51 | – | 0.00 | – |
|
| ||||||||||||
| Total | 1258 | 1006 | 1264 | 55.57 | 49.88 | 52.57 | 1252 | 969 | 1270 | 56.37 | 49.64 | 52.79 |
Average relative performance difference on cross-validated training data between classifiers trained with and without the feature listed, sorted from least effective to most effective. Positive values indicate that, on average, the classifier improved with the addition of that feature.
| Spelling correction (L+) | −0.042 | 1.652 | 0.164 | 1.704 | 0.069 | 0.405 |
| Character n-grams (C2–6) | −0.511 | 2.664 | 1.313 | 2.142 | 0.461 | 1.495 |
| Stemming (M+) | 0.628 | 1.663 | 0.861 | 1.936 | 0.749 | 0.710 |
| Variable dependencies (Dv) | −1.521 | 2.699 | 2.887 | 1.837 | 0.751 | 1.196 |
| Dependencies (Dd) | 1.310 | 1.801 | 2.434 | 1.578 | 1.920 | 0.480 |
| Bigrams (W1–2) | 4.199 | 2.573 | 0.844 | 2.078 | 2.473 | 1.069 |
Note: All classifiers were evaluated using the “Oa” optimization.
Performance of the three classifiers used for official submissions. Sorted by F1 on the test data, results are presented for both using 5-fold cross-validation on the training data and on the test data. Note that the Memorize labels heuristic, used in S2 and S3 was not applied on cross-validation.
| Love | 203 | 125 | 93 | 61.89 | 68.58 | 65.06 | 135 | 63 | 66 | 68.18 | 67.16 | 67.67 |
| Thankfulness | 44 | 16 | 50 | 73.33 | 46.81 | 57.14 | 30 | 17 | 15 | 63.83 | 66.67 | 65.22 |
| Instructions | 571 | 384 | 249 | 59.79 | 69.63 | 64.34 | 253 | 158 | 129 | 61.56 | 66.23 | 63.81 |
| Hopelessness | 244 | 212 | 211 | 53.51 | 53.63 | 53.37 | 138 | 114 | 91 | 54.76 | 60.26 | 57.38 |
| Guilt | 87 | 98 | 121 | 47.03 | 41.83 | 44.27 | 39 | 36 | 78 | 52.00 | 33.33 | 40.62 |
| Information | 103 | 134 | 192 | 43.46 | 34.92 | 38.72 | 36 | 71 | 68 | 33.64 | 34.62 | 34.12 |
|
| ||||||||||||
| 0 | 0 | 354 | – | 0.00 | – | 0 | 0 | 194 | – | 0.00 | – | |
|
| ||||||||||||
| Total | 1252 | 969 | 1270 | 56.37 | 49.64 | 631 | 459 | 641 | 57.89 | 49.61 | ||
|
| ||||||||||||
| Love | 198 | 111 | 98 | 64.08 | 66.89 | 65.45 | 130 | 56 | 71 | 69.89 | 64.68 | 67.18 |
| Instructions | 523 | 281 | 297 | 65.05 | 63.78 | 64.41 | 224 | 112 | 158 | 66.67 | 58.64 | 62.40 |
| Thankfulness | 50 | 28 | 44 | 64.10 | 53.19 | 58.14 | 33 | 28 | 12 | 54.10 | 73.33 | 62.26 |
| Hopelessness | 250 | 216 | 205 | 53.65 | 54.95 | 54.29 | 137 | 117 | 92 | 53.94 | 59.83 | 56.73 |
| Guilt | 98 | 136 | 110 | 41.88 | 47.12 | 44.34 | 36 | 34 | 81 | 51.43 | 30.77 | 38.50 |
| Information | 109 | 149 | 186 | 42.25 | 36.95 | 39.42 | 38 | 74 | 66 | 33.93 | 36.54 | 35.19 |
|
| ||||||||||||
| 0 | 0 | 354 | – | 0.00 | – | 1 | 2 | 193 | 0.33 | 0.01 | 0.01 | |
|
| ||||||||||||
| Total | 1228 | 921 | 1294 | 57.14 | 48.69 | 599 | 423 | 673 | 58.61 | 47.09 | ||
|
| ||||||||||||
| Love | 164 | 63 | 132 | 72.25 | 55.41 | 62.72 | 110 | 27 | 91 | 80.29 | 54.73 | 65.09 |
| Instructions | 441 | 178 | 379 | 71.24 | 53.78 | 61.29 | 182 | 68 | 200 | 72.80 | 47.64 | 57.59 |
| Thankfulness | 29 | 9 | 65 | 76.32 | 30.85 | 43.94 | 22 | 12 | 23 | 64.71 | 48.89 | 55.70 |
| Hopelessness | 186 | 92 | 269 | 66.91 | 40.88 | 50.75 | 100 | 49 | 129 | 67.11 | 43.67 | 52.91 |
| Information | 70 | 52 | 225 | 57.38 | 23.73 | 33.57 | 24 | 29 | 80 | 45.28 | 23.08 | 30.57 |
| Guilt | 63 | 51 | 145 | 55.26 | 30.29 | 39.13 | 21 | 18 | 96 | 53.85 | 17.95 | 26.92 |
|
| ||||||||||||
| 0 | 0 | 354 | 0.00 | 0.00 | 0.00 | 1 | 2 | 193 | 0.33 | 0.01 | 0.01 | |
|
| ||||||||||||
| Total | 953 | 445 | 1569 | 68.17 | 37.79 | 460 | 205 | 812 | 69.17 | 36.16 | ||