| Literature DB >> 22879772 |
Eric Yeh1, William Jarrold, Joshua Jordan.
Abstract
We describe the submission entered by SRI International and UC Davis for the I2B2 NLP Challenge Track 2. Our system is based on a machine learning approach and employs a combination of lexical, syntactic, and psycholinguistic features. In addition, we model the sequence and locations of occurrence of emotions found in the notes. We discuss the effect of these features on the emotion annotation task, as well as the nature of the notes themselves. We also explore the use of bootstrapping to help account for what appeared to be annotator fatigue in the data. We conclude a discussion of future avenues for improving the approach for this task, and also discuss how annotations at the word span level may be more appropriate for this task than annotations at the sentence level.Entities:
Keywords: emotion detection; natural language processing; psycholinguistic resources; suicide note
Year: 2012 PMID: 22879772 PMCID: PMC3409487 DOI: 10.4137/BII.S8979
Source DB: PubMed Journal: Biomed Inform Insights ISSN: 1178-2226
Distribution of the number of emotion annotations observed per sentence.
| 0 | 2460 | 53% |
| 1 | 1871 | 40% |
| 2 | 266 | 5% |
| 3 | 27 | 0% |
| 4 | 7 | 0% |
| 5 | 2 | 0% |
Emotion annotations, frequency and overall percentage.
| Instructions | 820.0 | 32% |
| Hopelessness | 455.0 | 18% |
| Love | 296.0 | 11% |
| Information | 295.0 | 11% |
| Guilt | 208.0 | 8% |
| Blame | 107.0 | 4% |
| Thankfulness | 94.0 | 3% |
| Anger | 69.0 | 2% |
| Sorrow | 51.0 | 2% |
| Hopefulness | 47.0 | 1% |
| Fear | 25.0 | 0% |
| Happiness/peacefulness | 25.0 | 0% |
| Pride | 15.0 | 0% |
| Abuse | 9.0 | 0% |
| Forgiveness | 6.0 | 0% |
Baseline system, performance by emotion and overall results.
| NO EMOTION | 0.6319 | 0.6463 | 0.6390 |
| Abuse | 0 | 0 | 0.000 |
| Anger | 0.0377 | 0.0180 | 0.0244 |
| Blame | 0.1406 | 0.1047 | 0.1200 |
| Fear | 0 | 0 | 0.000 |
| Forgiveness | 0 | 0 | 0.000 |
| Guilt | 0.4340 | 0.3977 | 0.4150 |
| Happiness/peacefulness | 0 | 0 | 0.000 |
| Hopefulness | 0.0392 | 0.0303 | 0.0342 |
| Hopelessness | 0.4732 | 0.5187 | 0.4949 |
| Information | 0.4412 | 0.3557 | 0.3939 |
| instructions | 0.6346 | 0.6855 | 0.6591 |
| Love | 0.6625 | 0.6262 | 0.6439 |
| Pride | 0 | 0 | 0.000 |
| Sorrow | 0 | 0 | 0.000 |
| Thankfulness | 0.4526 | 0.3924 | 0.4203 |
Notes: F1 = 0.3790; PRECISION = 0.3377; RECALL = 0.4318; N = 3225.
Full system, performance by emotion and overall results, lift/loss over baseline given in parentheses.
| NO EMOTION | 0.6255 (−0.0064) | 0.7649 (+0.1186) | 0.6882 (+0.0492) |
| Abuse | 0 (0) | 0 (0) | 0 (0) |
| Anger | 0.1212 (+0.0835) | 0.0374 (+0.0194) | 0.0571 (+0.0328) |
| Blame | 0.3117 (+0.1711) | 0.1491 (+0.0444) | 0.2017 (+0.0817) |
| Fear | 0.4 (+0.4) | 0.0571 (+0.0571) | 0.1 (0) |
| Forgiveness | 0 (0) | 0 (0) | 0 (0) |
| Guilt | 0.512 (+0.078) | 0.3798 (−0.0179) | 0.4361 (+0.0211) |
| Happiness/peacefulness | 0 (0) | 0 (0) | 0 (0) |
| Hopefulness | 0.2222 (+0.183) | 0.0606 (+0.0303) | 0.0952 (+0.0611) |
| Hopelessness | 0.5592 (+0.086) | 0.5584 (+0.0397) | 0.5588 (+0.0639) |
| Information | 0.5889 (+0.1477) | 0.4569 (+0.1012) | 0.5146 (+0.1207) |
| Instructions | 0.6931 (+0.0585) | 0.6865 (+0.0011) | 0.6898 (+0.0308) |
| Love | 0.7443 (+0.0818) | 0.6586 (+0.0324) | 0.6988 (+0.055) |
| Pride | 0 (0) | 0 (0) | 0 (0) |
| Sorrow | 0 (0) | 0 (0) | 0 (0) |
| Thankfulness | 0.775 (+0.3224) | 0.4079 (+0.0155) | 0.5345 (+0.1141) |
Notes: F1 = 0.4378; PRECISION = 0.4621; RECALL = 0.4159; N = 2270.
Baseline with full set of lexical features, lift/loss over baseline given in parentheses.
| NO EMOTION | 0.6231 (−0.0088) | 0.7511 (+0.1048) | 0.6812 (+0.0421) |
| Abuse | 0 (0) | 0 (0) | 0 (0) |
| Anger | 0.125 (+0.0873) | 0.018 (0) | 0.0315 (+0.0071) |
| Blame | 0.3596 (+0.2189) | 0.1798 (+0.0751) | 0.2397 (+0.1197) |
| Fear | 0 (0) | 0 (0) | 0 (0) |
| Forgiveness | 0 (0) | 0 (0) | 0 (0) |
| Guilt | 0.5357 (+0.1018) | 0.3715 (−0.0262) | 0.4388 (+0.0237) |
| Happiness/peacefulness | 0 (0) | 0 (0) | 0 (0) |
| Hopefulness | 0.2857 (+0.2465) | 0.0656 (+0.0353) | 0.1067 (+0.0725) |
| Hopelessness | 0.5401 (+0.067) | 0.5197 (+0.0009) | 0.5297 (+0.0348) |
| Information | 0.485 (+0.0438) | 0.3973 (+0.0416) | 0.4368 (+0.0429) |
| Instructions | 0.6583 (+0.0237) | 0.7104 (+0.0249) | 0.6834 (+0.0243) |
| Love | 0.7386 (+0.0761) | 0.6185 (−0.0077) | 0.6732 (+0.0294) |
| Pride | 0 (0) | 0 (0) | 0 (0) |
| Sorrow | 0 (0) | 0 (0) | 0 (0) |
| Thankfulness | 0.6744 (+0.2219) | 0.3671 (−0.0253) | 0.4754 (+0.0551) |
Notes: F1 = 0.4217; PRECISION = 0.4341; RECALL = 0.4100; N = 2382.
Baseline system with psycholinguistic features, lift/loss over baseline given in parentheses.
| NO EMOTION | 0.6258 (−0.0061) | 0.7313 (+0.0849) | 0.6744 (+0.0354) |
| Abuse | 0 (0) | 0 (0) | 0 (0) |
| Anger | 0.16 (+0.1223) | 0.0377 (+0.0197) | 0.0611 (+0.0367) |
| Blame | 0.2418 (+0.1011) | 0.1257 (+0.0211) | 0.1654 (+0.0454) |
| Fear | 0.3333 (+0.3333) | 0.1111 (+0.1111) | 0.1667 (0) |
| Forgiveness | 0 (0) | 0 (0) | 0 (0) |
| Guilt | 0.4772 (+0.0432) | 0.4237 (+0.026) | 0.4488 (+0.0338) |
| Happiness/peacefulness | 0 (0) | 0 (0) | 0 (0) |
| Hopefulness | 0.2143 (+0.1751) | 0.0882 (+0.0579) | 0.125 (+0.0908) |
| Hopelessness | 0.529 (+0.0558) | 0.529 (+0.0103) | 0.529 (+0.0341) |
| Information | 0.5029 (+0.0617) | 0.3772 (+0.0215) | 0.4311 (+0.0372) |
| Instructions | 0.6707 (+0.0361) | 0.6794 (−0.0061) | 0.675 (+0.016) |
| Love | 0.6977 (+0.0351) | 0.6613 (+0.0351) | 0.679 (+0.0351) |
| Pride | 0 (0) | 0 (0) | 0 (0) |
| Sorrow | 0 (0) | 0 (0) | 0 (0) |
| Thankfulness | 0.6588 (+0.2063) | 0.3709 (−0.0215) | 0.4746 (+0.0542) |
Notes: F1 = 0.4140; PRECISION = 0.4060; RECALL = 0.4223; N = 2623.
Baseline system with emotional sequence features, lift/loss over baseline given in parentheses.
| NO EMOTION | 0.628 (–0.0039) | 0.7081 (+0.0618) | 0.6657 (+0.0266) |
| Abuse | 0 (0) | 0 (0) | 0 (0) |
| Anger | 0.0526 (+0.0149) | 0.0196 (+0.0016) | 0.0286 (+0.0042) |
| Blame | 0.2364 (+0.0957) | 0.1512 (+0.0465) | 0.1844 (+0.0644) |
| Fear | 0.25 (+0.25) | 0.0606 (+0.0606) | 0.0976 (0) |
| Forgiveness | 0 (0) | 0 (0) | 0 (0) |
| Guilt | 0.4465 (+0.0125) | 0.4492 (+0.0515) | 0.4479 (+0.0328) |
| Happiness/peacefulness | 0 (0) | 0 (0) | 0 (0) |
| Hopefulness | 0.3333 (+0.2941) | 0.087 (+0.0567) | 0.1379 (+0.1037) |
| Hopelessness | 0.5179 (+0.0447) | 0.5066 (−0.0122) | 0.5121 (+0.0172) |
| Information | 0.4836 (+0.0424) | 0.4488 (+0.0931) | 0.4655 (+0.0717) |
| Instructions | 0.6761 (+0.0415) | 0.6877 (+0.0022) | 0.6818 (+0.0228) |
| Love | 0.7056 (+0.0431) | 0.6559 (+0.0297) | 0.6799 (+0.036) |
| Pride | 0 (0) | 0 (0) | 0 (0) |
| Sorrow | 0 (0) | 0 (0) | 0 (0) |
| Thankfulness | 0.6598 (+0.2072) | 0.4129 (+0.0205) | 0.5079 (+0.0876) |
Notes: F1 = 0.4184; PRECISION = 0.4438; RECALL = 0.3957; N = 2249.
System performance over test set.
| Baseline | 0.4623 | 0.4631 | 0.4627 | 1274 |
| Full system | 0.5000 | 0.4686 | 0.4838 | 1192 |
| Full system + Bootstrap | 0.5114 | 0.4764 | 0.4933 | 1185 |