| Literature DB >> 22879767 |
Aleksandar Kovačević1, Azad Dehghan, John A Keane, Goran Nenadic.
Abstract
We describe and evaluate an automated approach used as part of the i2b2 2011 challenge to identify and categorise statements in suicide notes into one of 15 topics, including Love, Guilt, Thankfulness, Hopelessness and Instructions. The approach combines a set of lexico-syntactic rules with a set of models derived by machine learning from a training dataset. The machine learning models rely on named entities, lexical, lexico-semantic and presentation features, as well as the rules that are applicable to a given statement. On a testing set of 300 suicide notes, the approach showed the overall best micro F-measure of up to 53.36%. The best precision achieved was 67.17% when only rules are used, whereas best recall of 50.57% was with integrated rules and machine learning. While some topics (eg, Sorrow, Anger, Blame) prove challenging, the performance for relatively frequent (eg, Love) and well-scoped categories (eg, Thankfulness) was comparatively higher (precision between 68% and 79%), suggesting that automated text mining approaches can be effective in topic categorisation of suicide notes.Entities:
Keywords: sentiment mining; suicide notes; text classification; text mining
Year: 2012 PMID: 22879767 PMCID: PMC3409492 DOI: 10.4137/BII.S8978
Source DB: PubMed Journal: Biomed Inform Insights ISSN: 1178-2226
Figure 1.System architecture.
Number of rules and dictionary entries per category.
| 25 | 61 | |
| 21 | 40 | |
| 8 | 0 | |
| 9 | 14 | |
| 21 | 30 | |
| 15 | 0 | |
| 6 | 13 | |
| 9 | 0 | |
| 6 | 0 | |
| 4 | 0 | |
| 5 | 0 | |
| 4 | 0 | |
| 4 | 0 | |
| 1 | 0 | |
| 2 | 11 |
Per category performance on the training data.
| 820 | 0.6093 | 0.6714 | 0.8120 | 0.5634 | 0.6652 | 0.6280 | 0.6772 | |||
| 455 | 0.6465 | 0.5626 | 0.6016 | 0.3912 | 0.5298 | 0.6639 | 0.5253 | 0.5865 | ||
| 296 | 0.6916 | 0.7196 | 0.7946 | 0.7208 | ||||||
| 208 | 0.5349 | 0.5529 | 0.5437 | 0.6333 | 0.3654 | 0.4634 | 0.5954 | 0.4952 | 0.5407 | |
| 295 | 0.3085 | 0.4375 | 0.7561 | 0.3153 | 0.4450 | 0.4256 | 0.5525 | 0.4808 | ||
| 107 | 0.8108 | 0.2804 | 0.4167 | 0.8108 | 0.2804 | 0.4167 | 0.8108 | 0.2804 | 0.4167 | |
| 94 | ||||||||||
| 47 | 0.4595 | 0.3617 | 0.4048 | 0.4595 | 0.3617 | 0.4048 | 0.4595 | 0.3617 | 0.4048 | |
| 51 | 0.4800 | 0.2353 | 0.3158 | 0.5000 | 0.2157 | 0.3014 | 0.4800 | 0.2353 | 0.3158 | |
| 69 | 0.8571 | 0.0870 | 0.1579 | 0.8571 | 0.0870 | 0.1579 | 0.8571 | 0.0870 | 0.1579 | |
| 25 | 0.3200 | 0.4444 | 0.7273 | 0.3200 | 0.4444 | 0.7273 | 0.3200 | 0.4444 | ||
| 25 | 0.5000 | 0.4400 | 0.4681 | 0.5000 | 0.4400 | 0.4681 | 0.5000 | 0.4400 | 0.4681 | |
| 15 | 0.5000 | 0.3333 | 0.4000 | 0.5000 | 0.3333 | 0.4000 | 0.5000 | 0.3333 | 0.4000 | |
| 6 | ||||||||||
| 9 | 0.5000 | 0.4444 | 0.4706 | 0.5000 | 0.4444 | 0.4706 | 0.5000 | 0.4444 | 0.4706 | |
| 0.6066 | 0.4372 | 0.4923 | 0.7031 | 0.3988 | 0.4907 | 0.6003 | 0.4371 | 0.4944 | ||
| 0.6339 | 0.5686 | 0.5995 | 0.7727 | 0.44370 | 0.5637 | 0.6477 | 0.5432 | 0.5909 | ||
Note: Frequency represents the number of lines in the training dataset.
Abbreviations: P, precision; R, recall; F, F-measure.
Micro-averaged results on the test data.
| Run 1 | 0.5661 | ||
| Run 2 | 0.3797 | 0.4851 | |
| Run 3 | 0.5900 | 0.4764 | 0.5271 |
Abbreviations: P, precision; R, recall; F, F-measure.
Per-category performance on the test data.
| 382 | 0.5509 | 0.6649 | 0.6026 | 0.7076 | 0.4372 | 0.5405 | 0.5692 | 0.6571 | 0.6100 | |
| 229 | 0.5775 | 0.5371 | 0.5566 | 0.7154 | 0.3843 | 0.5000 | 0.5950 | 0.5197 | 0.5548 | |
| 201 | ||||||||||
| 117 | 0.4857 | 0.4359 | 0.4595 | 0.6230 | 0.3248 | 0.4270 | 0.5294 | 0.3846 | 0.4455 | |
| 104 | 0.5000 | 0.2115 | 0.2973 | 0.5000 | 0.2115 | 0.2973 | 0.5000 | 0.2115 | 0.2973 | |
| 45 | 0.2381 | 0.1111 | 0.1515 | 0.2381 | 0.1111 | 0.1515 | 0.2381 | 0.1111 | 0.1515 | |
| 45 | ||||||||||
| 38 | 0.3077 | 0.1053 | 0.1569 | 0.3077 | 0.1053 | 0.1569 | 0.3077 | 0.1053 | 0.1569 | |
| 34 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | |
| 26 | 0.5000 | 0.0385 | 0.0714 | 0.5000 | 0.0385 | 0.0714 | 0.5000 | 0.0385 | 0.0714 | |
| 16 | ||||||||||
| 13 | 0.3636 | 0.3077 | 0.3333 | 0.3636 | 0.3077 | 0.3333 | 0.3636 | 0.3077 | 0.3333 | |
| 9 | 0.6667 | 0.2222 | 0.3333 | 0.6667 | 0.2222 | 0.3333 | 0.6667 | 0.2222 | 0.3333 | |
| 8 | 0.1250 | 0.2222 | 1.0000 | 0.1250 | 0.2222 | 1.0000 | 0.1250 | 0.2222 | ||
| 5 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | 0.0000 | |
| 0.4859 | 0.3064 | 0.3414 | 0.5223 | 0.2664 | 0.3301 | 0.4980 | 0.2940 | 0.3394 | ||
Note: Frequency represents the number of lines in the test dataset.
Abbreviations: P, precision; R, recall; F, F-measure.
Examples of FPs and FNs for Guilt.
| No annotation | ||
| No annotation | ||
| No annotation | ||
Examples of confusion between Instructions and Information.
Example FPs and FNs for the Blame category.
| No annotation | ||
Example of FPs and FNs for the Sorrow and Anger categories.
| No annotation | ||
| No annotation | ||
| No annotation | ||
| No annotation |