| Literature DB >> 22879773 |
Azadeh Nikfarjam1, Ehsan Emadzadeh, Graciela Gonzalez.
Abstract
The reasons that drive someone to commit suicide are complex and their study has attracted the attention of scientists in different domains. Analyzing this phenomenon could significantly improve the preventive efforts. In this paper we present a method for sentiment analysis of suicide notes submitted to the i2b2/VA/Cincinnati Shared Task 2011. In this task the sentences of 900 suicide notes were labeled with the possible emotions that they reflect. In order to label the sentence with emotions, we propose a hybrid approach which utilizes both rule based and machine learning techniques. To solve the multi class problem a rule-based engine and an SVM model is used for each category. A set of syntactic and semantic features are selected for each sentence to build the rules and train the classifier. The rules are generated manually based on a set of lexical and emotional clues. We propose a new approach to extract the sentence's clauses and constitutive grammatical elements and to use them in syntactic and semantic feature generation. The method utilizes a novel method to measure the polarity of the sentence based on the extracted grammatical elements, reaching precision of 41.79 with recall of 55.03 for an f-measure of 47.50. The overall mean f-measure of all submissions was 48.75% with a standard deviation of 7%.Entities:
Keywords: NLP; emotion classification; machine learning; polarity measurement; sentiment analysis
Year: 2012 PMID: 22879773 PMCID: PMC3409484 DOI: 10.4137/BII.S8981
Source DB: PubMed Journal: Biomed Inform Insights ISSN: 1178-2226
Figure 1.The overall system architecture (A) includes pre-processing steps and an Emotion Detector for each emotion (Emotion Detector 1… Emotion Detector 15), with an output of 1 (present) or 0 (not present) for each emotion. Each detector (B) consists of a rule engine and an SVM classifier. If a sentence is not covered by any of the rules for the emotion, it is passed to the classifier for a final decision.
Examples of emotional clues.
| If the continuation of life appears to be impossible for the person. | I can’t go on living like this. | |
| If the person describes himself/herself with a characteristic showing inadequateness | ||
| If the person ask forgiveness from others. | Please forgive me. | |
| If the person blame her/himself for doing or being bad with others or doing unpleasant actions to others. | I am such an anxiety to you. | |
| If the person blames other people for their current emotional state or actions they took. | You have killed everything in my soul. | |
| If the person describe him/herself happy and the polarity of the sentence is positive. | I am so happy to go. |
Examples of “hopelessness” clues and the corresponding rule.
| If the person express weakness in life. | If is_first_person(subj) & verb_aux is a member of [can, could, ca] & is_negated(verb) & verb_tense = = presnet & verb polarity > 0 then hopelessness exists. |
| If the person feels the continuation of life is impossible. | If is_first_person(subj) & verb is a member of [go on, continue, bear,…] & is_negated(verb) then hopelessness exists. |
Word polarity calculation.
| //The initial polarities are taken from Subjectivity Lexicon |
| //List of the modifiers(adjectives and adverbs) of the word are extracted by the system |
| |
| |
| |
| //The polarity of the non polar words are changed to their modifiers’ polarity |
| |
| //The polarity of a negative word is intensified by its modifier polarity eg. “truly sorry” |
| |
| |
| //The polarity of the positive words are increased by positive modifiers |
| |
| |
| //The polarity of the positive words is changed to negative by negative modifiers while it is intensified |
| |
| |
System result on test set for different experiments; the best micro-average f-measure achieved while using a combination of rule-based and machine learning for limited categories (Rule + ML1).
| Machine Learning (ML) only | 60.11 | 32.23 | 41.96 |
| Rule-based | 43.85 | 48.27 | 45.95 |
| Rule + ML2(ML used for all emotions) | 43.84 | 51.49 | 47.36 |
| Rule + ML1(ML used for selected emotions) |