| Literature DB >> 26524352 |
Spyridon Samothrakis1, Maria Fasli1.
Abstract
Fiction, a prime form of entertainment, has evolved into multiple genres which one can broadly attribute to different forms of stories. In this paper, we examine the hypothesis that works of fiction can be characterised by the emotions they portray. To investigate this hypothesis, we use the work of fictions in the Project Gutenberg and we attribute basic emotional content to each individual sentence using Ekman's model. A time-smoothed version of the emotional content for each basic emotion is used to train extremely randomized trees. We show through 10-fold Cross-Validation that the emotional content of each work of fiction can help identify each genre with significantly higher probability than random. We also show that the most important differentiator between genre novels is fear.Entities:
Mesh:
Year: 2015 PMID: 26524352 PMCID: PMC4629906 DOI: 10.1371/journal.pone.0141922
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
List of genres, numbers of instances found and unique ids for each class (i.e, genre).
|
| 648 | 0 |
|
| 662 | 1 |
|
| 346 | 2 |
|
| 108 | 3 |
|
| 1252 | 4 |
|
| 387 | 5 |
Fig 1Emotional content for Murder at Bridge, by Anne Austin, of class Mystery.
Fig 2Emotional content for Frankenstein, by Mary W. Shelley, of class Horror.
Fig 3Scaled confusion matrix.
Baseline classifiers and extremely random forests accuracy.
Baseline classifiers performance is on the whole training set.
| Most Frequent | 0.369558779982 |
| Stratified | 0.223275096239 |
| Extremely Random Forests | 0.579646017699 |
Fig 4Feature importances, with a = 0.05 confidence intervals bars.
Fig 5Average strength of each emotion among all texts.
Statistical significance for a = 0.05.
Emotion importance matrix.
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|
|
| -0.066 | -0.093 | 0.152 | -0.145 | 0.114 | -0.052 |
|
| 0.069 | 0.014 | -0.56 | 0.282 | 0.248 | 0.432 |
|
| -0.003 | 0 | 0.261 | 0.051 | 0.158 | -0.082 |
|
| 0.017 | 0.064 | 0.27 | -0.094 | 0.055 | -0.182 |
|
| -0.114 | 0.082 | 0 | -0.222 | -0.472 | 0.04 |
|
| 0.179 | -0.015 | -0.082 | 0.145 | -0.15 | -0.007 |