| Literature DB >> 25977755 |
Annika Labs1, Theresa Reich1, Helene Schulenburg1, Manuel Boennen1, Gehrke Mareike1, Madleen Golz1, Benita Hartigs1, Nico Hoffmann1, Sebastian Keil1, Malú Perlow1, Anne Katrin Peukmann1, Lea Noell Rabe1, Franca-Rosa von Sobbe1, Michael Hanke1,2.
Abstract
Here we present a dataset with a description of portrayed emotions in the movie "Forrest Gump". A total of 12 observers independently annotated emotional episodes regarding their temporal location and duration. The nature of an emotion was characterized with basic attributes, such as arousal and valence, as well as explicit emotion category labels. In addition, annotations include a record of the perceptual evidence for the presence of an emotion. Two variants of the movie were annotated separately: 1) an audio-movie version of Forrest Gump that has been used as a stimulus for the acquisition of a large public functional brain imaging dataset, and 2) the original audio-visual movie. We present reliability and consistency estimates that suggest that both stimuli can be used to study visual and auditory emotion cue processing in real-life like situations. Raw annotations from all observers are publicly released in full in order to maximize their utility for a wide range of applications and possible future extensions. In addition, aggregate time series of inter-observer agreement with respect to particular attributes of portrayed emotions are provided to facilitate adoption of these data.Entities:
Keywords: Audio-visual stimulus; Emotional episodes; Emotional processing cues; Forrest Gump; fMRI
Year: 2015 PMID: 25977755 PMCID: PMC4416536 DOI: 10.12688/f1000research.6230.1
Source DB: PubMed Journal: F1000Res ISSN: 2046-1402
Categories for cues indicating the onset or offset of an emotion.
| Label | Description |
|---|---|
|
|
|
|
|
|
Consolidated character labels.
Any annotation of a portrayed emotion is associated with exactly one of these 36 labels. While main characters, their relatives, and famous characters have individual labels, all remaining characters in the movie are aggregated into generic categories that preserve information on gender, age group, and number.
| Character label | Description | Character label | Description |
|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
List of 22 emotion categories the observer could (optionally) use to further qualify the nature of a portrayed emotion.
| Label | Description |
|---|---|
| ADMIRATION
| Appreciation of another person or object
|
Position of the fMRI stimulus segments with respect to the movie annotation time.
The last column lists the position of the scene boundary used as a reference for the segment transition. All times are in seconds and refer to the full (unsegmented) stimulus, which is a shorted version of the original movie [1].
| # | Start | End | Duration | Boundary |
|---|---|---|---|---|
| 0
| 0.0
| 902.0
| 902.0
| 891.2
|
Figure 1. Annotation frequencies by character and emotion.
The minimum criterion for counting emotion episodes is a 50% inter-observer agreement. Only categories/characters with a minimum of five such episodes are shown. The left panels show the number of episodes for both movie variants; the right panels show the cumulative duration of emotion display across all considered episodes. The top panels show the annotation frequency by movie character; the bottom panels show the corresponding frequencies for emotion categories.
Intra-stimulus inter-observer consistency.
All values are Spearman correlations for 1 Hz modulations of the fraction of IOA (as, for example, depicted in figure 3) with respect to a particular emotion attribute across the entire duration of the movie. The specified range corresponds to the width of the 95% confidence interval of the mean correlation for all possible combinations of partitioning observers into two sub-groups (audio-visual 4 vs. 5; audio-only 1 vs. 2). Higher correlations indicate higher consistency of agreement modulations across observer sub-groups. Correlations higher than 0.5 (arbitrary threshold) are depicted in bold for visualization purposes. The all characters column indicates the agreement for a particular emotion attribute over time irrespective of the annotated character. The columns on the right show the corresponding correlations for the two main characters. n/a fields indicate an insufficient number of annotations to compute the consistency measure.
| All characters | Forrest-only | Jenny-only | ||||
|---|---|---|---|---|---|---|
| Audio-visual | Audio-only | Audio-visual | Audio-only | Audio-visual | Audio-only | |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Figure 3. Inter-observer agreement (IOA) time courses for selected emotion attributes across the full duration of the movie.
Extreme values indicate perfect agreement with respect to presence or absence of a particular emotion attribute at a given time. IOA time series for the two movie types are overlaid on top of each other. For the purpose of visualization, the depicted time series reflects IOA computed using a 10 s segment size — in contrast to 1 s used for all other statistics presented in this paper.
Figure 2. Number of emotion episodes for all distinguished onset cue categories (minimum 50% inter-rater agreement) for both stimulus types.
Figure 4. Intra-stimulus indicator correlation.
The lower triangular matrix depicts the correlations between the IOA time courses for the three primary bipolar emotion attributes (arousal [high/low], valence [pos/neg], and direction [self/other]), the 22 emotion categories, and the six emotion onset indicators for the audio-visual movie. The upper triangular matrix shows the corresponding correlations for the audio-only movie. There were no observations for the emotion “fears confirmed”, or the facial expression onset cue in the audio-only movie. Likewise, there were no observations for the “narrator” onset cue in the audio-visual movie. The corresponding undefined correlations are depicted as zeros.
Inter-stimulus indicator correlation.
All values are Spearman correlations of 1 Hz modulations of the fraction of IOA (as, for example, depicted in figure 3) for the two stimulus types (audio-visual and audio-only movie). The specified range corresponds to the width of the 95% confidence interval of the correlation (computed via Fisher transformation). Correlations higher than 0.5 (arbitrary threshold) are depicted in bold for visualization purposes. n/a fields indicate an insufficient number of annotations to compute the consistency measure.
| All characters | Forrest-only | Jenny-only | |
|---|---|---|---|
|
| 0.474 ±0.018
| 0.278 ±0.021
|
|
|
|
|
|
|
|
|
|
|
|