| Literature DB >> 26407322 |
Hamed R-Tavakoli1, Adham Atyabi2, Antti Rantanen3, Seppo J Laukka3, Samia Nefti-Meziani4, Janne Heikkilä1.
Abstract
Multimedia analysis benefits from understanding the emotional content of a scene in a variety of tasks such as video genre classification and content-based image retrieval. Recently, there has been an increasing interest in applying human bio-signals, particularly eye movements, to recognize the emotional gist of a scene such as its valence. In order to determine the emotional category of images using eye movements, the existing methods often learn a classifier using several features that are extracted from eye movements. Although it has been shown that eye movement is potentially useful for recognition of scene valence, the contribution of each feature is not well-studied. To address the issue, we study the contribution of features extracted from eye movements in the classification of images into pleasant, neutral, and unpleasant categories. We assess ten features and their fusion. The features are histogram of saccade orientation, histogram of saccade slope, histogram of saccade length, histogram of saccade duration, histogram of saccade velocity, histogram of fixation duration, fixation histogram, top-ten salient coordinates, and saliency map. We utilize machine learning approach to analyze the performance of features by learning a support vector machine and exploiting various feature fusion schemes. The experiments reveal that 'saliency map', 'fixation histogram', 'histogram of fixation duration', and 'histogram of saccade slope' are the most contributing features. The selected features signify the influence of fixation information and angular behavior of eye movements in the recognition of the valence of images.Entities:
Mesh:
Year: 2015 PMID: 26407322 PMCID: PMC4583411 DOI: 10.1371/journal.pone.0138198
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Emotional images and their visual classes.
| Image Category | Number of Images | |||
|---|---|---|---|---|
| unpleasant | pleasant | neutral | all | |
| Building | 0 | 0 | 2 | 2 |
| Food | 0 | 2 | 2 | 4 |
| Baby | 0 | 8 | 0 | 8 |
| Rotten | 8 | 0 | 0 | 8 |
| Abstract & Conceptual | 3 | 0 | 6 | 9 |
| Animals | 0 | 11 | 6 | 17 |
| Wild Animals | 8 | 2 | 7 | 17 |
| Nature | 4 | 15 | 7 | 26 |
| Objects | 3 | 0 | 24 | 27 |
| Nude & Porn | 0 | 9 | 19 | 28 |
| Activity | 6 | 7 | 19 | 32 |
| People & Daily Activity | 29 | 24 | 56 | 109 |
| Total | 61 | 78 | 148 | 287 |
The following valence category is obtained by the emotional mean valence values reported by IAPS.
Fig 1Analysis of gender and emotional distribution of images.
As visualized, there is a difference between the ratings of female and male observers. Each image is visualized in terms of its mean valence and standard deviation ratings in regard to the genders.
Fig 2Valence categories and genders.
To show the existence of differences between male and female genders, a fuzzy c-means is run to categorize the images based on their mean valence and standard deviation into three classes of unpleasant, neutral, and pleasant. Comparing gender specific results, the disagreement of genders on valence is evident.
Images used in the study.
| Valence | Image ID |
|---|---|
| Neutral | ‘2020’, ‘2102’, ‘2104’, ‘2130’, ‘2190’, ‘2200’, ‘2271’, ‘2272’, ‘2280’, ‘2210’, ‘2214’, ‘2215’, ‘2220’, ‘2221’, ‘2230’, ‘2305’, ‘2357’, ‘2372’, ‘2383’, ‘2385’, ‘2393’, ‘2396’, ‘2397’, ‘2435’, ‘2441’, ‘2485’, ‘2487’, ‘2491’, ‘2493’, ‘2495’, ‘2499’, ‘2512’, ‘2513’, ‘2516’, ‘2520’, ‘2595’, ‘2635’, ‘2690’, ‘2704’, ‘2749’, ‘2770’, ‘2780’, ‘2795’, ‘2830’, ‘2840’, ‘2870’, ‘7506’ |
| Pleasant | ‘1340’, ‘1999’, ‘2000’, ‘2010’, ‘2037’, ‘2091’, ‘2092’, ‘2154’, ‘2222’, ‘2304’, ‘2339’, ‘2340’, ‘2341’, ‘2358’, ‘2362’, ‘2391’, ‘2501’, ‘2530’, ‘2620’, ‘2650’, ‘4617’, ‘5410’, ‘7325’, ‘8497’ |
| Unpleasant | ‘2095’, ‘2110’, ‘2120’, ‘2141’, ‘2205’, ‘2276’, ‘2278’, ‘2490’, ‘2590’, ‘2691’, ‘2710’, ‘2750’, ‘3500’, ‘3530’, ‘4621’, ‘6243’, ‘6313’, ‘6315’, ‘6360’, ‘6370’, ‘6530’, ‘6550’, ‘6560’, ‘6561’ |
The 95 images from IAPS used in this study and their valence category.
Fig 3Distribution of images across the valence range in each class of unpleasant, neutral, and pleasant.
Fig 4Mean value features versus valence.
From left to right: mean fixation duration (m = 0.012, p = 1.32e-36), mean saccade duration (m = -0.008, p = 1.11e-26), mean saccade length (m = -0.027, p = 1.77e-25), mean saccade slope (m = -0.002, p = 1.49e-35), mean saccade velocity(m = 0.004, p = 0.14), mean saccade orientation (m = -0.017, p = 5.19e-37).
Fig 5Baseline mean performance for mean value features in terms of bookmaker for classification of images into three class of unpleasant, neutral, and pleasant.
The mean of classification accuracy across folds and repetitions (%) and their associated standard errors are also added to each bar as a second measurement unit.
Fig 6Baseline performance of individual histogram-based features in terms of bookmaker for classification of images into three class of unpleasant, neutral, and pleasant.
The mean of classification accuracy across folds and repetitions (%) and their associated standard errors are added to each bar as a second measurement unit.
Fig 7Performance of conventional and evolutionary based decomposition methods.
Mean of classification accuracy across folds and repetitions (%) and their associated standard errors are added to each bar as a second measurement unit.