| Literature DB >> 26205268 |
Jong-Suk Choi1, Jae Won Bang2, Hwan Heo3, Kang Ryoung Park4.
Abstract
Most previous research into emotion recognition used either a single modality or multiple modalities of physiological signal. However, the former method allows for limited enhancement of accuracy, and the latter has the disadvantages that its performance can be affected by head or body movements. Further, the latter causes inconvenience to the user due to the sensors attached to the body. Among various emotions, the accurate evaluation of fear is crucial in many applications, such as criminal psychology, intelligent surveillance systems and the objective evaluation of horror movies. Therefore, we propose a new method for evaluating fear based on nonintrusive measurements obtained using multiple sensors. Experimental results based on the t-test, the effect size and the sum of all of the correlation values with other modalities showed that facial temperature and subjective evaluation are more reliable than electroencephalogram (EEG) and eye blinking rate for the evaluation of fear.Entities:
Keywords: facial temperature; fear; nonintrusive multimodal measurement; subjective evaluation
Mesh:
Year: 2015 PMID: 26205268 PMCID: PMC4541947 DOI: 10.3390/s150717507
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Comparison between the previous methods and the proposed method of emotion recognition.
| Category | Method | Advantages | Disadvantage | |
|---|---|---|---|---|
| Using a single modality | Visible-light camera-based methods [ | User’s emotion is recognized based on facial expression in an image. | - Providing comfort to the user without the attachment of sensors. | - Analyzing emotion is difficult if the person has no expression. |
| Thermal camera-based methods [ | Measuring the change of facial temperature according to emotion. | - Providing comfort to the user without the attachment of sensors. | - More expensive method than visible-light camera-based method. | |
| Voice-based methods [ | Measuring the change of voice features according to emotion. | Less expensive method than bio-signal or thermal camera-based method. | - The performance can be affected by surrounding noises. | |
| Physiological signal-based methods [ | ECG [ | High accuracy of emotion detection and fast speed of data acquisition. | - More discomfort to the user because sensors are attached to the body. | |
| Using multiple modalities | Multiple physiological signal-based methods [ | - Using EEG, heart rate, SC, respiration, ST and psychometrical ratings [ | Higher accuracy of emotion detection compared to single modality-based methods. | - More discomfort to the user because many sensors are attached to the body. |
| Hybrid method using both physiological signals and non-intrusive camera-based methods (proposed method). | Using facial temperature, EEG, blinking rate and subjective evaluation for evaluating fear. | - Higher accuracy of emotion evaluation compared to single modality-based methods. | Larger amount of data to be processed after acquiring the image sequences by dual cameras. | |
Figure 1Flowchart of the experimental procedure of our research (BR is blinking rate and FT is facial temperature).
Figure 2Proposed system for evaluating fear.
Figure 3Dual (visible-light and thermal) cameras used in our method and their images.
Figure 4Commercial EEG device and locations of 16 electrodes based on the international 10–20 system. (a) Emotiv EPOC headset; (b) positions of 16 electrodes.
Figure 5Four corresponding (calibration) pairs of points produced by four NIR illuminators to obtain the geometric transform matrix and an example for measuring calibration accuracy. (a) Four pairs of corresponding (calibration) points; (b) pair of points for measuring calibration accuracy.
Figure 6(a) Detected face and facial feature regions in the visible-light image; (b) mapped face and facial feature regions in the thermal image after geometric transformation.
Figure 7Example of defined ROIs used to measure the change of facial temperature.
Figure 8Comparison of delta and beta waves before and after watching a horror movie. (a) Change in delta waves; (b) change in beta waves; (c) change in the ratio of delta to beta waves.
Figure 9Example of detecting pupil regions using sub-block-based template matching.
Figure 10Example of determining whether eyes are open and closed. (a) Open eyes; (b) closed eyes.
Figure 11Experiment for measuring the accuracy of the geometric transform. The top and bottom figures of (a–c) are images from the visible-light and thermal cameras, respectively. The NIR illuminator is positioned at example positions: (a) Position 1, (b) Position 5 and (c) Position 9.
Accuracy of geometric transform between visible-light and thermal images.
| Ground Truth Position | Calculated Position (by Geometric Transform Matrix) | RMS Error (Pixels) | |||
|---|---|---|---|---|---|
| Position | X | Y | X | Y | |
| 1 | 62 | 57 | 63 | 58 | 1.41 |
| 2 | 155 | 57 | 156 | 59 | 2.24 |
| 3 | 249 | 61 | 249 | 63 | 2 |
| 4 | 68 | 112 | 67 | 112 | 1 |
| 5 | 155 | 112 | 154 | 112 | 1 |
| 6 | 239 | 111 | 239 | 113 | 2 |
| 7 | 67 | 176 | 67 | 175 | 1 |
| 8 | 158 | 178 | 158 | 178 | 0 |
| 9 | 249 | 179 | 249 | 179 | 0 |
| Average | 1.18 | ||||
Figure 12Experimental procedure for measuring fear (BR is blinking rate, FT is facial temperature and SE means subjective evaluation).
Contents of the questionnaire for the subjective test.
| Questions for Subjective Test |
|---|
| I am having difficulty seeing |
| I am scared |
| I have a headache |
| I am anxious |
| I feel unpleasant |
Figure 13Comparison of subjective evaluation scores before and after watching the horror movie.
Average value and standard deviation of subjective evaluation scores.
| Before Watching the Horror Movie | After Watching the Horror Movie | |
|---|---|---|
| Average | 1.175 | 4.113 |
| Standard deviation | 0.272 | 2.208 |
Figure 14Comparisons of FTs of facial feature regions before and after watching the horror movie (FT is facial temperature).
Average, standard deviation and p-value of the facial temperature for each facial feature region.
| Region | Average of All Regions | Middle of the Forehead | Left Eye | Right Eye | ||||
|---|---|---|---|---|---|---|---|---|
| Before | After | Before | After | Before | After | Before | After | |
| Average | 15,111.08 | 15,039.47 | 15,101.48 | 15,038.56 | 15,119.34 | 15,050.73 | 15,112.23 | 15,039.24 |
| Standard deviation | 46.77801 | 47.57241 | 53.48507 | 56.4484 | 57.7577 | 45.40563 | 53.06459 | 48.92598 |
| 0.00017 | 0.00295 | 0.00085 | 0.00034 | |||||
| Region | Left cheek | Right cheek | ||||||
| Before | After | Before | After | |||||
| Average | 15,105.64 | 15,031.03 | 15,116.71 | 15,037.77 | ||||
| Standard deviation | 53.81195 | 55.63021 | 38.96623 | 53.48883 | ||||
| 0.00057 | 0.00006 | |||||||
Figure 15Comparisons of eye blinking rate before watching the horror movie and in the last 1 min of watching the movie (BR is blinking rate).
Average values and standard deviations of the eye blinking rate.
| Before Watching the Horror Movie | Last 1 Min While Watching the Horror Movie | |
|---|---|---|
| Average | 23.75 | 25.88 |
| Standard deviation | 14.45 | 11.92 |
Figure 16Ratios of delta band to beta band of EEG data before and after watching the horror movie.
Average, standard deviation and p-value of the EEG signal for all electrodes.
| Electrode | AF3 | AF4 | F3 | F4 | ||||
|---|---|---|---|---|---|---|---|---|
| Before | After | Before | After | Before | After | Before | After | |
| Average | 1.2417 | 1.1899 | 1.4300 | 1.2577 | 1.0548 | 1.0091 | 1.0962 | 0.9638 |
| Standard deviation | 0.3031 | 0.4990 | 0.5296 | 0.4095 | 0.3325 | 0.4124 | 0.2735 | 0.3069 |
| 0.7256 | 0.3120 | 0.7324 | 0.2074 | |||||
| Electrode | F7 | F8 | FC5 | FC6 | ||||
| Before | After | Before | After | Before | After | Before | After | |
| Average | 1.2765 | 1.2769 | 1.3056 | 1.1483 | 1.1958 | 1.0737 | 1.1592 | 1.0113 |
| Standard deviation | 0.3760 | 0.2711 | 0.3097 | 0.4150 | 0.4526 | 0.4058 | 0.3546 | 0.4092 |
| 0.9974 | 0.2344 | 0.4281 | 0.2836 | |||||
| Electrode | O1 | O2 | P7 | P8 | ||||
| Before | After | Before | After | Before | After | Before | After | |
| Average | 1.1204 | 0.9422 | 1.1249 | 1.0264 | 1.2184 | 1.0587 | 1.2211 | 1.0875 |
| Standard deviation | 0.3214 | 0.3023 | 0.4222 | 0.4904 | 0.3464 | 0.3433 | 0.4367 | 0.3492 |
| 0.1166 | 0.5473 | 0.2003 | 0.3473 | |||||
| Electrode | T7 | T8 | ||||||
| Before | After | Before | After | |||||
| Average | 1.2337 | 1.1158 | 1.2095 | 1.0458 | ||||
| Standard deviation | 0.5250 | 0.7247 | 0.3232 | 0.4044 | ||||
| 0.6026 | 0.2160 | |||||||
Calculated value of Cohen’s d before and after watching the horror movie (in the case of eye blinking rate, the calculated value of Cohen’s d is based on a comparison before and in the last 1 min while watching the horror movie, as shown in Figure 12).
| Cohen’s | Effect Size | |
|---|---|---|
| Eye blinking rate | 0.1605 | Small |
| EEG | 0.5713 | Medium |
| Subjective evaluation | 1.8675 | Large |
| Facial temperature | 1.6868 | Large |
Gradient, R2 and correlation values between two modalities.
| Gradient | R2 | Correlation | |
|---|---|---|---|
| EEG | −0.0063 | 0.00004 | −0.0061 |
| EEG | 0.5965 | 0.3113 | 0.5579 |
| EEG | 0.1139 | 0.0085 | 0.0921 |
| Blinking rate | 0.1329 | 0.0166 | 0.1289 |
| Blinking rate | 0.5952 | 0.2491 | 0.4991 |
| Facial temperature | 0.5765 | 0.2486 | 0.4986 |
Confusion matrix and the sum of correlation values between modalities.
| EEG | Blinking Rate | Facial Temperature | Subjective Evaluation | The Sum of All of the Correlation Values with Other Modalities | |
|---|---|---|---|---|---|
| EEG | 1 | −0.0061 | 0.5579 | 0.0921 | 0.6439 |
| Blinking rate | −0.0061 | 1 | 0.1289 | 0.4991 | 0.6219 |
| Facial temperature | 0.5579 | 0.1289 | 1 | 0.4986 | 1.1854 |
| Subjective evaluation | 0.0921 | 0.4991 | 0.4986 | 1 | 1.0898 |
Figure 17Comparison of subjective evaluation scores before and after watching the video clip of emotionally-neutral content to the subjects.
Figure 18Comparisons of the facial temperature of facial feature regions before and after watching the video clip of emotionally-neutral content to the subjects (FT is facial temperature).
Figure 19Comparisons of eye blinking rate before and in the last 1 min of watching the video clip of emotionally-neutral content to the subjects (BR is blinking rate).
Figure 20Ratios of delta band to beta band of EEG data before and after watching the video clip of emotionally-neutral content to the subjects.