| Literature DB >> 26075199 |
Nicole Lazzeri1, Daniele Mazzei1, Alberto Greco1, Annalisa Rotesi2, Antonio Lanatà1, Danilo Emilio De Rossi3.
Abstract
Non-verbal signals expressed through body language play a crucial role in multi-modal human communication during social relations. Indeed, in all cultures, facial expressions are the most universal and direct signs to express innate emotional cues. A human face conveys important information in social interactions and helps us to better understand our social partners and establish empathic links. Latest researches show that humanoid and social robots are becoming increasingly similar to humans, both esthetically and expressively. However, their visual expressiveness is a crucial issue that must be improved to make these robots more realistic and intuitively perceivable by humans as not different from them. This study concerns the capability of a humanoid robot to exhibit emotions through facial expressions. More specifically, emotional signs performed by a humanoid robot have been compared with corresponding human facial expressions in terms of recognition rate and response time. The set of stimuli included standardized human expressions taken from an Ekman-based database and the same facial expressions performed by the robot. Furthermore, participants' psychophysiological responses have been explored to investigate whether there could be differences induced by interpreting robot or human emotional stimuli. Preliminary results show a trend to better recognize expressions performed by the robot than 2D photos or 3D models. Moreover, no significant differences in the subjects' psychophysiological state have been found during the discrimination of facial expressions performed by the robot in comparison with the same task performed with 2D photos and 3D models.Entities:
Keywords: affective computing; emotion perception; expression recognition; facial expressions; humanoid robot; psychophysiological signals; social robots
Year: 2015 PMID: 26075199 PMCID: PMC4443734 DOI: 10.3389/fbioe.2015.00064
Source DB: PubMed Journal: Front Bioeng Biotechnol ISSN: 2296-4185
Figure 1(A) AU positions mapped on the robot; (B) major facial muscles involved in the facial expressions; and (C) servo motor positions corresponding to the Aus.
Figure 22D photos and 3D models used in the experiment: (A) FACE expressions and (B) human expressions.
Comparison between the action unit configurations of the expressions: the Ekman AUs configuration (first column), the adapted AUs configuration for FACE (second column), and the adapted AUs configuration used in the Bosphorus database (third column).
| Action units | |||
|---|---|---|---|
| FACS based on humans | FACE | Bosphorus DB | |
| Anger | 4 + 5 + 7 + 22 + 23 + 24 | 4B + 7A + 16E + 25C | 4C + 38A |
| Disgust | 9 + 10 + (25/26) | 4B + 7B + 9D + 10D + 16E + 25C | 1C + 4B + 7B + L10A + R11C + 20C + 25C |
| Fear | 1 + 2 + 4 + 5 + 20 + 26 | 1C + 2C + 4B + 5D + 20B + 26B | 1C + R2C + 5D + 25C + 26C + 38C |
| Happiness | 6 + 12 | 6E + 7B + 12E + 25D | 7B + 10C + 12C + 25D |
| Sadness | 1 + (4) + 15 + (17) | 1D + 2A + 4E + 7B + 15E | 1C + 4B + 7C + 11C |
| Surprise | 1 + 2 + 5 + 25/26 | 1E + 2E + 5D + 12C + 25D + 27D | 1B + 2B + 5B + 25C + 27C |
The set of possible answers for all phases which includes the 7 basic expressions available in the database and reproduced by the robot (indicated by * symbol).
| English | |
|---|---|
| Pride | |
| Happiness* | |
| Embarrassment | |
| Neutral* | |
| Surprise* | |
| Disgust* | |
| Pain | |
| Pity | |
| Contempt | |
| Sadness* | |
| Interest | |
| Shame | |
| Fear* | |
| Excitement | |
| Anger* | |
| I do not know |
Italian answers are in bold style, English translation is in normal style.
HRV and SCR features extracted from the subjects’ physiological signals.
| HRV features | SCR features | ||||
|---|---|---|---|---|---|
| Time domain | Frequency domain | ||||
| Mean RR | Mean inter-beat interval (in ms) | VLF | Very low frequency (in Hz and in ms2) | nSCR | Number of SCR in the windows response |
| STD RR | Standard inter-beat interval (in ms) | LF | Low frequency (in Hz and in ms2) | MAX-phasic | Maximum value of the phasic component |
| RMSSD | Root mean square of the successive differences (in ms) | HF | High frequency (in Hz and in ms2) | Latency | Time interval between the stimulus and the SCR peak |
| NN50 | Number of pairs of successive beat-to-beat intervals (NNs) that differ by more than 50 ms (count) | LF/HF ratio | Ratio between the power of LF and HF bands indicating the balance between sympathetic and parasympathetic systems | AUC-phasic | Area under the phasic curve over time |
| pNN50 | Proportion of NN50 divided by total number of NNs (in %) | LF | Ratio between absolute value of the LF and difference between total power and VLF (in n.u., i.e., normalized unit) | Mean-phasic | Mean value of the phasic component |
| HRV tri-index | Total number of all NN intervals divided by the height of the histogram of all NN intervals | HF | Ratio between absolute value of the HF and difference between total power and VLF (in n.u., i.e., normalized unit) | STD-phasic | SD of the phasic component |
Confusion matrix (.
| Confusion matrix ( | |||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Human 2D photos | Human 3D models | Physical robot | |||||||||||||||
| A | D | F | N | Sa | Su | A | D | F | N | Sa | Su | A | D | F | Sa | Su | |
| Anger | 13.3 | 6.7 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 20.0 | 0.0 | 0.0 | 0.0 | ||
| Disgust | 0.0 | 0.0 | 0.0 | 13.3 | 0.0 | 0.0 | 20.0 | 6.7 | 0.0 | 0.0 | 0.0 | 13.3 | 13.3 | 0.0 | 6.7 | ||
| Fear | 0.0 | 0.0 | 20.0 | 0.0 | 0.0 | 0.0 | 6.7 | 20.0 | 26.7 | 0.0 | 0.0 | 0.0 | 6.7 | 0.0 | 0.0 | 20.0 | |
| Neutral | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | / | / | / | / | / | ||
| Sadness | 0.0 | 13.3 | 0.0 | 6.7 | 20.0 | 0.0 | 0.0 | 0.0 | 0.0 | 6.7 | 0.0 | 0.0 | 0.0 | 0.0 | 33.3 | 0.0 | |
| Surprise | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 13.3 | 0.0 | |||||
| Pride | 6.7 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 13.3 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 6.7 | 0.0 | 0.0 | 0.0 | 0.0 |
| Embarrass. | 0.0 | 6.7 | 13.3 | 0.0 | 0.0 | 6.7 | 0.0 | 13.3 | 6.7 | 0.0 | 20.0 | 0.0 | 0.0 | 0.0 | 13.3 | 0.0 | 0.0 |
| Pain | 6.7 | 26.7 | 0.0 | 0.0 | 13.3 | 0.0 | 0.0 | 6.7 | 0.0 | 20.0 | 0.0 | 13.3 | 0.0 | 0.0 | 13.3 | 0.0 | |
| Pity | 0.0 | 6.7 | 0.0 | 0.0 | 0.0 | 6.7 | 0.0 | 0.0 | 0.0 | 13.3 | 0.0 | 0.0 | 6.7 | 0.0 | 0.0 | ||
| Contempt | 20.0 | 0.0 | 0.0 | 6.7 | 6.7 | 0.0 | 6.7 | 0.0 | 0.0 | 6.7 | 0.0 | 0.0 | 20.0 | 20.0 | 0.0 | 0.0 | 0.0 |
| Interest | 20.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 13.3 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
| Shame | 0.0 | 6.7 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 6.7 | 0.0 | 0.0 | 6.7 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
| Excitement | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 6.7 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
| Idonotknow | 0.0 | 6.7 | 0.0 | 13.3 | 0.0 | 20.0 | 6.7 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 6.7 | 0.0 | 0.0 | |
| Noanswer | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 6.7 | 0.0 | 0.0 | 0.0 | 0.0 |
The highest values are set in bold.
The column labels are A, anger; D, disgust; F, fear; N, neutral; Sa, sadness; Su, surprise.
Figure 3(A) Recognition rates (in percentage) and (B) response time (in seconds) of human 2D photos, human 3D models, and robot FACE expressions.
Means and SDs of the response time (in seconds) of 15 subjects in recognizing the facial expressions of human 2D photos, human 3D models, and the robot FACE.
| Response time (s) | ||||||
|---|---|---|---|---|---|---|
| Human 2D | Human 3D | Robot | ||||
| Mean | SD | Mean | SD | Mean | SD | |
| Anger | 4.09 | 0.60 | 8.42 | 5.71 | 8.53 | 4.43 |
| Disgust | 7.42 | 4.26 | 10.79 | 3.30 | 10.55 | 5.78 |
| Fear | 9.71 | 6.90 | 11.01 | 1.61 | 9.68 | 3.87 |
| Sadness | 15.40 | 8.76 | 9.78 | 5.45 | 16.02 | 0.96 |
| Surprise | 6.81 | 5.48 | 8.39 | 3.83 | 9.31 | 4.20 |
Confusion matrix (.
| Confusion matrix ( | |||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| FACE 2D photos | FACE 3D models | Physical robot | |||||||||||||||
| A | D | F | N | Sa | Su | A | D | F | N | Sa | Su | A | D | F | Sa | Su | |
| Anger | 13.3 | 6.7 | 6.7 | 0.0 | 0.0 | 0.0 | 13.3 | 20.0 | 0.0 | 0.0 | 0.0 | 0.0 | 20.0 | 0.0 | 0.0 | 0.0 | |
| Disgust | 33.3 | 13.3 | 0.0 | 0.0 | 13.3 | 26.7 | 33.3 | 0.0 | 0.0 | 0.0 | 0.0 | 13.3 | 13.3 | 0.0 | 6.7 | ||
| Fear | 6.7 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 6.7 | 0.0 | 0.0 | 20.0 | |||
| Neutral | 0.0 | 6.7 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | / | / | / | / | / | ||
| Sadness | 0.0 | 0.0 | 0.0 | 0.0 | 6.7 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 33.3 | 0.0 | ||
| Surprise | 6.7 | 0.0 | 6.7 | 6.7 | 0.0 | 6.7 | 0.0 | 20.0 | 0.0 | 0.0 | 0.0 | 0.0 | 13.3 | 0.0 | |||
| Pride | 0.0 | 0.0 | 0.0 | 20.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 13.3 | 0.0 | 0.0 | 6.7 | 0.0 | 0.0 | 0.0 | 0.0 |
| Embarrass. | 0.0 | 0.0 | 13.3 | 0.0 | 6.7 | 6.7 | 0.0 | 0.0 | 13.3 | 0.0 | 0.0 | 6.7 | 0.0 | 0.0 | 13.3 | 0.0 | 0.0 |
| Pain | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 13.3 | 0.0 | 13.3 | 0.0 | 0.0 | 13.3 | 0.0 |
| Pity | 0.0 | 0.0 | 6.7 | 0.0 | 6.7 | 0.0 | 0.0 | 26.7 | 0.0 | 13.3 | 0.0 | 0.0 | 6.7 | 0.0 | 0.0 | ||
| Contempt | 13.3 | 6.7 | 6.7 | 6.7 | 6.7 | 0.0 | 6.7 | 0.0 | 0.0 | 20.0 | 20.0 | 0.0 | 0.0 | 0.0 | |||
| Interest | 0.0 | 0.0 | 0.0 | 20.0 | 6.7 | 0.0 | 13.3 | 0.0 | 0.0 | 26.7 | 6.7 | 6.7 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
| Shame | 0.0 | 0.0 | 6.7 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
| Excitement | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
| I do not know | 6.7 | 6.7 | 6.7 | 6.7 | 0.0 | 6.7 | 0.0 | 0.0 | 6.7 | 6.7 | 26.7 | 13.3 | 0.0 | 0.0 | 6.7 | 0.0 | 0.0 |
| No answer | 6.7 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 6.7 | 0.0 | 0.0 | 0.0 | 0.0 |
The highest values are set in bold.
The column labels are A, anger; D, disgust; F, fear; N, neutral; Sa, sadness; Su, surprise.
Figure 4(A) Recognition rates (in percentage) and (B) response time (in seconds) of robot 2D photos, robot 3D models, and robot FACE expressions.
Means and SDs of the response time (in seconds) of 15 subjects in recognizing the facial expressions of robot 2D photos, robot 3D models, and the physical robot.
| Response time (s) | ||||||
|---|---|---|---|---|---|---|
| FACE 2D | FACE 3D | Robot | ||||
| Mean | SD | Mean | SD | Mean | SD | |
| Anger | 16.49 | 11.20 | 12.60 | 9.09 | 8.53 | 4.43 |
| Disgust | 9.80 | 9.31 | 7.46 | 2.98 | 10.55 | 5.78 |
| Fear | 7.75 | 5.90 | 10.25 | 8.01 | 9.68 | 3.87 |
| Sadness | 9.19 | 5.71 | 9.12 | 6.22 | 16.02 | 0.96 |
| Surprise | 12.86 | 3.10 | 10.28 | 5.83 | 9.31 | 4.20 |
Figure 5(A) Recognition rates (in percentage) and (B) response time (in seconds) of positive/negative human expressions.
Figure 6(A) Recognition rates (in percentage) and (B) response time (in seconds) of positive/negative robot expressions.
Figure 7Recognition rates (in percentage) of the FACE expressions.
Figure 8Statistical analysis of two features extracted from HRV and SCR during the interpretation task based on human 2D photos, human 3D models, and the physical robot. (A) HRV results. Example of an intra-subject (subject 1) and an inter-subject statistical analysis result. The mean RR feature represents the mean value of the RR distance (ms). (B) SCR results. Example of an intra-subject (subject 1) and an inter-subject statistical analysis result. The mean-phasic feature represents the mean value of the SCR signal (uSiemens).